亚洲在线久爱草,狠狠天天香蕉网,天天搞日日干久草,伊人亚洲日本欧美

為了賬號安全,請及時綁定郵箱和手機立即綁定
已解決430363個問題,去搜搜看,總會有你想問的

使用 pytesseract 執行 OCR 時出錯

使用 pytesseract 執行 OCR 時出錯

繁花不似錦 2021-11-16 15:45:53
FileNotFoundError: [WinError 2] The system cannot find the file specified.During handling of the above exception, another exception occurred: pytesseract.pytesseract.TesseractNotFoundError: tesseract is not installed or it's not in your path.  我正在使用 Pycharm 社區并嘗試為 OCR 安裝 tesseract。我的代碼如下:import cv2import numpy as npimport pytesseractfrom PIL import Imagefrom pytesseract import image_to_string# Path of working folder on Disksrc_path = "C:/Users/fsipl/Desktop/"def get_string(img_path):    # Read image with opencv    img = cv2.imread(img_path)    # Convert to gray    img = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)    # Apply dilation and erosion to remove some noise    kernel = np.ones((1, 1), np.uint8)    img = cv2.dilate(img, kernel, iterations=1)    img = cv2.erode(img, kernel, iterations=1)    # Write image after removed noise    cv2.imwrite(src_path + "removed_noise.png", img)    #  Apply threshold to get image with only black and white    #img = cv2.adaptiveThreshold(img, 255, cv2.ADAPTIVE_THRESH_GAUSSIAN_C, cv2.THRESH_BINARY, 31, 2)    # Write the image after apply opencv to do some ...    cv2.imwrite(src_path + "thres.png", img)    # Recognize text with tesseract for python    result = pytesseract.image_to_string(Image.open(src_path + "thres.png"))    # Remove template file    #os.remove(temp)    return resultprint('--- Start recognize text from image ---')print(get_string(src_path+"word_text.jpg"))print("------ Done -------")
查看完整描述

1 回答

?
Smart貓小萌

TA貢獻1911條經驗 獲得超7個贊

是的,我通過單行更改解決了問題。

我們必須提供pytesseract exe的可執行路徑

pytesseract.pytesseract.tesseract_cmd = 'C:\Program Files (x86)\Tesseract-OCR\tesseract.exe'

這是下面的代碼:

def get_string(img_path):

    # Read image with opencv

    img = cv2.imread(img_path)


    # Convert to gray

    img = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)


    # Apply dilation and erosion to remove some noise

    kernel = np.ones((1, 1), np.uint8)

    img = cv2.dilate(img, kernel, iterations=1)

    img = cv2.erode(img, kernel, iterations=1)


    # Write image after removed noise

    cv2.imwrite(src_path + "removed_noise.png", img)


    #  Apply threshold to get image with only black and white

    # img = cv2.adaptiveThreshold(img, 255, cv2.ADAPTIVE_THRESH_GAUSSIAN_C, cv2.THRESH_BINARY, 31, 2)


    # Write the image after apply opencv to do some ...

    cv2.imwrite(src_path + "thres.png", img)


    # Recognize text with tesseract for python

    pytesseract.pytesseract.tesseract_cmd = 'C:\\Program Files (x86)\\Tesseract-OCR\\tesseract.exe'

    result = pytesseract.image_to_string(Image.open(src_path + "thres.png"))


    # Remove template file

    # os.remove(temp)


    return result


查看完整回答
反對 回復 2021-11-16
  • 1 回答
  • 0 關注
  • 333 瀏覽
慕課專欄
更多

添加回答

舉報

0/150
提交
取消
微信客服

購課補貼
聯系客服咨詢優惠詳情

幫助反饋 APP下載

慕課網APP
您的移動學習伙伴

公眾號

掃描二維碼
關注慕課網微信公眾號