Python 进行 OCR识别 -- pytesseract库

pip install pytesseract



报错:tesseract is not installed or it's not in your path

识别中文需要新的字库

图片:English.png



图片:Chinese.png



识别

import pytesseract
from PIL import Image

im_en = Image.open('English.png')
im_ch = Image.open('Chinese.png')

print('========识别字母========')
print(pytesseract.image_to_string(im_en), '\n\n')

print('========识别中文========')
print(pytesseract.image_to_string(im_ch, lang='chi_sim'))

结果

posted @ 2020-01-14 13:17  三个零  阅读(6326)  评论(2编辑  收藏  举报