OCR文字识别-开源方案本地部署
1. 开源OCR识别程序:
https://github.com/DayBreak-u/chineseocr_lite https://github.com/alisen39/TrWebOCR
2. 本地测试,选择了TrWebOCR:
https://github.com/alisen39/TrWebOCR
3. 部署步骤:
--- 确保本地基础环境python3已经安装完成 [root@VM-24-11-centos mnt]# python3 Python 3.6.8 (default, Nov 16 2020, 16:55:22) [GCC 4.8.5 20150623 (Red Hat 4.8.5-44)] on linux Type "help", "copyright", "credits" or "license" for more information. >>> exit() [root@VM-24-11-centos mnt]# --- docker方式部署: --- 拉取镜像:docker pull mmmz/trwebocr:latest --- 开放端口,重启网络 systemctl restart network && systemctl restart docker --- 确保IPV4允许访问:sysctl net.ipv4.ip_forward docker run -itd --rm -p 8089:8089 --name trwebocr mmmz/trwebocr:latest
浏览器-访问测试:IP+端口:
命令行调用接口服务:
root@b3aaf9b09687:/# python3 Python 3.7.8 (default, Jun 30 2020, 18:36:05) [GCC 8.3.0] on linux Type "help", "copyright", "credits" or "license" for more information. >>> import requests >>> import base64 >>> def img_to_base64(img_path): ... with open(img_path, 'rb')as read: ... b64 = base64.b64encode(read.read()) ... return b64 >>> url = 'http://101.99.11.11:8089/api/tr-run/' >>> >>> img_b64 = img_to_base64('/mnt/no1.png') >>> img_b64 b'iVBORw0KGgoAAAANSUhEUgAAANwAAAASCAIAAAC/2icSAAAAt0lEQVR42u2Y0Q6AIAgA+f9f7Sd6aG2mhqQxKe+ejNyacENTNoBgCCkApARASkBKgHeklJP8dS1SzvQILvJ1qEuZpikbl2ktxx7BRb4Oz6Q8Bkr67OnunjlLysG1I6Vjp5woZfPRu1Mi5fekvMu7fjLTC9OMRJAyW6O++3OmnCZlWqrfS0mnDCqlpVp2q/p2c6RESmvE3gstRQolJX/fQe8p5YpyWhqfyT3l6lICICUAUgJSAnSzA9yWKoIsN4K/AAAAAElFTkSuQmCC' >>> res = requests.post(url=url, data={'img': img_b64}) >>> res <Response [200]> >>> >>> print(res) <Response [200]> >>> print(res.json()) {'code': 200, 'msg': '成功', 'data': {'raw_out': [[[62.49999237060547, 9.499998092651367, 130.99998474121094, 17.714282989501953, -0.0], '012920210000013088', 0.9998447299003601]], 'speed_time': 0.07, 'img_detected': ''}} >>> >>> >>> img_b64 = img_to_base64('/mnt/wz1.png') >>> res = requests.post(url=url, data={'img': img_b64}) >>> print(res.json()) {'code': 200, 'msg': '成功', 'data': {'raw_out': [[[54.0, 9.5, 112.0, 17.71428680419922, -0.0], '测试文字1', 0.952775502204895]], 'speed_time': 0.06, 'img_detected': ''}} >>> >>> img_b64 = img_to_base64('/mnt/wz2.png') >>> res = requests.post(url=url, data={'img': img_b64}) >>> print(res.json()) {'code': 200, 'msg': '成功', 'data': {'raw_out': [[[78.0, 9.5, 160.0, 17.71428680419922, 0.0], '测试文字2', 0.9663978261607034]], 'speed_time': 0.08, 'img_detected': ''}}
其他方案:
百度:https://gitee.com/paddlepaddle/PaddleOCR/tree/release/2.6
PDF识别:https://blog.csdn.net/pinkuang3943/article/details/119610920
https://aistudio.baidu.com/aistudio/projectdetail/3552051
https://aistudio.baidu.com/aistudio/projectdetail/3492909?channelType=0&channel=0