OCR文字识别-开源方案本地部署

1. 开源OCR识别程序:

https://github.com/DayBreak-u/chineseocr_lite
https://github.com/alisen39/TrWebOCR

2. 本地测试,选择了TrWebOCR

https://github.com/alisen39/TrWebOCR

3. 部署步骤:

--- 确保本地基础环境python3已经安装完成
[root@VM-24-11-centos mnt]# python3
Python 3.6.8 (default, Nov 16 2020, 16:55:22)
[GCC 4.8.5 20150623 (Red Hat 4.8.5-44)] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> exit()
[root@VM-24-11-centos mnt]#

--- docker方式部署:
--- 拉取镜像:docker pull mmmz/trwebocr:latest
--- 开放端口,重启网络
systemctl restart network && systemctl restart docker
--- 确保IPV4允许访问:sysctl net.ipv4.ip_forward
docker run -itd --rm -p 8089:8089 --name trwebocr mmmz/trwebocr:latest

浏览器-访问测试:IP+端口:

命令行调用接口服务:

root@b3aaf9b09687:/# python3
Python 3.7.8 (default, Jun 30 2020, 18:36:05)
[GCC 8.3.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import requests
>>> import base64
>>> def img_to_base64(img_path):
... with open(img_path, 'rb')as read:
... b64 = base64.b64encode(read.read())
... return b64
>>> url = 'http://101.99.11.11:8089/api/tr-run/'
>>>
>>> img_b64 = img_to_base64('/mnt/no1.png')
>>> img_b64
b'iVBORw0KGgoAAAANSUhEUgAAANwAAAASCAIAAAC/2icSAAAAt0lEQVR42u2Y0Q6AIAgA+f9f7Sd6aG2mhqQxKe+ejNyacENTNoBgCCkApARASkBKgHeklJP8dS1SzvQILvJ1qEuZpikbl2ktxx7BRb4Oz6Q8Bkr67OnunjlLysG1I6Vjp5woZfPRu1Mi5fekvMu7fjLTC9OMRJAyW6O++3OmnCZlWqrfS0mnDCqlpVp2q/p2c6RESmvE3gstRQolJX/fQe8p5YpyWhqfyT3l6lICICUAUgJSAnSzA9yWKoIsN4K/AAAAAElFTkSuQmCC'
>>> res = requests.post(url=url, data={'img': img_b64})
>>> res
<Response [200]>
>>>
>>> print(res)
<Response [200]>
>>> print(res.json())
{'code': 200, 'msg': '成功', 'data': {'raw_out': [[[62.49999237060547, 9.499998092651367, 130.99998474121094, 17.714282989501953, -0.0], '012920210000013088', 0.9998447299003601]], 'speed_time': 0.07, 'img_detected': ''}}
>>>
>>>
>>> img_b64 = img_to_base64('/mnt/wz1.png')
>>> res = requests.post(url=url, data={'img': img_b64})
>>> print(res.json())
{'code': 200, 'msg': '成功', 'data': {'raw_out': [[[54.0, 9.5, 112.0, 17.71428680419922, -0.0], '测试文字1', 0.952775502204895]], 'speed_time': 0.06, 'img_detected': ''}}
>>>
>>> img_b64 = img_to_base64('/mnt/wz2.png')
>>> res = requests.post(url=url, data={'img': img_b64})
>>> print(res.json())
{'code': 200, 'msg': '成功', 'data': {'raw_out': [[[78.0, 9.5, 160.0, 17.71428680419922, 0.0], '测试文字2', 0.9663978261607034]], 'speed_time': 0.08, 'img_detected': ''}}

 其他方案:

百度:https://gitee.com/paddlepaddle/PaddleOCR/tree/release/2.6

PDF识别:https://blog.csdn.net/pinkuang3943/article/details/119610920

https://aistudio.baidu.com/aistudio/projectdetail/3552051

https://aistudio.baidu.com/aistudio/projectdetail/3492909?channelType=0&channel=0

posted @ 2022-03-10 18:08  hello-Jesson  阅读(2578)  评论(0编辑  收藏  举报