tf识别非固定长度图片ocr（数字+字母 n位长度可变）- CNN+RNN+CTC

先安装必须的库

tensorflow_gpu==1.15.0
numpy
opencv_python

github:

https://github.com/bai-shang/crnn_ctc_ocr_tf

下载数据集：

http://www.robots.ox.ac.uk/~vgg/data/text/mjsynth.tar.gz
要10G

然后解压缩，估计完整解压完需要1天

find ./mnt/ | xargs ls -d | grep jpg > image_list_all.txt

# use some of data to train and eval
cat image_list_all.txt | head -n 1000 > image_list.txt

这个文件的图片txt将会作为输入，然后生成tfrecord

python create_synth90k_tfrecord.py --image_dir C:\Users\McKay\PycharmProjects\test8\tfdemo\data --anno_file ./image_list.txt --char_map_json_file ../char_map/char_map.json

然后就是训练了

参数：

--data_dir ../data/tfrecords/ --model_dir ./model/ --batch_size 32 --char_map_json_file ../char_map/char_map.json

没有GPU，训练了4个小时，也只是7%的正确率

果断中断，有缘再用GPU训练。

posted @ 2020-02-12 21:25 McKay 阅读(897) 评论(3) 收藏举报

刷新页面返回顶部

McKay

tf识别非固定长度图片ocr（数字+字母 n位长度可变）- CNN+RNN+CTC

公告