python中的验证码识别库PyTesser

PyTesser

PyTesser is an Optical Character Recognition module for Python. It takes as input an image or image file and outputs a string.

PyTesser uses the Tesseract OCR engine, converting images to an accepted format and calling the Tesseract executable as an external script. A Windows executable is provided along with the Python scripts. The scripts should work in other operating systems as well.

Dependencies

PIL is required to work with images in memory. PyTesser has been tested with Python 2.4 in Windows XP.

Usage Example

>>>from pytesser import* 
>>> image =Image.open('fnord.tif')  # Open image object using PIL 
>>>print image_to_string(image)     # Run tesseract.exe on image 
fnord 
>>>print image_file_to_string('fnord.tif') 
fnord

(more examples in README)

 

pytesser下载

http://code.google.com/p/pytesser/

Tesseract OCR engine下载:

http://code.google.com/p/tesseract-ocr/

PIL官方下载

http://www.pythonware.com/products/pil/

 

django-simple-captcha

https://github.com/mbi/django-simple-captcha

Django Simple Captcha is an extremely simple, yet highly customizable Django application to add captcha images to any Django form.

http://django-simple-captcha.googlecode.com/files/Captcha3.png

Features

  • Very simple to setup and deploy, yet very configurable
  • Can use custom challenges (e.g. random chars, simple maths, dictionary word, ...)
  • Custom generators, noise and filter functions alter the look of the generated image
  • Supports text-to-speech audio output of the challenge text, for improved accessibility

Requirements

  • Django 1.0+
  • A fairly recent version of the Python Imaging Library (PIL) compiled with FreeType support
  • Flite is required for text-to-speech (audio) output, but not mandatory

Documentation

Read the documentation online.

posted @ 2012-08-09 15:10  方倍工作室  阅读(15468)  评论(0编辑  收藏  举报