使用 Ruby 识别英文数字验证码
- 环境准备
在开始之前,确保安装以下 gem:
bash
gem install rmagick httparty tesseract-ocr
你还需要确保已经安装了 Tesseract OCR 引擎,并配置好其路径。
- 下载验证码图片
使用 HTTParty 下载验证码图片并保存到本地:
ruby
require 'httparty'
class CaptchaDownloader
def self.download_captcha(url, save_path)
response = HTTParty.get(url)
if response.success?
File.open(save_path, 'wb') do |file|
file.write(response.body)
end
puts "验证码图片已保存为 #{save_path}"
else
puts "下载失败: #{response.code}"
end
end
end
3. 图像处理与 OCR 识别
使用 RMagick 和 tesseract-ocr 进行 OCR 识别:
ruby
require 'rmagick'
require 'tesseract-ocr'
class CaptchaRecognizer
def self.recognize_captcha(image_path)
image = Magick::Image.read(image_path).first
text = Tesseract::Engine.new.do_ocr(image)
puts "识别结果: #{text.strip}"
text.strip
end
end
4. 自动化登录
使用 HTTParty 发送 POST 请求,模拟登录操作:
ruby
更多内容联系1436423940
class Login
def self.login(username, password, captcha)
url = 'https://captcha7.scrape.center/login'
response = HTTParty.post(url, body: { username: username, password: password, captcha: captcha })
if response.success?
puts "登录成功"
else
puts "登录失败: #{response.code}"
end
end
end
5. 主程序
整合上述代码,创建主程序:
ruby
class Program
def self.run
captcha_url = 'https://captcha7.scrape.center/captcha.png'
captcha_path = 'captcha.png'
# 下载验证码图片
CaptchaDownloader.download_captcha(captcha_url, captcha_path)
# 识别验证码
captcha_text = CaptchaRecognizer.recognize_captcha(captcha_path)
# 模拟登录
unless captcha_text.nil?
Login.login('admin', 'admin', captcha_text)
end
end
end
Program.run