OCR-CAPTCHA

Property	Value
Author	xiaolv
Model URL	https://huggingface.co/xiaolv/ocr-captcha
Base Model	DAMO Text Recognition Model

What is ocr-captcha?

OCR-CAPTCHA is a specialized optical character recognition model designed specifically for identifying and decoding various types of CAPTCHA verification codes. The model comes in two variants: a small model trained on 84,000 CAPTCHA images achieving nearly 100% accuracy, and a larger model trained on 1.35 million images with 93.95% accuracy.

Implementation Details

The model is fine-tuned on DAMO Academy's text recognition base model, specifically optimized for handling CAPTCHA recognition tasks. It supports multiple CAPTCHA formats including numeric-only, alphanumeric, and pure alphabetic (both uppercase and lowercase) combinations, with sequence lengths of 4, 5, or 6 characters.

Small model: 700MB dataset, 27 training epochs, ~100% accuracy
Large model: 11GB dataset, 1 training epoch, 93.95% accuracy
Supports multiple CAPTCHA types and lengths

Core Capabilities

Recognition of pure numeric CAPTCHAs
Processing of mixed alphanumeric codes
Handling of pure alphabetic sequences (case-sensitive)
Support for variable length sequences (4-6 characters)
High accuracy recognition across different CAPTCHA styles

Frequently Asked Questions

Q: What makes this model unique?

The model's specialization in CAPTCHA recognition and its high accuracy rates, particularly the small model's near-perfect performance, make it stand out. It's specifically optimized for common CAPTCHA formats found in real-world applications.

Q: What are the recommended use cases?

This model is ideal for automated CAPTCHA recognition in testing environments, web automation tasks, and accessibility tools. The small model is recommended for most use cases due to its superior accuracy and efficiency.

ocr-captcha