crnn-fa-printed-96-long

Maintained By
hezarai

crnn-fa-printed-96-long

PropertyValue
LicenseApache 2.0
LanguagePersian (Farsi)
Research PaperCRNN Paper
Downloads24,564

What is crnn-fa-printed-96-long?

This is an advanced Persian OCR (Optical Character Recognition) model based on the CRNN (Convolutional Recurrent Neural Network) architecture. It represents a significant improvement over its predecessor, specifically designed to handle printed Persian text with enhanced capabilities and broader character support.

Implementation Details

The model combines CNN and LSTM architectures, optimized for processing Persian text images. It features a revised input image size of 32x384 pixels and can handle sequences up to 96 characters, though it's optimized for texts around 50 characters in length.

  • 5X larger training dataset compared to previous version
  • Modified input dimensions (32x384)
  • Extended maximum output length to 96 characters
  • Intelligent handling of LTR characters within RTL text
  • Comprehensive support for numbers and special characters

Core Capabilities

  • High-accuracy Persian text recognition from printed documents
  • Automatic handling of mixed RTL/LTR text
  • Support for numbers and special characters
  • Optimized for word-level text detection
  • Suitable for fine-tuning on specific domains

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its enhanced capability to handle longer text sequences (up to 96 characters) and its sophisticated handling of mixed RTL/LTR text, making it particularly effective for real-world Persian document processing.

Q: What are the recommended use cases?

The model is primarily designed for printed/scanned documents and works best with text boxes containing up to 50 characters. It's recommended to use it in conjunction with a text detector model for optimal results in end-to-end OCR pipelines.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.