crnn-fa-printed-96-long

Property	Value
License	Apache 2.0
Language	Persian (Farsi)
Research Paper	CRNN Paper
Downloads	24,564

What is crnn-fa-printed-96-long?

This is an advanced Persian OCR (Optical Character Recognition) model based on the CRNN (Convolutional Recurrent Neural Network) architecture. It represents a significant improvement over its predecessor, specifically designed to handle printed Persian text with enhanced capabilities and broader character support.

Implementation Details

The model combines CNN and LSTM architectures, optimized for processing Persian text images. It features a revised input image size of 32x384 pixels and can handle sequences up to 96 characters, though it's optimized for texts around 50 characters in length.

5X larger training dataset compared to previous version
Modified input dimensions (32x384)
Extended maximum output length to 96 characters
Intelligent handling of LTR characters within RTL text
Comprehensive support for numbers and special characters

Core Capabilities

High-accuracy Persian text recognition from printed documents
Automatic handling of mixed RTL/LTR text
Support for numbers and special characters
Optimized for word-level text detection
Suitable for fine-tuning on specific domains

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its enhanced capability to handle longer text sequences (up to 96 characters) and its sophisticated handling of mixed RTL/LTR text, making it particularly effective for real-world Persian document processing.

Q: What are the recommended use cases?

The model is primarily designed for printed/scanned documents and works best with text boxes containing up to 50 characters. It's recommended to use it in conjunction with a text detector model for optimal results in end-to-end OCR pipelines.