table-transformer-structure-recognition-v1.1-all

Maintained By
microsoft

Table Transformer Structure Recognition v1.1

PropertyValue
Parameter Count28.8M
LicenseMIT
PaperView Paper
Downloads577,057
Tensor TypeF32

What is table-transformer-structure-recognition-v1.1-all?

The Table Transformer (TATR) is a specialized model designed for table structure recognition in documents. Developed by Microsoft, it builds upon the DETR (Detection Transformer) architecture and has been trained on comprehensive datasets including PubTables1M and FinTabNet.c. This version implements the "normalize before" approach, applying layer normalization before self- and cross-attention operations.

Implementation Details

Based on the Transformer architecture, this model incorporates object detection capabilities specifically optimized for table recognition. The model utilizes 28.8M parameters and operates with F32 tensor precision.

  • Implements DETR-based architecture with modified normalization
  • Trained on specialized table recognition datasets
  • Optimized for document table structure analysis

Core Capabilities

  • Table structure detection in documents
  • Complex table layout recognition
  • Integration-ready with inference endpoints
  • Robust object detection for tabular data

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its specialized focus on table structure recognition, implementing a modified DETR architecture with "normalize before" setting, making it particularly effective for document analysis tasks.

Q: What are the recommended use cases?

The model is ideal for automated document processing systems, academic paper analysis, financial document processing, and any application requiring accurate table structure recognition in digital documents.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.