legal-led-base-16384
Property | Value |
---|---|
Author | nsi319 |
Model Type | Longformer Encoder-Decoder (LED) |
Max Input Length | 16,384 tokens |
Training Data | SEC litigation releases (2700+ documents) |
Best ROUGE-1 Score | 55.69 |
What is legal-led-base-16384?
legal-led-base-16384 is a specialized language model designed for summarizing long legal documents. Built on the Longformer Encoder-Decoder (LED) architecture, it has been specifically trained on SEC litigation releases and complaints to provide accurate and coherent summaries of legal texts up to 16,384 tokens in length.
Implementation Details
The model utilizes the LED-base architecture optimized for the legal domain. It implements advanced attention mechanisms to handle long documents efficiently while maintaining context awareness throughout the text. The model achieves impressive performance metrics, with a ROUGE-1 score of 55.69 and ROUGE-2 score of 29.03.
- Supports document lengths up to 16,384 tokens
- Implements beam search with no_repeat_ngram_size=3
- Uses length penalty optimization for better summary generation
- Customizable minimum and maximum summary lengths
Core Capabilities
- Long-form legal document summarization
- High-precision summary generation (ROUGE-1 precision: 61.73)
- Efficient handling of complex legal terminology
- Flexible input length handling
Frequently Asked Questions
Q: What makes this model unique?
This model specializes in legal document summarization with significantly better performance compared to generic LED models. It shows a 26.5 percentage point improvement in ROUGE-1 scores over the base LED model when applied to legal texts.
Q: What are the recommended use cases?
The model is ideal for summarizing legal documents such as litigation releases, complaints, and other legal documentation where maintaining accuracy and capturing key legal points is crucial. It's particularly effective for long documents that would be challenging for standard summarization models.