legal-led-base-16384

Maintained By
nsi319

legal-led-base-16384

PropertyValue
Authornsi319
Model TypeLongformer Encoder-Decoder (LED)
Max Input Length16,384 tokens
Training DataSEC litigation releases (2700+ documents)
Best ROUGE-1 Score55.69

What is legal-led-base-16384?

legal-led-base-16384 is a specialized language model designed for summarizing long legal documents. Built on the Longformer Encoder-Decoder (LED) architecture, it has been specifically trained on SEC litigation releases and complaints to provide accurate and coherent summaries of legal texts up to 16,384 tokens in length.

Implementation Details

The model utilizes the LED-base architecture optimized for the legal domain. It implements advanced attention mechanisms to handle long documents efficiently while maintaining context awareness throughout the text. The model achieves impressive performance metrics, with a ROUGE-1 score of 55.69 and ROUGE-2 score of 29.03.

  • Supports document lengths up to 16,384 tokens
  • Implements beam search with no_repeat_ngram_size=3
  • Uses length penalty optimization for better summary generation
  • Customizable minimum and maximum summary lengths

Core Capabilities

  • Long-form legal document summarization
  • High-precision summary generation (ROUGE-1 precision: 61.73)
  • Efficient handling of complex legal terminology
  • Flexible input length handling

Frequently Asked Questions

Q: What makes this model unique?

This model specializes in legal document summarization with significantly better performance compared to generic LED models. It shows a 26.5 percentage point improvement in ROUGE-1 scores over the base LED model when applied to legal texts.

Q: What are the recommended use cases?

The model is ideal for summarizing legal documents such as litigation releases, complaints, and other legal documentation where maintaining accuracy and capturing key legal points is crucial. It's particularly effective for long documents that would be challenging for standard summarization models.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.