SummLlama3-70B

Maintained By
DISLab

SummLlama3-70B

PropertyValue
Parameter Count70.6B
Base ModelMeta-Llama-3-70B-Instruct
Tensor TypeBF16
Research PaperLink
AuthorDISLab

What is SummLlama3-70B?

SummLlama3-70B is an advanced text summarization model built upon Llama3-70B-Instruct, specifically optimized using Direct Preference Optimization (DPO) with over 100K summarization feedback instances. The model is designed to generate human-preferred summaries across seven distinct domains, including both dialogue and non-dialogue formats.

Implementation Details

The model leverages large-scale LLM-generated feedback instead of expensive human annotations, focusing on three critical aspects: faithfulness, completeness, and conciseness. According to automated evaluations, it achieves impressive scores across all metrics (Faithfulness: 0.950, Completeness: 0.632, Conciseness: 0.754).

  • Trained on seven domains: News, Lifestyle, Report, Medical, Daily Life, Interview, and Meeting
  • Implements Direct Preference Optimization (DPO) training methodology
  • Supports both dialogue and non-dialogue summarization tasks

Core Capabilities

  • Generates highly faithful summaries with minimal information manipulation
  • Maintains completeness by capturing all key information
  • Produces concise outputs focusing only on essential information
  • Outperforms base Llama3 models and achieves comparable results to GPT-4

Frequently Asked Questions

Q: What makes this model unique?

The model's unique strength lies in its optimization for human preferences using DPO training and its comprehensive coverage of multiple domains. It achieves superior performance in faithfulness (0.950) and conciseness (0.754) compared to both Llama3 variants and GPT-4.

Q: What are the recommended use cases?

The model is ideal for generating summaries of both conversational and non-conversational text across various domains. It's particularly effective for scenarios requiring high-fidelity summaries while maintaining conciseness.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.