SummLlama3-8B

Maintained By
DISLab

SummLlama3-8B

PropertyValue
Parameter Count8.03B
Model TypeSummarization
Base ModelMeta-Llama-3-8B-Instruct
PaperResearch Paper
Tensor TypeBF16

What is SummLlama3-8B?

SummLlama3-8B is a specialized summarization model built on Llama3-8B-Instruct, enhanced through Direct Preference Optimization (DPO) using over 100K summarization feedback data points. This model demonstrates remarkable performance, surpassing even the much larger Llama3-70B-Instruct and GPT-4o in automated evaluations while maintaining significantly faster inference speeds.

Implementation Details

The model leverages advanced training techniques focusing on three crucial aspects of summarization: faithfulness (0.980), completeness (0.697), and conciseness (0.959). It achieves these impressive metrics through careful optimization across seven distinct domains, including both dialogue and non-dialogue formats.

  • Training incorporates feedback from multiple domains: News, Lifestyle, Report, Medical, Daily Life, Interview, and Meeting
  • Utilizes Direct Preference Optimization (DPO) with LLM-generated feedback
  • Implements BF16 tensor format for efficient processing

Core Capabilities

  • Superior performance in human-preferred summary generation
  • Balanced handling of both dialogue and non-dialogue content
  • Exceptional faithfulness scores in human evaluation (0.980)
  • Efficient processing with smaller parameter count compared to alternatives

Frequently Asked Questions

Q: What makes this model unique?

SummLlama3-8B stands out for its ability to outperform much larger models while maintaining efficiency. It achieves this through specialized training on diverse summarization tasks and optimization for human preferences.

Q: What are the recommended use cases?

The model excels at generating summaries across various domains, making it ideal for summarizing news articles, medical documents, meeting transcripts, interviews, and general content where faithful and concise summaries are crucial.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.