SummLlama3-8B

Maintained By
DISLab

SummLlama3-8B

PropertyValue
Parameter Count8.03B
Base ModelMeta-Llama-3-8B-Instruct
Training MethodDirect Preference Optimization (DPO)
PaperResearch Paper
Tensor TypeBF16

What is SummLlama3-8B?

SummLlama3-8B is a specialized text summarization model that achieves remarkable performance, surpassing even the much larger Llama3-70B-Instruct and GPT-4o. Built on the Llama3-8B-Instruct architecture, it's been enhanced through Direct Preference Optimization using over 100,000 summarization feedback samples across diverse domains.

Implementation Details

The model has been trained across seven distinct domains, including four non-dialogue (News, Lifestyle, Report, Medical) and three dialogue domains (Daily Life, Interview, Meeting). It excels in three critical aspects of summarization: faithfulness (0.980), completeness (0.697), and conciseness (0.959), achieving an impressive average score of 0.879 in human evaluations.

  • Utilizes high-quality, multi-dimensional feedback from LLMs
  • Optimized for both dialogue and non-dialogue text summarization
  • Implements specialized prompt formatting for optimal results

Core Capabilities

  • Generates highly faithful summaries without information manipulation
  • Maintains completeness by capturing all key information
  • Produces concise outputs focusing on essential content
  • Processes both dialogue and non-dialogue texts effectively

Frequently Asked Questions

Q: What makes this model unique?

SummLlama3-8B stands out for its ability to outperform much larger models while maintaining high efficiency. It achieves this through specialized training on diverse summarization tasks and optimized feedback incorporation.

Q: What are the recommended use cases?

The model is ideal for summarizing various content types, from news articles and medical reports to conversations and interviews. It's particularly effective when faithful and concise summaries are crucial.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.