SummLlama3.1-70B

Maintained By
DISLab

SummLlama3.1-70B

PropertyValue
Parameter Count70.6B
Base ModelLlama-3.1-70B-Instruct
Training MethodDirect Preference Optimization (DPO)
PaperResearch Paper
Tensor TypeBF16

What is SummLlama3.1-70B?

SummLlama3.1-70B is a specialized large language model designed for generating human-preferred summaries across multiple domains. Built upon the Llama-3.1-70B-Instruct architecture, this model has been fine-tuned using Direct Preference Optimization (DPO) with over 100,000 summarization feedback samples.

Implementation Details

The model leverages advanced training techniques utilizing LLM-generated feedback instead of expensive human annotations. It has been optimized across seven distinct domains, including four non-dialogue domains (News, Lifestyle, Report, Medical) and three dialogue domains (Daily Life, Interview, Meeting).

  • Architecture based on Llama-3.1-70B-Instruct
  • Trained using DPO methodology
  • Optimized for BF16 tensor operations
  • Comprehensive coverage across 7 domains

Core Capabilities

  • Faithfulness: 94.2% accuracy in maintaining source information integrity
  • Completeness: 63.7% effectiveness in capturing key information
  • Conciseness: 90.9% efficiency in producing focused summaries
  • Overall performance improvement over base model (82.9% vs 67.0% average score)

Frequently Asked Questions

Q: What makes this model unique?

The model's unique strength lies in its optimization for human-preferred summaries, achieving superior performance in faithfulness and conciseness compared to both the base Llama3.1 model and GPT-4. It's particularly effective at maintaining information accuracy while producing concise summaries.

Q: What are the recommended use cases?

The model is ideal for summarizing various types of content, from short texts to lengthy documents, across both dialogue and non-dialogue formats. It's particularly effective for summarizing news articles, medical documents, meetings, and interviews while maintaining high faithfulness to the source material.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.