Llama3-TenyxChat-70B

Maintained By
tenyx

Llama3-TenyxChat-70B

PropertyValue
Parameter Count70.6B
LicenseMeta Llama 3 Community License
Base ModelLlama3-70B
Training FrameworkDirect Preference Optimization (DPO)
MT-Bench Score8.15

What is Llama3-TenyxChat-70B?

Llama3-TenyxChat-70B is a state-of-the-art language model developed by Tenyx Research, fine-tuned from Meta's Llama3-70B using advanced preference optimization techniques. It represents a significant achievement in open-source AI, achieving the highest MT-Bench score (8.15) among publicly available models.

Implementation Details

The model was trained using eight A100 GPUs (80GB) for fifteen hours, implementing Direct Preference Optimization on the UltraFeedback dataset. Tenyx's proprietary approach focuses on mitigating catastrophic forgetting while maintaining high performance across various benchmarks.

  • Uses BF16 tensor type for efficient computation
  • Compatible with standard Llama3 chat templates
  • Evaluated using GPT-4-preview-0125 for benchmarking
  • Demonstrates exceptional performance in both single and multi-turn conversations

Core Capabilities

  • Achieves 8.15 on MT-Bench, surpassing the base Llama3-70B-Instruct (7.96)
  • Scores 79.43 average on Open LLM Leaderboard benchmarks
  • Maintains consistent performance in multi-turn dialogues
  • Excels in GSM8K (91.21%) and HellaSwag (86.11%) tasks

Frequently Asked Questions

Q: What makes this model unique?

The model's distinctive feature is its ability to maintain high performance across multiple conversation turns while avoiding catastrophic forgetting, achieved through Tenyx's specialized fine-tuning approach.

Q: What are the recommended use cases?

The model excels in conversational AI applications, particularly in scenarios requiring sustained dialogue quality across multiple turns. It's suitable for both general-purpose chat applications and specialized tasks in reasoning, STEM, and humanities domains.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.