Athene-V2-Chat

Nexusflow

Powerful 72.7B parameter chat model rivaling GPT-4, excelling in math, coding, and instruction-following. Built on Qwen2.5, optimized via RLHF.

Property	Value
Parameter Count	72.7B
Model Type	Chat Model
Base Model	Qwen2.5-72B-Instruct
License	Nexusflow Research License
Tensor Type	BF16

What is Athene-V2-Chat?

Athene-V2-Chat is a state-of-the-art language model that achieves performance parity with GPT-4o across various benchmarks. Developed by Nexusflow, this 72.7B parameter model represents a significant advancement in open-weight LLMs, particularly excelling in chat, mathematics, and coding tasks. The model was refined through RLHF (Reinforcement Learning from Human Feedback) using Qwen-2.5-72B-Instruct as its foundation.

Implementation Details

The model utilizes the Transformers library and maintains compatibility with Qwen2.5's chat template. It employs BF16 precision for optimal performance and efficiency. Implementation requires minimal setup, with built-in support for auto-device mapping and dtype handling.

Seamless integration with Hugging Face Transformers
Efficient token processing and generation
Support for system prompts to enhance performance
Compatible with standard chat templates

Core Capabilities

Superior performance in mathematical computations
Advanced coding assistance and generation
Robust instruction following
Excellent performance in multi-turn conversations
Competitive performance in hard and longer queries

Frequently Asked Questions

Q: What makes this model unique?

Athene-V2-Chat distinguishes itself by matching or exceeding GPT-4o's performance across multiple benchmarks, particularly in mathematics and coding. It's currently the best open model according to Chatbot Arena, specifically outperforming GPT-4o-0513 in hard and math categories.

Q: What are the recommended use cases?

The model excels in chat applications, mathematical problem-solving, coding tasks, and complex instruction following. It's particularly effective when enhanced with system prompts for specialized tasks, though this isn't necessary for general chat evaluation.