Nous-Hermes-2-Yi-34B

NousResearch

A powerful 34B parameter LLM built on Yi-34B, fine-tuned on 1M GPT-4 generated entries. Excels in reasoning and achieves strong benchmark performance.

Property	Value
Parameter Count	34.4B
Base Model	01-ai/Yi-34B
License	Apache 2.0
Training Data	1M entries (GPT-4 generated)
Format	ChatML

What is Nous-Hermes-2-Yi-34B?

Nous-Hermes-2-Yi-34B is a state-of-the-art language model that represents a significant advancement in AI capabilities. Built on the Yi-34B architecture, this model has been fine-tuned on an extensive dataset of 1,000,000 entries, primarily generated by GPT-4, along with other high-quality data sources from across the AI landscape.

Implementation Details

The model utilizes the ChatML format for interactions, making it compatible with OpenAI endpoints and familiar to ChatGPT API users. It supports system prompts for enhanced steerability and runs on BF16 precision.

Achieves exceptional performance across multiple benchmarks including GPT4All, AGIEval, and BigBench
Implements a sophisticated prompt format supporting multi-turn dialogue
Available in quantized versions through GGUF format

Core Capabilities

Strong reasoning abilities demonstrated through benchmark scores
Excellent performance in tasks requiring logical deduction and analysis
Enhanced truthfulness compared to previous versions (60.34% on TruthfulQA)
Versatile dialogue capabilities with support for system-level instructions

Frequently Asked Questions

Q: What makes this model unique?

The model stands out for its significant improvements over previous Nous-Hermes versions, showing substantial gains across all benchmarks. It achieved a +5.95% average improvement over OpenHermes-2.5, with particularly strong gains in AGI Eval (+7.20%) and TruthfulQA (+7.30%).

Q: What are the recommended use cases?

The model excels in various applications including complex reasoning tasks, technical discussions, and multi-turn conversations. It's particularly well-suited for applications requiring strong logical reasoning and truthful responses, as evidenced by its benchmark performances.