Chronos Hermes 13B GGUF
Property | Value |
---|---|
Parameter Count | 13B |
Model Type | LLaMA-based |
License | Other |
Author | TheBloke (Quantized) / Austism (Original) |
What is chronos-hermes-13B-GGUF?
Chronos Hermes 13B GGUF is a sophisticated language model that combines the creative storytelling capabilities of Chronos-13B with the instruction-following prowess of Nous-Hermes-13B in a 75/25 merge ratio. This model has been specifically optimized for generating detailed, coherent narratives while maintaining strong adherence to user instructions.
Implementation Details
The model is available in various GGUF quantization formats, ranging from 2-bit to 8-bit precision, allowing users to balance between model size and performance. The recommended Q4_K_M variant offers an optimal balance of quality and resource usage at 7.87GB.
- Multiple quantization options from Q2_K (5.43GB) to Q8_0 (13.83GB)
- Supports GPU layer offloading for optimized performance
- Uses the Alpaca prompt template format
- Compatible with llama.cpp and various third-party UIs
Core Capabilities
- Enhanced narrative generation and storytelling
- Improved instruction following compared to base Chronos
- Long-form, descriptive output generation
- Coherent and contextually appropriate responses
- Balanced creative writing with logical structure
Frequently Asked Questions
Q: What makes this model unique?
This model uniquely combines Chronos's descriptive and creative writing capabilities with Hermes's improved instruction-following abilities, resulting in a more controlled yet creative language model ideal for narrative tasks.
Q: What are the recommended use cases?
The model excels at creative writing, storytelling, and generating detailed descriptive content. It's particularly well-suited for narrative-driven applications where both creativity and coherence are essential.