chronos-hermes-13B-GGUF

TheBloke

A 13B parameter LLaMA-based model optimized for storytelling and descriptive outputs, featuring GGUF quantization for efficient deployment and combining Chronos and Hermes capabilities.

Property	Value
Parameter Count	13B
Model Type	LLaMA-based
License	Other
Quantization Format	GGUF

What is chronos-hermes-13B-GGUF?

Chronos Hermes 13B GGUF is a sophisticated language model that combines the creative storytelling capabilities of Chronos with the instruction-following prowess of Nous-Hermes in a 75/25 merge ratio. This model has been specifically optimized and converted to the GGUF format by TheBloke, enabling efficient deployment across various platforms and use cases.

Implementation Details

The model comes in multiple quantization variants, ranging from 2-bit to 8-bit precision, allowing users to balance between model size and performance. The recommended Q4_K_M variant offers a balanced compromise at 7.87GB file size.

Implements the Alpaca prompt template for consistent interaction
Supports various deployment options including llama.cpp, text-generation-webui, and Python libraries
Offers GPU acceleration capabilities with adjustable layer offloading

Core Capabilities

Enhanced descriptive writing and narrative generation
Improved instruction following compared to base Chronos
Long-form content generation with coherent output
Efficient memory usage through GGUF quantization

Frequently Asked Questions

Q: What makes this model unique?

This model uniquely combines Chronos's descriptive and creative writing capabilities with Hermes's instruction-following abilities, resulting in a more controlled yet creative language model ideal for storytelling and narrative tasks.

Q: What are the recommended use cases?

The model excels at story writing, narrative development, and generating descriptive content. It's particularly well-suited for creative writing applications where both imagination and coherent structure are important.