patricide-12B-Unslop-Mell
Property | Value |
---|---|
Parameter Count | 12B |
Model Type | Merged Language Model |
Architecture | SLERP Merge |
Quantization | GGUF Available |
Author | redrix |
What is patricide-12B-Unslop-Mell?
patricide-12B-Unslop-Mell is a merged language model created using mergekit, combining TheDrummer/UnslopNemo-12B-v4.1 and inflatebot/MN-12B-Mag-Mell-R1. The model utilizes the SLERP merge method and is designed to inherit beneficial traits from both parent models while maintaining coherent outputs.
Implementation Details
The model was implemented using a bfloat16 dtype and employs a specific SLERP configuration with parameters t=[0, 0.5, 1, 0.5, 0]. It supports the ChatML template and offers multiple quantization options through GGUF, including static and weighted/Imatrix variants.
- Supports Q6_K GGUF Quantization
- Implements ChatML Template
- Uses SLERP merge methodology
- Available in multiple quantization formats
Core Capabilities
- Stable and coherent text generation
- Story writing and RP potential
- Performs well with Temperature 1 and Min-P of 0.1
- Multiple template support including ChatML
Frequently Asked Questions
Q: What makes this model unique?
The model combines the strengths of two established models using SLERP methodology, offering a balance between coherence and creativity. It provides stable outputs even in base configurations and supports various quantization options for different deployment needs.
Q: What are the recommended use cases?
The model is suitable for general text generation, with particular potential in story writing and role-playing applications. It performs well with standard temperature and Min-P settings, making it accessible for various use cases without extensive parameter tuning.