MistralThinker-v1.1
Property | Value |
---|---|
Base Model | Mistral-Small-24B-Base-2501 |
Author | Undi95 |
Model URL | Hugging Face |
Training Approach | DeepSeek R1 distillation |
What is MistralThinker-v1.1?
MistralThinker-v1.1 is a specialized language model derived from Mistral-Small-24B-Base-2501, specifically engineered for roleplay (RP) and creative storytelling applications. The model employs a unique DeepSeek R1 distillation process, with 40% of its training data focused on roleplay, storywriting, and character card content, while the remaining 60% covers broad language understanding, mathematics, and logical reasoning.
Implementation Details
The model utilizes the Mistral-V7 prompt format and features a doubled dataset size compared to its previous version. It implements a sophisticated chain-of-thought process inherited from DeepSeek R1, enhancing its narrative coherence and creative capabilities.
- Custom prompt format with system prompt support
- DeepSeek R1 thinking process integration
- Enhanced dataset with balanced RP and general knowledge content
- Flexible usage with or without system prompts
Core Capabilities
- Advanced roleplay and character interaction generation
- Rich narrative and story development
- Character lore and backstory creation
- Contextually aware dialogue generation
- Adaptable interaction styles for various use cases
Frequently Asked Questions
Q: What makes this model unique?
The model's distinctive feature is its specialized focus on roleplay and storytelling, combined with the DeepSeek R1 distillation process. The balanced dataset composition ensures both creative excellence and broad knowledge understanding, while maintaining coherent narrative generation.
Q: What are the recommended use cases?
MistralThinker-v1.1 excels in roleplay scenarios, creative writing, character development, and interactive storytelling. It's particularly effective when provided with clear context either through system prompts or initial user messages, making it ideal for both structured and freeform creative applications.