MistralThinker-v1.1

MistralThinker-v1.1

Undi95

MistralThinker-v1.1: Specialized Mistral-24B variant optimized for roleplay and storytelling, featuring DeepSeek R1 distillation and 40% RP-focused dataset.

PropertyValue
Base ModelMistral-Small-24B-Base-2501
AuthorUndi95
Model URLHugging Face
Training ApproachDeepSeek R1 distillation

What is MistralThinker-v1.1?

MistralThinker-v1.1 is a specialized language model derived from Mistral-Small-24B-Base-2501, specifically engineered for roleplay (RP) and creative storytelling applications. The model employs a unique DeepSeek R1 distillation process, with 40% of its training data focused on roleplay, storywriting, and character card content, while the remaining 60% covers broad language understanding, mathematics, and logical reasoning.

Implementation Details

The model utilizes the Mistral-V7 prompt format and features a doubled dataset size compared to its previous version. It implements a sophisticated chain-of-thought process inherited from DeepSeek R1, enhancing its narrative coherence and creative capabilities.

  • Custom prompt format with system prompt support
  • DeepSeek R1 thinking process integration
  • Enhanced dataset with balanced RP and general knowledge content
  • Flexible usage with or without system prompts

Core Capabilities

  • Advanced roleplay and character interaction generation
  • Rich narrative and story development
  • Character lore and backstory creation
  • Contextually aware dialogue generation
  • Adaptable interaction styles for various use cases

Frequently Asked Questions

Q: What makes this model unique?

The model's distinctive feature is its specialized focus on roleplay and storytelling, combined with the DeepSeek R1 distillation process. The balanced dataset composition ensures both creative excellence and broad knowledge understanding, while maintaining coherent narrative generation.

Q: What are the recommended use cases?

MistralThinker-v1.1 excels in roleplay scenarios, creative writing, character development, and interactive storytelling. It's particularly effective when provided with clear context either through system prompts or initial user messages, making it ideal for both structured and freeform creative applications.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026