MN-12B-Celeste-V1.9

Maintained By
nothingiisreal

MN-12B-Celeste-V1.9

PropertyValue
Parameter Count12.2B
Model TypeText Generation
ArchitectureMistral-based Transformer
LicenseApache 2.0
Context Length8K (trained) / 16K+ (inherited)

What is MN-12B-Celeste-V1.9?

MN-12B-Celeste-V1.9 is a specialized language model built on Mistral NeMo 12B Instruct, specifically engineered for story writing and roleplaying applications. The model represents a significant advancement in narrative AI, incorporating ChatML tokens and featuring improved NSFW capabilities alongside smarter and more active narration.

Implementation Details

The model was trained on a carefully curated mixture of datasets including Reddit Writing Prompts, Kalo's Opus 25K Instruct, and cleaned conversation logs. Training was conducted using LoRA+ methodology on an H100 SXM GPU, with specific optimizations for maintaining narrative coherence and character consistency.

  • Trained with 8K context window while inheriting Mistral's 16K+ capability
  • Implements BF16 tensor type for optimal performance
  • Features specialized sampling settings for both stable and creative outputs
  • Supports multiple quantization options including FP8, EXL2, and GGUF formats

Core Capabilities

  • Advanced story writing with human-like prose generation
  • Dynamic character interaction and roleplay
  • Flexible content generation from SFW to NSFW
  • OOC (Out of Character) steering support
  • Context-aware narrative development
  • Multiple API integration options

Frequently Asked Questions

Q: What makes this model unique?

The model's distinctive feature is its specialized training for narrative generation, combining multiple high-quality datasets with ChatML integration and advanced steering capabilities. It offers exceptional flexibility in narrative styles while maintaining coherence.

Q: What are the recommended use cases?

Primary use cases include creative writing, interactive storytelling, character-based roleplay, and narrative content generation. The model excels in both short-form and long-form storytelling, with particular strength in character consistency and narrative development.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.