mpt-7b-storywriter-4bit-128g

Maintained By
OccamRazor

MPT-7B-StoryWriter-4bit-128g

PropertyValue
Parameter Count1.07B
LicenseApache-2.0
Context Length65k+ tokens
ArchitectureModified decoder-only transformer
PaperALiBi Paper

What is mpt-7b-storywriter-4bit-128g?

This is a specialized version of the MPT-7B-StoryWriter model, optimized for 4-bit quantization specifically for use with KoboldAI. The model is designed for reading and writing fictional stories with extraordinarily long context lengths, capable of handling up to 65,000 tokens and potentially extrapolating beyond that using ALiBi technology.

Implementation Details

The model employs a modified decoder-only transformer architecture with several key optimizations. It utilizes FlashAttention for improved efficiency, ALiBi (Attention with Linear Biases) instead of traditional positional embeddings, and operates without biases. The model has been quantized to 4-bit precision while maintaining performance.

  • 32 layers with 32 attention heads
  • 4096 dimensional model (d_model)
  • 50,432 vocabulary size
  • Supports sequence lengths up to 65,536 tokens

Core Capabilities

  • Long-form story generation and continuation
  • Extended context understanding (65k+ tokens)
  • Memory-efficient operation through 4-bit quantization
  • Commercial usage permitted under Apache-2.0 license

Frequently Asked Questions

Q: What makes this model unique?

The model's ability to handle extremely long context lengths (65k+ tokens) through ALiBi technology, combined with 4-bit quantization for efficient deployment, makes it particularly suitable for long-form story generation tasks. The model can maintain coherence across very long sequences, as demonstrated by its ability to process the entire text of "The Great Gatsby" and generate coherent continuations.

Q: What are the recommended use cases?

This model is specifically optimized for creative writing applications, particularly long-form fiction. It excels at understanding and generating narrative content, making it ideal for story continuation, creative writing assistance, and long-form content generation.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.