Yi-34B-200K-RPMerge

Maintained By
brucethemoose

Yi-34B-200K-RPMerge

PropertyValue
Parameter Count34.4B
LicenseYi License
Tensor TypeBF16
Context Length200K
Merge MethodDARE TIES
Base PapersDARE, TIES

What is Yi-34B-200K-RPMerge?

Yi-34B-200K-RPMerge is a sophisticated model merge specifically designed for enhanced storytelling and long-context processing. It combines several carefully selected Yi-34B models, primarily using Vicuna format, to create a cohesive system capable of handling context lengths up to 40K+ tokens while maintaining high-quality narrative generation.

Implementation Details

The model utilizes the DARE TIES merge method, incorporating six major models including Tess-34B, Nous-Capybara-34B, and ChatAllInOne. Each component is weighted specifically to maintain optimal performance, with weights ranging from 0.05 to 0.19 and varying density parameters.

  • Implements Vicuna prompt format for consistency
  • Supports context lengths of 40K-90K on 24GB GPUs using exllamav2
  • Utilizes quadratic sampling for improved output quality
  • Features carefully tuned parameters to handle Yi's large tokenizer vocabulary

Core Capabilities

  • Extended context processing up to 200K tokens
  • Enhanced storytelling and narrative generation
  • Robust instruction-following abilities
  • Multi-character story support
  • Reduced likelihood of generating refusals
  • Efficient performance on consumer hardware with proper optimization

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its focused approach to storytelling while maintaining extended context capabilities. Unlike kitchen sink merges, it specifically uses Vicuna-format models to ensure consistency and avoid format conflicts.

Q: What are the recommended use cases?

The model excels at long-form narrative generation, creative writing, and extended context analysis. It's particularly well-suited for novel continuations, multi-character storytelling, and general instruction-following tasks requiring extended context understanding.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.