LamarckInfusion-14B-v2

LamarckInfusion-14B-v2

sometimesanotion

A sophisticated 14B parameter LLM merging Lamarck, Chocolatine, and Qwenvergence models using SLERP method with dual-slice architecture for enhanced stability and eloquence.

PropertyValue
Model Size14B parameters
Base ModelLamarck-14B-v0.7-Fusion
Merge MethodSLERP with dual-slice architecture
Model URLHugging Face Repository

What is LamarckInfusion-14B-v2?

LamarckInfusion-14B-v2 is an innovative language model that combines the strengths of three powerful models: Lamarck-14B-v0.7-Fusion, Chocolatine-2-14B-Instruct-v2.0.3, and Qwenvergence-14B-v12-Prose-DS. Using a sophisticated SLERP (Spherical Linear Interpolation) merge method with a unique dual-slice architecture, it achieves enhanced stability and eloquence while maintaining the core capabilities of its parent models.

Implementation Details

The model implements a novel two-slice SLERP merge configuration, with carefully calibrated attention and MLP layer weightings. The first slice (layers 0-24) combines Lamarck and Chocolatine models, while the second slice (layers 24-48) merges Lamarck with Qwenvergence, creating a balanced fusion of capabilities.

  • Advanced dual-slice architecture with separate attention and MLP weightings
  • Precision-engineered layer transitions using float32 processing and bfloat16 output
  • Optimized self-attention and MLP layer configurations across model depth

Core Capabilities

  • Enhanced stability from Chocolatine's instruction-following abilities
  • Improved prose generation from Qwenvergence's capabilities
  • Maintained eloquence from Lamarck's base characteristics
  • Balanced performance across different tasks due to strategic model fusion

Frequently Asked Questions

Q: What makes this model unique?

The model's dual-slice architecture and precise SLERP merge configuration sets it apart, allowing it to effectively combine the strengths of three different models while maintaining stability and coherence.

Q: What are the recommended use cases?

Given its balanced capabilities, the model is well-suited for general-purpose applications, particularly those requiring both instruction-following and high-quality prose generation.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026