gemma-2-baku-2b-it

Maintained By
rinna

Gemma 2 Baku 2B Instruct

PropertyValue
Parameter Count2.61B
Model TypeInstruction-tuned Language Model
LanguagesJapanese, English
LicenseGemma Terms of Use
Base Modelgoogle/gemma-2-2b

What is gemma-2-baku-2b-it?

Gemma 2 Baku 2B Instruct is an advanced bilingual language model that builds upon the foundation of Gemma 2, specifically designed to handle both Japanese and English language tasks. It represents a sophisticated merge of Google's Gemma architecture with specialized instruction-tuning capabilities, achieved through innovative Chat Vector addition and ORPO (Odds Ratio Preference Optimization) techniques.

Implementation Details

The model features a 26-layer transformer architecture with 2304 hidden dimensions. Its unique implementation involves a chat vector addition process where parameter vectors are calculated by subtracting google/gemma-2-2b from gemma-2-2b-it, then adding this difference to the base model. The model specifically excludes embedding layer modifications during this process to maintain linguistic integrity.

  • Utilizes eager attention mechanism for optimal batch inference under bfloat16 precision
  • Implements the original google/gemma-2-2b-it tokenizer
  • Incorporates ORPO fine-tuning using proprietary datasets

Core Capabilities

  • Bilingual instruction following in Japanese and English
  • Enhanced performance through Chat Vector technology
  • Optimized response generation using ORPO
  • Efficient handling of conversational tasks
  • Support for both CPU and GPU inference

Frequently Asked Questions

Q: What makes this model unique?

This model stands out due to its innovative combination of Chat Vector technology and ORPO optimization, specifically designed for bilingual capability while maintaining the robust performance of the Gemma 2 architecture. The careful parameter merging process ensures optimal performance in both Japanese and English contexts.

Q: What are the recommended use cases?

The model is particularly well-suited for bilingual applications requiring instruction following, conversational AI implementations, and text generation tasks in both Japanese and English. It's optimized for scenarios requiring nuanced understanding and generation in either language.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.