Tanuki-8B-dpo-v1.0

Maintained By
weblab-GENIAC

Tanuki-8B-dpo-v1.0

PropertyValue
Parameter Count7.51B
LicenseApache 2.0
LanguagesJapanese, English
Training Data1.3T tokens
Model TypeTransformer-based LLM

What is Tanuki-8B-dpo-v1.0?

Tanuki-8B-dpo-v1.0 is a sophisticated bilingual language model developed by the GENIAC project at the University of Tokyo. This model represents a significant achievement in Japanese-English language modeling, having undergone extensive pre-training with 1.3T tokens and refined through DPO (Direct Preference Optimization) for enhanced conversational capabilities.

Implementation Details

The model utilizes a transformer architecture and supports BF16 precision. It's been specifically optimized for both Japanese and English language processing, with various quantized versions available including AWQ 4-bit, GPTQ 4-bit/8-bit, and GGUF formats.

  • Follows Japanese Alpaca prompt format
  • Supports both single-turn and multi-turn conversations
  • Implements chat template functionality
  • Offers streaming capabilities for real-time response generation

Core Capabilities

  • Strong performance in Japanese MT-Bench evaluation (7.24 average score)
  • Excels in humanities (9.1), STEM (9.35), and writing (9.05) tasks
  • Efficient handling of bilingual conversations
  • Specialized in instruction-following scenarios

Frequently Asked Questions

Q: What makes this model unique?

The model stands out for its specialized Japanese-English capabilities and extensive pre-training on 1.3T tokens, making it particularly effective for bilingual applications. Its DPO fine-tuning ensures high-quality conversational abilities.

Q: What are the recommended use cases?

The model is ideal for Japanese-English conversational AI applications, academic discussion, writing assistance, and general knowledge tasks. It shows particularly strong performance in humanities, STEM, and writing-related queries.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.