opus-mt-zh-vi
Property | Value |
---|---|
Developer | Helsinki-NLP |
Architecture | transformer-align |
BLEU Score | 20.0 |
chrF Score | 0.385 |
Training Date | 2020-06-17 |
Model URL | https://huggingface.co/Helsinki-NLP/opus-mt-zh-vi |
What is opus-mt-zh-vi?
opus-mt-zh-vi is a specialized neural machine translation model designed to translate Chinese text to Vietnamese. Developed by Helsinki-NLP, this model utilizes the transformer-align architecture and implements SentencePiece tokenization with 32k vocabulary size for both source and target languages.
Implementation Details
The model employs a sophisticated preprocessing pipeline that includes normalization and SentencePiece tokenization. It supports various Chinese text formats including simplified, traditional, and multiple romanization systems, making it highly versatile for different input types.
- Supports multiple Chinese variants including Mandarin, Cantonese, and Classical Chinese
- Implements dual SentencePiece tokenization (spm32k)
- Achieves a BLEU score of 20.0 on the Tatoeba test set
- Demonstrates a chrF score of 0.385, indicating good translation quality
Core Capabilities
- Direct Chinese to Vietnamese translation
- Handles multiple Chinese writing systems and variants
- Suitable for both simplified and traditional Chinese input
- Optimized for general-purpose translation tasks
Frequently Asked Questions
Q: What makes this model unique?
This model specializes in Chinese to Vietnamese translation, supporting an impressive range of Chinese language variants and writing systems. Its transformer-align architecture and sophisticated preprocessing pipeline make it particularly effective for this language pair.
Q: What are the recommended use cases?
The model is best suited for general-purpose Chinese to Vietnamese translation tasks. With its ability to handle multiple Chinese variants and writing systems, it's particularly valuable for applications requiring flexible Chinese text input handling.