opus-mt-zh-vi

opus-mt-zh-vi

Helsinki-NLP

Neural machine translation model for Chinese to Vietnamese translation, achieving BLEU score of 20.0 and chrF of 0.385. Built by Helsinki-NLP using transformer-align architecture.

PropertyValue
DeveloperHelsinki-NLP
Architecturetransformer-align
BLEU Score20.0
chrF Score0.385
Training Date2020-06-17
Model URLhttps://huggingface.co/Helsinki-NLP/opus-mt-zh-vi

What is opus-mt-zh-vi?

opus-mt-zh-vi is a specialized neural machine translation model designed to translate Chinese text to Vietnamese. Developed by Helsinki-NLP, this model utilizes the transformer-align architecture and implements SentencePiece tokenization with 32k vocabulary size for both source and target languages.

Implementation Details

The model employs a sophisticated preprocessing pipeline that includes normalization and SentencePiece tokenization. It supports various Chinese text formats including simplified, traditional, and multiple romanization systems, making it highly versatile for different input types.

  • Supports multiple Chinese variants including Mandarin, Cantonese, and Classical Chinese
  • Implements dual SentencePiece tokenization (spm32k)
  • Achieves a BLEU score of 20.0 on the Tatoeba test set
  • Demonstrates a chrF score of 0.385, indicating good translation quality

Core Capabilities

  • Direct Chinese to Vietnamese translation
  • Handles multiple Chinese writing systems and variants
  • Suitable for both simplified and traditional Chinese input
  • Optimized for general-purpose translation tasks

Frequently Asked Questions

Q: What makes this model unique?

This model specializes in Chinese to Vietnamese translation, supporting an impressive range of Chinese language variants and writing systems. Its transformer-align architecture and sophisticated preprocessing pipeline make it particularly effective for this language pair.

Q: What are the recommended use cases?

The model is best suited for general-purpose Chinese to Vietnamese translation tasks. With its ability to handle multiple Chinese variants and writing systems, it's particularly valuable for applications requiring flexible Chinese text input handling.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026