phobert-base

phobert-base

vinai

PhoBERT base model - State-of-the-art Vietnamese language model based on RoBERTa architecture, with 43 likes and 174K+ downloads

PropertyValue
AuthorVINAI
LicenseMIT
LanguageVietnamese
PaperResearch Paper

What is phobert-base?

PhoBERT-base is a state-of-the-art language model specifically designed for Vietnamese language processing. Named after "Phở", a popular Vietnamese dish, it represents one of the first public large-scale monolingual language models pre-trained for Vietnamese. Built on the RoBERTa architecture, which optimizes BERT's pre-training procedure, it achieves superior performance in Vietnamese NLP tasks.

Implementation Details

The model utilizes the RoBERTa architecture and is implemented using PyTorch. It employs advanced transformer-based technology and has been extensively pre-trained on Vietnamese text data.

  • Based on RoBERTa architecture for robust performance
  • Supports Fill-Mask task operations
  • Compatible with PyTorch framework
  • Optimized for Vietnamese language processing

Core Capabilities

  • Part-of-speech tagging
  • Dependency parsing
  • Named-entity recognition
  • Natural language inference
  • Masked language modeling

Frequently Asked Questions

Q: What makes this model unique?

PhoBERT is the first large-scale monolingual language model specifically designed for Vietnamese, achieving state-of-the-art performance across multiple NLP tasks. Its architecture is optimized based on RoBERTa, making it particularly effective for Vietnamese language processing.

Q: What are the recommended use cases?

The model is ideal for Vietnamese language processing tasks including part-of-speech tagging, dependency parsing, named-entity recognition, and natural language inference. It's particularly useful for researchers and developers working on Vietnamese NLP applications.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026