roberta-base-finetuned-cluener2020-chinese

Maintained By
uer

RoBERTa Base Chinese NER Model

PropertyValue
AuthorUER Team
Base ArchitectureRoBERTa
TaskNamed Entity Recognition
Training DataCLUENER2020
PaperCLUENER2020: Fine-grained NER for Chinese

What is roberta-base-finetuned-cluener2020-chinese?

This is a specialized Chinese language model based on RoBERTa architecture, fine-tuned specifically for Named Entity Recognition tasks. The model was developed by the UER team and trained on the CLUENER2020 dataset, making it particularly effective at identifying and classifying named entities in Chinese text.

Implementation Details

The model was fine-tuned using UER-py framework for 5 epochs with a sequence length of 512. It builds upon the pre-trained chinese_roberta_L-12_H-768 model and uses specialized training procedures to optimize NER performance. The training process includes automatic model saving based on development set performance metrics.

  • Sequence length: 512
  • Batch size: 32
  • Learning rate: 3e-5
  • Training epochs: 5

Core Capabilities

  • Accurate identification of Chinese named entities
  • Support for multiple entity types including addresses and company names
  • Easy integration with Hugging Face's transformers library
  • Optimized for production deployment

Frequently Asked Questions

Q: What makes this model unique?

This model combines the robust performance of RoBERTa with specialized training for Chinese NER tasks, making it particularly effective for identifying named entities in Chinese text. Its fine-tuning on CLUENER2020 provides it with high accuracy for practical applications.

Q: What are the recommended use cases?

The model is ideal for applications requiring Chinese named entity recognition, such as information extraction, content analysis, and automated text processing systems. It's particularly effective for identifying entities like company names, addresses, and other standard named entity categories in Chinese text.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.