twitter-xlm-roberta-base

twitter-xlm-roberta-base

cardiffnlp

Multilingual XLM-RoBERTa model trained on 198M tweets, optimized for Twitter text analysis and sentiment tasks across 30+ languages.

PropertyValue
Authorcardiffnlp
PaperXLM-T Paper
Downloads30,725
FrameworkPyTorch, TensorFlow

What is twitter-xlm-roberta-base?

twitter-xlm-roberta-base is a multilingual language model based on XLM-RoBERTa architecture, specifically trained on approximately 198 million tweets across more than 30 languages. This model represents a significant advancement in multilingual social media text analysis, particularly designed to handle the unique characteristics of Twitter content.

Implementation Details

The model builds upon the XLM-RoBERTa architecture and includes specialized preprocessing for Twitter-specific content, such as handling usernames and URLs. It supports various tasks including fill-mask operations and sentiment analysis, with particular emphasis on cross-lingual capabilities.

  • Pre-trained on 198M multilingual tweets
  • Supports text similarity computation across different languages
  • Includes specialized Twitter text preprocessing
  • Implements masked language modeling

Core Capabilities

  • Multilingual tweet analysis and understanding
  • Cross-lingual similarity computation
  • Sentiment analysis across multiple languages
  • Emoji and social media content handling
  • Support for over 30 languages

Frequently Asked Questions

Q: What makes this model unique?

This model's uniqueness lies in its specific training on Twitter data across multiple languages, making it particularly effective for social media analysis. Unlike traditional language models, it's optimized for handling informal language, emojis, and Twitter-specific content patterns.

Q: What are the recommended use cases?

The model is ideal for multilingual social media analysis, sentiment analysis, tweet similarity computation, and general natural language understanding tasks involving Twitter content. It's particularly useful for applications requiring cross-lingual capabilities in social media contexts.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026