robertuito-hate-speech

Property	Value
Base Model	RoBERTuito
Task	Hate Speech Detection
Language	Spanish
Performance	75.9% Macro F1 Score
Source	SemEval 2019 Task 5
Repository	GitHub

What is robertuito-hate-speech?

robertuito-hate-speech is a specialized Spanish language model designed for detecting hate speech in social media content, particularly Twitter. Built upon RoBERTuito, a RoBERTa model pre-trained on Spanish tweets, this model was specifically fine-tuned using the SemEval 2019 Task 5 HatEval corpus. It excels in multi-classification tasks related to hate speech detection.

Implementation Details

The model implements a three-way classification system that analyzes text for hate speech characteristics. It leverages the robust architecture of RoBERTuito, which was trained on over 500 million Spanish tweets, ensuring strong performance on social media content.

Built on RoBERTuito architecture, optimized for Spanish social media text
Trained using SemEval 2019 Task 5 dataset
Achieves state-of-the-art performance with 75.9% Macro F1 score
Outperforms other Spanish language models in hate speech detection

Core Capabilities

HS (Hate Speech): Determines if content contains hate speech
TR (Target): Identifies if the speech targets specific individuals
AG (Aggression): Analyzes the aggressive nature of the content
Cross-lingual capabilities for English-Spanish code-switching scenarios

Frequently Asked Questions

Q: What makes this model unique?

This model combines the power of RoBERTuito's Spanish social media understanding with specialized hate speech detection capabilities. Its three-dimensional analysis (HS, TR, AG) provides a comprehensive assessment of problematic content, making it particularly valuable for content moderation.

Q: What are the recommended use cases?

The model is ideal for social media content moderation, online harassment detection, and research on hate speech in Spanish-language social media. It's particularly effective for platforms dealing with user-generated content and organizations monitoring online discourse.