bert-base-italian-cased-sentiment
Property | Value |
---|---|
License | MIT |
Training Data | 45K pre-processed Italian tweets |
Accuracy | 82% |
Base Model | bert-base-italian-cased |
What is bert-base-italian-cased-sentiment?
This is a specialized BERT model developed by Neuraly for Italian sentiment analysis. Built upon the bert-base-italian-cased foundation, it's been fine-tuned on a dataset of 45,000 Italian tweets to perform three-way sentiment classification (negative, neutral, positive). The model achieves an impressive 82% accuracy on test data.
Implementation Details
The model utilizes the BERT architecture with Italian language specialization. Training was performed using AdamW optimizer with a 2e-5 learning rate and early stopping, running on GTX1080ti hardware. The preprocessing maintains most linguistic features, only removing @mentions, URLs, and emails to preserve semantic richness.
- Trained with batch size of 32 over 3 epochs
- Implements softmax activation for probability distribution
- Uses PyTorch framework for implementation
- Preserves case sensitivity for better language understanding
Core Capabilities
- Three-way sentiment classification (negative, neutral, positive)
- Optimized for Italian language text
- Handles complex sentence structures and informal language
- Suitable for social media content analysis
Frequently Asked Questions
Q: What makes this model unique?
This model specializes in Italian sentiment analysis, which is relatively rare in the NLP landscape. It's particularly effective with social media content while maintaining good performance across various domains despite being trained on tweets.
Q: What are the recommended use cases?
The model is ideal for Italian social media sentiment analysis, customer feedback processing, and general Italian text sentiment classification. While trained on tweets, it shows good generalization to other text types.