Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

russian-sensitive-topics

Brief-details: Russian language model for detecting 18 sensitive topics in text, including crime, discrimination, and social issues. Trained on manual and semi-automatic labeled data.

Text Classification

s-nlp

ruT5-base-detox

Brief Details: Russian text detoxification model based on ruT5-base (223M params), converts toxic Russian text to neutral language with maintained meaning

Text2Text Generation

apanc

russian-inappropriate-messages

Brief Details: Russian language model for detecting inappropriate messages that could harm reputation - focuses on sensitive topics beyond toxicity, achieving 89% accuracy.

Text Classification

Nuwaisir

Quran_speech_recognizer

Brief-details: Specialized speech recognition model for Quran recitation, built on wav2vec2 architecture. Fine-tuned on Arabic speech data for accurate Quranic verse identification.

Automatic Speech Recognition

PlanTL-GOB-ES

roberta-base-biomedical-clinical-es

Brief Details: A Spanish biomedical RoBERTa model trained on 1B+ tokens of clinical text, achieving SOTA results on medical NER tasks with 90.04% F1 score.

Fill-Mask

NovelAI

genji-jp

Brief Details: Japanese language model based on GPT-J 6B, specialized in storytelling. Features 6B parameters, 28 layers, and supports both Japanese and English text generation.

Text Generation

ShreyaR

finetuned-roberta-depression

Brief Details: RoBERTa-based depression detection model achieving 97.45% accuracy. Fine-tuned for identifying depressive content in text with MIT license.

Text Classification

Muennighoff

SGPT-125M-weightedmean-nli-bitfit

Brief-details: SGPT-125M is a lightweight GPT-based sentence embedding model optimized for semantic search, featuring weighted-mean pooling and BitFit training

Sentence Similarity

NbAiLab

nb-bert-base

Brief Details: Norwegian BERT-base model (179M params) trained on 200 years of Norwegian text, supporting both bokmål and nynorsk variants for masked language modeling.

Fill-Mask

Narrativa

mbart-large-50-finetuned-opus-en-pt-translation

Brief Details: A powerful multilingual translation model fine-tuned on OPUS100 dataset, specifically optimized for English to Portuguese translation with BLEU score of 20.61.

Translation

NYTK

text-generation-news-gpt2-small-hungarian

BRIEF-DETAILS: Hungarian GPT-2 model specialized in news generation, trained on Wikipedia and news sites. Achieves 22.06 perplexity score, MIT licensed.

Text Generation

Narrativa

distilroberta-finetuned-stereotype-detection

Brief Details: A fine-tuned DistilRoBERTa model achieving 98.9% accuracy for stereotype detection, particularly focusing on gender bias identification in text.

Text Classification

MarcBrun

ixambert-finetuned-squad-eu-en

BRIEF DETAILS: Multilingual question-answering model supporting English, Spanish & Basque, fine-tuned on SQuAD dataset. Optimized for extractive QA tasks with high accuracy.

Question Answering

MilaNLProc

feel-it-italian-sentiment

Brief-details: Italian sentiment analysis model for text classification, based on FEEL-IT dataset. Achieves 0.84 accuracy on SENTIPOLC16. Built on UmBERTo architecture.

Text Classification

M-CLIP

M-BERT-Distil-40

Brief Details: M-BERT-Distil-40 is a multilingual BERT model supporting 38 languages, fine-tuned to match CLIP's embedding space. Optimized for cross-lingual text understanding and feature extraction.

Feature Extraction

Lowin

chinese-bigbird-small-1024

Brief-details: A Chinese language BigBird model optimized for 1024 token sequences, featuring Jieba tokenization and Apache 2.0 license. Ideal for Chinese text processing and feature extraction.

Feature Extraction

LennartKeller

longformer-gottbert-base-8192-aw512

Brief Details: A German language Longformer model with 153M parameters, trained on OSCAR corpus. Features 8192 token sequence length and 512-token attention windows.

Feature Extraction

Langboat

mengzi-bert-base

Brief-details: Powerful Chinese BERT model trained on 300GB corpus, achieving SOTA on 9 NLP tasks. Features MLM, POS tagging & SOP training objectives.

Fill-Mask

SI2M-Lab

DarijaBERT

Brief Details: DarijaBERT is a pioneering BERT model for Moroccan Arabic (Darija), trained on 3M sequences with 209M parameters, specializing in dialectal understanding.

Fill-Mask

KoichiYasuoka

chinese-bert-wwm-ext-upos

Brief Details: BERT-based Chinese POS-tagger and dependency parser, pre-trained on Wikipedia texts, supporting Universal Part-Of-Speech tagging with Apache 2.0 license.

Token Classification

HooshvareLab

bert-fa-zwnj-base

Brief-details: A specialized BERT model for Persian language understanding, featuring zero-width non-joiner character handling and trained on diverse Persian corpora

Fill-Mask

russian-sensitive-topics

ruT5-base-detox

russian-inappropriate-messages

Quran_speech_recognizer

roberta-base-biomedical-clinical-es

genji-jp

finetuned-roberta-depression

SGPT-125M-weightedmean-nli-bitfit

nb-bert-base

mbart-large-50-finetuned-opus-en-pt-translation

text-generation-news-gpt2-small-hungarian

distilroberta-finetuned-stereotype-detection

ixambert-finetuned-squad-eu-en

feel-it-italian-sentiment

M-BERT-Distil-40

chinese-bigbird-small-1024

longformer-gottbert-base-8192-aw512

mengzi-bert-base

DarijaBERT

chinese-bert-wwm-ext-upos

bert-fa-zwnj-base

The first platform built for prompt engineering