Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

bert-base-uncased-hatexplain

Brief-details: BERT-based hate speech classification model trained on Twitter/Gab data with human rationales. Detects hate speech, offensive content & normal text.

Text Classification

intfloat

e5-small-v2

Brief-details: E5-small-v2 is a 12-layer text embedding model with 33.4M parameters, trained via weakly-supervised contrastive learning for semantic similarity tasks.

Sentence Similarity

facebook

dino-vits16

Brief-details: A self-supervised Vision Transformer (ViT) model for image feature extraction, trained on ImageNet-1k using DINO method, with small architecture and 16x16 patch size.

Image Feature Extraction

cardiffnlp

twitter-roberta-base-irony

BRIEF DETAILS: RoBERTa-based model fine-tuned for irony detection in tweets, trained on 58M tweets. Achieves strong performance on TweetEval benchmark. Ideal for social media analysis.

Text Classification

cross-encoder

stsb-TinyBERT-L-4

BRIEF-DETAILS: Cross-encoder TinyBERT model trained on STS benchmark dataset for semantic similarity scoring, featuring efficient architecture and Apache 2.0 license.

Text Classification

facebook

deit-base-patch16-224

Brief Details: DeiT-base is a data-efficient Vision Transformer with 86M parameters achieving 81.8% ImageNet accuracy, optimized for image classification tasks

Image Classification

jinaai

jina-embeddings-v2-base-en

Brief-details: A powerful 137M parameter English embedding model supporting 8192 token sequences, using ALiBi positioning for enhanced long-text processing and RAG applications.

Feature Extraction

Vikhrmodels

Vikhr-7B-instruct_0.4

Brief Details: Vikhr-7B-instruct_0.4 is a bilingual Russian-English LLM with 7.63B parameters, featuring enhanced SFT training and improved JSON/multiturn capabilities.

Text Generation

tiiuae

falcon-40b-instruct

Brief-details: A 40B parameter instruction-tuned LLM from TII, based on Falcon-40B. Optimized for chat/instruction tasks with FlashAttention and multiquery architecture.

Text Generation

dunzhang

stella_en_1.5B_v5

Brief-details: A 1.5B parameter sentence embedding model trained on GTE-large with multi-dimensional retrieval learning, optimized for semantic search and text similarity tasks

Sentence Similarity

facebook

hubert-base-ls960

Brief Details: HuBERT base model for self-supervised speech representation learning, trained on LibriSpeech. Features 16kHz audio processing and BERT-like prediction architecture.

Feature Extraction

facebook

dinov2-giant

Brief Details: DINOv2-giant: A 1.14B parameter Vision Transformer model for self-supervised image feature extraction, developed by Facebook for robust visual understanding.

Image Feature Extraction

microsoft

deberta-v2-xlarge

BRIEF DETAILS: DeBERTa V2 XLarge: Advanced NLP model with 900M parameters, featuring disentangled attention and enhanced mask decoder. Achieves SOTA performance on NLU tasks.

Fill-Mask

dbmdz

bert-base-italian-xxl-uncased

Brief-details: Italian BERT model trained on 81GB corpus with 13B tokens. Uncased version with 111M parameters optimized for Italian language tasks.

Fill-Mask

pkshatech

GLuCoSE-base-ja-v2

Brief Details: Japanese text embedding model (133M params) optimized for retrieval tasks with SOTA performance, supports CPU inference and cosine similarity.

Sentence Similarity

MoritzLaurer

deberta-v3-large-zeroshot-v2.0

Brief-details: Efficient zero-shot text classifier based on DeBERTa-v3-large (435M params), trained for universal classification tasks with commercial-friendly data and strong performance.

Zero-Shot Classification

biu-nlp

f-coref

Brief-details: F-coref is a fast and accurate coreference resolution model achieving 78.5 F1 on OntoNotes, processing 2.8K documents in 25 seconds on V100 GPU.

Transformers

LoneStriker

Yarn-Llama-2-70b-32k-2.4bpw-h6-exl2

BRIEF-DETAILS: Advanced 70B parameter LLaMA-2 variant with extended 32k token context window, optimized for long-form content processing and generation

Text Generation

Helsinki-NLP

opus-mt-en-ru

Brief Details: A Helsinki-NLP English-to-Russian neural machine translation model with strong BLEU scores (31.1 on newstest2012), based on the Marian framework

Translation

unsloth

Llama-3.2-1B-Instruct-bnb-4bit

Brief-details: Optimized 765M parameter Llama 3.2 model with 4-bit quantization for efficient inference. Supports multilingual tasks with reduced memory footprint and faster processing.

Text Generation

deepset

deberta-v3-base-injection

Brief-details: DeBERTa-v3 model fine-tuned for prompt injection detection, achieving 99.14% accuracy. Built by deepset, it classifies requests as INJECTION or LEGIT.

Text Classification

bert-base-uncased-hatexplain

e5-small-v2

dino-vits16

twitter-roberta-base-irony

stsb-TinyBERT-L-4

deit-base-patch16-224

jina-embeddings-v2-base-en

Vikhr-7B-instruct_0.4

falcon-40b-instruct

stella_en_1.5B_v5

hubert-base-ls960

dinov2-giant

deberta-v2-xlarge

bert-base-italian-xxl-uncased

GLuCoSE-base-ja-v2

deberta-v3-large-zeroshot-v2.0

f-coref

Yarn-Llama-2-70b-32k-2.4bpw-h6-exl2

opus-mt-en-ru

Llama-3.2-1B-Instruct-bnb-4bit

deberta-v3-base-injection

The first platform built for prompt engineering