Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

gte-large-en-v1.5

Brief-details: State-of-the-art text embedding model with 434M parameters, supporting 8192 token context length and achieving 65.39 MTEB score. Built on transformer++ architecture.

Sentence Similarity

cardiffnlp

twitter-xlm-roberta-base-sentiment-multilingual

Brief Details: Multilingual sentiment analysis model based on XLM-RoBERTa, achieving 69.3% accuracy across languages for tweet classification

Text Classification

stabilityai

stable-diffusion-xl-base-1.0

Brief-details: SDXL 1.0 Base - Advanced text-to-image diffusion model from Stability AI. Features dual text encoders and improved generation quality over SD 1.5/2.1

Text-to-Image

almanach

camembert-base

Brief-details: CamemBERT is a powerful French language model based on RoBERTa architecture with 110M parameters, trained on OSCAR dataset for masked language modeling tasks.

Fill-Mask

BAAI

bge-base-en-v1.5

Brief-details: Base-sized English embedding model (109M params) optimized for retrieval and semantic search, achieving strong MTEB benchmark performance across tasks like clustering and reranking.

Feature Extraction

distilbert

distilroberta-base

Brief-details: DistilRoBERTa-base is a lightweight, distilled version of RoBERTa with 82.8M parameters, offering 2x faster performance while maintaining strong language understanding capabilities.

Fill-Mask

distilbert

distilgpt2

Brief-details: DistilGPT2 is a compressed version of GPT-2 with 82M parameters, trained via knowledge distillation for faster, lighter text generation while maintaining strong performance.

Text Generation

microsoft

infoxlm-large

Brief Details: InfoXLM-Large: Microsoft's cross-lingual language model using information-theoretic framework. 2.8M+ downloads, popular for multilingual NLP tasks.

Fill-Mask

BAAI

bge-large-en-v1.5

Brief-details: State-of-the-art English language embedding model with 335M parameters, achieving top performance on MTEB benchmarks. Optimized for retrieval and similarity tasks.

Feature Extraction

microsoft

table-transformer-detection

Brief Details: A DETR-based transformer model with 28.8M parameters for table detection in documents, trained on PubTables1M dataset with MIT license.

Object Detection

w11wo

indonesian-roberta-base-posp-tagger

Brief Details: Indonesian RoBERTa-based POS tagger achieving 96.25% accuracy on IndoNLU dataset. 124M params, MIT licensed, optimized for Indonesian text.

Token Classification

ntu-spml

distilhubert

Brief Details: DistilHuBERT - Efficient speech representation model with 23.5M params. 75% smaller than HuBERT while maintaining performance. Ideal for academic/small-scale ML.

Feature Extraction

google

siglip-so400m-patch14-384

Brief Details: SigLIP vision-language model with 878M parameters optimized for zero-shot classification. Uses sigmoid loss, trained on WebLI dataset at 384x384 resolution.

Zero-Shot Image Classification

facebook

bart-large-mnli

Brief Details: BART-Large-MNLI is a 407M parameter NLI model fine-tuned for zero-shot classification, offering powerful multi-label text classification capabilities.

Zero-Shot Classification

BAAI

bge-m3

Brief-details: BGE-M3 is a versatile multilingual embedding model supporting dense retrieval, lexical matching, and multi-vector interaction across 100+ languages with 8192 token context.

Sentence Similarity

answerdotai

answerai-colbert-small-v1

BRIEF DETAILS: A compact yet powerful ColBERT-based retrieval model with 33.4M parameters, outperforming larger models in passage retrieval tasks while maintaining efficiency.

ONNX

MIT

ast-finetuned-audioset-10-10-0.4593

Brief Details: Audio Spectrogram Transformer with 86.6M params, fine-tuned on AudioSet. Converts audio to spectrograms for classification using ViT architecture.

Audio Classification

facebook

bart-base

Brief Details: BART base model (139M params) by Facebook - A transformer-based seq2seq model for text generation and comprehension tasks, pre-trained on English text.

Feature Extraction

neuralmind

bert-base-portuguese-cased

Brief Details: BERTimbau Base - A state-of-the-art BERT model for Brazilian Portuguese with 110M parameters, trained on brWaC dataset for NLP tasks.

Fill-Mask

openai

whisper-large-v3

Brief-details: State-of-the-art multilingual speech recognition model with 1.54B parameters, supporting 99 languages and offering improved accuracy over previous versions.

Automatic Speech Recognition

FacebookAI

xlm-roberta-large-finetuned-conll03-english

Brief-details: Multilingual XLM-RoBERTa model (560M params) fine-tuned for token classification/NER, supporting 94 languages with strong performance on English CoNLL-2003 dataset.

Token Classification

gte-large-en-v1.5

twitter-xlm-roberta-base-sentiment-multilingual

stable-diffusion-xl-base-1.0

camembert-base

bge-base-en-v1.5

distilroberta-base

distilgpt2

infoxlm-large

bge-large-en-v1.5

table-transformer-detection

indonesian-roberta-base-posp-tagger

distilhubert

siglip-so400m-patch14-384

bart-large-mnli

bge-m3

answerai-colbert-small-v1

ast-finetuned-audioset-10-10-0.4593

bart-base

bert-base-portuguese-cased

whisper-large-v3

xlm-roberta-large-finetuned-conll03-english

The first platform built for prompt engineering