Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Llama-3.2-1B-Instruct-FP8

Brief-details: Optimized 1.5B parameter Llama-3 model quantized to FP8, offering 50% memory reduction while maintaining 99.8% accuracy of original model. Supports 8 languages.

Text Generation

laion

CLIP-convnext_large_d_320.laion2B-s29B-b131K-ft-soup

Brief-details: A high-performance CLIP model using ConvNeXt-Large architecture, trained on LAION-2B dataset, achieving 76.9% ImageNet accuracy with weight-averaged soup approach.

Zero-Shot Image Classification

SenswiseData

bert_turkish_sentiment

Brief-details: Turkish sentiment analysis BERT model with 163M parameters, achieving 99.72% accuracy. Fine-tuned on TurkishBERTweet base model with MIT license.

Text Classification

google

t5-v1_1-base

Brief-details: Google's T5-v1.1 base model - Text-to-text transfer transformer with GEGLU activation, trained on C4 dataset. Key improvements over original T5 with 262K+ downloads.

Text2Text Generation

to-be

donut-base-finetuned-invoices

Brief Details: A specialized document understanding model fine-tuned for invoice processing, combining Swin Transformer vision encoding with BART text decoding. Supports multilingual invoice analysis.

Image-to-Text

openai

whisper-small.en

Brief Details: A specialized English ASR model with 242M parameters, based on OpenAI's Whisper architecture. Optimized for English speech recognition with strong accuracy and noise resilience.

Automatic Speech Recognition

Systran

faster-whisper-medium

Brief-details: Fast and efficient speech recognition model supporting 99 languages, based on OpenAI's Whisper medium variant, optimized with CTranslate2 for improved performance

Automatic Speech Recognition

cross-encoder

ms-marco-MiniLM-L-2-v2

BRIEF DETAILS: A cross-encoder model fine-tuned on MS Marco, achieving 71.01 NDCG@10 on TREC DL 19 with 4100 docs/sec processing speed. Optimized for passage ranking.

Text Classification

Salesforce

instructblip-vicuna-7b

Brief Details: InstructBLIP-Vicuna-7B: A 7.91B parameter vision-language model combining BLIP-2 architecture with Vicuna-7b LLM for advanced image-text tasks

Image-Text-to-Text

google

vit-large-patch32-384

Brief-details: Large-scale Vision Transformer model trained on ImageNet-21k (14M images) and fine-tuned on ImageNet 2012, specializing in image classification at 384x384 resolution.

Image Classification

google

siglip-base-patch16-224

Brief Details: SigLIP base model (203M params) for vision-language tasks. Features sigmoid loss function, 224x224 resolution, and zero-shot capabilities.

Zero-Shot Image Classification

BAAI

llm-embedder

BRIEF-DETAILS: General-purpose text embedding model with 109M parameters. Optimized for LLM retrieval augmentation, supporting diverse embedding needs with SOTA performance.

Feature Extraction

sentence-transformers

distilbert-multilingual-nli-stsb-quora-ranking

BRIEF DETAILS: Multilingual sentence embedding model (135M params) optimized for semantic similarity tasks. Maps text to 768D vectors. Built on DistilBERT.

Sentence Similarity

facebook

dino-vitb16

Brief Details: A Vision Transformer model trained with DINO self-supervision on ImageNet-1k, offering robust image feature extraction with 16x16 patch size encoding.

Image Feature Extraction

depth-anything

Depth-Anything-V2-Large-hf

Brief-details: Depth Anything V2 Large - A state-of-the-art depth estimation model with 335M parameters, trained on 595K synthetic + 62M real images for robust monocular depth estimation.

Depth Estimation

distil-whisper

distil-medium.en

Brief-details: A distilled version of Whisper medium.en that's 6x faster, 49% smaller (394M params) while maintaining accuracy within 1% WER for English ASR.

Automatic Speech Recognition

moka-ai

m3e-base

Brief-details: M3E-base is a 102M parameter bilingual (Chinese-English) embedding model trained on 22M+ sentence pairs, optimized for text similarity and retrieval tasks with state-of-the-art performance.

sentence-transformers

unitary

unbiased-toxic-roberta

Brief Details: RoBERTa-based toxicity classifier trained to detect toxic content while minimizing unintended bias. Achieves 0.93639 score on Jigsaw dataset.

Text Classification

mpoyraz

wav2vec2-xls-r-300m-cv7-turkish

Brief-details: A fine-tuned wav2vec2-xls-r-300m model for Turkish speech recognition, achieving 8.62% WER on Common Voice 7, trained with comprehensive preprocessing and custom language modeling.

Automatic Speech Recognition

cross-encoder

nli-deberta-v3-base

BRIEF DETAILS: DeBERTa-v3 based cross-encoder for Natural Language Inference, achieving 92.38% accuracy on SNLI. Excellent for zero-shot classification and NLI tasks.

Zero-Shot Classification

nlpaueb

legal-bert-base-uncased

Brief Details: A specialized BERT model pre-trained on 12GB of legal texts, optimized for legal NLP tasks with variants for specific legal domains like contracts and EU law.

Fill-Mask

Llama-3.2-1B-Instruct-FP8

CLIP-convnext_large_d_320.laion2B-s29B-b131K-ft-soup

bert_turkish_sentiment

t5-v1_1-base

donut-base-finetuned-invoices

whisper-small.en

faster-whisper-medium

ms-marco-MiniLM-L-2-v2

instructblip-vicuna-7b

vit-large-patch32-384

siglip-base-patch16-224

llm-embedder

distilbert-multilingual-nli-stsb-quora-ranking

dino-vitb16

Depth-Anything-V2-Large-hf

distil-medium.en

m3e-base

unbiased-toxic-roberta

wav2vec2-xls-r-300m-cv7-turkish

nli-deberta-v3-base

legal-bert-base-uncased

The first platform built for prompt engineering