Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

TinyBERT_General_4L_312D

Brief-details: A compact BERT variant that's 7.5x smaller and 9.4x faster than BERT-base, using transformer distillation for efficient NLP tasks.

Transformers

s-nlp

roberta_toxicity_classifier

Brief Details: RoBERTa-based toxicity classifier trained on Jigsaw datasets. Achieves 0.98 AUC-ROC for toxic content detection. Popular with 128K+ downloads.

Text Classification

MaziyarPanahi

Meraj-Mini-GGUF

BRIEF DETAILS: Quantized GGUF variant of Meraj-Mini with 7.62B parameters, offering multiple precision options (2-8 bit) for efficient text generation and conversation tasks.

Text Generation

MaziyarPanahi

Mistral-Large-Instruct-2407-GGUF

Brief Details: Quantized version of Mistral-Large-Instruct model with 123B parameters, available in multiple GGUF precision formats (2-8 bit) for efficient deployment.

Text Generation

FreedomIntelligence

HuatuoGPT-Vision-7B

Brief Details: HuatuoGPT-Vision-7B is a 7.94B parameter multimodal LLM specialized for medical image analysis, built on Qwen2-7B using LLaVA architecture.

Text Generation

city96

FLUX.1-dev-gguf

Brief-details: FLUX.1-dev-gguf is a quantized text-to-image generation model with 11.9B parameters, optimized for GGUF format and compatible with ComfyUI framework

Text-to-Image

Snowflake

snowflake-arctic-embed-xs

Brief-details: A compact 22.6M parameter text embedding model optimized for retrieval tasks, achieving SOTA performance with 384-dim embeddings and NDCG@10 of 50.15

Sentence Similarity

bond005

whisper-large-v3-ru-podlodka

Brief-details: Russian speech recognition model based on Whisper Large V3, fine-tuned on Russian datasets with 1.54B parameters, achieving ~10% WER without punctuation.

Automatic Speech Recognition

timm

inception_v3.tv_in1k

Brief Details: Inception-v3 model for image classification with 23.9M params, trained on ImageNet-1k. Efficient architecture for computer vision tasks at 299x299 resolution.

Image Classification

nvidia

Llama-3_1-Nemotron-51B-Instruct

Brief Details: NVIDIA's 51B parameter LLM optimized for efficiency, based on Llama-3.1. Features Neural Architecture Search for better performance-to-cost ratio.

Text Generation

Qdrant

bge-small-en-v1.5-onnx-Q

Brief-details: Quantized ONNX version of BGE-small for efficient text embeddings and similarity search. Optimized for production with Apache 2.0 license.

Sentence Similarity

SWivid

E2-TTS

Brief-details: E2-TTS is a non-autoregressive zero-shot text-to-speech model trained on the Emilia dataset, offering efficient and high-quality speech synthesis capabilities.

Text-to-Speech

yisol

IDM-VTON

Brief-details: IDM-VTON is an advanced virtual try-on AI model based on SDXL, enabling realistic clothing transfer onto person images with improved diffusion techniques and authentic results.

Image-to-Image

jinaai

jina-bert-flash-implementation

Brief-details: BERT implementation with Flash-Attention optimization - features configurable attention windows, MLPs, and checkpointing for improved GPU performance

Transformers

NlpHUST

ner-vietnamese-electra-base

Brief-details: Vietnamese Named Entity Recognition model based on ELECTRA architecture. Achieves 92.14% F1 score on VLSP 2018. Optimized for location, person, organization detection.

Token Classification

tuner007

pegasus_paraphrase

Brief-details: PEGASUS-based paraphrasing model fine-tuned for text reformulation with support for multiple output variations. Popular with 132K+ downloads.

Text2Text Generation

Qwen

Qwen2-VL-7B-Instruct-GPTQ-Int4

Brief-details: Qwen2-VL is a 7B-parameter vision-language model optimized with GPTQ Int4 quantization, featuring dynamic resolution handling and multilingual support for images/videos

Image-Text-to-Text

timpal0l

mdeberta-v3-base-squad2

Brief Details: Multilingual DeBERTa-v3 model fine-tuned on SQuAD2.0 for question-answering, supporting 94 languages with 278M parameters. Achieves 84% F1 score.

Question Answering

ufal

robeczech-base

Brief-details: RobeCzech is a 126M-parameter Czech language model based on RoBERTa architecture, trained for masked language modeling with strong performance in NLP tasks.

Fill-Mask

timm

vit_large_patch14_dinov2.lvd142m

Brief-details: Vision Transformer model with 304M params, trained on LVD-142M dataset using DINOv2 self-supervised learning. Optimized for image feature extraction.

Image Feature Extraction

pysentimiento

robertuito-ner

Brief-details: Named Entity Recognition model for Spanish/English tweets, based on RoBERTuito. Achieves 68.5% accuracy on LinCE benchmark. Popular with 135K+ downloads.

PyTorch

TinyBERT_General_4L_312D

roberta_toxicity_classifier

Meraj-Mini-GGUF

Mistral-Large-Instruct-2407-GGUF

HuatuoGPT-Vision-7B

FLUX.1-dev-gguf

snowflake-arctic-embed-xs

whisper-large-v3-ru-podlodka

inception_v3.tv_in1k

Llama-3_1-Nemotron-51B-Instruct

bge-small-en-v1.5-onnx-Q

E2-TTS

IDM-VTON

jina-bert-flash-implementation

ner-vietnamese-electra-base

pegasus_paraphrase

Qwen2-VL-7B-Instruct-GPTQ-Int4

mdeberta-v3-base-squad2

robeczech-base

vit_large_patch14_dinov2.lvd142m

robertuito-ner

The first platform built for prompt engineering