Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

detr-resnet-101

Brief-details: DETR-ResNet-101 is a 60.7M parameter transformer-based object detection model that achieves 43.5 AP on COCO, combining CNN and attention mechanisms for end-to-end detection.

Object Detection

MaziyarPanahi

calme-3.2-baguette-3b-GGUF

Brief Details: A 3.09B parameter GGUF-formatted language model optimized for text generation with multiple quantization options (2-8 bit precision).

Text Generation

LiheYoung

depth-anything-large-hf

BRIEF DETAILS: State-of-the-art depth estimation model with 335M parameters, trained on 62M images. Uses DPT architecture with DINOv2 backbone for zero-shot depth perception.

Depth Estimation

timm

vit_small_patch16_224.augreg_in21k_ft_in1k

BRIEF DETAILS: Vision Transformer (ViT) model with 22.1M params, pretrained on ImageNet-21k and fine-tuned on ImageNet-1k, optimized for 224x224 images.

Image Classification

pdelobelle

robbert-v2-dutch-ner

Brief-details: RobBERT-v2-dutch-ner is a state-of-the-art Dutch language model for named entity recognition, built on RoBERTa architecture with MIT license.

Token Classification

Qdrant

all-MiniLM-L6-v2-onnx

Brief-details: ONNX-optimized version of all-MiniLM-L6-v2 for efficient sentence similarity and text embeddings, with Apache 2.0 license and 215K+ downloads

Sentence Similarity

laion

CLIP-convnext_large_d.laion2B-s26B-b102K-augreg

Brief Details: A powerful CLIP model using ConvNeXt-Large architecture, trained on LAION-2B dataset, achieving 75.9% ImageNet zero-shot accuracy with enhanced efficiency.

Zero-Shot Image Classification

microsoft

trocr-large-printed

Brief Details: Large-scale OCR model (608M params) using transformer architecture for printed text recognition. Microsoft-developed with high accuracy for document processing.

Image-to-Text

microsoft

wavlm-large

Brief Details: WavLM-Large is Microsoft's advanced speech processing model trained on 94k hours of audio data, optimized for speech recognition and speaker identification.

Feature Extraction

h2oai

h2ogpt-4096-llama2-7b-chat

Brief Details: H2O.ai's 7B parameter LLaMA2-based chat model with 4096 context window, optimized for text generation and conversation tasks

Text Generation

unsloth

Llama-3.2-3B-Instruct-bnb-4bit

Brief Details: 4-bit quantized Llama 3.2 (3B params) instruction model optimized for multilingual dialogue, featuring 2.4x faster inference and 58% less memory usage.

Text Generation

google-t5

t5-3b

Brief Details: T5-3B is a powerful 3B parameter text-to-text transformer model from Google, capable of handling multiple NLP tasks with state-of-the-art performance.

Translation

MoritzLaurer

deberta-v3-base-zeroshot-v1.1-all-33

Brief-details: Zero-shot text classifier based on DeBERTa-v3, trained on 33 datasets for universal binary classification tasks with 184M parameters.

Zero-Shot Classification

nferruz

ProtGPT2

Brief Details: ProtGPT2 is a 738M parameter language model specialized in protein sequence generation, trained on UniRef50 database with state-of-the-art capabilities in de novo protein design.

Text Generation

google

metricx-23-qe-large-v2p0

Brief-details: MetricX-23 QE-Large is a reference-free translation quality evaluation model, utilizing MT5 architecture to predict translation error scores on a 0-25 scale.

Transformers

jonathandinu

face-parsing

Brief Details: Face parsing model built on SegFormer architecture with 84.6M parameters. Fine-tuned on CelebAMask-HQ for detailed facial feature segmentation.

Image Segmentation

sentence-transformers

nli-mpnet-base-v2

Brief Details: A powerful sentence embedding model with 768-dimensional vectors, based on MPNet architecture. Optimized for semantic similarity and NLI tasks, 109M parameters.

Sentence Similarity

MoritzLaurer

mDeBERTa-v3-base-mnli-xnli

Brief-details: Multilingual zero-shot classification model supporting 100 languages, fine-tuned on XNLI and MNLI datasets. 279M parameters, state-of-the-art performance for base-sized multilingual transformers.

Zero-Shot Classification

google

metricx-23-large-v2p0

Brief-details: MetricX-23-Large is a PyTorch-based translation evaluation model, trained on MQM data to assess translation quality with scores from 0-25

Transformers

LTP

small

Brief-details: LTP Small - A Chinese NLP toolkit supporting 6 core tasks including word segmentation, POS tagging, and NER with 98.4% segmentation accuracy and 43.13 sentences/second processing speed.

Transformers

hfl

chinese-roberta-wwm-ext

Brief Details: Chinese RoBERTa model with Whole Word Masking, optimized for NLP tasks. Features 226K+ downloads, Apache 2.0 license, and BERT-based architecture.

Fill-Mask

detr-resnet-101

calme-3.2-baguette-3b-GGUF

depth-anything-large-hf

vit_small_patch16_224.augreg_in21k_ft_in1k

robbert-v2-dutch-ner

all-MiniLM-L6-v2-onnx

CLIP-convnext_large_d.laion2B-s26B-b102K-augreg

trocr-large-printed

wavlm-large

h2ogpt-4096-llama2-7b-chat

Llama-3.2-3B-Instruct-bnb-4bit

t5-3b

deberta-v3-base-zeroshot-v1.1-all-33

ProtGPT2

metricx-23-qe-large-v2p0

face-parsing

nli-mpnet-base-v2

mDeBERTa-v3-base-mnli-xnli

metricx-23-large-v2p0

small

chinese-roberta-wwm-ext

The first platform built for prompt engineering