Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

roberta-large-squad2

Brief Details: RoBERTa-large model fine-tuned on SQuAD2.0 for extractive QA tasks. 354M params, achieves 85.17% exact match on SQuAD2.0 validation set.

Question Answering

nvidia

bigvgan_v2_24khz_100band_256x

Brief Details: BigVGAN-v2 is a universal neural vocoder for high-quality audio generation, supporting 24kHz sampling rate with 100 mel bands and 256x upsampling, optimized for audio-to-audio tasks.

Audio-to-Audio

westlake-repl

SaProt_650M_AF2

Brief Details: SaProt_650M_AF2 is a 650M parameter protein language model for masked prediction and mutation effect analysis with AlphaFold2 integration

Fill-Mask

ai-forever

rugpt3small_based_on_gpt2

Brief-details: Russian GPT-3 small variant trained on 80B tokens, designed for text generation tasks. Based on GPT-2 architecture with 1024/2048 sequence length support.

Text Generation

TheBloke

Falcon-180B-GPTQ

BRIEF DETAILS: A powerful 180B parameter GPTQ-quantized language model optimized for efficient inference, supporting multiple languages and offering various quantization options for different hardware requirements.

Text Generation

HooshvareLab

bert-fa-base-uncased-sentiment-snappfood

BRIEF-DETAILS: Persian BERT model fine-tuned for sentiment analysis on SnappFood reviews, achieving 87.98% F1 score for binary classification of food delivery comments.

Text Classification

microsoft

beit-base-patch16-224

BRIEF DETAILS: BEiT base model with 87M parameters, trained on ImageNet-21k and fine-tuned on ImageNet-1k, specialized for image classification using Vision Transformer architecture.

Image Classification

microsoft

trocr-base-stage1

Brief Details: Microsoft's TrOCR base model (384M params) for OCR tasks using transformer architecture. Pre-trained vision-encoder & text-decoder model for text extraction from images.

Image-to-Text

sentence-transformers

sentence-t5-large

BRIEF-DETAILS: Sentence-T5-large: 335M parameter model for sentence embeddings, maps text to 768-dimensional vectors. Optimized for sentence similarity with FP16 weights.

Sentence Similarity

Helsinki-NLP

opus-mt-cs-en

Brief Details: A Czech-to-English neural machine translation model by Helsinki-NLP, achieving BLEU scores up to 58.0 on Tatoeba dataset, built on Marian framework.

Translation

timm

vit_base_patch14_reg4_dinov2.lvd142m

BRIEF-DETAILS: Vision Transformer model with registers, pretrained on LVD-142M dataset using DINOv2. Features 86.6M params and specialized for image feature extraction.

Image Feature Extraction

bigcode

starcoder2-15b

Brief-details: StarCoder2-15B: A 15B parameter code generation model trained on 600+ programming languages with 16K context window and GQA architecture. Achieves 46.3% pass@1 on HumanEval.

Text Generation

Chat-UniVi

Brief-details: A unified vision-language model that can process both images and videos using dynamic visual tokens, built on Llama 2 architecture with state-of-the-art performance.

Video-Text-to-Text

Laihaoran

BioClinicalMPBERT

Brief Details: BioClinicalMPBERT is a specialized BERT model initialized from BioBERT and trained on MIMIC clinical notes and Padchest data, optimized for medical text analysis.

Transformers

QuantFactory

Llama-3.2-3B-GGUF

Brief-details: A quantized 3.2B parameter multilingual LLM optimized for dialogue, supporting 8 languages with 128k context length and trained on 9T tokens

Text Generation

anurag-ai

task-13-microsoft-Phi-3-mini-4k-instruct

Brief Details: Phi-3-mini-4k-instruct PEFT adaptation model with extensive downloads (20k+). Built on Microsoft's base model with TensorBoard and Safetensors support.

PEFT

mradermacher

Chronos-Gold-12B-1.0-i1-GGUF

BRIEF DETAILS: 12.2B parameter GGUF model optimized for general-purpose tasks, roleplay, and story writing. Features multiple quantization options from 3.1GB to 10.2GB with imatrix improvements.

Transformers

OpenGVLab

InternViT-300M-448px

Brief Details: A lightweight 304M parameter vision foundation model optimized for image feature extraction, supporting dynamic 448x448 resolution with multi-tile processing.

Image Feature Extraction

facebook

dragon-plus-query-encoder

Brief-details: DRAGON+ is a BERT-based dense retriever model specialized in feature extraction, trained on augmented MS MARCO data for improved query encoding and information retrieval.

Feature Extraction

facebook

sam2-hiera-small

Brief-details: SAM2 small-scale model for segmenting anything in images/videos. Supports promptable visual segmentation with efficient architecture. Apache 2.0 licensed.

Mask Generation

PrunaAI

Trelis-Meta-Llama-3-8B-Instruct-function-calling-bnb-4bit-smashed

Brief Details: A 4-bit quantized version of Meta's Llama-3-8B model optimized for function calling, compressed by PrunaAI for improved efficiency and reduced resource usage.

Text Generation

roberta-large-squad2

bigvgan_v2_24khz_100band_256x

SaProt_650M_AF2

rugpt3small_based_on_gpt2

Falcon-180B-GPTQ

bert-fa-base-uncased-sentiment-snappfood

beit-base-patch16-224

trocr-base-stage1

sentence-t5-large

opus-mt-cs-en

vit_base_patch14_reg4_dinov2.lvd142m

starcoder2-15b

Chat-UniVi

BioClinicalMPBERT

Llama-3.2-3B-GGUF

task-13-microsoft-Phi-3-mini-4k-instruct

Chronos-Gold-12B-1.0-i1-GGUF

InternViT-300M-448px

dragon-plus-query-encoder

sam2-hiera-small

Trelis-Meta-Llama-3-8B-Instruct-function-calling-bnb-4bit-smashed

The first platform built for prompt engineering