Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Medical-NER

Brief Details: Medical Named Entity Recognition model based on DeBERTa-v3, fine-tuned to identify 41 medical entities. 184M params, MIT licensed.

Token Classification

facebook

mask2former-swin-large-coco-panoptic

Brief-details: A powerful image segmentation model by Facebook using Mask2Former architecture with Swin backbone, optimized for COCO panoptic segmentation with masked-attention transformer approach.

Image Segmentation

tiiuae

falcon-7b-instruct

Brief-details: A 7B parameter instruction-tuned language model by TII, built on Falcon-7B. Apache 2.0 licensed, optimized for chat/instruct tasks with FlashAttention architecture.

Text Generation

Alibaba-NLP

gte-base-en-v1.5

Brief-details: GTE-base-en-v1.5 is a 137M parameter English text embedding model supporting 8192 token sequences with SOTA performance on MTEB benchmark and long-context retrieval.

Sentence Similarity

facebook

timesformer-hr-finetuned-k600

Brief Details: TimeSformer video classification model fine-tuned on Kinetics-600, specialized in space-time attention for video understanding with high-resolution input processing.

Video Classification

microsoft

codebert-base-mlm

BRIEF DETAILS: CodeBERT MLM variant trained on programming languages, built on RoBERTa architecture with 200K+ downloads, specializing in code-language tasks and masked token prediction.

Fill-Mask

nvidia

Llama-3.1-Nemotron-70B-Instruct-HF

Brief-details: NVIDIA's 70B parameter Llama-3.1 variant optimized for helpful responses, achieving top scores on Arena Hard (85.0), AlpacaEval 2 LC (57.6), and MT-Bench (8.98).

Text Generation

google

t5-v1_1-xl

Brief-details: Google's T5-v1.1-XL model - Advanced text-to-text transformer with GEGLU activation, trained on C4 corpus, designed for transfer learning tasks

Text2Text Generation

playgroundai

playground-v2.5-1024px-aesthetic

Brief-details: Advanced text-to-image diffusion model by Playground AI, offering high aesthetic quality at 1024px resolution. Outperforms SDXL and DALL-E 3 in user studies.

Text-to-Image

Team-ACE

ToolACE-8B

Brief Details: ToolACE-8B: State-of-the-art 8B parameter LLM specialized in function calling, based on LLaMA-3.1, achieving GPT-4 level performance on BFCL.

Safetensors

Helsinki-NLP

opus-mt-en-de

Brief Details: An English-to-German translation model by Helsinki-NLP, achieving BLEU scores up to 45.2 on news test sets. Built on OPUS-MT framework with Marian architecture.

Translation

solidrust

Codestral-22B-v0.1-hf-AWQ

BRIEF DETAILS: A 4-bit quantized version of Codestral-22B using AWQ technology, optimized for efficient text generation with 3.33B parameters. High download count (204K+) suggests strong community adoption.

Text Generation

neulab

codebert-java

Brief-details: A specialized CodeBERT model fine-tuned on Java code for 1M steps, designed for code evaluation and masked language modeling tasks.

Fill-Mask

microsoft

DialoGPT-medium

Brief Details: DialoGPT-medium is Microsoft's state-of-the-art dialogue model trained on 147M Reddit conversations, offering human-like response generation capabilities.

Text Generation

sentence-transformers

paraphrase-distilroberta-base-v1

Brief Details: A sentence embedding model that maps text to 768-dimensional vectors, based on DistilRoBERTa with 82.1M parameters. Optimized for semantic similarity tasks.

Sentence Similarity

google

vit-base-patch16-384

Brief Details: Vision Transformer base model with 86.9M params, pre-trained on ImageNet-21k and fine-tuned for 384x384 image classification tasks.

Image Classification

sentence-transformers

paraphrase-TinyBERT-L6-v2

Brief Details: A lightweight sentence embedding model (67M params) that maps text to 768-dim vectors, optimized for semantic similarity tasks using TinyBERT architecture.

Sentence Similarity

zake7749

gemma-2-2b-it-chinese-kyara-dpo

Brief-details: A 2.61B parameter Gemma-based model fine-tuned for Chinese language tasks, featuring knowledge retrieval capabilities and DPO training, achieving strong performance on various benchmarks.

Text Generation

MaziyarPanahi

Meta-Llama-3.1-405B-Instruct-GGUF

Brief Details: A powerful 405B parameter LLM optimized for instruction-following, supporting 8 languages and available in GGUF format for efficient local deployment

Text Generation

openart-custom

CrystalClearXL

Brief-details: CrystalClearXL is a popular text-to-image diffusion model with over 214K downloads, built on the StableDiffusionXL pipeline framework offering high-quality image generation capabilities.

Text-to-Image

vikhyatk

moondream1

Brief-details: A 1.86B parameter vision-language model combining SigLIP and Phi-1.5 architectures, optimized for visual question-answering tasks with competitive performance despite smaller size.

Text Generation

Medical-NER

mask2former-swin-large-coco-panoptic

falcon-7b-instruct

gte-base-en-v1.5

timesformer-hr-finetuned-k600

codebert-base-mlm

Llama-3.1-Nemotron-70B-Instruct-HF

t5-v1_1-xl

playground-v2.5-1024px-aesthetic

ToolACE-8B

opus-mt-en-de

Codestral-22B-v0.1-hf-AWQ

codebert-java

DialoGPT-medium

paraphrase-distilroberta-base-v1

vit-base-patch16-384

paraphrase-TinyBERT-L6-v2

gemma-2-2b-it-chinese-kyara-dpo

Meta-Llama-3.1-405B-Instruct-GGUF

CrystalClearXL

moondream1

The first platform built for prompt engineering