Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

DUSt3R_ViTLarge_BaseDecoder_512_dpt

Brief Details: DUSt3R is a geometric 3D vision model with 571M params, using ViT-Large encoder and ViT-Base decoder for image-to-3D tasks, developed by NAVER.

Image-to-3D

DeepMount00

Llama-3-8b-Ita

Brief-details: Specialized Italian language variant of Llama-3 (8B params) with strong performance on Italian NLP tasks. Features BF16 precision and extensive evaluation metrics.

Text Generation

MRNH

mbart-english-grammar-corrector

Brief Details: Multilingual BART model (610M params) fine-tuned for English grammar correction, using FCE dataset. Popular with 98K+ downloads.

Text2Text Generation

TheBloke

Mistral-7B-Instruct-v0.2-AWQ

Brief Details: AWQ-quantized version of Mistral-7B-Instruct-v0.2, optimized for 4-bit precision, offering efficient inference with 4.15GB size and 4096 context length

Text Generation

stevhliu

my_awesome_model

Brief Details: A fine-tuned DistilBERT model with 67M parameters, achieving 92.95% training accuracy. Optimized for text classification using PyTorch/TensorFlow.

Text Classification

allenai

OLMo-1B-0724-hf

Brief-details: OLMo-1B-0724-hf is a 1.28B parameter open language model trained on the Dolma dataset, achieving strong performance in language tasks with improved dataset quality and staged training.

Text Generation

sentence-transformers

paraphrase-xlm-r-multilingual-v1

BRIEF DETAILS: Multilingual sentence embedding model with 278M parameters, maps sentences to 768D vectors. Built on XLM-RoBERTa, supports semantic search and clustering across languages.

Sentence Similarity

Snowflake

snowflake-arctic-embed-m-long

Brief Details: State-of-the-art long-context text embedding model with 137M parameters, optimized for retrieval tasks with support for up to 8192 tokens using RPE.

Sentence Similarity

llava-hf

llava-onevision-qwen2-0.5b-ov-hf

Brief-details: Cutting-edge 894M parameter multimodal LLM optimized for single-image, multi-image, and video tasks, built on Qwen2 with extensive fine-tuning.

Image-Text-to-Text

facebook

m2m100_1.2B

Brief-details: Multilingual translation model supporting 100 languages with 9,900 translation directions. Built by Facebook, features direct translation capabilities using transformer architecture.

Text2Text Generation

THUDM

CogVideoX-5b-I2V

Brief Details: CogVideoX-5b-I2V is a sophisticated image-to-video generation model with 5B parameters, capable of creating 6-second videos at 8fps from images and text prompts.

Image-to-Video

MingZhong

unieval-fact

Brief-details: UniEval-fact is a pre-trained evaluator for assessing factual consistency in text generation, with over 100k downloads and EMNLP backing

Text2Text Generation

deepseek-ai

deepseek-coder-6.7b-base

Brief Details: DeepSeek Coder 6.7B - Advanced code generation model trained on 2T tokens (87% code, 13% language) with 16K context window and fill-in-blank capabilities

Text Generation

OpenGVLab

InternVL2-1B

Brief-details: An advanced 938M parameter multimodal LLM combining InternViT-300M-448px vision encoder with Qwen2-0.5B-Instruct language model for versatile visual-language tasks.

Image-Text-to-Text

csarron

mobilebert-uncased-squad-v2

Brief Details: A lightweight question-answering model based on MobileBERT, optimized for mobile devices with 24.6M parameters. Achieves 75.2% EM score on SQuAD v2.0.

Question Answering

kyutai

moshiko-pytorch-bf16

Brief-details: Moshiko-pytorch-bf16 is a 7.69B parameter speech-text foundation model optimized for real-time dialogue, featuring BF16 precision and 160ms latency

Moshi

stabilityai

stable-video-diffusion-img2vid

BRIEF DETAILS: Stability AI's image-to-video diffusion model generating 14-frame video clips from still images at 576x1024 resolution. Popular with 103K+ downloads.

Image-to-Video

google

mt5-base

Brief-details: Multilingual T5 model supporting 101 languages, pre-trained on mC4 corpus. Requires fine-tuning for downstream tasks. Created by Google.

Text2Text Generation

SG161222

Realistic_Vision_V6.0_B1_noVAE

Brief Details: High-performance photorealistic image generation model optimized for portrait and full-body shots with enhanced resolution support up to 896x896px.

Text-to-Image

ali-vilab

In-Context-LoRA

Brief-details: In-Context-LoRA enables customizable image set generation with defined relationships, supporting 10 specialized tasks like visual effects, design templates, and storyboarding using FLUX as base model.

Text-to-Image

Intel

dpt-beit-base-384

Brief-details: Dense Prediction Transformer for depth estimation, 111M params, MIT license, uses BEiT backbone for monocular depth analysis from single images

Depth Estimation

DUSt3R_ViTLarge_BaseDecoder_512_dpt

Llama-3-8b-Ita

mbart-english-grammar-corrector

Mistral-7B-Instruct-v0.2-AWQ

my_awesome_model

OLMo-1B-0724-hf

paraphrase-xlm-r-multilingual-v1

snowflake-arctic-embed-m-long

llava-onevision-qwen2-0.5b-ov-hf

m2m100_1.2B

CogVideoX-5b-I2V

unieval-fact

deepseek-coder-6.7b-base

InternVL2-1B

mobilebert-uncased-squad-v2

moshiko-pytorch-bf16

stable-video-diffusion-img2vid

mt5-base

Realistic_Vision_V6.0_B1_noVAE

In-Context-LoRA

dpt-beit-base-384

The first platform built for prompt engineering