Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

swin-base-patch4-window7-224

Brief-details: Swin Transformer base model with 87.8M parameters for image classification, using hierarchical vision transformer architecture with shifted windows for efficient processing.

Image Classification

ali-vilab

text-to-video-ms-1.7b

Brief-details: A 1.7B parameter text-to-video diffusion model that generates videos from English text descriptions, utilizing a UNet3D architecture and multi-stage generation process.

Text-to-Video

law-ai

InLegalBERT

Brief-details: InLegalBERT - A specialized BERT model pre-trained on 5.4M Indian legal documents, featuring 110M parameters and optimized for legal domain tasks.

Fill-Mask

protectai

unbiased-toxic-roberta-onnx

Brief Details: ONNX-converted RoBERTa model for toxicity detection with bias mitigation, supporting multiple classification tasks and identity-aware analysis.

Token Classification

stabilityai

stable-cascade

Brief-details: Highly efficient text-to-image model using cascade architecture with 42x compression factor, offering faster inference and cheaper training than traditional models like Stable Diffusion.

Text-to-Image

facebook

dino-vitb8

Brief Details: Vision Transformer model trained with DINO method, featuring 85.8M params for self-supervised image processing with 8x8 patch resolution. Apache-2.0 licensed.

Image Feature Extraction

openbmb

MiniCPM3-4B

Brief-details: MiniCPM3-4B is a powerful 4B parameter bilingual LLM that outperforms GPT-3.5-Turbo-0125 in several benchmarks, featuring 32k context window and function calling capabilities.

Text Generation

lmms-lab

llava-onevision-qwen2-0.5b-si

Brief Details: LLaVA-OneVision 0.5B multimodal model based on Qwen2, supporting English/Chinese image/video interaction with 894M parameters. Shows strong performance on visual tasks.

Text Generation

facebook

sam2-hiera-tiny

Brief-details: SAM2's tiny variant for image/video segmentation - offers efficient mask generation with lightweight architecture, Apache 2.0 licensed, 29K+ downloads

Mask Generation

tokyotech-llm

Llama-3.1-Swallow-8B-Instruct-v0.1

Brief-details: An 8B parameter Japanese-enhanced LLaMA 3.1 model, fine-tuned for instruction following with improved bilingual capabilities and strong performance on Japanese NLP tasks

Text Generation

mixedbread-ai

mxbai-colbert-large-v1

Brief-details: A powerful ColBERT reranking model with 335M parameters, built on mixedbread-ai's embedding architecture, optimized for search and retrieval tasks

Transformers

cross-encoder

nli-deberta-v3-xsmall

Brief-details: A lightweight DeBERTa-v3 model fine-tuned for Natural Language Inference tasks, achieving 91.64% accuracy on SNLI with cross-encoder architecture.

Zero-Shot Classification

unsloth

Phi-3.5-mini-instruct-bnb-4bit

Brief-details: Microsoft's 3.8B parameter multilingual model optimized for 4-bit inference, supporting 128K context with strong reasoning capabilities.

Text Generation

LanguageBind

Video-LLaVA-7B

Brief-details: A 7.47B parameter multimodal model capable of understanding both images and videos through unified visual representations, offering high-performance visual reasoning capabilities.

Text Generation

bartowski

Qwen2.5-32B-Instruct-GGUF

Brief-details: Qwen2.5-32B-Instruct-GGUF is a high-performance 32.8B parameter language model with multiple quantization options for efficient deployment, optimized for chat applications.

Text Generation

diffusers

controlnet-canny-sdxl-1.0

BRIEF DETAILS: A ControlNet model for SDXL that uses Canny edge detection for precise image generation control. Features 30K+ downloads and OpenRAIL++ license.

Text-to-Image

unsloth

Llama-3.2-3B

Brief-details: Llama-3.2-3B is Meta's latest 3.21B parameter multilingual LLM, optimized for dialogue and supporting 8+ languages with enhanced efficiency and reduced memory usage.

Text Generation

budecosystem

boomer-1b

Brief Details: A 1.1B parameter language model trained on 41B tokens, featuring flash attention and enhanced MLP layers. Optimized for text generation and edge deployment.

Text Generation

jasperai

Flux.1-dev-Controlnet-Depth

Brief Details: A specialized ControlNet model for depth-aware image generation, built on Flux.1-dev. Enables precise depth map-guided image creation with 3.3K+ downloads.

Image-to-Image

laion

CLIP-ViT-B-16-DataComp.XL-s13B-b90K

Brief Details: CLIP ViT-B/16 model trained on DataComp-1B dataset, achieving 73.5% ImageNet accuracy. Specialized for zero-shot image classification and retrieval tasks.

Zero-Shot Image Classification

davidkim205

ko-gemma-2-9b-it

Brief Details: A Korean-English instruction-tuned 9B parameter LLM based on Gemma 2, optimized for conversational AI and detailed explanations, with strong performance on Korean language tasks.

Text Generation

swin-base-patch4-window7-224

text-to-video-ms-1.7b

InLegalBERT

unbiased-toxic-roberta-onnx

stable-cascade

dino-vitb8

MiniCPM3-4B

llava-onevision-qwen2-0.5b-si

sam2-hiera-tiny

Llama-3.1-Swallow-8B-Instruct-v0.1

mxbai-colbert-large-v1

nli-deberta-v3-xsmall

Phi-3.5-mini-instruct-bnb-4bit

Video-LLaVA-7B

Qwen2.5-32B-Instruct-GGUF

controlnet-canny-sdxl-1.0

Llama-3.2-3B

boomer-1b

Flux.1-dev-Controlnet-Depth

CLIP-ViT-B-16-DataComp.XL-s13B-b90K

ko-gemma-2-9b-it

The first platform built for prompt engineering