Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

segformer-b1-finetuned-cityscapes-1024-1024

Brief-details: SegFormer B1 model fine-tuned for urban scene segmentation on Cityscapes dataset. Features hierarchical Transformer encoder and MLP decode head for efficient semantic segmentation.

Image Segmentation

TheBloke

CodeLlama-7B-Instruct-GPTQ

Brief-details: CodeLlama-7B-Instruct-GPTQ is a 7B parameter GPTQ-quantized code generation model optimized for instruction-following and coding tasks, offering multiple quantization options for efficient deployment.

Text Generation

Vikhrmodels

Vikhr-Nemo-12B-Instruct-R-21-09-24

Brief Details: Advanced 12B parameter bilingual LLM optimized for Russian/English, featuring RAG capabilities and 128k context, built on Mistral-Nemo architecture

Text Generation

timm

xcit_tiny_24_p8_384.fb_dist_in1k

Brief Details: XCiT (Cross-Covariance Image Transformer) image classification model with 12.1M parameters, optimized for 384x384 images with distillation training on ImageNet-1k.

Image Classification

laion

CLIP-convnext_base_w_320-laion_aesthetic-s13B-b82K-augreg

Brief-details: ConvNeXt-Base CLIP model trained on LAION Aesthetic dataset, optimized for 320x320 resolution with augmented regularization, achieving 71.3% ImageNet zero-shot accuracy.

Zero-Shot Image Classification

theaiinstitute

theia-base-patch16-224-cddsv

Brief-details: Theia is a vision foundation model for robotics that distills knowledge from multiple vision models, offering 188M parameters with F32 precision and superior performance in robot learning tasks.

Feature Extraction

Alibaba-NLP

gte-multilingual-reranker-base

Brief Details: A powerful multilingual reranker supporting 75 languages with 306M parameters, featuring SOTA performance and 8192 token context length. Optimized for fast inference.

Text Classification

team-lucid

trocr-small-korean

Brief Details: Korean OCR model with 54.5M parameters, trained on 6M synthetic images. Uses vision-encoder-decoder architecture with DeiT and RoBERTa weights.

Image-to-Text

Efficient-Large-Model

Llama-3-VILA1.5-8B

Brief Details: VILA1.5-8B is an advanced visual language model built on Llama 3, supporting multi-image reasoning and text generation with 8B parameters.

Text Generation

facebook

dragon-plus-context-encoder

BRIEF DETAILS: DRAGON+ is a BERT-based dense retriever for efficient text search, featuring dual encoders and strong performance on MARCO Dev (39.0) and BEIR (47.4) benchmarks.

Feature Extraction

prs-eth

marigold-depth-v1-0

Brief-details: Marigold is a state-of-the-art monocular depth estimation model that repurposes Stable Diffusion for zero-shot depth prediction from single images.

Depth Estimation

VAGOsolutions

SauerkrautLM-Mixtral-8x7B-Instruct

Brief Details: A powerful multilingual Mixtral-8x7B instruction-tuned model optimized for German language use, featuring DPO alignment and 46.7B parameters.

Text Generation

jinymusim

gpt-czech-poet

BRIEF DETAILS: A Czech poetry generation model built on GPT-2, specialized in various rhyme schemas (ABBA, ABAB, AABB, AABCCB) from different time periods. 34.9K downloads.

Text Generation

openai

shap-e

Brief-details: Shap-E is OpenAI's innovative text-to-3D diffusion model that generates textured meshes and neural radiance fields from text prompts, offering fast 3D asset generation.

Text-to-3D

beomi

Llama-3-Open-Ko-8B-Instruct-preview

Here's the two-part response: Brief Details: Korean-optimized 8B parameter LLaMA-3 instruction model, fine-tuned on 60GB+ text data with enhanced Korean language capabilities and chat functionality.

Text Generation

google

metricx-23-qe-xl-v2p0

Brief Details: MetricX-23 QE-XL - Advanced reference-free translation quality evaluation model, part of Google's WMT'23 metrics submission

Transformers

CAMeL-Lab

bert-base-arabic-camelbert-mix-sentiment

BRIEF DETAILS: Arabic sentiment analysis BERT model fine-tuned on ASTD, ArSAS, and SemEval datasets. Specializes in MSA, dialectal, and classical Arabic text classification.

Text Classification

HuggingFaceTB

SmolLM2-360M-Instruct

Brief Details: SmolLM2-360M-Instruct is a compact 362M parameter language model optimized for instruction following, trained on 4T tokens with improved reasoning capabilities.

Text Generation

deepseek-ai

deepseek-coder-33b-instruct

Brief Details: A powerful 33B parameter code generation model trained on 2T tokens (87% code, 13% language), offering state-of-the-art performance for multiple programming languages.

Text Generation

mradermacher

Chaotic-Soliloquy-4x8B-GGUF

Brief-details: A 24.9B parameter MoE model available in multiple GGUF quantizations, optimized for English conversation with sizes ranging from 9.4GB to 26.6GB.

Transformers

mixedbread-ai

mxbai-rerank-large-v1

Brief-details: A powerful 435M parameter reranking model optimized for search relevance, achieving 48.8 NDCG@10 on BEIR benchmarks with cross-encoder architecture

Text Classification

segformer-b1-finetuned-cityscapes-1024-1024

CodeLlama-7B-Instruct-GPTQ

Vikhr-Nemo-12B-Instruct-R-21-09-24

xcit_tiny_24_p8_384.fb_dist_in1k

CLIP-convnext_base_w_320-laion_aesthetic-s13B-b82K-augreg

theia-base-patch16-224-cddsv

gte-multilingual-reranker-base

trocr-small-korean

Llama-3-VILA1.5-8B

dragon-plus-context-encoder

marigold-depth-v1-0

SauerkrautLM-Mixtral-8x7B-Instruct

gpt-czech-poet

shap-e

Llama-3-Open-Ko-8B-Instruct-preview

metricx-23-qe-xl-v2p0

bert-base-arabic-camelbert-mix-sentiment

SmolLM2-360M-Instruct

deepseek-coder-33b-instruct

Chaotic-Soliloquy-4x8B-GGUF

mxbai-rerank-large-v1

The first platform built for prompt engineering