Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

layoutlmv2-large-uncased

Brief-details: Microsoft's LayoutLMv2 multimodal model for document understanding, combining text, layout & image analysis with SOTA results on FUNSD, CORD & DocVQA tasks.

Transformers

MrLight

dse-qwen2-2b-mrl-v1

Brief Details: A bi-encoder model that converts document screenshots into dense vectors for retrieval, supporting multiple languages and achieving 85.8 nDCG@5 on ViDoRE.

PyTorch

thesven

Mistral-7B-Instruct-v0.3-GPTQ

Brief Details: 4-bit quantized version of Mistral-7B-Instruct-v0.3 LLM. Features 1.21B params, Apache 2.0 license, optimized for text generation & conversation.

Text Generation

swl-models

xiaolxl-guofeng-v3

Brief Details: A Chinese antique-style text-to-image model specializing in 2.5D game character generation with improved scene elements and male characters. Features enhanced face/hand quality and 1024px output.

Text-to-Image

Lykon

dreamshaper-xl-v2-turbo

Brief Details: Dreamshaper XL v2 Turbo is a specialized SDXL-based text-to-image model optimized for fast inference with high-quality artistic outputs.

Text-to-Image

jondurbin

bagel-8b-v1.0

Brief-details: An 8B parameter LLaMA-3 based instruction-tuned model optimized for diverse tasks including RAG, summarization, and function calling. Fine-tuned on 41 datasets.

Text Generation

Qwen

Qwen2.5-72B-Instruct-GPTQ-Int4

Brief Details: Qwen2.5's 72B parameter instruction-tuned model quantized to 4-bit precision. Features 128K context length, multi-lingual support, and enhanced capabilities in coding and mathematics.

Text Generation

Casual-Autopsy

L3-Umbral-Mind-RP-v3.0-8B

Brief-details: An 8B parameter LLaMA3-based roleplay model specialized in handling darker themes and complex emotional scenarios, merging 20+ models with unique weighting schemes

Text Generation

SPO-Diffusion-Models

SPO-SDXL_4k-p_10ep

Brief-details: SPO-SDXL is a fine-tuned SDXL model using Step-aware Preference Optimization, trained on 4k prompts for 10 epochs to enhance image generation alignment with complex prompts.

Text-to-Image

lvwerra

distilbert-imdb

Brief-details: A fine-tuned DistilBERT model for IMDB sentiment analysis, achieving 92.8% accuracy. Built on distilbert-base-uncased with Apache 2.0 license.

Text Classification

hezarai

crnn-fa-license-plate-recognition-v2

Brief Details: CRNN-based Persian license plate OCR model, fine-tuned on specialized dataset. Processes cropped plate images with high accuracy (20K+ downloads).

Image-to-Text

timm

vgg19.tv_in1k

Brief Details: VGG19 ImageNet model with 144M params, trained for classification tasks. Features torchvision weights and BSD-3-Clause license. Efficient for feature extraction.

Image Classification

timm

vit_base_patch32_clip_448.laion2b_ft_in12k_in1k

Brief Details: ViT model pretrained on LAION-2B, fine-tuned on ImageNet-12k/1k. 88.3M params, 448x448 input size, ideal for image classification & embeddings.

Image Classification

unsloth

gemma-2-2b-bnb-4bit

Brief-details: A 4-bit quantized version of Google's Gemma 2B model optimized by Unsloth, offering 2.4x faster inference with 58% less memory usage

Text Generation

CAMeL-Lab

bert-base-arabic-camelbert-da-sentiment

BRIEF DETAILS: Arabic sentiment analysis BERT model fine-tuned on dialectal Arabic, capable of classifying text as positive/negative with high accuracy

Text Classification

laion

CLIP-ViT-B-32-DataComp.XL-s13B-b90K

Brief Details: CLIP Vision Transformer model trained on DataComp-1B dataset, achieving 72.7% ImageNet accuracy. Optimized for zero-shot classification and retrieval tasks.

Zero-Shot Image Classification

LLM4Binary

llm4decompile-6.7b-v1.5

Brief Details: LLM4Decompile 6.7B parameter model specialized in converting x86 assembly to C code, trained on 15B tokens with 4096 token length support.

Text Generation

HooshvareLab

bert-base-parsbert-ner-uncased

Brief Details: ParsBERT NER model for Persian language understanding, supporting token classification for named entities with high F1 scores (95%+) on ARMAN/PEYMA datasets.

Token Classification

leloy

Anole-7b-v0.1-hf

Brief Details: Anole-7b is a 7B parameter multimodal model capable of interleaved image-text generation and understanding, built on Chameleon architecture.

Image-Text-to-Text

s-nlp

russian_toxicity_classifier

BRIEF DETAILS: Russian BERT-based toxicity classifier with 178M params, trained on merged datasets from 2ch.hk and ok.ru, achieving 97% accuracy for toxic comment detection.

Text Classification

mlabonne

NeuralDaredevil-8B-abliterated

Brief-details: An 8B parameter LLM based on Llama architecture, fine-tuned with DPO. Achieves 69.1% on MMLU, optimized for uncensored text generation and role-playing tasks.

Text Generation

layoutlmv2-large-uncased

dse-qwen2-2b-mrl-v1

Mistral-7B-Instruct-v0.3-GPTQ

xiaolxl-guofeng-v3

dreamshaper-xl-v2-turbo

bagel-8b-v1.0

Qwen2.5-72B-Instruct-GPTQ-Int4

L3-Umbral-Mind-RP-v3.0-8B

SPO-SDXL_4k-p_10ep

distilbert-imdb

crnn-fa-license-plate-recognition-v2

vgg19.tv_in1k

vit_base_patch32_clip_448.laion2b_ft_in12k_in1k

gemma-2-2b-bnb-4bit

bert-base-arabic-camelbert-da-sentiment

CLIP-ViT-B-32-DataComp.XL-s13B-b90K

llm4decompile-6.7b-v1.5

bert-base-parsbert-ner-uncased

Anole-7b-v0.1-hf

russian_toxicity_classifier

NeuralDaredevil-8B-abliterated

The first platform built for prompt engineering