Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

tiny-random-marian

Brief Details: A lightweight, randomized version of the Marian neural machine translation model, designed for experimental and educational purposes.

datalab-to

surya_layout

Brief Details: Surya Layout is a specialized AI model focused on document layout analysis and understanding, developed by datalab-to for efficient document structure processing.

neuralmagic

Mistral-Small-24B-Instruct-2501-FP8-Dynamic

BRIEF-DETAILS: 24B parameter Mistral model optimized with FP8 quantization, achieving 99.28% accuracy recovery while reducing model size by 50%

tezuesh

diarization

Brief-details: A speech diarization model by tezuesh for speaker identification and segmentation in audio conversations, hosted on HuggingFace.

MiniMaxAI

MiniMax-VL-01

Brief-details: A multimodal vision-language model combining ViT (303M params) with MiniMax-Text-01, featuring dynamic resolution and trained on 512B tokens

NexaAIDev

OmniAudio-2.6B

Brief-details: OmniAudio-2.6B is a fast, efficient audio-language model combining Gemma-2-2b and Whisper turbo for on-device text/audio processing at 66 tokens/sec.

CohereForAI

c4ai-command-r-plus-4bit

Brief Details: c4ai-command-r-plus-4bit is a 4-bit quantized language model from CohereForAI, optimized for command-based interactions and efficient deployment with reduced memory footprint.

deepseek-ai

DeepSeek-V2.5-1210

Brief-details: DeepSeek-V2.5-1210 is an enhanced language model with improved mathematical (82.8% MATH-500) and coding capabilities (34.38% LiveCodebench), featuring BF16 inference support and comprehensive function calling.

latentcat

latentcat-controlnet

BRIEF-DETAILS: ControlNet model specializing in brightness and illumination control for Stable Diffusion, offering precise lighting adjustments with recommended weights of 0.4-0.9

prithivMLmods

SD3.5-Turbo-Realism-2.0-LoRA

Brief-details: A specialized LoRA model for SD3.5-Turbo focusing on photorealistic image generation, featuring 64 network dimensions and trained on 27 curated images over 13 epochs.

PygmalionAI

Pygmalion-3-12B

Brief Details: A 12B parameter roleplaying AI model built on Mistral's Nemo base, fine-tuned with hundreds of millions of tokens for creative conversation and character interaction.

leafspark

Llama-3.2-11B-Vision-Instruct-GGUF

BRIEF-DETAILS: Llama 3.2 Vision (11B params) - Advanced multimodal LLM optimized for visual recognition, image reasoning, and captioning tasks

qt8833

ai-comic-factory

Brief-details: AI Comic Factory is a specialized model by qt8833 focused on comic-style image generation, hosted on HuggingSpace for creative digital artwork and illustration purposes.

Comfy-Org

stable-diffusion-3.5-fp8

Brief-details: Optimized SD3.5 checkpoint with integrated CLIP/text encoders, featuring FP8 precision for efficient deployment in ComfyUI workflows

Goekdeniz-Guelmez

Josiefied-Qwen2.5-14B-Instruct-abliterated-v4

Brief-details: Abliterated version of Qwen2.5-14B focused on unrestricted responses, featuring 14B parameters and 32K context length with YaRN scaling support

juliozhao

DocLayout-YOLO-DocStructBench

Brief Details: DocLayout-YOLO is a specialized YOLO-based model for document layout analysis, trained on DocStructBench dataset for accurate structure detection.

zetasepic

Qwen2.5-32B-Instruct-abliterated-v2

BRIEF-DETAILS: Modified version of Qwen2.5-32B-Instruct using abliteration technique to reduce safety filters while maintaining core capabilities. 32B parameters.

anthracite-org

magnum-v4-12b-gguf

Brief-details: Fine-tuned 12B parameter model based on Mistral-Nemo-Instruct-2407, designed to replicate Claude 3's prose quality, with 32k context window and GGUF quantization support.

mobiuslabsgmbh

faster-whisper-large-v3-turbo

Brief-details: Optimized Whisper large-v3 conversion for CTranslate2, offering faster speech recognition with FP16 precision and seamless integration with faster-whisper framework

paris-noah

Mantis-8M

Brief-details: Lightweight foundation model for time series classification with 8M parameters. Features easy fine-tuning, dimension reduction adapters, and scikit-learn compatibility.

UsefulSensors

moonshine

Brief-details: Moonshine is a lightweight ASR model for real-time speech recognition, featuring tiny (27M) and base (61M) variants optimized for resource-constrained platforms