Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

CohereForAI_c4ai-command-a-03-2025-GGUF

BRIEF-DETAILS: Quantized versions of Cohere's command-a-03-2025 model, offering multiple compression levels from 118GB to 26GB with varying quality-size tradeoffs. Supports multilingual capabilities and runs on llama.cpp.

Mungert

gemma-3-1b-it-gguf

Brief-details: Gemma 3B Instruct GGUF - Google DeepMind's lightweight, state-of-the-art text model with multiple quantization options for various hardware setups. 32K input context.

bartowski

trashpanda-org_QwQ-32B-Snowdrop-v0-GGUF

Brief-details: Comprehensive GGUF quantization suite of QwQ-32B-Snowdrop model offering 27 variants from 65GB to 9GB, optimized for different RAM/performance trade-offs

bartowski

open-r1_OlympicCoder-7B-GGUF

Brief-details: GGUF quantized versions of OlympicCoder-7B optimized for different hardware setups, offering various compression levels from 2.78GB to 15.24GB with different quality-size tradeoffs.

Remade-AI

Crush

Brief-details: A specialized LoRA model for Wan2.1 14B I2V that creates realistic hydraulic press crushing animations from static images. 20-epoch trained on crush footage.

bartowski

open-r1_OlympicCoder-32B-GGUF

Brief-details: A 32B parameter coding-specialized LLM available in multiple GGUF quantizations (Q8_0 to IQ2_XXS), optimized for code generation and technical tasks

kuleshov-group

bd3lm-owt-block_size16

BRIEF-DETAILS: Block Diffusion Language Model trained on OpenWebText, bridging autoregressive and diffusion approaches for text generation with 16-token blocks.

EuroBERT

EuroBERT-610m

Brief-details: EuroBERT-610m is a powerful multilingual encoder supporting 15 languages with 610M parameters, capable of processing up to 8,192 tokens for various NLP tasks

jingyaogong

MiniMind2

Brief-details: Ultra-lightweight Chinese LLM (26M-145M params) trained in 2 hours for $0.43. Features pretrain, SFT, LoRA, DPO implementations with minimal dependencies.

trl-internal-testing

tiny-T5ForConditionalGeneration

Brief-details: A lightweight T5 model variant designed specifically for TRL (Transformer Reinforcement Learning) library testing, featuring minimal architecture for unit testing purposes.

YituTech

conv-bert-base

Brief-details: ConvBERT-base is a lightweight BERT variant developed by YituTech that uses dynamic convolutions to improve efficiency while maintaining BERT-like performance.

mlx-community

Llama-3.2-1B-Instruct-4bit

BRIEF DETAILS: 4-bit quantized version of Llama 3.2B instruction-tuned model optimized for MLX framework, offering efficient deployment on Apple Silicon

unslothai

vram-48

Brief Details: VRAM-48 is an AI model by unslothai focused on optimizing VRAM usage, potentially offering efficient memory management for deep learning applications.

databricks

dbrx-base

Brief Details: DBRX-Base by Databricks - A foundational language model focused on enterprise applications with privacy-conscious data handling and processing capabilities.

meta-llama

Llama-2-7b-chat

Brief Details: Llama-2-7b-chat is Meta's 7B parameter chat-optimized language model, designed for dialogue applications with enhanced instruction-following capabilities.

TurkuNLP

eccobert-base-cased-v1

Brief-details: ECCO-BERT is a specialized BERT model trained on 18th-century UK documents, optimized for historical text analysis and ECCO dataset tasks.

MaziyarPanahi

Mistral-Small-Instruct-2409-GGUF

BRIEF-DETAILS: GGUF-formatted version of Mistral-Small-Instruct-2409, optimized for local deployment with broad client support and GPU acceleration capabilities.

uwg

upscaler

Brief-details: A community-driven AI upscaling model from OpenModelDB, designed for image enhancement and super-resolution tasks, hosted on HuggingFace by uwg.

mistralai

Mamba-Codestral-7B-v0.1

BRIEF-DETAILS: Mamba-Codestral-7B is a 7B parameter model by MistralAI combining Mamba architecture with code generation capabilities, optimized for programming tasks.

2vXpSwA7

iroiro-lora

Brief Details: Iroiro-LoRA is a specialized LoRA (Low-Rank Adaptation) model available on HuggingFace, created by 2vXpSwA7 for fine-tuning language models.

Efficient-Large-Model

SANA1.5_4.8B_1024px

Brief Details: SANA1.5 is a 4.8B parameter efficient text-to-image model featuring Linear-Diffusion-Transformer architecture, capable of 1024px image generation with 60% reduced training costs.

CohereForAI_c4ai-command-a-03-2025-GGUF

gemma-3-1b-it-gguf

trashpanda-org_QwQ-32B-Snowdrop-v0-GGUF

open-r1_OlympicCoder-7B-GGUF

Crush

open-r1_OlympicCoder-32B-GGUF

bd3lm-owt-block_size16

EuroBERT-610m

MiniMind2

tiny-T5ForConditionalGeneration

conv-bert-base

Llama-3.2-1B-Instruct-4bit

vram-48

dbrx-base

Llama-2-7b-chat

eccobert-base-cased-v1

Mistral-Small-Instruct-2409-GGUF

upscaler

Mamba-Codestral-7B-v0.1

iroiro-lora

SANA1.5_4.8B_1024px

The first platform built for prompt engineering