Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Al-Atlas-0.5B

Brief-details: Al-Atlas-0.5B is the first dedicated 0.5B parameter LLM for Moroccan Darija, trained on 155M tokens of authentic content, offering specialized Arabic dialect processing capabilities.

Clybius

Chroma-GGUF

BRIEF-DETAILS: 8.9B parameter image generation model based on FLUX.1, optimized with mixed quantization for enhanced performance and speed

amd

Instella-3B-SFT

Brief-details: AMD's 3B parameter instruction-tuned LLM, trained on 8.9B tokens. Features 36 decoder layers, 32 attention heads, and 4K context length. Strong performance on reasoning tasks.

mlx-community

QwQ-32B-8bit

BRIEF DETAILS: QwQ-32B-8bit is an 8-bit quantized version of the QwQ-32B model, optimized for MLX framework with reduced memory footprint while maintaining performance.

yasu-oh

sarashina2.2-3b-instruct-v0.1-GGUF

BRIEF-DETAILS: A 3B parameter Japanese instruction-tuned language model based on sarashina2.2, optimized with GGUF format and enhanced with imatrix dataset.

FreedomIntelligence

Soundwave

BRIEF-DETAILS: Efficient speech-to-text model trained on just 10k hours of data, offering exceptional performance in speech translation and AIR-Bench tasks

sbintuitions

sarashina2.2-0.5b

Brief-details: A 0.5B parameter Japanese-English language model trained on 10T tokens, optimized for math and coding tasks with strong performance in Japanese NLP benchmarks.

huihui-ai

Phi-4-mini-instruct-abliterated

Brief-details: Uncensored variant of Microsoft's Phi-4-mini-instruct model, created using abliteration technique to remove content restrictions. Deployable via Ollama.

Darkknight535

WinterEngine-24B-Instruct

Brief Details: A 24B parameter instruction-tuned LLM optimized for creative tasks & roleplay. Built on Mistral-24B, combining PersonalityEngine and Redemption Wind models.

mlx-community

Kokoro-82M-bf16

Brief-details: Kokoro-82M-bf16 is an MLX-optimized text-to-speech model with 82M parameters, converted from hexagrad/Kokoro-82M for Apple Silicon efficiency.

Tower-Babel

Babel-83B

BRIEF-DETAILS: Powerful 83B parameter multilingual LLM supporting 25 languages covering 90% of global speakers, with strong performance across knowledge, reasoning, and translation tasks

HoangHa

Pensez-v0.1-e5-GGUF

BRIEF DETAILS: Bilingual French-English 7B parameter LLM focused on reasoning, built on Qwen 2.5, trained on 2K curated samples over 5 epochs

TheDrummer

Fallen-Llama-3.3-R1-70B-v1-GGUF

Brief-details: A 70B parameter "evil-tuned" variant of Deepseek's R1 Distill on Llama 3.3, designed for uncensored and creative interactions without typical ethical constraints.

jdh-algo

Citrus1.0-llama-70B

Brief-details: Specialized 70B medical LLM built on Llama 3.1, trained to emulate expert clinical reasoning patterns for advanced medical decision support and diagnostics.

prithivMLmods

Guard-Against-Unsafe-Content-Siglip2

BRIEF DETAILS: Image classification model for NSFW content detection, fine-tuned from siglip2-base. Binary classifier for safe/unsafe content with high accuracy and Hugging Face integration.

baichuan-inc

Baichuan-Audio-Instruct

BRIEF-DETAILS: End-to-end speech interaction model featuring audio tokenization, LLM processing, and flow-matching decoder. Supports seamless text-audio switching and high-quality speech synthesis.

TurboPascal

ChineseModernBert

Brief-details: Modern Chinese BERT variant trained on high-quality CCI3-HQ dataset, optimized for 4096 context length using 3x8 A100 GPUs. Apache 2.0 licensed.

voidful

Llama-3.1-TAIDE-R1-8B-Chat

Brief Details: An 8B parameter merged LLM combining DeepSeek and TAIDE models, built on Llama-3.1 architecture using SCE merge method for enhanced chat capabilities.

prithivMLmods

Sombrero-Opus-14B-Elite6

Brief-details: A 14B parameter LLM based on Qwen 2.5 architecture, featuring enhanced reasoning, 128K context window, and multilingual support across 29 languages.

prithivMLmods

Magellanic-Llama-70B-r999

Brief Details: A 70B parameter LLaMA-based model fine-tuned from DeepSeek R1 Distill, optimized through RL with 1M+ training entries for enhanced reasoning and safety.

IntelligentEstate

Baby_Grok3-1.5b-iQ4_K_M-GGUF

BRIEF-DETAILS: Small but powerful 1.5B parameter model optimized for edge devices, featuring multi-turn function calling and exceptional reasoning capabilities compared to larger models

Al-Atlas-0.5B

Chroma-GGUF

Instella-3B-SFT

QwQ-32B-8bit

sarashina2.2-3b-instruct-v0.1-GGUF

Soundwave

sarashina2.2-0.5b

Phi-4-mini-instruct-abliterated

WinterEngine-24B-Instruct

Kokoro-82M-bf16

Babel-83B

Pensez-v0.1-e5-GGUF

Fallen-Llama-3.3-R1-70B-v1-GGUF

Citrus1.0-llama-70B

Guard-Against-Unsafe-Content-Siglip2

Baichuan-Audio-Instruct

ChineseModernBert

Llama-3.1-TAIDE-R1-8B-Chat

Sombrero-Opus-14B-Elite6

Magellanic-Llama-70B-r999

Baby_Grok3-1.5b-iQ4_K_M-GGUF

The first platform built for prompt engineering