Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

federated-learning-whisper-tiny-Korean

Brief Details: A Korean-optimized Whisper tiny model trained using federated learning techniques for speech recognition tasks, focusing on Korean language processing.

EXCAI

Diffusion-As-Shader

Brief-details: A specialized video diffusion model that treats diffusion as a shader process, enabling 3D-aware video generation with advanced control capabilities and versatile editing options.

strangerzonehf

Halftone-Recraft-Flux

Brief-details: A specialized LoRA model for FLUX.1 focused on halftone car photography effects, optimized for 1280x832 resolution with 49 training images.

nvidia

audio-flamingo-2

Brief-details: Audio Flamingo 2 is a 3B parameter audio-language model from NVIDIA that achieves SOTA performance in audio understanding and expert reasoning, capable of processing 5-minute audio clips

SVECTOR-CORPORATION

Akshara-8B-Llama-Multilingual-V0.1

Brief-details: Akshara-8B is an 8B parameter multilingual LLM optimized for Indian languages, supporting 8 languages including Hindi, Tamil, and English. Built by SVECTOR.

mradermacher

Slush-FallMix-Fire_Edition_1.0-12B-GGUF

BRIEF DETAILS: 12B parameter GGUF quantized model offering multiple compression variants (Q2-Q8), with optimized versions balancing quality and performance.

ilsp

Llama-Krikri-8B-Instruct

Brief-details: Greek-English instruction-tuned 8B parameter LLM built on Llama-3.1, optimized for 128k context, strong bilingual capabilities and domain expertise

Delta-Vector

Archaeo-12B

BRIEF-DETAILS: 12B parameter language model merging Rei-12B and Francois-Huali-12B, optimized for roleplay and creative writing using ChatML format

BlackBeenie

ModernBERT-large-msmarco-bpr

Brief-details: Advanced sentence transformer model built on ModernBERT-large, optimized for semantic search with 1024D embeddings and 8192 token support

arcee-ai

Arcee-Blitz

BRIEF-DETAILS: Arcee-Blitz (24B) - A Mistral-based model distilled from DeepSeek, offering enhanced performance across various tasks with improved world knowledge and efficiency.

Imran1

QWEN2.5-32B-Translation

Brief-details: Advanced multilingual translation model based on Qwen 2.5 32B, fine-tuned for 16 languages using RLHF and expert feedback. Optimized with LoRA + QLoRA.

SicariusSicariiStuff

Redemption_Wind_24B

BRIEF-DETAILS: 24B parameter Mistral-based model optimized for fine-tuning. Features ChatML support, creative writing capabilities, and roleplay functionality. Intentionally undercooked at loss ~8.0.

nisten

q3-reasoner

Brief-details: q3-reasoner is a fine-tuned version of Qwen2.5-Coder-3B-Instruct optimized for faster inference using Unsloth and TRL, licensed under Apache-2.0

conflux-xyz

cx-tissue-seg

BRIEF DETAILS: Fast tissue segmentation model for H&E pathology slides using UNet with MobileNet-v3 encoder. Achieves 0.93 mIoU, processes slides in <1s on CPU.

nvidia

QLIP-L-14-392

Brief-details: QLIP-L-14-392 is NVIDIA's state-of-the-art visual tokenization model combining high-quality image reconstruction with zero-shot image understanding, achieving 79.1% accuracy.

suayptalha

Luminis-phi-4

Brief-details: Powerful merged LLM combining Phi-4 variants using SLERP method. Ranks #3 among models up to 15B parameters with strong performance on various benchmarks.

suayptalha

Lamarckvergence-14B

BRIEF-DETAILS: A merged 14B parameter LLM combining Lamarck and Qwenvergence models, ranking #1 among sub-15B models with impressive performance in reasoning tasks

NexaAIDev

DeepSeek-R1-Distill-Llama-8B-NexaQuant

Brief-details: Quantized version of DeepSeek-R1 that maintains full accuracy while reducing size by 75%. Achieves 17.2 tokens/sec with just 5GB RAM usage.

neuralmagic

DeepSeek-R1-Distill-Qwen-7B-quantized.w8a8

Brief-details: 8-bit quantized version of DeepSeek-R1-Distill-Qwen-7B offering 1.6x speedup and 50% memory reduction while maintaining accuracy