Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Qwen_QwQ-32B-GGUF

Brief-details: Quantized version of QwQ-32B model offering multiple compression levels (9GB-35GB) with imatrix optimization, suitable for various hardware configurations and performance needs

SparkAudio

Spark-TTS-0.5B

Brief-details: Spark-TTS-0.5B is an efficient LLM-based text-to-speech model supporting bilingual synthesis and zero-shot voice cloning, built on Qwen2.5 architecture.

Wan-AI

Wan2.1-I2V-14B-720P

Brief Details: Wan2.1-I2V-14B-720P: A 14B parameter image-to-video generation model capable of producing high-quality 720P videos with state-of-the-art performance and multi-GPU support.

Kijai

WanVideo_comfy

Brief-details: A specialized video-related model combining and quantizing WanVideo models from Wan-AI, optimized for ComfyUI integration through a custom wrapper implementation.

Comfy-Org

Wan_2.1_ComfyUI_repackaged

Brief Details: Wan 2.1 repackaged for ComfyUI - A specialized adaptation of the Wan 2.1 model optimized for ComfyUI workflows, focusing on enhanced compatibility and integration.

hexgrad

Kokoro-82M

BRIEF-DETAILS: Kokoro-82M is an open-weight, 82M-parameter TTS model supporting 8 languages with 54 voices. Apache-licensed, cost-efficient training ($1000), based on StyleTTS 2.

microsoft

Magma-8B

Brief-details: Magma-8B is Microsoft's groundbreaking multimodal AI agent foundation model, combining vision, language, and action capabilities for UI navigation, robotics, and gaming tasks.

microsoft

Phi-4-mini-instruct

BRIEF DETAILS: Phi-4-mini-instruct: A 3.8B parameter lightweight model from Microsoft with 128K context, strong reasoning capabilities, and multilingual support, optimized for efficiency.

CohereForAI

aya-vision-32b

BRIEF-DETAILS: Aya-vision-32b is a large-scale vision language model by CohereForAI with 32B parameters, designed for advanced visual understanding and processing tasks.

black-forest-labs

FLUX.1-dev

Brief-details: FLUX.1-dev is an advanced AI model by black-forest-labs, featuring integrated development capabilities with companion models for Fill, Redux, and Depth processing functionalities.

THUDM

CogView4-6B

BRIEF-DETAILS: CogView4-6B is a high-performance text-to-image generation model with strong capabilities in composition, positioning, and attribute accuracy. Supports resolutions up to 2048x2048.

tencent

HunyuanVideo-I2V

Brief-details: HunyuanVideo-I2V is a powerful image-to-video generation model from Tencent, capable of creating 720p videos from static images with high fidelity and temporal consistency.

perplexity-ai

r1-1776

Brief Details: R1-1776 is a post-trained DeepSeek-R1 model by Perplexity AI, specifically modified to remove CCP censorship while maintaining reasoning capabilities

CohereForAI

aya-vision-8b

BRIEF-DETAILS: Aya-vision-8b is an 8B parameter vision model from CohereForAI, designed for advanced visual understanding tasks with connections to larger vision models.

allenai

olmOCR-7B-0225-preview

Brief-details: A 7B parameter OCR model fine-tuned from Qwen2-VL-7B-Instruct, specialized in document image analysis with support for efficient large-scale processing.

Wan-AI

Wan2.1-T2V-14B

BRIEF-DETAILS: Advanced 14B parameter text-to-video model capable of generating high-quality 480P/720P videos with Chinese/English text. SOTA performance with extensive motion dynamics.

microsoft

Phi-4-multimodal-instruct

BRIEF-DETAILS: Phi-4-multimodal-instruct: 5.6B parameter multimodal model supporting text, vision, and speech across multiple languages with 128K context length and flash attention

Qwen

QwQ-32B

Brief Details: QwQ-32B is a 32.5B parameter reasoning model from Qwen with 131K context length, featuring enhanced performance in complex tasks through thoughtful outputs and step-by-step reasoning capabilities.

deepseek-ai

DeepSeek-R1

Brief-details: DeepSeek-R1 is a 671B parameter MoE model focused on reasoning capabilities, trained via reinforcement learning without initial supervised fine-tuning, achieving performance comparable to OpenAI-o1.

Qwen

Qwen2-VL-72B-Instruct

Brief-details: A powerful 72B parameter vision-language model capable of processing long videos, multilingual text, and high-resolution images with state-of-the-art performance on visual understanding benchmarks.

Image-Text-to-Text

boboto

LLaMA-65B-HF

BRIEF-DETAILS: LLaMA-65B-HF is Meta AI's 65B parameter language model trained on diverse web data, optimized for research and featuring strong reasoning capabilities.

Text Generation

Qwen_QwQ-32B-GGUF

Spark-TTS-0.5B

Wan2.1-I2V-14B-720P

WanVideo_comfy

Wan_2.1_ComfyUI_repackaged

Kokoro-82M

Magma-8B

Phi-4-mini-instruct

aya-vision-32b

FLUX.1-dev

CogView4-6B

HunyuanVideo-I2V

r1-1776

aya-vision-8b

olmOCR-7B-0225-preview

Wan2.1-T2V-14B

Phi-4-multimodal-instruct

QwQ-32B

DeepSeek-R1

Qwen2-VL-72B-Instruct

LLaMA-65B-HF

The first platform built for prompt engineering