Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

EimisAnimeDiffusion_1.0v

Brief Details: Specialized anime-style image generation model with strong capabilities in character and landscape creation, featuring high-quality detail rendering and versatile prompt handling.

Text-to-Image

myshell-ai

OpenVoice

BRIEF-DETAILS: Versatile instant voice cloning model enabling multi-language speech generation with style control and zero-shot capabilities. MIT licensed, supports English & Chinese.

Text-to-Speech

OpenGVLab

InternVL-Chat-V1-5

Brief-details: A powerful 25.5B parameter multimodal LLM combining InternViT-6B and InternLM2-20B, featuring dynamic high-resolution processing and strong bilingual capabilities.

Image-Text-to-Text

nitrosocke

Future-Diffusion

Brief-details: Stable Diffusion 2.0-based model specialized in futuristic Sci-Fi imagery, featuring high-quality 3D visuals triggered by "future style" token. 402 likes, 767 downloads.

Text-to-Image

gpt-omni

mini-omni

Brief-details: Mini-omni is a multimodal LLM capable of real-time speech-to-speech conversation with streaming audio output, built on Qwen2-0.5B base model for English language processing.

Text-to-Speech

Vision-CAIR

MiniGPT-4

Brief-details: MiniGPT-4 combines BLIP-2's visual encoder with Vicuna LLM for advanced vision-language understanding, trained in two stages for enhanced image comprehension and natural conversation.

Text Generation

THUDM

chatglm-6b-int4

Brief Details: ChatGLM-6B-INT4 is a quantized bilingual LLM with 6B parameters, optimized for Chinese-English dialogue, requiring only 6GB VRAM for inference.

Transformers

lmsys

vicuna-13b-delta-v1.1

BRIEF-DETAILS: Vicuna-13b-delta-v1.1 is a fine-tuned LLaMA variant trained on 70K ShareGPT conversations, offering strong chat capabilities for researchers and hobbyists

Text Generation

NousResearch

Nous-Hermes-2-Mixtral-8x7B-DPO

BRIEF DETAILS: A powerful 46.7B parameter Mixtral-based model fine-tuned with DPO, achieving state-of-the-art performance. Features ChatML format and extensive benchmark improvements.

Text Generation

dcy

AsiaFacemix

Brief Details: AsiaFacemix is a specialized AI model focused on improving Asian facial features in image generation, based on basil mix, dreamlike, and ProtoGen models. Licensed under OpenRail.

Gustavosta/Stable-Diffusion-Prompts

cognitivecomputations

dolphin-2.9-llama3-8b

Brief-details: Dolphin 2.9 is an 8B parameter LLaMA3-based model optimized for conversational AI, coding, and instruction-following with uncensored capabilities and 4k context length.

Text Generation

nitrosocke

classic-anim-diffusion

Brief-details: A Stable Diffusion model fine-tuned for classic animation-style image generation, specializing in Disney-like character rendering with 412 likes and 338 downloads.

Text-to-Image

Onodofthenorth

SD_PixelArt_SpriteSheet_Generator

BRIEF DETAILS: A specialized Stable Diffusion model for generating pixel art sprite sheets from 4 angles (front, back, left, right). Apache-2.0 licensed with 1.3K+ downloads.

Text-to-Image

TheBloke

Mixtral-8x7B-v0.1-GGUF

Brief-details: A powerful 46.7B parameter MoE model quantized for efficient deployment, supporting 5 languages with various GGUF formats for different performance/quality trade-offs

Transformers

NexaAIDev

omnivision-968M

Brief-details: Compact 968M-parameter multimodal model optimized for edge devices. Features 9x token reduction and DPO training for reliable visual-text processing.

GGUF

openbmb

MiniCPM-V-2

Brief Details: MiniCPM-V-2 is a 3.43B parameter bilingual multimodal LLM achieving GPT-4V-level performance, supporting high-res images and efficient deployment on mobile devices.

Visual Question Answering

amazon

MistralLite

Brief Details: MistralLite - A fine-tuned Mistral-7B model optimized for long context (32K tokens) with enhanced retrieval capabilities. Built by Amazon.

Text Generation

upstage

solar-pro-preview-instruct

Brief-details: Solar Pro Preview is a 22B parameter LLM optimized for single GPU deployment, offering performance comparable to 70B models with enhanced instruction-following capabilities and MMLU benchmark excellence.

Text Generation

IDEA-CCNL

Taiyi-Stable-Diffusion-1B-Chinese-v0.1

BRIEF DETAILS: First open-source Chinese Stable Diffusion model trained on 20M filtered Chinese image-text pairs. Uses CLIP-based filtering and specialized text encoder for Chinese concept alignment.

Text-to-Image

cognitivecomputations

WizardLM-7B-Uncensored

Brief-details: WizardLM-7B-Uncensored is an unfiltered variant of WizardLM, trained without alignment constraints for customizable fine-tuning. Built on PyTorch.

Text Generation

hassanblend

hassanblend1.4

BRIEF-DETAILS: A versatile text-to-image model created by hassanblend, featuring specialized diffusion techniques with 436 likes and 1,993 downloads. Licensed under CreativeML OpenRAIL-M.

Text-to-Image

EimisAnimeDiffusion_1.0v

OpenVoice

InternVL-Chat-V1-5

Future-Diffusion

mini-omni

MiniGPT-4

chatglm-6b-int4

vicuna-13b-delta-v1.1

Nous-Hermes-2-Mixtral-8x7B-DPO

AsiaFacemix

dolphin-2.9-llama3-8b

classic-anim-diffusion

SD_PixelArt_SpriteSheet_Generator

Mixtral-8x7B-v0.1-GGUF

omnivision-968M

MiniCPM-V-2

MistralLite

solar-pro-preview-instruct

Taiyi-Stable-Diffusion-1B-Chinese-v0.1

WizardLM-7B-Uncensored

hassanblend1.4

The first platform built for prompt engineering