Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

chatgpt-prompt-generator-v12

Brief-details: A BART-based ChatGPT prompt generator fine-tuned on prompts dataset, achieving 2.48 train loss and 2.73 validation loss after 4 epochs. Apache 2.0 licensed.

Text2Text Generation

22h

cabrita-lora-v0-1

Brief Details: Portuguese language instruction-tuned LLaMA model using LoRA, trained on translated Alpaca dataset. Efficient fine-tuning using A100 GPU.

Portuguese

microsoft

GODEL-v1_1-base-seq2seq

Brief-details: GODEL is a Transformer-based encoder-decoder model for goal-directed dialogs, pre-trained on 551M multi-turn dialogs, optimized for grounded response generation.

Text2Text Generation

THUDM

glm-4-voice-9b

Brief Details: GLM-4-Voice-9B: A 9.54B parameter end-to-end voice model for Chinese/English speech understanding and generation with real-time capabilities

Safetensors

failspy

Llama-3-8B-Instruct-MopeyMule

Brief Details: A melancholic variant of Llama-3-8B-Instruct using orthogonalization to create an intentionally unenthusiastic conversational style. 8B parameters.

Text Generation

BAAI

Emu3-Chat

Brief Details: Emu3-Chat: An 8.49B parameter multimodal model using next-token prediction for image/text/video tasks. Outperforms SDXL and LLaVA-1.6 in generation and perception.

Text Generation

TheBloke

Yarn-Mistral-7B-128k-AWQ

Brief Details: A 7B parameter AWQ-quantized Mistral model with 128k context window, optimized for long-form text generation and efficient inference at 4-bit precision.

Text Generation

FlagAlpha

Llama3-Chinese-8B-Instruct

Brief Details: Llama3-Chinese-8B-Instruct is an 8B parameter Chinese language model based on Llama3, optimized for instruction-following and dialogue tasks with FP16 precision.

Text Generation

clibrain

mamba-2.8b-instruct-openhermes

Brief-details: A 2.8B parameter Mamba architecture model fine-tuned on OpenHermes dataset, optimized for instruction-following and conversational AI tasks using state space modeling approach.

Text Generation

internlm

internlm-xcomposer2-4khd-7b

BRIEF DETAILS: InternLM-XComposer2-4KHD is a powerful vision-language model capable of processing 4K resolution images, built on InternLM2 architecture with advanced visual understanding capabilities.

Visual Question Answering

TencentARC

t2i-adapter-sketch-sdxl-1.0

Brief Details: T2I-Adapter for SDXL specialized in sketch-to-image generation. 77M parameters, Apache 2.0 licensed, built on SDXL base model.

Image-to-Image

RWKV

v5-Eagle-7B-HF

Brief Details: RWKV's v5-Eagle-7B-HF is a 7B parameter transformer-based LLM optimized for both CPU and GPU inference, supporting multilingual text generation with HuggingFace integration.

Text Generation

LinkSoul

Chinese-Llama-2-7b-4bit

Brief-details: A 4-bit quantized Chinese-English LLaMA 2 model optimized for bilingual conversation, built on 10M instruction-tuning samples with commercial usage rights.

Text Generation

openbmb

UltraLM-13b

Brief Details: UltraLM-13b is a fine-tuned LLaMA-based chat model trained on UltraChat dataset, optimized for multi-turn conversations with 13B parameters.

Text Generation

medalpaca

medalpaca-7b

Brief Details: A 7B parameter medical LLM fine-tuned from LLaMA, specialized in medical Q&A with extensive training on healthcare datasets including ChatDoctor and WikiDoc.

Text Generation

jonatasgrosman

whisper-large-zh-cv11

Brief Details: A fine-tuned Whisper Large model optimized for Chinese (Mandarin) speech recognition, achieving 9.55% CER on Common Voice 11 test set.

Automatic Speech Recognition

Qwen

Qwen2.5-Coder-32B

Brief Details: A powerful 32B parameter code-specialized LLM with 128K context length, optimized for code generation, reasoning, and fixing. Built on Qwen2.5 architecture.

Text Generation

spaablauw

PhotoHelper

Brief-details: A Stable Diffusion 2.x embedding model trained on 120 photos for generating photorealistic images with enhanced color representation and photography-like qualities.

Text Generation

RED-AIGC

StoryMaker

Brief-details: StoryMaker is a specialized text-to-image model focusing on maintaining character consistency across multiple scenes, ideal for visual storytelling and sequential image generation.

Text-to-Image

deepdml

faster-whisper-large-v3-turbo-ct2

Brief Details: Optimized speech recognition model supporting 100+ languages, built on Whisper large-v3, converted to CTranslate2 format for faster inference.

Automatic Speech Recognition

cognitivecomputations

dolphin-2.9-llama3-70b

Brief-details: Large-scale 70B parameter LLaMA3-based model fine-tuned for conversational AI. Features uncensored responses, coding capabilities, and function calling support.

Text Generation

chatgpt-prompt-generator-v12

cabrita-lora-v0-1

GODEL-v1_1-base-seq2seq

glm-4-voice-9b

Llama-3-8B-Instruct-MopeyMule

Emu3-Chat

Yarn-Mistral-7B-128k-AWQ

Llama3-Chinese-8B-Instruct

mamba-2.8b-instruct-openhermes

internlm-xcomposer2-4khd-7b

t2i-adapter-sketch-sdxl-1.0

v5-Eagle-7B-HF

Chinese-Llama-2-7b-4bit

UltraLM-13b

medalpaca-7b

whisper-large-zh-cv11

Qwen2.5-Coder-32B

PhotoHelper

StoryMaker

faster-whisper-large-v3-turbo-ct2

dolphin-2.9-llama3-70b

The first platform built for prompt engineering