Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

MobileLLM-1B

Brief Details: MobileLLM-1B is a 1.01B parameter efficient language model optimized for on-device use, featuring GQA and shared embeddings with 2k context.

Text Generation

senseable

WestLake-7B-v2

Brief-details: WestLake-7B-v2 is a 7B parameter LLM specializing in role-play and text generation, achieving 74.68% average score on key benchmarks with strong performance in HellaSwag (88.65%) and Winogrande (86.98%).

Text Generation

deepseek-ai

deepseek-coder-7b-instruct-v1.5

Brief Details: A 7B parameter coding-focused LLM pre-trained on 2T tokens with 4K context window, fine-tuned on 2B instruction tokens for code generation.

Text Generation

Lasorco

lametta

Brief-details: A specialized anime-style Stable Diffusion model focused on creating round-eyed female characters with low-height proportions and detailed eye highlights

Text-to-Image

defog

sqlcoder2

Brief Details: 15B parameter SQL generation model that outperforms GPT-3.5-turbo, converting natural language to SQL with 77.5% accuracy.

Text Generation

cognitivecomputations

dolphin-2.9.2-qwen2-72b

Brief-details: A powerful 72B parameter LLM based on Qwen2, fine-tuned for conversations, coding, and function calling. Features uncensored responses and 128k context window.

Text Generation

lmstudio-community

Meta-Llama-3-70B-Instruct-GGUF

Brief-details: Meta's latest 70B parameter LLM, instruction-tuned for superior performance. Features GQA attention, 15T token training, and enhanced code capabilities. Matches/exceeds GPT-3.5.

Text Generation

Vezora

Mistral-22B-v0.2

Brief Details: A powerful 22B parameter dense language model derived from MOE compression, featuring 32k context length, uncensored capabilities, and strong performance in math, coding, and multi-turn conversations.

Text Generation

NousResearch

Redmond-Puffin-13B

Brief-details: Redmond-Puffin-13B is a commercially available Llama-2 based model fine-tuned on 3K high-quality examples, achieving SOTA performance on GPT4ALL benchmarks.

Text Generation

nvidia

Nemotron-4-340B-Reward

Brief-details: A 340B-parameter reward model by NVIDIA for evaluating AI responses across 5 dimensions: helpfulness, correctness, coherence, complexity, and verbosity.

NeMo

ashen0209

Flux-Dev2Pro

BRIEF DETAILS: A specialized fine-tuned transformer model designed to improve LoRA training performance on Flux-Dev, trained on 3M high-quality images across two epochs.

Text-to-Image

PygmalionAI

pygmalion-13b

Brief-details: Pygmalion-13B is a conversational LLaMA fine-tune focused on dialogue generation, requiring XOR decoding with original LLaMA weights for deployment.

Text Generation

THUDM

LongWriter-glm4-9b

Brief-details: A powerful 9.4B parameter language model specialized in long-form content generation, capable of producing 10,000+ word texts in English and Chinese

Text Generation

espnet

xeus

Brief-details: XEUS is a powerful multilingual speech encoder covering 4000+ languages, using E-Branchformer architecture with 577M parameters, designed for universal speech recognition.

Automatic Speech Recognition

cloudyu

Mixtral_34Bx2_MoE_60B

Brief-details: A powerful 60.8B parameter MoE model combining two 34B models, achieving strong performance on various benchmarks with multi-lingual capabilities and efficient architecture.

Text Generation

CultriX

MistralTrix-v1

Brief Details: MistralTrix-v1: An 8.99B parameter model fine-tuned with DPO, achieving top performance among 7B LLMs. Features FP16 precision and English language support.

Text Generation

uer

sbert-base-chinese-nli

Brief Details: A Chinese BERT-based sentence embedding model fine-tuned on NLI data, optimized for semantic similarity tasks with 13.4K+ downloads and Apache 2.0 license.

Sentence Similarity

sakuraumi

Sakura-13B-Galgame

Brief-details: Specialized Japanese-to-Chinese translation model optimized for light novels and visual novels, built on Qwen/Baichuan architecture with extensive ACGN-domain training.

Text Generation

deepseek-ai

deepseek-llm-67b-base

Brief-details: Language model with 67B parameters trained on 2T tokens, supporting English/Chinese text generation. Features Grouped-Query Attention and commercial use license.

Text Generation

latentcat

control_v1u_sd15_illumination_webui

Brief Details: A specialized ControlNet model for Stable Diffusion that enables brightness control and image colorization, with recommended weights of 0.4-0.9 for optimal results.

Transformers

IlyaGusev

saiga_llama3_8b

BRIEF DETAILS: Russian-language Llama-3 based chatbot (8B params) with specialized prompt format and strong performance in language tasks. Built for conversational AI.

Text Generation

MobileLLM-1B

WestLake-7B-v2

deepseek-coder-7b-instruct-v1.5

lametta

sqlcoder2

dolphin-2.9.2-qwen2-72b

Meta-Llama-3-70B-Instruct-GGUF

Mistral-22B-v0.2

Redmond-Puffin-13B

Nemotron-4-340B-Reward

Flux-Dev2Pro

pygmalion-13b

LongWriter-glm4-9b

xeus

Mixtral_34Bx2_MoE_60B

MistralTrix-v1

sbert-base-chinese-nli

Sakura-13B-Galgame

deepseek-llm-67b-base

control_v1u_sd15_illumination_webui

saiga_llama3_8b

The first platform built for prompt engineering