Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

gemma-2B-10M

Brief Details: Gemma 2B variant with 10M context length using recurrent local attention, optimized for <32GB memory usage. MIT-licensed with F32 tensor support.

Transformers

fal

AuraSR-v2

Brief-details: AuraSR-v2 is a GAN-based super-resolution model for 4x upscaling of generated images, featuring 618M parameters and Apache 2.0 license

Image-to-Image

HuggingFaceH4

starchat-alpha

Brief Details: StarChat-Alpha: 15.5B parameter coding assistant model fine-tuned from StarCoder, optimized for programming tasks with FP16 precision

Text Generation

THUDM

codegeex4-all-9b

Brief-details: CodeGeeX4-ALL-9B is a powerful multilingual code generation model with 9.4B parameters, built on GLM-4-9B, offering superior code completion and generation capabilities.

Text Generation

JosephusCheung

Guanaco

Brief Details: Multilingual instruction-following LLM based on LLaMA 7B, supporting EN/ZH/JA/DE with enhanced role-playing and context handling capabilities.

Text Generation

KoboldAI

OPT-13B-Erebus

Brief Details: OPT-13B-Erebus is a specialized text generation model based on OPT-13B architecture, trained on adult-themed content with 6 distinct datasets.

Text Generation

Deci

DeciLM-6b

Brief Details: DeciLM-6b is a 5.7B parameter LLM optimized for efficiency, featuring variable GQA attention and 4096 token context window. Up to 15x faster than Llama 2 7B.

Text Generation

WizardLMTeam

WizardLM-70B-V1.0

BRIEF-DETAILS: WizardLM-70B-V1.0 is a powerful large language model built on Llama 2, achieving impressive scores on MT-Bench (7.78) and AlpacaEval (92.91%), specialized in following complex instructions.

Text Generation

THUDM

chatglm2-6b-int4

BRIEF DETAILS: ChatGLM2-6B-INT4 is a quantized bilingual LLM offering 42% faster inference than its predecessor, with 8K context length support and improved performance across multiple benchmarks.

Transformers

rhymes-ai

Allegro

Brief-details: Allegro is an advanced open-source text-to-video generation model with 2.8B parameters, capable of creating 6-second HD videos at 15 FPS from text prompts.

Text-to-Video

1bitLLM

bitnet_b1_58-3B

Brief Details: BitNet b1.58 3B - An efficient 3.32B parameter model trained on RedPajama dataset, achieving comparable performance to FP16 models with binary weights.

Text Generation

THUDM

chatglm3-6b-32k

Brief-details: ChatGLM3-6B-32K: Advanced 6B parameter LLM optimized for long contexts up to 32K tokens, with enhanced position encoding and specialized long-text training capabilities.

Transformers

0xJustin

Dungeons-and-Diffusion

Brief Details: A specialized text-to-image model for generating D&D character art, supporting 29 species and 15 classes with detailed fantasy styling.

Text-to-Image

HuggingFaceM4

Idefics3-8B-Llama3

Brief Details: Multimodal 8.46B parameter model combining vision and language capabilities, specialized in OCR and document understanding with Apache 2.0 license.

Image-Text-to-Text

Virt-io

SillyTavern-Presets

BRIEF-DETAILS: SillyTavern-Presets offers customizable roleplay configurations for LLM interactions, featuring optimized sampling parameters and character card templates.

roleplay

amphion

MaskGCT

Brief-details: Zero-shot text-to-speech model supporting 6 languages (en, zh, ko, ja, fr, de) with non-autoregressive architecture and masked generative codec transformer technology

Text-to-Speech

jetmoe

jetmoe-8b

Brief-details: JetMoE-8B is a cost-efficient 8.52B parameter MoE model achieving LLaMA2-7B performance, with only 2.2B active parameters and trained for $0.1M.

Text Generation

TheBloke

Mistral-7B-OpenOrca-GGUF

Brief-details: Optimized 7B parameter Mistral model fine-tuned on OpenOrca dataset, offering strong performance with various quantization options and ChatML format support

Text Generation

Sao10K

L3-8B-Stheno-v3.2

Brief Details: 8B parameter LLaMA-based model optimized for creative writing and assistant tasks, featuring improved narrative capabilities and instruction following

Text Generation

dreamlike-art

dreamlike-anime-1.0

Brief-details: High-quality anime-style text-to-image model optimized for 768x768px generation, featuring enhanced photorealistic capabilities and specialized anime aesthetics.

Text-to-Image

Deci

DeciCoder-1b

Brief-details: DeciCoder-1b is a 1.1B parameter code generation model optimized for Python, Java, and JavaScript, featuring Grouped Query Attention and 2048-token context window.

Text Generation

gemma-2B-10M

AuraSR-v2

starchat-alpha

codegeex4-all-9b

Guanaco

OPT-13B-Erebus

DeciLM-6b

WizardLM-70B-V1.0

chatglm2-6b-int4

Allegro

bitnet_b1_58-3B

chatglm3-6b-32k

Dungeons-and-Diffusion

Idefics3-8B-Llama3

SillyTavern-Presets

MaskGCT

jetmoe-8b

Mistral-7B-OpenOrca-GGUF

L3-8B-Stheno-v3.2

dreamlike-anime-1.0

DeciCoder-1b

The first platform built for prompt engineering