Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Skywork-13B-base

Brief-details: Skywork-13B-base is a powerful bilingual LLM with 13B parameters, trained on 3.2T tokens of Chinese/English data, achieving SOTA performance in various benchmarks.

Text Generation

Qwen

Qwen-7B-Chat-Int4

Brief-details: Qwen-7B-Chat-Int4 is a 4-bit quantized version of the Qwen-7B-Chat model, offering efficient inference with 2.11B parameters while maintaining strong performance across multiple languages and tasks.

Text Generation

glaiveai

glaive-function-calling-v1

BRIEF-DETAILS: 2.7B parameter chat model specialized in function calling, built on replit-code-v1-3b, with GPT-3.5/4-like capabilities for API integration

Text Generation

TheBloke

llama2_7b_chat_uncensored-GPTQ

Brief-details: A 7B parameter uncensored Llama2 chat model quantized to 4-bit precision, offering multiple GPTQ variants for efficient GPU inference and deployment

Text Generation

OpenAssistant

stablelm-7b-sft-v7-epoch-3

Brief Details: A 7B parameter language model fine-tuned by Open-Assistant on human demonstrations, based on StableLM, optimized for assistant-style conversations.

Text Generation

apple

coreml-stable-diffusion-xl-base

Brief Details: Core ML implementation of Stable Diffusion XL base model, optimized for macOS GPUs with ORIGINAL attention implementation. Created by Apple, supporting text-to-image generation.

Text-to-Image

PWB

starlake

Brief-details: An artistic AI model focusing on balanced scene and character generation, featuring strong light/shadow control and improved tag reading in v5.0. Supports both landscape and portrait modes.

art

teasan

endlessMix

Brief-details: Japanese art-focused diffusion model built on Defacta base, optimized for anime/manga-style images with multiple versions (V1-V9) and specialized VAE requirements

Diffusers

geolocal

StreetCLIP

Brief Details: StreetCLIP - A powerful zero-shot image geolocalization model trained on 1.1M street-level images, achieving SOTA performance in geographic classification tasks.

Zero-Shot Image Classification

NUROISEA

anything-mix

Brief-details: A comprehensive collection of mixed anime/weeb Stable Diffusion models, featuring 9 different model combinations optimized for high-quality anime-style image generation.

Text-to-Image

alirezamsh

small100

Brief-details: A compact multilingual translation model supporting 101 languages with 333M parameters, achieving M2M-100 comparable performance while being 3.6x smaller.

Translation

sd-concepts-library

moebius

Brief-details: Moebius-style art concept for Stable Diffusion, MIT-licensed textural inversion model enabling Jean Giraud's distinctive sci-fi art style generation.

Text Generation

cyberagent

calm3-22b-chat

Brief-details: CyberAgent's 22B parameter bilingual (Japanese/English) chat model with 16K context window, trained on 2T tokens and fine-tuned for dialogue use cases.

Text Generation

internlm

internlm2_5-7b-chat-1m

Brief-details: A powerful 7B parameter LLM with 1M token context window, excelling in math reasoning and long-text comprehension. Features state-of-the-art performance and advanced tool utilization capabilities.

Text Generation

THUDM

cogvlm2-llama3-chinese-chat-19B

Brief Details: CogVLM2 is a powerful 19.5B parameter vision-language model supporting 8K text length and 1344x1344 image resolution with Chinese/English capabilities

Text Generation

boun-tabi-LMG

TURNA

Brief Details: TURNA - A 1.14B parameter Turkish language model based on UL2 framework, optimized for text generation and understanding tasks with 36 layers and 16 attention heads.

Text2Text Generation

moreh

MoMo-72B-lora-1.8.7-DPO

BRIEF DETAILS: 72B parameter LLM using DPO training on QWEN base model. Features LoRA optimization, MIT license, AMD MI250 GPU compatible. Focused on text generation tasks.

Text Generation

LLM360

Crystal

Brief Details: Crystal is a 7B parameter LLM trained on SlimPajama and StarCoder, excelling in both natural language and coding tasks with competitive benchmark performance.

Text Generation

deepseek-ai

deepseek-coder-33b-base

Brief-details: A powerful 33B parameter code generation model trained on 2T tokens (87% code, 13% language), supporting multiple programming languages with 16K context window.

Text Generation

NousResearch

Nous-Hermes-llama-2-7b

BRIEF DETAILS: 7B parameter LLaMA-2 based model fine-tuned on 300k+ instructions, optimized for long responses and reduced hallucination, MIT licensed

Text Generation