Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

TinyStories-33M

Brief Details: TinyStories-33M is a GPT-Neo-based language model trained on simple stories dataset, optimized for generating straightforward narratives with 33M parameters.

Text Generation

togethercomputer

RedPajama-INCITE-Base-3B-v1

Brief Details: A 2.8B parameter language model trained on RedPajama-1T dataset, offering base text generation capabilities with efficient GPU/CPU inference options

Text Generation

mosaicml

mpt-1b-redpajama-200b

Brief Details: MPT-1b-RedPajama-200b is a 1.3B parameter decoder-only transformer trained on RedPajama dataset for 200B tokens, utilizing advanced features like FlashAttention and ALIBI.

Text Generation

TheBloke

alpaca-lora-65B-GGML

Brief-details: A quantized 65B parameter Alpaca model available in multiple GGML formats (2-8 bit) for CPU+GPU inference, optimized for efficient local deployment using llama.cpp

Text Generation

TheLastBen

hrrzg-style-768px

Brief-details: Fine-tuned Stable Diffusion model trained on Fred Herzog's photography style, optimized for 768px resolution with specialized urban/street photography aesthetics.

Text-to-Image

Conflictx

AnimeScreencap

Brief-details: AnimeScreencap is a Textual Inversion Embedding model for Stable Diffusion 2.x, specialized in warm, movie-stylized anime environments at 768x768 resolution

Text-to-Image

bartowski

Meta-Llama-3-8B-Instruct-GGUF

Brief-details: Quantized version of Meta's Llama-3 8B instruction model offering multiple compression formats (Q2-Q8) for different RAM/performance tradeoffs.

Text Generation

Xhaheen

srkay-man_6-1-2022

Brief Details: A fine-tuned Stable Diffusion model specialized in generating photorealistic images of Shah Rukh Khan, trained using DreamBooth technology with the instance prompt "a photo of srkay man"

Text-to-Image

KnutJaegersberg

2-bit-LLMs

Brief-details: A comprehensive collection of large language models (70B-120B parameters) quantized to 2-bit precision, optimizing storage while maintaining performance

Text Generation

OFA-Sys

chinese-clip-vit-base-patch16

Brief Details: Chinese CLIP model using ViT-B/16 image encoder and RoBERTa-wwm-base text encoder, trained on 200M Chinese image-text pairs for multimodal understanding and zero-shot classification.

Zero-Shot Image Classification

Qwen

CodeQwen1.5-7B

Brief Details: A powerful 7B parameter code-specialized LLM supporting 92 programming languages with 64K context length, built on Qwen1.5 architecture for superior code generation and understanding.

Text Generation

HuggingFaceM4

idefics2-8b-chatty

Brief-details: IDEFICS2 8B chatty variant - Advanced multimodal model capable of processing interleaved image-text sequences for chat-like interactions, built by HuggingFace.

Image-Text-to-Text

microsoft

speecht5_vc

Brief-details: Microsoft's SpeechT5 voice conversion model - Transformer-based unified speech/text framework for converting speech between voices, built on CMU ARCTIC dataset.

Audio-to-Audio

NoCrypt

SomethingV2

Brief-details: SomethingV2 is an anime-focused text-to-image diffusion model optimized for vibrant but soft anime-style images, featuring built-in VAE and specific recommendations for high-quality outputs.

Text-to-Image

TheDrummer

Moistral-11B-v3

Brief-details: A specialized 11B parameter LLaMA-based model fine-tuned for creative text generation, optimized for long-form narrative content with enhanced vocabulary and genre diversity.

Text Generation

Qwen

Qwen1.5-110B

Brief Details: Qwen1.5-110B: Advanced 111B parameter language model with 32K context length, part of Qwen2 beta series. Supports multilingual tasks and improved chat capabilities.

Text Generation

ruslanmv

Medical-Llama3-8B

BRIEF DETAILS: Medical-focused 8B parameter LLaMA3 model fine-tuned for healthcare Q&A. Optimized for clinical discussions with BF16 precision and medical domain expertise.

Text Generation

state-spaces

mamba-2.8b-hf

Brief-details: Mamba-2.8b-hf is a state-of-the-art 2.77B parameter language model using innovative Mamba architecture, optimized for efficient text generation and inference.

Text Generation

BatsResearch

bonito-v1

Brief-details: Bonito-v1 is a specialized text-to-text generation model based on Mistral-7B, designed for creating synthetic instruction tuning datasets from unannotated text.

Text2Text Generation

google

madlad400-10b-mt

Brief-details: MADLAD-400-10B-MT: A powerful 10.7B parameter multilingual translation model supporting 419 languages, based on T5 architecture with state-of-the-art performance.

Translation

gorilla-llm

gorilla-openfunctions-v1

Brief-details: Gorilla OpenFunctions v1 is an advanced LLM that converts natural language into executable API calls, supporting parallel functions and multiple function selection.

Text Generation

TinyStories-33M

RedPajama-INCITE-Base-3B-v1

mpt-1b-redpajama-200b

alpaca-lora-65B-GGML

hrrzg-style-768px

AnimeScreencap

Meta-Llama-3-8B-Instruct-GGUF

srkay-man_6-1-2022

2-bit-LLMs

chinese-clip-vit-base-patch16

CodeQwen1.5-7B

idefics2-8b-chatty

speecht5_vc

SomethingV2

Moistral-11B-v3

Qwen1.5-110B

Medical-Llama3-8B

mamba-2.8b-hf

bonito-v1

madlad400-10b-mt

gorilla-openfunctions-v1

The first platform built for prompt engineering