Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

m3e-large

Brief-details: M3E-large is a 340M parameter Chinese-English text embedding model trained on 22M+ sentence pairs, optimized for text similarity and retrieval tasks. Features state-of-the-art performance on Chinese NLP benchmarks.

sentence-transformers

lmsys

vicuna-7b-delta-v1.1

BRIEF DETAILS: Vicuna-7B v1.1 is a fine-tuned LLaMA variant trained on 70K ShareGPT conversations, offering strong chat capabilities for research use

Text Generation

mosaicml

mpt-30b-chat

BRIEF DETAILS: MPT-30B-Chat: A 30B parameter chatbot model by MosaicML, fine-tuned on diverse datasets with 8K token context window and FlashAttention support.

Text Generation

codellama

CodeLlama-70b-Instruct-hf

Brief Details: CodeLlama-70b-Instruct-hf is Meta's 70B parameter instruction-tuned code generation model, optimized for code completion and chat interactions.

Text Generation

McGill-NLP

Llama-3-8B-Web

Brief Details: A specialized 8B parameter LLaMA-3 model fine-tuned for web navigation, achieving 18% better performance than GPT-4V on WebLINX benchmark tasks.

Text Generation

ctheodoris

Geneformer

Brief Details: Geneformer is a transformer-based model for genomics with 38M parameters, trained on 30M+ single-cell transcriptomes for biological network analysis and gene prediction.

Fill-Mask

NousResearch

Nous-Hermes-2-SOLAR-10.7B

Brief-details: A 10.7B parameter LLM built on SOLAR architecture, fine-tuned with 1M GPT-4 entries, optimized for instruction-following and chat. Strong benchmark performance and ChatML format support.

Text Generation

Djrango

Qwen2vl-Flux

Brief-details: Advanced multimodal image generation model combining FLUX architecture with Qwen2VL's vision-language capabilities for high-quality image generation and manipulation with multiple modes

Text-to-Image

Aybeeceedee

knollingcase

BRIEF-DETAILS: Specialized Stable Diffusion model for creating knolling-style technical diagrams and isometric displays with OLED interface aesthetics

Text-to-Image

myshell-ai

MeloTTS-English

BRIEF-DETAILS: High-quality multilingual text-to-speech model supporting various English accents (US, UK, Indian, Australian) with real-time CPU inference capabilities

Text-to-Speech

MILVLG

imp-v1-3b

Brief-details: A compact 3B parameter multimodal LLM combining Phi-2 and SigLIP visual encoder, achieving performance comparable to 7B models on visual tasks.

Text Generation

microsoft

phi-1

Brief Details: Phi-1: Microsoft's 1.3B parameter specialized Python coding model. Trained on code datasets, achieves 50%+ HumanEval accuracy. MIT licensed.

Text Generation

cyberagent

open-calm-7b

BRIEF DETAILS: Japanese language model with 6.8B parameters, built by CyberAgent. Specialized in Japanese text generation with robust performance (8.2 perplexity score).

Text Generation

ArkanDash

rvc-genshin-impact

Brief-details: RVC voice conversion model for Genshin Impact characters with 62 Japanese voice models. MIT licensed, specialized for audio-to-audio conversion.

Audio-to-Audio

THUDM

visualglm-6b

Brief Details: VisualGLM-6B: A 6.2B parameter multimodal model supporting Chinese/English vision-language tasks, built on ChatGLM-6B with BLIP2-Qformer architecture.

Transformers

TheBloke

Vicuna-13B-1.1-GPTQ

Brief Details: A 4-bit quantized version of Vicuna-13B-1.1, optimized for efficient deployment while maintaining high performance in conversational AI tasks

Text Generation

alimama-creative

FLUX.1-dev-Controlnet-Inpainting-Alpha

Brief-details: FLUX.1-dev-Controlnet-Inpainting-Alpha is an advanced inpainting model built on FLUX.1-dev, optimized for 768x768 resolution with strong control capabilities and non-commercial licensing.

Diffusers

TheBloke

Wizard-Vicuna-13B-Uncensored-HF

Brief Details: A 13B parameter uncensored language model based on Wizard-Vicuna, optimized in float16 format for efficient GPU inference and deployment.

Text Generation

nomic-ai

gpt4all-lora

Brief-details: GPT4all-lora is an autoregressive transformer model trained on curated data using Atlas, featuring 4 epochs of training and built on LLaMA architecture.

nomic-ai/gpt4all_prompt_generations

cognitivecomputations

dolphin-2.8-mistral-7b-v02

Brief-details: A powerful 7B parameter language model built on Mistral-7B-v0.2, featuring enhanced instruction-following, coding capabilities, and 32k context window. Uncensored and Apache 2.0 licensed.

Text Generation

BlinkDL

rwkv-4-world

RWKV-4 World: Multilingual large language model supporting 12 languages, trained on diverse datasets including Pile and RedPajama. Features specialized tokenization and flexible deployment options.

Text Generation

m3e-large

vicuna-7b-delta-v1.1

mpt-30b-chat

CodeLlama-70b-Instruct-hf

Llama-3-8B-Web

Geneformer

Nous-Hermes-2-SOLAR-10.7B

Qwen2vl-Flux

knollingcase

MeloTTS-English

imp-v1-3b

phi-1

open-calm-7b

rvc-genshin-impact

visualglm-6b

Vicuna-13B-1.1-GPTQ

FLUX.1-dev-Controlnet-Inpainting-Alpha

Wizard-Vicuna-13B-Uncensored-HF

gpt4all-lora

dolphin-2.8-mistral-7b-v02

rwkv-4-world

The first platform built for prompt engineering