Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

test_dataset_Codellama-3-8B

Brief-details: A test implementation of Llama-3-8B finetuned on code tasks, achieving 63% pass@1 on HumanEval. Features low VRAM training using Unsloth + QLora + Galore optimizations.

Text Generation

cognitivecomputations

laserxtral

Brief Details: Laserxtral is a 24.2B parameter MoE model combining 4x7B models with laser denoising, offering Mixtral-level performance at half size.

Text Generation

haoranxu

ALMA-13B-R

Brief Details: ALMA-13B-R: Advanced 13B parameter language model fine-tuned with Contrastive Preference Optimization for state-of-the-art machine translation performance.

Text Generation

teknium

CollectiveCognition-v1.1-Mistral-7B

Brief-details: A fine-tuned Mistral-7B model achieving impressive TruthfulQA benchmark scores, trained on just 100 data points in 3 minutes using QLora technique.

Text Generation

22h

vintedois-diffusion-v0-2

Brief-details: Stable Diffusion-based text-to-image model trained for high-quality generation with simple prompts. Features 1.07B parameters and "estilovintedois" style prefix.

Text-to-Image

mosaicml

mpt-1b-redpajama-200b-dolly

Brief Details: A 1.3B parameter decoder-only transformer pre-trained on RedPajama dataset and fine-tuned on Databricks Dolly, optimized with FlashAttention and ALIBI

Text Generation

apple

coreml-stable-diffusion-2-base

Brief Details: Core ML optimized version of Stable Diffusion v2 for Apple Silicon, offering efficient text-to-image generation with both Swift and Python inference options.

Text-to-Image

Tanger

LoraByTanger

Brief-details: A specialized Lora model collection focused on Genshin Impact and anime characters, offering high-quality character generation with multiple clothing variations and detailed style control.

Text-to-Image

Vsukiyaki

SukiAni-mix

Brief Details: An experimental AI model combining detailed backgrounds with anime-style characters through U-Net hierarchical merging. Optimized for dual-style generation.

Text Generation

Linaqruf

hitokomoru-diffusion

Brief-details: Anime-style diffusion model trained on Hitokomoru's artwork, featuring 20k training steps on 255 images with specialized aspect ratio bucketing for Japanese-style art generation.

Text-to-Image

NeverSleep

Lumimaid-v0.2-12B

BRIEF-DETAILS: 12B parameter Mistral-based conversational AI model focused on creative text generation with NSFW capabilities. Built on Mistral-Nemo-Instruct-2407.

Text Generation

rinna

japanese-gpt2-medium

Brief-details: Japanese GPT-2 medium-sized language model (361M params) trained on CC-100 and Wikipedia, optimized for Japanese text generation and language modeling.

Text Generation

NousResearch

Hermes-2-Theta-Llama-3-70B

BRIEF-DETAILS: 70B parameter LLM merging Hermes 2 Pro and Llama-3, featuring enhanced function calling, JSON outputs, and ChatML support. Strong benchmark scores.

Text Generation

PixArt-alpha

PixArt-Sigma-XL-2-1024-MS

BRIEF DETAILS: Advanced text-to-image transformer model capable of generating high-res images up to 4K. Features innovative transformer-based latent diffusion and supports multiple image sizes.

Text-to-Image

wangfuyun

PCM_Weights

Brief-details: PCM_Weights is a specialized LoRA weight package for Stable Diffusion XL, enabling fast text-to-image generation with phased consistency and supporting multiple inference steps.

Text-to-Image

utrobinmv

t5_translate_en_ru_zh_large_1024

Brief Details: T5-based multilingual translation model (851M params) supporting bidirectional translation between English, Russian, and Chinese with Apache 2.0 license.

Translation

internlm

internlm-xcomposer2-vl-7b

Brief-details: InternLM-XComposer2-VL-7B is a vision-language model built on InternLM2, enabling advanced text-image comprehension and generation with PyTorch integration.

Visual Question Answering

Crystalcareai

GemMoE-Beta-1

Brief-details: An 8x8 Mixture of Experts model based on Gemma, featuring 8 separately fine-tuned models with 2 experts per token for enhanced text generation capabilities.

Text Generation

VAST-AI

TriplaneGaussian

Brief-details: Fast single-view 3D reconstruction model using hybrid Triplane-Gaussian representation, processes images in seconds with transformer architecture. Apache 2.0 licensed.

Image-to-3D

notmahi

dobb-e

Brief-details: Dobb-E is a robotics-focused vision model with 21.3M parameters, trained on home environments for robot navigation and interaction. MIT licensed.

Robotics

anthracite-org

magnum-v2-12b

Brief-details: A 12.2B parameter multilingual chat model fine-tuned on Mistral-Nemo-Base, optimized for Claude 3-like prose quality with support for 9 languages.

Text Generation

test_dataset_Codellama-3-8B

laserxtral

ALMA-13B-R

CollectiveCognition-v1.1-Mistral-7B

vintedois-diffusion-v0-2

mpt-1b-redpajama-200b-dolly

coreml-stable-diffusion-2-base

LoraByTanger

SukiAni-mix

hitokomoru-diffusion

Lumimaid-v0.2-12B

japanese-gpt2-medium

Hermes-2-Theta-Llama-3-70B

PixArt-Sigma-XL-2-1024-MS

PCM_Weights

t5_translate_en_ru_zh_large_1024

internlm-xcomposer2-vl-7b

GemMoE-Beta-1

TriplaneGaussian

dobb-e

magnum-v2-12b

The first platform built for prompt engineering