Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

tiny-LlamaForCausalLM-3.1

BRIEF-DETAILS: A minimal LLaMA-based causal language model designed specifically for TRL library unit testing purposes, emphasizing lightweight functionality.

trl-internal-testing

tiny-OPTForCausalLM

Brief-details: A minimal OPT-based causal language model designed specifically for TRL library testing purposes, featuring streamlined architecture and testing-focused capabilities.

unsloth

Llama-3.2-1B-Instruct-unsloth-bnb-4bit

Brief Details: Unsloth's 4-bit quantized version of Llama 3.2 1B Instruct model, offering 2.4x faster performance and 58% reduced memory usage with dynamic quantization.

trl-internal-testing

tiny-CohereForCausalLM

Brief-details: A minimalist CohereForCausalLM model designed specifically for TRL library testing purposes, emphasizing lightweight functionality and basic causal language modeling capabilities.

lorahub

flan_t5_large-wiqa_what_is_the_final_step_of_the_following_process

Brief-details: A fine-tuned FLAN-T5-large model specialized in identifying final steps of processes, based on the WIQA dataset for procedural reasoning.

nvidia

Cosmos-1.0-Diffusion-14B-Text2World

Brief-details: NVIDIA's 14B parameter text-to-world generation model leveraging diffusion techniques for advanced world modeling and generation capabilities

digiplay

ya3_xt

BRIEF-DETAILS: Ya3_xt is an experimental AI model combining Ya3 architecture with xtremixUltimateMerge_v1.5, currently in testing phase. Created by digiplay.

imageomics

bioclip

Brief-details: BioCLIP is a CLIP-based foundation model for biological classification, trained on 450K+ taxa, achieving 16-17% improvement over baselines for species identification.

xiuyul

mamba-2.8b-zephyr

BRIEF DETAILS: A 2.8B parameter Mamba architecture model fine-tuned using DPO on UltraFeedback data, achieving strong preference alignment with 78.57% accuracy.

nomic-ai

nomic-embed-vision-v1.5

Brief-details: High-performance vision embedding model sharing space with nomic-embed-text-v1.5, achieving 71.0% on ImageNet 0-shot and 56.8% on Datacomp, optimized for multimodal applications

notstoic

pygmalion-13b-4bit-128g

BRIEF-DETAILS: 4-bit quantized version of Pygmalion-13B language model, optimized with GPTQ and 128-group size. Not suitable for minors. X-rated capable.

therealvul

so-vits-svc-4.0

Brief-details: A specialized so-vits-svc-4.0 voice conversion model trained on MLP:FiM audio clips, focused on preserving and recreating pony character voices

openchat

openchat-3.6-8b-20240522

Brief Details: OpenChat 3.6 8B - State-of-the-art open-source 8B parameter LLM outperforming Llama-3-8B-Instruct, optimized for coding and general tasks with 8K context window

demimomi

llm-jp-3-13b-finetune-ex

Brief Details: A Japanese language model fine-tuned from LLM-JP 13B using Unsloth and TRL libraries, optimized for faster training and enhanced performance with context length of 888 tokens.

vidore

colpali-v1.2-hf

BRIEF-DETAILS: ColPali is a PaliGemma-3B based visual retrieval model that uses ColBERT strategy for efficient document indexing, combining SigLIP and language model capabilities.

huihui-ai

QwQ-32B-Preview-abliterated

Brief-details: Uncensored 32B parameter variant of QwQ model created through abliteration technique to remove refusal behaviors. Available via Ollama.

prithivMLmods

Flux-Meme-Xd-LoRA

BRIEF DETAILS: A LoRA model fine-tuned for meme image generation, trained on FLUX.1-dev base model. Optimized for 768x1024 resolution with specialized meme-style outputs.

mit-han-lab

svdq-int4-flux.1-schnell

Brief-details: SVDQuant-based INT4-quantized image generation model achieving 3.6× memory reduction and 8.7× speedup over 16-bit models, optimized for NVIDIA GPUs.

CoRover

BharatGPT-3B-Indic

BRIEF DETAILS: Multilingual Indian language model (3B params) supporting 11 Indic languages. Optimized for translation, summarization & conversational AI. Non-commercial license.

argmaxinc

whisperkit-pro

Brief-details: WhisperKit Pro is the commercial version of WhisperKit, offering advanced speech recognition capabilities. Early access available through waitlist registration.