Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

InternVL2_5-Pretrain-Models

Brief-details: Advanced multimodal LLM series with models ranging from 1B to 78B parameters, combining vision and language capabilities with state-of-the-art pre-training strategies

mergekit-community

L3.1-Athena-a-8B

Brief-details: A sophisticated 8B parameter LLama-3.1-based merged model combining 14 specialized models, optimized for diverse tasks using model stock merge method.

AventIQ-AI

drama_base_sentence_similarity

Brief Details: Sentence transformer model based on facebook/drama-base with 768-dim output, optimized for semantic similarity and search tasks. Fine-tuned on STS-B dataset.

mradermacher

Llama-GitVac-Turbo-8B-i1-GGUF

BRIEF-DETAILS: 8B parameter LLaMA-based model with various GGUF quantization options (2.1GB-6.7GB), optimized for efficiency and performance balance

mradermacher

Babel-9B-Chat-GGUF

Brief-details: A quantized version of Babel-9B-Chat offering multiple GGUF variants for efficient deployment, with sizes ranging from 3.6GB to 18.1GB and optimized performance options.

mradermacher

Llama-GitVac-Turbo-8B-GGUF

BRIEF-DETAILS: 8B parameter GGUF-quantized LLaMA model with multiple compression variants optimized for efficiency (3.3GB-16.2GB), featuring Q2 to Q8 quantization options

reissbaker

llama-3.1-70b-abliterated-lora

Brief-details: A LoRA adapter for Llama-3.1-70B-Instruct, trained on specific datasets using 8xA100s with FSDP, featuring BF16 mixed-precision and 4e-4 learning rate.

Salesforce

moirai-moe-1.0-R-small

BRIEF DETAILS: Salesforce's research-focused MoE model, designed for academic purposes with small parameter footprint. Features mixture-of-experts architecture with ethical usage guidelines.

bartowski

agentica-org_DeepScaleR-1.5B-Preview-GGUF

Brief-details: Llamacpp imatrix quantized versions of DeepScaleR-1.5B model with various compression levels (0.77GB-7.11GB), optimized for different hardware configurations

mrm8488

t5-small-finetuned-common_gen

Brief Details: T5-small model fine-tuned on CommonGen dataset for enhanced text generation with conceptual combinations, optimized for natural language generation tasks.

unsloth

meta-Llama-3.1-8B-unsloth-bnb-4bit

Brief-details: Meta's 8B parameter Llama 3.1 model optimized with unsloth's 4-bit quantization for efficient inference. Supports multilingual text generation and tool use with 128k context.

upskyy

e5-large-korean

BRIEF-DETAILS: Korean-optimized sentence embedding model based on E5-large, offering 1024-dimensional vectors for semantic analysis with strong performance on Korean language tasks

Helsinki-NLP

opus-mt-pl-es

BRIEF-DETAILS: Neural machine translation model for Polish to Spanish translation, based on transformer architecture with BLEU score of 46.9 and chrF of 0.654.

mradermacher

TinyR1-32B-Preview-GGUF

Brief Details: A GGUF quantized version of TinyR1-32B offering various compression levels from 12.4GB to 34.9GB, with recommended Q4_K variants for optimal performance balance.

bdotloh

just-another-emotion-classifier

Brief Details: Emotion classification model built on DistilBERT, fine-tuned for 32 emotion classes from Empathetic Dialogues dataset. Extends go-emotions dataset work.

Helsinki-NLP

opus-mt-en-af

Brief-details: Helsinki-NLP's English-to-Afrikaans neural MT model with impressive BLEU score of 56.1, based on transformer-align architecture and OPUS dataset.

m-a-p

YuE-s2-1B-general

Brief Details: YuE-s2-1B-general is a 1B parameter music generation model capable of transforming lyrics into complete songs with vocals and accompaniment, supporting multiple languages and genres.

Helsinki-NLP

opus-mt-pl-fr

Brief-details: Polish to French neural machine translation model by Helsinki-NLP, achieving 49.0 BLEU score on Tatoeba test set using transformer architecture.

AlekseyKorshuk

vicuna-7b

Brief-details: Vicuna-7B is an open-source LLaMA-based chatbot trained on 70K ShareGPT conversations, developed without ethical filtering for research purposes.

naonovn

Lora

BRIEF-DETAILS: Lora by naonovn: A specialized LoRA adapter model designed to enhance stable diffusion capabilities, with connections to the ChilloutMix ecosystem.

GIMG

AIChan_Model

Brief Details: AIChan_Model is a freely shared AI model by GIMG, hosted on HuggingFace. Limited documentation available but accessible for community use.

InternVL2_5-Pretrain-Models

L3.1-Athena-a-8B

drama_base_sentence_similarity

Llama-GitVac-Turbo-8B-i1-GGUF

Babel-9B-Chat-GGUF

Llama-GitVac-Turbo-8B-GGUF

llama-3.1-70b-abliterated-lora

moirai-moe-1.0-R-small

agentica-org_DeepScaleR-1.5B-Preview-GGUF

t5-small-finetuned-common_gen

meta-Llama-3.1-8B-unsloth-bnb-4bit

e5-large-korean

opus-mt-pl-es

TinyR1-32B-Preview-GGUF

just-another-emotion-classifier

opus-mt-en-af

YuE-s2-1B-general

opus-mt-pl-fr

vicuna-7b

Lora

AIChan_Model

The first platform built for prompt engineering