Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

vicuna-13b-free

BRIEF DETAILS: Vicuna 13B unfiltered language model trained on V2023.05.02v0 dataset, featuring unrestricted responses and multiple quantization formats for deployment flexibility.

Viet-Mistral

Vistral-7B-Chat

Brief-details: Vistral-7B-Chat is a 7B parameter chat model from Viet-Mistral, focused on safe and ethical AI interactions with specific guidance against harmful human experimentation.

Trelis

Llama-2-7b-chat-hf-function-calling-v2

Brief-details: Function-calling enabled Llama 2 7B model optimized for structured API interactions. Supports multiple function calls and argument types with improved v2 syntax.

Pi3141

alpaca-lora-30B-ggml

BRIEF-DETAILS: Alpaca LoRA 30B GGML - Optimized version of Alpaca fine-tuned LLaMA for CPU inference, compatible with Alpaca.cpp and related frameworks.

yandex

yalm-100b

Brief Details: YaLM-100B is Yandex's 100B parameter GPT-like model trained on 1.7TB of multilingual data, optimized for English and Russian text generation and processing.

pyannote

embedding

Brief-details: Pyannote embedding model for speaker diarization and voice processing tasks. Supports academic research and commercial applications with focus on machine listening.

allenai

OLMoE-1B-7B-0125-Instruct-GGUF

Brief-details: A GGUF-optimized version of OLMoE-1B-7B-0125-Instruct model by Allen AI, combining mixture-of-experts architecture with instruction tuning capabilities

kuro-08

bert-transaction-categorization

Brief Details: BERT model fine-tuned for financial transaction categorization across 25 categories, optimized for English language processing and classification tasks.

rubenroy

Zurich-7B-GCv2-100k

Brief-details: Qwen 2.5 7B-based model fine-tuned on GammaCorpus v2-100k dataset, featuring 7.61B parameters and trained for 60 epochs on T4 GPU. Optimized for chat.

black-forest-labs

FLUX.1-Depth-dev-onnx

BRIEF-DETAILS: FLUX.1-Depth-dev-onnx is a depth estimation model in ONNX format from black-forest-labs, designed for non-commercial applications with specialized depth perception capabilities.

ibm-granite

granite-vision-3.1-2b-preview

Brief-details: Compact 3B-parameter vision-language model optimized for document understanding, featuring strong performance on chart/table analysis and general VQA tasks. Built on Granite LLM.

mradermacher

Selene-1-Mini-Llama-3.1-8B-GGUF

Brief-details: Quantized version of Selene-1-Mini-Llama (8B params) offering multiple GGUF compression variants, optimized for different size/quality tradeoffs, with Q4_K_M recommended for balanced performance.

OPEA

DeepSeek-R1-int4-sym-gguf-q4-0-inc

Brief-details: DeepSeek-R1 quantized to INT4 with symmetric quantization and GGUF format, optimized using Intel's auto-round algorithm for efficient inference while maintaining performance.

Azure99

Blossom-V6-14B

Brief-details: Blossom-V6-14B is an open-source conversational LLM based on Qwen2.5-14B, featuring innovative data synthesis workflow and cross-model evaluation for enhanced performance.

mmnga

cyberagent-DeepSeek-R1-Distill-Qwen-14B-Japanese-gguf

Brief-details: A GGUF-formatted Japanese language model converted from CyberAgent's DeepSeek-R1-Distill-Qwen-14B, optimized for Japanese text generation and processing

lmms-lab

Qwen2-VL-2B-GRPO-8k

Brief Details: 2B parameter multimodal model fine-tuned on 8k curated dataset using GRPO, supporting English/Chinese vision-language tasks with efficient processing capabilities.

NickyNicky

Llama-1B-GRPO_Final

Brief-details: A 1B parameter LLaMA model fine-tuned on GSM8K dataset for mathematical reasoning, trained over 132 steps by NickyNicky. Available on HuggingFace.

Minthy

RouWei-0.7

Brief Details: RouWei-0.7 is a large-scale anime art model fine-tuned from Illustrious, featuring 7M unique images, enhanced prompt following, and superior anatomy rendering.

ReadyArt

L3.3-Nevoria-R1-70b_EXL2_5.0bpw_H8

Brief Details: A 70B parameter LLaMA-based model combining EVA-LLAMA storytelling, EURYALE scene descriptions, and DeepSeek-R1 reasoning, optimized for creative dialogue and detailed narratives.

mradermacher

sororicide-12B-Farer-Mell-Unslop-i1-GGUF

BRIEF DETAILS: A comprehensive GGUF quantized variant of the sororicide-12B model, offering multiple compression levels from 3.1GB to 10.2GB with imatrix optimizations

mradermacher

DeepSeek-R1-Distill-Qwen-14B-abliterated-v2-GGUF

BRIEF-DETAILS: DeepSeek-R1-Distill-Qwen-14B quantized model with multiple compression options (Q2-Q8), optimized for efficient deployment and reduced size.

vicuna-13b-free

Vistral-7B-Chat

Llama-2-7b-chat-hf-function-calling-v2

alpaca-lora-30B-ggml

yalm-100b

embedding

OLMoE-1B-7B-0125-Instruct-GGUF

bert-transaction-categorization

Zurich-7B-GCv2-100k

FLUX.1-Depth-dev-onnx

granite-vision-3.1-2b-preview

Selene-1-Mini-Llama-3.1-8B-GGUF

DeepSeek-R1-int4-sym-gguf-q4-0-inc

Blossom-V6-14B

cyberagent-DeepSeek-R1-Distill-Qwen-14B-Japanese-gguf

Qwen2-VL-2B-GRPO-8k

Llama-1B-GRPO_Final

RouWei-0.7

L3.3-Nevoria-R1-70b_EXL2_5.0bpw_H8

sororicide-12B-Farer-Mell-Unslop-i1-GGUF

DeepSeek-R1-Distill-Qwen-14B-abliterated-v2-GGUF

The first platform built for prompt engineering