Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

DeepSeek-R1-Distill-Qwen-7B-6bit

BRIEF-DETAILS: DeepSeek-R1-Distill-Qwen-7B-6bit is a 6-bit quantized MLX format model derived from DeepSeek-R1-Distill-Qwen-7B, optimized for efficient deployment

pimpilikipilapi1

bj

BRIEF DETAILS: A specialized Stable Diffusion image generation model focused on specific anatomical views, utilizing trigger words for consistent results.

bartowski

cognitivecomputations_Dolphin3.0-R1-Mistral-24B-GGUF

BRIEF DETAILS: 24B parameter Mistral-based model optimized for reasoning, with multiple GGUF quantization options (6.5GB-94GB). Features special prompt format & reasoning system prompt.

sentence-transformers

msmarco-MiniLM-L12-v3

BRIEF DETAILS: Sentence embedding model that converts text to 384-dim vectors, optimized for semantic search & clustering. Based on MiniLM architecture with strong performance on MS MARCO dataset.

sentence-transformers

all-MiniLM-L6-v1

Brief Details: Efficient sentence embedding model that maps text to 384-dimensional vectors. Fast, lightweight implementation of MiniLM architecture optimized for semantic search and clustering.

deepvk

USER-base

Brief-details: Russian-specific sentence encoder that maps text to 768D vectors. Built on DeBERTa-v1-base with 85M params. Optimized for semantic search & clustering.

wave-on-discord

gemini-nano

Brief Details: Compact version of Gemini model extracted from Chrome browser, loadable via MediaPipe framework. Features basic instruction tuning capabilities.

distilbert

distilbert-base-german-cased

brief-details: German-optimized DistilBERT model with cased tokenization, offering efficient NLP capabilities for German language tasks through knowledge distillation

Salesforce

ctrl

Brief-details: CTRL - Salesforce's 140GB conditional transformer language model with controllable text generation via domain-specific codes. Supports creative writing and NLP research.

google-bert

bert-large-cased-whole-word-masking

Brief-details: BERT large cased model with whole word masking, 336M parameters, 24 layers, trained on BookCorpus and Wikipedia. Optimized for bidirectional language understanding.

google-bert

bert-base-german-dbmdz-uncased

Brief-details: German BERT model (uncased) trained by DBMDZ team, optimized for German language processing tasks with transformer architecture and full vocabulary support

AlexBefest

CardProjector-R1-preview-8B-v1.1

Brief Details: An 8B parameter AI model focused on card-based projections and analysis, developed by AlexBefest. Available on HuggingFace for preview access.

hudsongouge

Chatter-70M

BRIEF-DETAILS: Lightweight 70M parameter chat model trained on Discord data, featuring Llama-3 architecture and customizable chat styles through usernames. Optimized for casual conversations.

suayptalha

EmojiLlama-3.1-8B

Brief-details: A Llama-3.1-8B variant fine-tuned with DPO for enhanced emoji expression and friendly interactions. 8B parameters, optimized for engaging responses.

FuseAI

FuseO1-QwQ-DeepSeekR1-LightR1-32B

Brief-details: A 32B parameter merged LLM combining QwQ, DeepSeek-R1, and Light-R1 models, achieving superior performance on math reasoning and AIME benchmarks

JulianVelandia

Llama-3.2-1B-unal-instruct-ft-gguf

Brief-details: Spanish language instruction-tuned 1B parameter LLaMA model fine-tuned on UNAL academic Q&A dataset using LoRA adaptation, optimized for academic text generation

DavidAU

QwQ-35B-Eureka-Cubed-gguf

Brief-details: Enhanced 35B parameter model combining QwQ-32B with TinyR1 and DeepSeek capabilities. Optimized for reasoning, creative generation, and instruction following. Requires ChatML template.

mradermacher

Glowing-Forest-12B-i1-GGUF

Brief-details: Glowing-Forest-12B-i1-GGUF is a quantized version of the original Glowing-Forest-12B model, offering various compression options from 3.1GB to 10.2GB with different quality-performance tradeoffs.

mradermacher

MN-Sappho-n3-12B-GGUF

Brief-details: MN-Sappho-n3-12B-GGUF is a quantized version of the MN-Sappho model offering various compression levels, with sizes ranging from 4.9GB to 13.1GB, optimized for different performance/quality trade-offs.

spacepxl

wan-cfgdistill-loras

Brief-details: A specialized LoRA model focused on CFG distillation for Stable Diffusion, created by spacepxl for enhanced image generation control and quality optimization.

mradermacher

UnslopNemo-12B-v4-i1-GGUF

BRIEF DETAILS: 12B parameter quantized language model with multiple GGUF variants (3.1GB-10.2GB), optimized for different performance/quality tradeoffs using imatrix quantization.

DeepSeek-R1-Distill-Qwen-7B-6bit

bj

cognitivecomputations_Dolphin3.0-R1-Mistral-24B-GGUF

msmarco-MiniLM-L12-v3

all-MiniLM-L6-v1

USER-base

gemini-nano

distilbert-base-german-cased

ctrl

bert-large-cased-whole-word-masking

bert-base-german-dbmdz-uncased

CardProjector-R1-preview-8B-v1.1

Chatter-70M

EmojiLlama-3.1-8B

FuseO1-QwQ-DeepSeekR1-LightR1-32B

Llama-3.2-1B-unal-instruct-ft-gguf

QwQ-35B-Eureka-Cubed-gguf

Glowing-Forest-12B-i1-GGUF

MN-Sappho-n3-12B-GGUF

wan-cfgdistill-loras

UnslopNemo-12B-v4-i1-GGUF

The first platform built for prompt engineering