Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

LLaMA-Mesh-Q6_K-GGUF

Brief Details: An 8.03B parameter LLaMA model optimized for mesh generation, converted to GGUF format with Q6_K quantization for efficient deployment via llama.cpp

Text Generation

MLP-KTLim

llama-3.1-Asian-Bllossom-8B-Translator

Brief Details: A specialized 8B parameter LLaMA-based translator supporting mutual translation between Korean, Vietnamese, Indonesian, Cambodian, and Thai with strong BLEU/ROUGE scores.

Text Generation

prithivMLmods

Qwen2.5-Coder-14B-Instruct-F16-GGUF

Brief Details: Qwen2.5-Coder-14B-Instruct-F16-GGUF is a 14.8B parameter coding-specialized language model optimized for 16-bit inference using GGUF format, compatible with Llama.cpp

Text Generation

THUMedInfo

GENIE_zh_7b

Brief Details: GENIE_zh_7b is a specialized 7.62B parameter Chinese language model for structuring electronic health records (EHR), built on Qwen2.5-7B-Instruct.

Feature Extraction

MendelAI

nv-embed-v2-ontada-twab-peft

Brief-details: Advanced sentence embedding model based on NV-Embed-v2, fine-tuned for medical text similarity with 7.85B parameters and 4096-dim outputs

Sentence Similarity

LGAI-EXAONE

EXAONE-3.0-7.8B-Instruct-AWQ

Brief-details: A quantized 7.8B parameter instruction-tuned language model supporting English and Korean, using AWQ 4-bit quantization for efficient deployment while maintaining performance.

Safetensors

Arkana08

Mythorica-L3-8B

Brief-details: 8B parameter storytelling-focused LLM merged using DARE-TIES method, combining Hathor, LexiMaid & Chara models. Optimized for RP/narrative generation.

Text Generation

mradermacher

OrcaAgent-llama3.2-1b-GGUF

BRIEF DETAILS: Lightweight 1.24B parameter GGUF-quantized Orca model optimized for agent-based instruction following, offering multiple quantization options from 0.7GB to 2.6GB with varying quality-size tradeoffs.

Transformers

city96

Flux.1-Heavy-17B

Brief-details: A massive 17B parameter self-merge of Flux.1-dev model, requiring 35-40GB VRAM for inference. Specialized in text-to-image generation with extensive resource demands.

Text-to-Image

Yntec

3rdComing

Brief Details: A fine-tuned text-to-image model based on XenoGASM-MK2, specializing in artistic and versatile image generation with anime influences.

Text-to-Image

prithivMLmods

Flux.1-Dev-Ctoon-LoRA

Brief-details: A specialized LoRA model for cartoon-style image generation, built on FLUX.1-dev. Features 64 network dimensions, trained on 22 images with constant LR scheduling and AdamW optimization.

Text-to-Image

prithivMLmods

Flux.1-Dev-Poster-HQ-LoRA

Brief-details: A specialized LoRA model for high-quality poster generation built on FLUX.1-dev, featuring 64 network dimensions and optimized for 768x1024 resolution poster creation.

Text-to-Image

prithivMLmods

Flux.1-Dev-Quote-LoRA

BRIEF DETAILS: A specialized LoRA model for generating motivational quote stickers, built on FLUX.1-dev. Features 64 network dimensions and optimized for 768x1024 resolution.

Text-to-Image

NarrativAI

Cakrawala-Llama-3.1-70B

Brief-details: A 70B parameter LLaMA-based model optimized for roleplaying conversations, trained on 13,000 conversation pairs with rich character interactions and emotional expressions.

PyTorch

bartowski

Qwen2.5-72B-0.6x-Instruct-GGUF

Brief-details: Qwen2.5 72B model with multiple GGUF quantizations (25GB-77GB), optimized for different hardware setups and RAM constraints. Features imatrix quantization for enhanced performance.

Text Generation

govtech

stsb-roberta-base-off-topic

Brief Details: A fine-tuned RoBERTa-based model for off-topic classification, achieving 0.99 ROC-AUC with binary classification capabilities for enterprise LLM applications.

ONNX

INSAIT-Institute

BgGPT-Gemma-2-2.6B-IT-v1.0

BRIEF DETAILS: Bulgarian-English language model based on Gemma-2-2b with 2.6B parameters, optimized for instruction-following and conversation. Enhanced Bulgarian capabilities while maintaining English performance.

Text Generation

govtech

jina-embeddings-v2-small-en-off-topic

BRIEF DETAILS: Fine-tuned Jina Embeddings model for off-topic classification with impressive 0.99 ROC-AUC score, supporting 1024 token context length for enterprise use.

ONNX

nvidia

Meta-Llama-3.2-3B-Instruct-ONNX-INT4

Brief-details: Quantized INT4 version of Meta's Llama-3.2-3B-Instruct model optimized for NVIDIA GPUs, offering efficient inference with reduced memory footprint and ONNX runtime support.

ONNX

scepter-studio

ACE-0.6B-1024px

Brief Details: ACE-0.6B-1024px is a unified visual generation model supporting multi-modal inputs and long-context processing for image editing and generation tasks, with 1024px resolution capabilities.

English