Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Rin-9B-GGUF

BRIEF DETAILS: GGUF quantized version of Rin-9B model featuring multiple compression variants from 3.9GB to 18.6GB, optimized for efficiency and performance.

OddTheGreat

Blagoveshchensk_14B_V3

BRIEF-DETAILS: 14B parameter language model optimized for roleplay & Russian language support. Merged from Qwen 2.5 finetunes. Strong creative capabilities with stable performance.

bunnycore

Qwen2.5-3B-Model-Stock-v4.1

BRIEF-DETAILS: Qwen2.5-3B merged model combining multiple Qwen variants using model stock method, built on Qwen2.5-3B-Instruct base for enhanced capabilities

AventIQ-AI

finbert-sentiment-analysis

Brief-details: FinBERT model fine-tuned for financial sentiment analysis with 88% accuracy. Classifies text as positive, negative, or neutral. Built on BERT architecture.

ibm-nasa-geospatial

Prithvi-EO-2.0-300M-BurnScars

Brief Details: Specialized 300M parameter model fine-tuned for burn scar segmentation in satellite imagery, achieving 87.52% IoU for burned area detection using HLS data

Lunzima

NQLSG-Qwen2.5-14B-MegaFusion-v8

Brief-details: A sophisticated 14B parameter merged LLM combining multiple high-quality models using SCE merge method, built on Qwen2.5 architecture with optimized performance and capabilities

AventIQ-AI

T5-small-grammar-correction

Brief Details: A quantized T5-small model fine-tuned for grammar correction, achieving 0.88 BLEU score with FP16 optimization for efficient inference

marcuscedricridia

cursa-o1-7b-v1.1

Brief-details: A 7B parameter language model created through SLERP merging of pre-cursa-o1-v1.2 and post-cursa-o1 models, featuring optimized attention and MLP layer weights

Nexesenex

Llama_3.1_8b_DodoWild_v2.02

Brief-details: An 8B parameter merged LLM combining Dolermed and Smarteaz variants of Llama 3.1, built on Dobby-Mini-Unhinged base using model stock merge method.

Nexesenex

Llama_3.1_8b_DoberWild_v2.02

Brief-details: 8B parameter Llama 3.1-based merged model combining Smarteaz and Hermedive variants with Dobby-Mini-Unhinged base, using model_stock merge method

Nexesenex

Llama_3.1_8b_Dolermed_R1_V1.01

BRIEF-DETAILS: 8B parameter Llama 3.1-based merged model combining medical knowledge (MedIT-SUN) with DeepHermes capabilities, built using model_stock method

inclusionAI

Ling-plus

BRIEF-DETAILS: Ling-plus: A 290B parameter MoE LLM with 28.8B activated parameters, 64K context window, and open-source architecture optimized for scalability and adaptability.

inclusionAI

Ling-plus-base

Brief Details: Ling-plus-base is a 290B parameter MoE LLM with 28.8B activated parameters, featuring 64K context length and MIT license. Developed by InclusionAI.

l3cube-pune

hindi-sentence-similarity-sbert

Brief-details: Hindi sentence similarity model using SBERT architecture. Maps Hindi text to 768-dimensional vectors for semantic comparison and search.

absa

classifier-rest-0.2

Brief Details: A classifier model from ABSA trained for version 0.2, available on HuggingFace, designed for classification tasks with REST API compatibility.

vikp

layout_segmenter

Brief-details: Advanced PDF layout segmentation model built on LayoutLMv3, specialized for document structure analysis and block-level segmentation.

THUDM

glm-edge-v-2b

BRIEF-DETAILS: GLM-Edge-V-2B is a 2B parameter vision-language model from THUDM, capable of image understanding and text generation with bfloat16 support

TitanML

tiny-mixtral

Brief-details: Tiny-mixtral is a minimal test version of Mixtral, designed for CI/CD testing purposes. Not trained for production use. Created by TitanML.

unsloth

Llama-3.3-70B-Instruct-bnb-4bit

Brief-details: Meta's Llama 3.3 70B instruction-tuned model optimized for 4-bit quantization, offering multilingual capabilities across 8 languages with 128k context window.

microsoft

wavlm-base

BRIEF DETAILS: Microsoft's WavLM-Base: Pre-trained speech model for full-stack audio processing, trained on 960h Librispeech data, optimized for 16kHz audio input.

mmnga

DeepSeek-V3-slice-jp64-gguf

BRIEF-DETAILS: Japanese-optimized DeepSeek-V3 variant with selective MoE layer experts, focused on Japanese language processing. GGUF format for efficient deployment.

Rin-9B-GGUF

Blagoveshchensk_14B_V3

Qwen2.5-3B-Model-Stock-v4.1

finbert-sentiment-analysis

Prithvi-EO-2.0-300M-BurnScars

NQLSG-Qwen2.5-14B-MegaFusion-v8

T5-small-grammar-correction

cursa-o1-7b-v1.1

Llama_3.1_8b_DodoWild_v2.02

Llama_3.1_8b_DoberWild_v2.02

Llama_3.1_8b_Dolermed_R1_V1.01

Ling-plus

Ling-plus-base

hindi-sentence-similarity-sbert

classifier-rest-0.2

layout_segmenter

glm-edge-v-2b

tiny-mixtral

Llama-3.3-70B-Instruct-bnb-4bit

wavlm-base

DeepSeek-V3-slice-jp64-gguf

The first platform built for prompt engineering