Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

granite-3.2-8b-instruct-abliterated-GGUF

Brief Details: 8B parameter instruction-tuned model with multiple GGUF quantized versions - optimized for efficient deployment with sizes from 3.2GB to 16.4GB

ksanjeeb

PipaT1-500M

Brief Details: PipaT1-500M is a 500M parameter language model fine-tuned from Qwen2-0.5b-instruct, developed by ksanjeeb under Apache-2.0 license.

mradermacher

Nemo-12b-Humanize-SFT-v0.2-Quarter-i1-GGUF

BRIEF DETAILS: 12B parameter GGUF-quantized model with multiple compression variants (3.1GB-10.2GB), optimized for efficient deployment while maintaining performance.

bartowski

allura-org_Mistral-Small-24b-Sertraline-0304-GGUF

BRIEF-DETAILS: 24B parameter Mistral model with multiple GGUF quantization options (7-25GB), optimized for different RAM/VRAM constraints and performance needs. Features Q2-Q8 quantization variants.

alicekyting

Qwen2-Audio-7B-Instruct-4bit

Brief Details: 4-bit quantized version of Qwen2-Audio-7B-Instruct for audio-text processing, offering reduced memory usage while maintaining core capabilities

Helsinki-NLP

opus-mt-pl-no

Brief-details: Neural machine translation model for Polish to Norwegian translation. Transformer-align architecture with SentencePiece tokenization, achieving 27.5 BLEU score.

shahrukhx01

question-vs-statement-classifier

Brief-details: A specialized classifier model built on transformer architecture to distinguish between question and statement queries, primarily designed for neural search applications and Haystack integration.

mesolitica

llama2-embedding-1b-8k

BRIEF DETAILS: 1B parameter Llama2-based embedding model specialized for Malaysian text, with 8k training context length and 32k inference scaling capability.

Helsinki-NLP

opus-mt-pl-de

BRIEF-DETAILS: Neural machine translation model for Polish to German translation, using transformer architecture with SentencePiece preprocessing. BLEU score: 47.8

Helsinki-NLP

opus-mt-de-cs

Brief-details: German to Czech neural machine translation model trained on OPUS data, achieving BLEU scores of 20-42 across various test sets

roneneldan

TinyStories-1M

BRIEF-DETAILS: TinyStories-1M: Language model trained on simple stories dataset, designed for generating coherent short narratives. Uses GPT-Neo tokenizer.

katuni4ka

tiny-random-sana

Brief-details: A tiny random model created by katuni4ka, with connections to other tiny-random variants. Primarily focused on experimental AI development with minimal architecture details available.

p208p2002

zh-wiki-punctuation-restore

Brief-details: Chinese punctuation restoration model that supports 6 punctuation marks (，、。？！；). Uses transformer architecture to automatically insert correct punctuation in unpunctuated Chinese text.

NbAiLab

nb-wav2vec2-1b-nynorsk

Brief-details: Large-scale Norwegian ASR model (1B parameters) for Nynorsk dialect, achieving 11.32% WER. Built on XLS-R, trained on NPSC dataset.

conjuncts

ditr-e15

Brief-details: A transformers-based model by conjuncts hosted on Hugging Face Hub. Limited documentation available but appears to be an experimental model implementation.

lllyasviel

fav_models

Brief-details: Personal model collection curated by lllyasviel, featuring ControlNet-related models and optimization tools. Focused on private use and development.

chavinlo

alpaca-13b

BRIEF-DETAILS: Alpaca-13B is a native implementation of the Alpaca model with 13B parameters, offering instruction-following capabilities without LoRA fine-tuning.

google

recurrentgemma-2b-it

BRIEF DETAILS: RecurrentGemma-2B-IT: Google's instruction-tuned variant of RecurrentGemma-2B, requiring Hugging Face license acceptance for access. Built on Gemma architecture.

mys

ggml_llava-v1.5-7b

Brief-details: GGML-optimized version of LLaVA-1.5-7B for efficient local inference using llama.cpp, enabling end-to-end multimodal capabilities without extra dependencies

mradermacher

Astra-v1-12B-i1-GGUF

BRIEF DETAILS: Astra-v1-12B quantized model with multiple GGUF variants optimized for different size/performance tradeoffs. Features iMatrix quantization for improved efficiency.

mradermacher

Astra-v1-12B-GGUF

Brief Details: Astra-v1-12B-GGUF is a quantized version of the Astra language model, offering multiple compression variants from 4.9GB to 13.1GB with varying quality-size tradeoffs.

granite-3.2-8b-instruct-abliterated-GGUF

PipaT1-500M

Nemo-12b-Humanize-SFT-v0.2-Quarter-i1-GGUF

allura-org_Mistral-Small-24b-Sertraline-0304-GGUF

Qwen2-Audio-7B-Instruct-4bit

opus-mt-pl-no

question-vs-statement-classifier

llama2-embedding-1b-8k

opus-mt-pl-de

opus-mt-de-cs

TinyStories-1M

tiny-random-sana

zh-wiki-punctuation-restore

nb-wav2vec2-1b-nynorsk

ditr-e15

fav_models

alpaca-13b

recurrentgemma-2b-it

ggml_llava-v1.5-7b

Astra-v1-12B-i1-GGUF

Astra-v1-12B-GGUF

The first platform built for prompt engineering