Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

politicalBiasBERT

BRIEF-DETAILS: BERT-based model for detecting political bias in text, classifying content as left, center, or right-leaning. Built by bucketresearch for automated bias detection.

waifu-research-department

Inugami-Korone

BRIEF-DETAILS: Specialized AI model for generating Inugami Korone (Hololive VTuber) images, trained on Waifu Diffusion 1.3 with 22 training and 3300 regularization images.

unsloth

Qwen2.5-3B-Instruct-unsloth-bnb-4bit

Brief-details: 4-bit quantized version of Qwen2.5-3B-Instruct optimized by Unsloth for faster training and lower memory usage, featuring dynamic quantization for improved accuracy

Qwen

Qwen2.5-VL-7B-Instruct-AWQ

Brief-details: Qwen2.5-VL-7B-Instruct-AWQ is an advanced vision-language model offering enhanced visual understanding, video analysis, and structured output capabilities with AWQ quantization.

sergeyzh

rubert-tiny-turbo

BRIEF DETAILS: Fast and lightweight Russian BERT model for sentence embeddings. 312-dim embeddings, 2048 context size, optimized for speed with good accuracy.

microsoft

graphcodebert-base

Brief-details: GraphCodeBERT is a Transformer-based model for code understanding that combines sequence data with data-flow graphs, featuring 12 layers and trained on 2.3M functions across 6 programming languages.

timm

convnext_tiny.in12k_ft_in1k

Brief-details: ConvNeXt tiny model (28.6M params) pretrained on ImageNet-12k and fine-tuned on ImageNet-1k, achieving 84.2% top-1 accuracy at 224px

google-bert

bert-large-cased-whole-word-masking-finetuned-squad

BRIEF-DETAILS: BERT large cased model with whole word masking, 336M parameters, fine-tuned on SQuAD dataset. Optimized for question-answering tasks.

Kwaipilot

KwaiCoder-DS-V2-Lite-Base

Brief-details: A powerful 16B parameter code-centric LLM built on Deepseek-v2-Lite-Base, achieving SOTA performance in code generation and mathematical tasks

elastic

distilbert-base-uncased-finetuned-conll03-english

Brief-details: DistilBERT model fine-tuned for Named Entity Recognition (NER) using CoNLL-2003 dataset, case-insensitive version optimized for English text analysis.

DeepChem

ChemBERTa-77M-MLM

Brief Details: ChemBERTa-77M-MLM is a 77M parameter BERT-style model pre-trained on SMILES molecular representations for chemical property prediction and analysis.

mlx-community

Meta-Llama-3.1-8B-Instruct-4bit

Brief Details: Meta-Llama 3.1 8B Instruct model optimized to 4-bit quantization for MLX framework, offering efficient inference with reduced memory footprint

ggml-org

stories15M_MOE

BRIEF-DETAILS: A 15M parameter MoE model with 4 experts based on TinyLlama, specialized for storytelling. Features Shakespeare LoRA adapter for creative text generation.

microsoft

BiomedParse

Brief-details: BiomedParse - Microsoft's foundation model for biomedical image analysis across 9 modalities, enabling joint segmentation, detection & recognition with advanced transformer architecture.

trl-internal-testing

tiny-BartModel

Brief Details: A minimal BART model implementation designed specifically for TRL library testing purposes, focusing on core transformer architecture validation.

internlm

internlm3-8b-instruct

Brief-details: InternLM3's 8B parameter instruction model optimized for reasoning tasks, trained on 4T tokens with 75% less cost than peers. Excels in math and knowledge tasks.

mental

mental-bert-base-uncased

Brief-details: BERT-based model specialized for mental health analysis, trained on Reddit data using 4 Tesla V100 GPUs for 624k iterations over 8 days

google

metricx-24-hybrid-xxl-v2p6-bfloat16

Brief-details: MetricX-24 XXL hybrid model for translation evaluation, supporting both reference-based and reference-free assessment with state-of-the-art performance

Jean-Baptiste

camembert-ner-with-dates

Brief-details: French NER model based on CamemBERT, specialized in entity & date recognition. F1 score ~83% on mixed chat/email data. Built on wikiner-fr dataset.

afrideva

Tiny-Vicuna-1B-GGUF

Brief Details: Compact 1.1B parameter LLM based on TinyLLama, fine-tuned with WizardVicuna dataset. Available in multiple GGUF quantizations from 482MB to 1.17GB.