Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

pn-summary-b2b-shared

BRIEF-DETAILS: B2B-shared model specialized in article summarization, developed by HooshvareLab to process and generate concise text summaries from the pnSummary dataset.

lvwerra

gpt2-imdb

BRIEF-DETAILS: GPT2 model fine-tuned on IMDB movie reviews dataset for sentiment analysis and review generation. Built by lvwerra with single epoch training.

lucio

xls-r-uzbek-cv8

Brief Details: XLS-R-300M model fine-tuned for Uzbek speech recognition, achieving 38.52% WER. Built on Common Voice 8.0 with KenLM language modeling.

lucio

wav2vec2-large-xlsr-kinyarwanda-apostrophied

Brief Details: Fine-tuned wav2vec2-large-xlsr-53 model for Kinyarwanda ASR, trained on 25% Common Voice data, specialized in apostrophe prediction with 39.92% WER.

lordtt13

emo-mobilebert

BRIEF-DETAILS: A compact emotion recognition model based on MobileBERT architecture, optimized for mobile devices. Classifies text into 4 emotion categories with high efficiency and low latency.

limivan

DialoGPT-small-c3po

BRIEF DETAILS: DialoGPT-small-c3po is a conversational AI model fine-tuned to mimic C-3PO's distinctive communication style from Star Wars, based on the DialoGPT architecture.

lighteternal

gpt2-finetuned-greek

Brief-details: Greek language GPT-2 model (117M params) fine-tuned on 23.4GB Greek text corpus. Achieves 39.12 perplexity. Developed by Hellenic Army Academy & TUC.

katuni4ka

tiny-random-mistral-nemo

Brief Details: Experimental NeMo framework implementation of Mistral architecture, designed for testing and development purposes. Created by katuni4ka.

katuni4ka

tiny-random-glm-edge

Brief Details: A lightweight GLM-based edge model by katuni4ka, optimized for edge computing scenarios with minimal resource requirements.

timm

fastvit_t8.apple_dist_in1k

Brief-details: FastViT T8 is a lightweight vision transformer (4M params) optimized for speed using structural reparameterization, trained on ImageNet-1k with distillation

nreimers

BERT-Tiny_L-2_H-128_A-2

BRIEF-DETAILS: BERT-Tiny is a lightweight version of BERT with 2 layers, 128 hidden units, and 2 attention heads - ideal for resource-constrained applications

LongSafari

hyenadna-medium-160k-seqlen-hf

Brief-details: HyenaDNA medium model (160k sequence length) - A genomic foundation model using Hyena operators for long-range DNA sequence analysis at single nucleotide resolution

Infermatic

Llama-3.3-70B-Instruct-FP8-Dynamic

Brief Details: Meta's 70B parameter instruction-tuned LLM with multilingual support, 128k context, FP8 quantization, and enhanced safety features. December 2023 knowledge cutoff.

mit-han-lab

dc-ae-f32c32-in-1.0

Brief-details: Deep Compression Autoencoder for efficient high-resolution diffusion models, achieving up to 128x spatial compression while maintaining quality. By MIT-Han Lab, optimized for ImageNet.

neuralmagic

Meta-Llama-3-8B-Instruct-quantized.w8a8

Brief-details: INT8-quantized version of Meta-Llama-3-8B-Instruct, optimized for efficiency with 50% reduced memory footprint while maintaining 68.66 OpenLLM benchmark score

allenai

OLMo-7B-0724-Instruct-hf

BRIEF DETAILS: OLMo-7B-0724-Instruct: 7B parameter instruct-tuned LLM from Allen AI. Features 4096 context length, trained on Tulu/UltraFeedback datasets. Apache 2.0 licensed.

timm

deit3_small_patch16_384.fb_in22k_ft_in1k

Brief-details: DeiT-III small variant vision transformer (22.2M params) pretrained on ImageNet-22k and fine-tuned on ImageNet-1k, optimized for 384x384 images

facebook

musicgen-stereo-small

Brief Details: A 300M parameter stereo music generation model capable of creating high-quality stereo audio from text descriptions, part of Facebook's MusicGen family.

Kijai

llava-llama-3-8b-text-encoder-tokenizer

Brief Details: Text encoder and tokenizer component of LLaVA-LLaMA 3 8B model, designed for efficient text processing in multimodal AI systems

HelpingAI

HelpingAI2.5-10B

Brief Details: 10B parameter emotionally-intelligent LLM achieving 98.13 on EI tests. Optimized for empathetic conversations and mental health support. Trained on 152.5M emotional dialogue datasets.

dariolopez

roberta-base-bne-finetuned-msmarco-qa-es-mnrl-mn

Brief Details: Spanish QA-focused sentence transformer model based on RoBERTa-BNE, maps text to 768D vectors, fine-tuned on MS-MARCO dataset for semantic search.