Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

pt-tinybert-msmarco

BRIEF DETAILS: A TinyBERT model fine-tuned on MS MARCO dataset, optimized for efficient passage ranking and information retrieval tasks.

Helsinki-NLP

opus-mt-en-tl

Brief-details: English to Tagalog (Filipino) neural machine translation model based on transformer architecture, achieving 26.6 BLEU score on Tatoeba test set

bartowski

stable-code-instruct-3b-GGUF

BRIEF-DETAILS: Quantized versions of stable-code-instruct-3b optimized for different performance/size tradeoffs, ranging from 1.08GB to 2.97GB with varying quality levels

MBZUAI

GLaMM-GranD-Pretrained

Brief Details: GLaMM-GranD-Pretrained: Advanced multimodal model trained on 7.5M concepts across 810M regions for detailed visual understanding and segmentation.

Johnson8187

Chinese-Emotion-Small

BRIEF DETAILS: Specialized Chinese text emotion classifier based on mDeBERTa-v3, capable of identifying 8 distinct emotional tones. Lightweight version (small) optimized for efficient deployment.

cheonboy

sentence_embedding_japanese

Brief-details: Japanese sentence embedding model based on LUKE architecture, optimized for semantic similarity tasks with comparable or better performance than BERT-based alternatives

peft-internal-testing

tiny-OPTForCausalLM-lora

Brief-details: A PEFT-optimized tiny OPT model using LoRA adaptation, designed for testing and development purposes. Built with PEFT 0.4.0.dev0 framework.

Danswer

hybrid-intent-token-classifier

Brief-details: A specialized token classifier model developed by Danswer, combining intent classification and token identification capabilities for enhanced natural language understanding tasks.

abhinand

MedEmbed-large-v0.1

Brief-details: MedEmbed-large-v0.1 is a specialized embedding model fine-tuned for medical and clinical information retrieval, offering superior performance on healthcare NLP tasks compared to general-purpose models.

Finnish-NLP

wav2vec2-xlsr-300m-finnish-lm

Brief-details: A fine-tuned wav2vec2-xls-r-300m model for Finnish ASR, trained on 275.6 hours of Finnish speech data achieving 17.92% WER without LM and 8.16% with LM.

espnet

voxcelebs12_rawnet3

Brief-details: ESPnet speaker recognition model trained on VoxCeleb, achieving 0.739% EER. Uses RawNet3 architecture with self-supervised front-ends for speaker embeddings.

stepfun-ai

stepvideo-t2v-turbo

Brief Details: State-of-the-art text-to-video model with 30B parameters. Features 16x16 spatial and 8x temporal compression, generating up to 204-frame videos with DPO optimization.

Revai

reverb-asr

Brief-details: Highly accurate English ASR model trained on 200K hours of human-transcribed speech, featuring adjustable verbatimicity and multiple decoding options

OpenPipe

mistral-ft-optimized-1227

Brief-details: A powerful Mistral-based 7B parameter model optimized through SLERP merging of leading models, designed specifically for downstream fine-tuning tasks and superior performance.

google

t5-large-lm-adapt

Brief-details: T5-large-lm-adapt is Google's enhanced T5 model with GEGLU activation, improved pre-training on C4, and specialized LM adaptation for better prompt tuning capabilities.

google

t5-efficient-tiny-nl32

Brief-details: T5-efficient-tiny-nl32 is a deep-narrow variant of T5 with 67M parameters, optimized for efficiency through increased depth (32 layers) while maintaining a narrow architecture

google

t5-efficient-small

Brief Details: T5-efficient-small is a deep-narrow variant of T5 with 60.52M parameters, optimized for efficiency through increased depth rather than width.

google

t5-efficient-base

Brief Details: T5-efficient-base is a deep-narrow variant of T5 with 223M parameters, optimized for efficient scaling and downstream performance through increased model depth.

google

rembert

Brief-details: RemBERT is Google's multilingual BERT variant trained on 110 languages with separate input/output embeddings, optimized for classification tasks

google

muril-base-cased

Brief Details: BERT-based multilingual model pre-trained on 17 Indian languages, optimized for both native and transliterated text processing, achieving strong cross-lingual performance.

google

fnet-large

Brief Details: FNet-large is Google's transformer variant using Fourier transforms instead of attention, trained on C4 with 24 layers and 1024 hidden dimensions for MLM/NSP tasks.

pt-tinybert-msmarco

opus-mt-en-tl

stable-code-instruct-3b-GGUF

GLaMM-GranD-Pretrained

Chinese-Emotion-Small

sentence_embedding_japanese

tiny-OPTForCausalLM-lora

hybrid-intent-token-classifier

MedEmbed-large-v0.1

wav2vec2-xlsr-300m-finnish-lm

voxcelebs12_rawnet3

stepvideo-t2v-turbo

reverb-asr

mistral-ft-optimized-1227

t5-large-lm-adapt

t5-efficient-tiny-nl32

t5-efficient-small

t5-efficient-base

rembert

muril-base-cased

fnet-large

The first platform built for prompt engineering