Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

m2m100-12B-avg-5-ckpt

BRIEF DETAILS: Multilingual translation model supporting 100 languages with 9,900 translation directions. 12B parameters, MIT licensed, developed by Facebook for many-to-many translation tasks.

Text2Text Generation

microsoft

resnet-152

Brief-details: ResNet-152: A powerful 60.3M-parameter image classification model from Microsoft, trained on ImageNet-1k. Features residual learning for deep networks.

Image Classification

SJ-Ray

Re-Punctuate

Brief-details: T5-based model for correcting capitalization and punctuation in text, trained on DialogSum dataset with 115k records. Apache 2.0 licensed.

Text2Text Generation

dangvantuan

sentence-camembert-base

Brief Details: French sentence embedding model based on CamemBERT, achieving 82.36% Pearson correlation on STS benchmark. 111M params, Apache 2.0 licensed.

Sentence Similarity

edbeeching

decision-transformer-gym-walker2d-expert

Brief-details: Decision Transformer model for Walker2d environment, trained on expert trajectories. Specializes in continuous control tasks using transformer architecture.

Reinforcement Learning

TahaDouaji

detr-doc-table-detection

Brief Details: DETR-based model (41.6M params) specialized in detecting bordered & borderless tables in documents. Built on ResNet-50, Apache 2.0 licensed.

Object Detection

michiyasunaga

BioLinkBERT-large

Brief Details: BioLinkBERT-large: State-of-the-art biomedical NLP model trained on PubMed abstracts with citation links, achieving superior performance in medical tasks.

Text Classification

microsoft

dit-base-finetuned-rvlcdip

Brief Details: Document Image Transformer (DiT) fine-tuned on RVL-CDIP dataset for document classification, based on BEiT architecture with 16 classes

Image Classification

tsantos

PathologyBERT

Brief Details: PathologyBERT is a specialized BERT model trained on breast pathology specimens reports, optimized for medical terminology and pathology-specific language tasks.

Fill-Mask

casehold

legalbert

Brief-details: LegalBERT - A specialized BERT model trained on 3.4M legal decisions (37GB), fine-tuned for legal text analysis and classification tasks.

Fill-Mask

yseop

roberta-base-finance-hypernym-identification

Brief-details: Financial domain RoBERTa model specialized in hypernym identification, fine-tuned on FIBO ontology data. Achieves 73% accuracy in financial term classification.

Sentence Similarity

ynie

roberta-large-snli_mnli_fever_anli_R1_R2_R3-nli

Brief-details: RoBERTa-Large model fine-tuned on multiple NLI datasets (SNLI, MNLI, FEVER, ANLI), specialized for natural language inference tasks.

Text Classification

yhavinga

gpt2-large-dutch

Brief Details: GPT2-Large Dutch language model with 812M params, trained on cleaned mC4 dataset. Achieves 15.1 perplexity, specialized for Dutch text generation.

Text Generation

voidful

albert_chinese_base

Brief Details: A Chinese ALBERT base model for masked language modeling, featuring 10.7M parameters. Supports AutoTokenizer and requires BertTokenizer for proper operation. Popular with 1029+ downloads.

Fill-Mask

vespa-engine

col-minilm

Brief-details: ColBERT-based ranking model optimized for efficient passage search, featuring 22.3M parameters and achieving 0.364 MRR@10 on MS Marco dev set.

Transformers

valhalla

t5-base-qg-hl

BRIEF-DETAILS: T5-based question generation model that creates questions from highlighted text spans. Popular with 5.8K downloads, MIT licensed, and built for answer-aware QG tasks.

Text2Text Generation

uer

t5-v1_1-small-chinese-cluecorpussmall

BRIEF DETAILS: Chinese T5 v1.1 small model trained on CLUECorpusSmall dataset, optimized for text-to-text generation tasks with GEGLU activation and improved architecture (8 layers, 512 hidden size).

Text2Text Generation

unicamp-dl

translation-en-pt-t5

Brief Details: T5-based English-Portuguese translation model trained on diverse corpora including biomedical texts. 9.4K+ downloads, optimized for modest hardware.

Translation

uf-aice-lab

math-roberta

Brief Details: Fine-tuned RoBERTa-large model with 355M parameters, specialized for math education NLP tasks, trained on 3M math discussions.

Text Generation

unicamp-dl

mt5-base-mmarco-v2

Brief Details: MT5-based multilingual reranker model fine-tuned on mMARCO dataset, supporting 9 languages including Portuguese. MIT licensed, optimized for text generation.

Text2Text Generation

uer

gpt2-chinese-poem

BRIEF DETAILS: GPT2-based Chinese poem generator with 800k poems training data. Pre-trained using UER-py framework, supports ancient Chinese poetry generation with specialized vocabulary and transformer architecture.

Text Generation

m2m100-12B-avg-5-ckpt

resnet-152

Re-Punctuate

sentence-camembert-base

decision-transformer-gym-walker2d-expert

detr-doc-table-detection

BioLinkBERT-large

dit-base-finetuned-rvlcdip

PathologyBERT

legalbert

roberta-base-finance-hypernym-identification

roberta-large-snli_mnli_fever_anli_R1_R2_R3-nli

gpt2-large-dutch

albert_chinese_base

col-minilm

t5-base-qg-hl

t5-v1_1-small-chinese-cluecorpussmall

translation-en-pt-t5

math-roberta

mt5-base-mmarco-v2

gpt2-chinese-poem

The first platform built for prompt engineering