Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

bert-base-japanese-whole-word-masking

BRIEF DETAILS: BERT base model for Japanese text processing with whole word masking, trained on Wikipedia data. Features 12-layer architecture with word-level IPA dictionary tokenization and 32K vocabulary.

Fill-Mask

distilbert

distilbert-base-cased

BRIEF DETAILS: DistilBERT base cased - A compact, faster version of BERT with 65.8M parameters, trained on BookCorpus and Wikipedia. Optimized for masked language modeling and sequence classification.

Fill-Mask

apple

DFN5B-CLIP-ViT-H-14-378

Brief Details: A powerful CLIP model trained on 5B filtered images, achieving 84.2% accuracy on ImageNet-1K and strong zero-shot classification capabilities.

OpenCLIP

tner

roberta-large-ontonotes5

Brief-details: RoBERTa-based NER model fine-tuned on OntoNotes5, achieving 90.86% F1 score. Specialized in recognizing 18 entity types with high precision and recall.

Token Classification

google

electra-small-discriminator

Brief-details: ELECTRA small discriminator model by Google - efficient pre-trained transformer for token classification using GAN-like training approach

Transformers

lucas-leme

FinBERT-PT-BR

BRIEF DETAILS: Portuguese financial sentiment analysis BERT model trained on 1.4M texts with high accuracy for market sentiment classification.

Text Classification

depth-anything

Depth-Anything-V2-Small-hf

Brief Details: A state-of-the-art monocular depth estimation model trained on 595K synthetic + 62M real images, offering efficient depth predictions at 24.8M params.

Depth Estimation

rinna

japanese-cloob-vit-b-16

Brief-details: Japanese CLOOB model for image-text understanding, featuring ViT-B/16 architecture with 197M params. Trained on CC12M dataset, supports Japanese text-image alignment.

Feature Extraction

liuhaotian

llava-v1.5-13b

Brief Details: LLaVA-v1.5-13B is a powerful multimodal chatbot combining vision and language capabilities, built on LLaMA/Vicuna with 558K+ training pairs.

Image-Text-to-Text

autogluon

tabpfn-mix-1.0-classifier

Brief Details: TabPFNMix: A 39M-parameter transformer-based tabular classifier, pre-trained on synthetic datasets with in-context learning capabilities

Tabular Classification

doc2query

msmarco-t5-base-v1

BRIEF-DETAILS: Doc2query T5-based model for document expansion and query generation. Generates relevant queries from text passages to improve search relevance and training data generation.

Text2Text Generation

XLabs-AI

flux-RealismLora

BRIEF DETAILS: A LoRA-based photorealism enhancement model for FLUX.1-dev, offering improved realistic image generation capabilities with 392K+ downloads and non-commercial licensing.

Text-to-Image

MichalMlodawski

nsfw-image-detection-large

Brief Details: A powerful NSFW image classifier built on FocalNet, offering 3-category content moderation (Safe/Questionable/Unsafe) with 95%+ accuracy and 87.1M parameters.

Image Classification

hugging-quants

Meta-Llama-3.1-8B-Instruct-AWQ-INT4

Brief-details: 4-bit quantized Meta Llama 3.1 8B model optimized for multilingual dialogue, supports 8 languages, requires 4GB VRAM, community-driven AWQ quantization

Text Generation

sentence-transformers

LaBSE

Brief Details: LaBSE (Language-agnostic BERT Sentence Embedding) - A powerful multilingual sentence embedding model supporting 110 languages with strong cross-lingual capabilities.

Sentence Similarity

shahrukhx01

bert-mini-finetune-question-detection

Brief Details: BERT-mini model fine-tuned for query classification, distinguishing between keyword queries and questions/statements. 11.2M params, 99% validation accuracy.

Text Classification

cagliostrolab

animagine-xl-3.1

Brief-details: Cutting-edge anime-style text-to-image model built on SDXL, featuring improved hand anatomy, enhanced concept understanding, and refined aesthetic generation capabilities.

Text-to-Image

apple

mobilevit-small

Brief-details: Lightweight vision transformer model combining CNN and transformer architecture, with 5.6M parameters achieving 78.4% ImageNet accuracy, ideal for mobile applications.

Image Classification

Systran

faster-whisper-base

Brief-details: Efficient speech recognition model supporting 99 languages, built on CTranslate2 framework with MIT license. Popular with 400k+ downloads, optimized for performance.

Automatic Speech Recognition

cointegrated

LaBSE-en-ru

Brief-details: LaBSE-en-ru is a specialized bilingual BERT model for English-Russian sentence embeddings, optimized to 27% of original size with maintained quality.

Feature Extraction

madhurjindal

autonlp-Gibberish-Detector-492513457

Brief-details: A sophisticated gibberish detection model with 67M parameters, achieving 97.36% accuracy. Built on DistilBERT, specializing in 4-level text classification for content quality assessment.

Text Classification

bert-base-japanese-whole-word-masking

distilbert-base-cased

DFN5B-CLIP-ViT-H-14-378

roberta-large-ontonotes5

electra-small-discriminator

FinBERT-PT-BR

Depth-Anything-V2-Small-hf

japanese-cloob-vit-b-16

llava-v1.5-13b

tabpfn-mix-1.0-classifier

msmarco-t5-base-v1

flux-RealismLora

nsfw-image-detection-large

Meta-Llama-3.1-8B-Instruct-AWQ-INT4

LaBSE

bert-mini-finetune-question-detection

animagine-xl-3.1

mobilevit-small

faster-whisper-base

LaBSE-en-ru

autonlp-Gibberish-Detector-492513457

The first platform built for prompt engineering