Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Hermes-3-Llama-3.1-405B

BRIEF DETAILS: Hermes-3 is a 405B parameter LLM built on Llama-3.1, offering advanced capabilities in reasoning, roleplaying, and function calling with ChatML format support.

Text Generation

microsoft

swinv2-tiny-patch4-window16-256

Brief-details: A tiny variant of Swin Transformer V2 optimized for 256x256 images, featuring hierarchical feature extraction and efficient local self-attention for computer vision tasks.

Image Classification

sentence-transformers

paraphrase-albert-small-v2

Brief Details: A lightweight ALBERT-based sentence embedding model with 11.7M parameters, optimized for semantic similarity and paraphrase detection. Maps text to 768-dimensional vectors.

Sentence Similarity

lllyasviel

sd-controlnet-canny

Brief-details: ControlNet model trained on canny edge detection, enabling precise control over Stable Diffusion image generation through edge maps. 3M training pairs, 600 GPU-hours on A100.

Image-to-Image

Systran

faster-whisper-small

Brief-details: A CTranslate2-optimized version of OpenAI's Whisper-small for efficient speech recognition, supporting 99 languages with MIT license and float16 precision.

Automatic Speech Recognition

Xenova

tiny-random-Phi3ForCausalLM

Brief-details: A lightweight Phi-3 variant with 2.07M parameters, featuring 2 hidden layers and 4 attention heads, designed for experimental text generation tasks

Text Generation

OrionStarAI

Orion-14B-Base

Brief-details: A powerful 14B parameter multilingual LLM excelling in English, Chinese, Japanese and Korean. Features strong reasoning, long context support (320k tokens) and multiple specialized variants.

Text Generation

ncbi

MedCPT-Query-Encoder

Brief Details: MedCPT-Query-Encoder is a 109M parameter biomedical text embedding model trained on 255M PubMed query-article pairs for semantic search and retrieval.

Feature Extraction

emrecan

bert-base-turkish-cased-mean-nli-stsb-tr

Brief-details: A Turkish BERT-based sentence transformer model that maps text to 768-dimensional vectors, trained on NLI and STS-B datasets with strong semantic similarity performance (0.83+ correlation scores).

Sentence Similarity

openart-custom

DucHaiten-AIart-SDXL_v3

Brief-details: SDXL-based text-to-image diffusion model with high download count (326k+), optimized for artistic generation using Stable Diffusion XL pipeline architecture.

Text-to-Image

hustvl

yolos-tiny

Brief Details: YOLOS-tiny: Lightweight Vision Transformer (6.49M params) for object detection, achieving 28.7 AP on COCO. Apache 2.0 licensed, ideal for efficient deployment.

Object Detection

deepset

bert-large-uncased-whole-word-masking-squad2

Brief-details: A powerful BERT-based QA model with 335M parameters, achieving 80.88% exact match on SQuAD 2.0. Specializes in extractive question answering.

Question Answering

tohoku-nlp

bert-base-japanese-v3

Brief Details: BERT base Japanese model trained on CC-100 and Wikipedia, featuring word-level tokenization with Unidic 2.1.2 dictionary and whole word masking capability.

Transformers

jonatasgrosman

wav2vec2-large-xlsr-53-polish

Brief Details: A fine-tuned XLSR-53 large model for Polish speech recognition, achieving 14.21% WER on Common Voice, with 339K+ downloads and Apache 2.0 license.

Automatic Speech Recognition

facebook

mask2former-swin-large-cityscapes-semantic

Brief-details: Mask2Former model (216M params) for semantic segmentation using Swin backbone, optimized for Cityscapes dataset with masked-attention Transformer architecture.

Image Segmentation

microsoft

unixcoder-base

Brief Details: UniXcoder-base is a unified cross-modal pre-trained model for code representation, built on RoBERTa with multi-modal capabilities for code analysis.

Feature Extraction

facebook

esm2_t33_650M_UR50D

Brief Details: ESM-2 protein language model with 650M parameters. Trained on masked language modeling for protein sequences. Mid-tier model balancing performance and efficiency.

Fill-Mask

hustvl

vitmatte-small-composition-1k

BRIEF DETAILS: ViTMatte is a Vision Transformer-based model for image matting with 25.8M parameters, utilizing PyTorch and offering high-quality foreground estimation capabilities.

Transformers

flair

ner-english

Brief-details: A powerful English NER model built with Flair, achieving 93.06% F1-score on CoNLL-03. Identifies PER, LOC, ORG, and MISC entities using LSTM-CRF architecture.

Token Classification

McGill-NLP

LLM2Vec-Mistral-7B-Instruct-v2-mntp

Brief Details: LLM2Vec-Mistral: A powerful text encoder that converts decoder-only LLMs into efficient embeddings using bidirectional attention and masked token prediction.

Sentence Similarity

vinai

phobert-base-v2

Brief Details: PhoBERT-base-v2 is a state-of-the-art Vietnamese language model with 135M parameters, trained on 140GB of text data, optimized for NLP tasks.

Fill-Mask

Hermes-3-Llama-3.1-405B

swinv2-tiny-patch4-window16-256

paraphrase-albert-small-v2

sd-controlnet-canny

faster-whisper-small

tiny-random-Phi3ForCausalLM

Orion-14B-Base

MedCPT-Query-Encoder

bert-base-turkish-cased-mean-nli-stsb-tr

DucHaiten-AIart-SDXL_v3

yolos-tiny

bert-large-uncased-whole-word-masking-squad2

bert-base-japanese-v3

wav2vec2-large-xlsr-53-polish

mask2former-swin-large-cityscapes-semantic

unixcoder-base

esm2_t33_650M_UR50D

vitmatte-small-composition-1k

ner-english

LLM2Vec-Mistral-7B-Instruct-v2-mntp

phobert-base-v2

The first platform built for prompt engineering