Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

punctuation_fullstop_truecase_english

Brief Details: A powerful English text processing model that restores punctuation, capitalization, and sentence boundaries in a single pass. Handles acronyms and complex capitalization patterns.

Text2Text Generation

salmanshahid

segmentation

Brief-details: Advanced speaker segmentation model for voice activity detection and overlapped speech detection, based on pyannote.audio 2.0 framework with MIT license

Voice Activity Detection

salmanshahid

vad

Brief-details: Voice Activity Detection (VAD) model powered by pyannote.audio 2.1, offering precise speech detection in audio files with MIT license and 286K+ downloads.

Automatic Speech Recognition

vectara

hallucination_evaluation_model

Brief-details: HHEM-2.1-Open: A 110M parameter hallucination detection model for evaluating LLM outputs, outperforming GPT-3.5/4 with efficient resource usage

Text Classification

hkunlp

instructor-large

Brief-details: Instruction-tuned text embedding model that generates task-specific embeddings via natural language prompts, achieving SOTA on 70+ embedding tasks.

Sentence Similarity

stabilityai

stable-diffusion-2-depth

Brief-details: Stable Diffusion v2 depth-aware model that enables depth-controlled image generation and modification, building on SD2-base with MiDaS integration

Diffusers

aglazkova

bart_finetuned_keyphrase_extraction

BRIEF DETAILS: BART-based model fine-tuned for keyphrase generation across scientific and news domains, with 291K+ downloads and support for multiple datasets

Text2Text Generation

neuralmind

bert-large-portuguese-cased

BRIEF DETAILS: Large-scale Portuguese BERT model (335M params) by neuralmind, optimized for Brazilian Portuguese NLP tasks with state-of-the-art performance

Fill-Mask

jinaai

jina-embeddings-v2-small-en

Brief-details: Efficient 33M parameter embedding model supporting 8K sequence length, built on BERT with ALiBi. Optimized for English text embeddings and RAG applications.

Feature Extraction

Salesforce

blip2-opt-6.7b-coco

Brief Details: BLIP-2 vision-language model with 7.75B parameters, combining CLIP image encoder and OPT-6.7b LLM for image captioning and VQA tasks. MIT licensed.

Image-Text-to-Text

adrianjoheni

translation-model-opus

Brief Details: English-Spanish translation model with BLEU score of 54.9, trained on OPUS data. Supports bidirectional translation using transformer architecture.

Translation

jonatasgrosman

wav2vec2-large-xlsr-53-persian

BRIEF DETAILS: A fine-tuned XLSR-53 large model for Persian speech recognition, achieving 30.12% WER and 7.37% CER on Common Voice dataset. Supports 16kHz audio input.

Automatic Speech Recognition

sentence-transformers

multi-qa-distilbert-cos-v1

Brief-details: A powerful 66.4M parameter DistilBERT model trained on 215M question-answer pairs, optimized for semantic search and sentence similarity tasks.

Sentence Similarity

Qwen

Qwen1.5-0.5B

Brief Details: Qwen1.5-0.5B is a 620M parameter transformer-based language model offering 32K context length support and enhanced multilingual capabilities.

Text Generation

openai-community

gpt2-xl

BRIEF-DETAILS: GPT-2 XL: 1.5B parameter transformer-based language model by OpenAI. Advanced text generation capabilities with extensive pre-training on web content.

Text Generation

openai

whisper-medium.en

Brief-details: A powerful English speech recognition model with 764M parameters, trained on 680k hours of data, achieving 4.12% WER on LibriSpeech clean test set.

Automatic Speech Recognition

dccuchile

bert-base-spanish-wwm-uncased

Brief-details: Pre-trained Spanish BERT model using Whole Word Masking, achieving SOTA results on Spanish NLP tasks with 309K+ downloads and strong benchmark performance.

Fill-Mask

facebook

sam-vit-large

Brief Details: SAM-ViT-Large is a powerful vision segmentation model with 312M parameters, capable of generating high-quality object masks from various input prompts and zero-shot performance.

Mask Generation

CompVis

ldm-super-resolution-4x-openimages

Brief-details: A powerful latent diffusion model for 4x image super-resolution, developed by CompVis. Specializes in high-quality upscaling while being computationally efficient.

Diffusers

unsloth

Meta-Llama-3.1-8B-bnb-4bit

Brief-details: 4-bit quantized version of Meta's Llama 3.1 8B model optimized for efficiency, featuring multilingual capabilities and 128k context window

Text Generation

microsoft

rad-dino

Brief-details: RAD-DINO is an 86.6M parameter vision transformer model specialized in chest X-ray encoding, trained using self-supervised DINOv2 methodology across 882,775 medical images

Image Feature Extraction

punctuation_fullstop_truecase_english

segmentation

vad

hallucination_evaluation_model

instructor-large

stable-diffusion-2-depth

bart_finetuned_keyphrase_extraction

bert-large-portuguese-cased

jina-embeddings-v2-small-en

blip2-opt-6.7b-coco

translation-model-opus

wav2vec2-large-xlsr-53-persian

multi-qa-distilbert-cos-v1

Qwen1.5-0.5B

gpt2-xl

whisper-medium.en

bert-base-spanish-wwm-uncased

sam-vit-large

ldm-super-resolution-4x-openimages

Meta-Llama-3.1-8B-bnb-4bit

rad-dino

The first platform built for prompt engineering