Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Wav2Vec2_CommonPhone

Brief-details: Multilingual phone recognition model using Wav2Vec2 architecture, supporting 6 languages with impressive accuracy (avg 9.2% PER). Optimized for pathological speech analysis.

Automatic Speech Recognition

bartowski

Phi-3-medium-128k-instruct-GGUF

Brief Details: Quantized version of Microsoft's Phi-3 medium model with 128k context window, offering multiple compression options from 3.7GB to 55.8GB with varying quality-size tradeoffs.

Text Generation

llava-hf

llava-onevision-qwen2-7b-ov-hf

Brief Details: LLaVA-OneVision is an 8.03B parameter multimodal LLM that combines Qwen2 with vision capabilities for single-image, multi-image, and video tasks.

Image-Text-to-Text

Yntec

RealLife

BRIEF DETAILS: Text-to-image model focusing on photorealistic generation with 43.7K+ downloads. Supports various styles from abstract to realistic, built on StableDiffusion.

Text-to-Image

lllyasviel

control_v11p_sd15_inpaint

Brief Details: ControlNet inpainting model for Stable Diffusion v1.5, enabling precise image editing with masked regions and conditional control over generation.

Image-to-Image

OpenMatch

cocodr-large-msmarco

Brief-details: Large-scale BERT model (335M params) for dense retrieval, pretrained on BEIR corpus and fine-tuned on MS MARCO, focusing on distribution shift handling.

Fill-Mask

DavidAU

Gemma-The-Writer-N-Restless-Quill-10B-Uncensored-GGUF

Brief Details: A 10B parameter Gemma-based creative writing model with enhanced prose generation capabilities through Brainstorm 5x technology. Optimized for fiction, storytelling, and vivid descriptions.

Text Generation

huggingface

CodeBERTa-small-v1

Brief-details: CodeBERTa-small-v1 is a 6-layer RoBERTa-like model trained on CodeSearchNet, optimized for code understanding across 6 programming languages with 84M parameters.

Fill-Mask

IDEA-CCNL

Erlangshen-Roberta-330M-Sentiment

Brief Details: Erlangshen-Roberta-330M-Sentiment is a fine-tuned Chinese RoBERTa model with 326M parameters, specialized in sentiment analysis across 8 datasets with 227,347 samples.

Text Classification

openai-community

openai-gpt

Brief Details: OpenAI GPT-1: Pioneer 120M parameter transformer model for language understanding. First of its kind from OpenAI with MIT license and strong zero-shot capabilities.

Text Generation

ncbi

MedCPT-Article-Encoder

Brief-details: MedCPT-Article-Encoder is a 109M-parameter transformer model for generating biomedical text embeddings, trained on 255M PubMed query-article pairs

Feature Extraction

Intel

zoedepth-nyu-kitti

Brief Details: A state-of-the-art depth estimation model with 345M parameters, combining relative and metric depth estimation using DPT framework. MIT licensed.

Depth Estimation

ku-nlp

deberta-v2-base-japanese-char-wwm

Brief Details: Japanese DeBERTa V2 base model (122M params) pre-trained on Wikipedia, CC-100, and OSCAR. Features character-level tokenization and whole word masking for advanced NLP tasks.

Fill-Mask

InstructPLM

MPNN-ProGen2-xlarge-CATH42

Brief Details: InstructPLM protein design model with 6.57B parameters, combining ProGen2 and ProteinMPNN architectures for accurate protein sequence generation based on backbone structures

Text Generation

parler-tts

parler-tts-mini-v1

BRIEF-DETAILS: Lightweight text-to-speech model with 878M parameters, capable of generating natural speech with controllable features like gender, speed, and pitch. Apache 2.0 licensed.

Text-to-Speech

typeform

distilbert-base-uncased-mnli

Brief Details: DistilBERT model fine-tuned on MNLI dataset for zero-shot classification. 67M parameters, English-language focused, achieving 82% accuracy on MNLI tasks.

Zero-Shot Classification

google

pegasus-large

Brief Details: Pegasus-large: Google's powerful abstractive summarization transformer model with mixed & stochastic training on C4 and HugeNews datasets, achieving state-of-the-art results.

Summarization

Helsinki-NLP

opus-mt-fi-en

Brief-details: Neural machine translation model for Finnish to English conversion with strong BLEU scores (53.4 on Tatoeba test), built by Helsinki-NLP using transformer architecture

Translation

Helsinki-NLP

opus-mt-th-en

Brief-details: A Thai-to-English translation model by Helsinki-NLP using transformer-align architecture, achieving 48.1 BLEU score on Tatoeba test set.

Translation

deepvk

USER-bge-m3

Brief Details: Universal Sentence Encoder for Russian (USER-bge-m3) - 359M parameter model for Russian text embeddings with 1024-dimensional vectors, based on BGE-M3.

Sentence Similarity

TencentARC

PhotoMaker

Brief Details: PhotoMaker is an advanced text-to-image AI model that creates customized photos from face inputs and text prompts, featuring ID embedding and SDXL compatibility.

Text-to-Image

Wav2Vec2_CommonPhone

Phi-3-medium-128k-instruct-GGUF

llava-onevision-qwen2-7b-ov-hf

RealLife

control_v11p_sd15_inpaint

cocodr-large-msmarco

Gemma-The-Writer-N-Restless-Quill-10B-Uncensored-GGUF

CodeBERTa-small-v1

Erlangshen-Roberta-330M-Sentiment

openai-gpt

MedCPT-Article-Encoder

zoedepth-nyu-kitti

deberta-v2-base-japanese-char-wwm

MPNN-ProGen2-xlarge-CATH42

parler-tts-mini-v1

distilbert-base-uncased-mnli

pegasus-large

opus-mt-fi-en

opus-mt-th-en

USER-bge-m3

PhotoMaker

The first platform built for prompt engineering