Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

blip-vqa-base

BRIEF DETAILS: BLIP visual question-answering model with 385M parameters, combining vision-language understanding for image-based Q&A tasks. Built by Salesforce with PyTorch support.

Visual Question Answering

distil-whisper

distil-large-v3

Brief-details: Distil-large-v3 is a 756M parameter English speech recognition model, 6.3x faster than Whisper large-v3 with comparable accuracy and optimized for long-form transcription.

Automatic Speech Recognition

Intel

dpt-hybrid-midas

Brief Details: DPT-Hybrid (MiDaS 3.0) is Intel's state-of-the-art monocular depth estimation model using Vision Transformers, trained on 1.4M images with impressive zero-shot capabilities.

Depth Estimation

jukofyork

creative-writing-control-vectors-v3.0

Brief-details: Creative writing control vectors for fine-tuned text generation control, supporting multiple LLMs with GGUF format compatibility and advanced axis manipulation

GGUF

Qwen

Qwen1.5-1.8B

Brief-details: Qwen1.5-1.8B is a 1.84B parameter transformer-based language model, part of Qwen2's beta release, featuring 32K context length and improved multilingual capabilities.

Text Generation

Salesforce

codet5p-110m-embedding

Brief Details: CodeT5+ 110M embedding model for code understanding - generates 256-dim code embeddings with strong performance on code retrieval tasks

Transformers

facebook

w2v-bert-2.0

BRIEF DETAILS: Powerful multilingual speech encoder (580M params) supporting 96 languages, pre-trained on 4.5M hours of audio data. Ideal for feature extraction and ASR tasks.

Feature Extraction

vikhyatk

moondream2

BRIEF-DETAILS: Efficient vision-language model (1.87B params) for edge devices, capable of VQA tasks with strong benchmark performance and Apache 2.0 license

Image-Text-to-Text

EleutherAI

gpt-j-6b

Brief Details: A powerful 6B parameter language model trained on The Pile dataset, offering strong performance in text generation and NLP tasks with public availability.

Text Generation

microsoft

BiomedVLP-CXR-BERT-specialized

Brief-details: A specialized BERT model for chest X-ray radiology, achieving SOTA results in radiology NLI tasks with improved vocabulary and multi-modal capabilities.

Fill-Mask

hfl

chinese-bert-wwm-ext

Brief Details: A Chinese BERT model utilizing Whole Word Masking, developed by HFL team. Features improved masking strategy for Chinese text and 256K+ downloads. Apache 2.0 licensed.

Fill-Mask

JackFram

llama-160m

Brief Details: A lightweight 160M parameter LLaMA-like model trained on Wikipedia and C4 datasets, designed for speculative inference acceleration research.

Text Generation

basel

ATTACK-BERT

Brief Details: ATTACK-BERT is a specialized cybersecurity language model that uses sentence transformers to analyze and compare attack-related text, with strong focus on semantic similarity.

Sentence Similarity

THUDM

glm-4v-9b

Brief Details: GLM-4V-9B is a powerful 13.9B parameter multimodal LLM with high-resolution image understanding capabilities and superior performance in Chinese/English tasks.

Transformers

ckiplab

bert-base-chinese-ws

Brief Details: A traditional Chinese BERT model specialized in word segmentation tasks, part of CKIP Lab's NLP toolkit. Features GPL-3.0 license and extensive downloads (257k+).

Token Classification

TechxGenus

starcoder2-15b-instruct-GPTQ

Brief-details: GPTQ-quantized version of StarCoder2-15B optimized for code generation. Achieves 77.4% pass@1 on HumanEval-Python, using Alpaca instruction format.

Text Generation

EleutherAI

gpt-neo-2.7B

**Brief Details:** A 2.7B parameter GPT-style language model trained on The Pile dataset, capable of sophisticated text generation with strong performance on various NLP tasks.

Text Generation

oliverguhr

german-sentiment-bert

Brief Details: German BERT model for sentiment analysis with 109M params. Achieves 96.39% F1 score across datasets. Supports positive/negative/neutral classification.

Text Classification

dslim

bert-large-NER

Brief Details: BERT-large model fine-tuned for Named Entity Recognition, achieving 91.7% F1 score on CoNLL-2003. Identifies LOC, ORG, PER, MISC entities.

Token Classification

SenswiseData

bert_cased_ner

Brief Details: Turkish NER model using BERT, trained on MilliyetNER dataset. Achieves 96% F1-score for entity recognition. Supports person, location, and organization detection.

Token Classification

intfloat

multilingual-e5-large-instruct

Brief-details: Multilingual text embedding model supporting 94+ languages, fine-tuned with instructions. 560M parameters, optimized for retrieval and classification tasks.

Feature Extraction

blip-vqa-base

distil-large-v3

dpt-hybrid-midas

creative-writing-control-vectors-v3.0

Qwen1.5-1.8B

codet5p-110m-embedding

w2v-bert-2.0

moondream2

gpt-j-6b

BiomedVLP-CXR-BERT-specialized

chinese-bert-wwm-ext

llama-160m

ATTACK-BERT

glm-4v-9b

bert-base-chinese-ws

starcoder2-15b-instruct-GPTQ

gpt-neo-2.7B

german-sentiment-bert

bert-large-NER

bert_cased_ner

multilingual-e5-large-instruct

The first platform built for prompt engineering