Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

idefics2-8b

Brief-details: An 8.4B parameter multimodal AI model that processes interleaved image-text sequences, featuring enhanced OCR capabilities and native resolution image handling

Image-Text-to-Text

Qwen

Qwen-Audio-Chat

Brief-details: Qwen-Audio-Chat is an 8.4B parameter multimodal AI model that processes audio (speech, music, sounds) and text inputs, enabling natural conversations and audio analysis through multi-turn dialogues.

Text Generation

GritLM

GritLM-7B

Brief-details: 7B parameter LLM combining text representation and generation, built on Mistral architecture with SOTA performance on embedding and generation tasks

Text Generation

neuralmagic

Llama-3.2-90B-Vision-Instruct-FP8-dynamic

Brief-details: FP8-quantized 90B parameter multimodal LLM supporting text+image input across 8 languages, optimized for 50% reduced memory footprint while maintaining performance

Text Generation

lmms-lab

llava-onevision-qwen2-0.5b-ov

Brief Details: LLaVA-OneVision 0.5B multimodal model based on Qwen2, capable of processing images and videos with 894M parameters. Supports English/Chinese interaction.

Text Generation

vidore

colqwen2-v1.0

Brief Details: A visual retrieval model based on Qwen2-VL-2B-Instruct that uses ColBERT strategy for efficient document indexing and retrieval.

ColPali

HooshvareLab

bert-base-parsbert-uncased

Brief-details: ParsBERT - A Persian BERT model trained on 2M+ documents, achieving SOTA performance in sentiment analysis, text classification, and NER tasks.

Fill-Mask

Habana

wav2vec2

Brief-details: Wav2vec2 configuration for Habana's Gaudi HPU processors, optimizing speech recognition model deployment with mixed-precision training support and custom implementations

Habana

Marqo

dunzhang-stella_en_400M_v5

Brief-details: Optimized 400M parameter language model with fused matryoshka layer for efficient embedding generation while maintaining strong performance on MTEB benchmarks

Sentence Similarity

timm

resnet18.tv_in1k

Brief-details: ResNet-18 computer vision model featuring 11.7M parameters, ReLU activations, and 7x7 convolutions. Trained on ImageNet-1k with 69.76% top-1 accuracy.

Image Classification

parler-tts

parler_tts_mini_v0.1

Brief-details: Lightweight text-to-speech model with 647M parameters, capable of generating natural speech with controllable features like gender, pitch, and speed.

Text-to-Speech

Yntec

AnotherFineMess-SD2.1

Brief Details: A versatile text-to-image model merging Freedom and MangledMerge3, optimized for both anime and general art with SD2.1 architecture.

Text-to-Image

sentence-transformers

use-cmlm-multilingual

Brief-details: Multilingual sentence embedding model supporting 109 languages, based on LaBSE architecture with 472M parameters. Ideal for cross-lingual sentence similarity tasks.

Sentence Similarity

Dremmar

nsfw-xl

Brief-details: NSFW-XL is a specialized LORA model built on Stable Diffusion XL Base 1.0, focusing on artistic photo-realistic content generation with film photography aesthetics

Text-to-Image

InstaDeepAI

nucleotide-transformer-v2-500m-multi-species

Brief-details: A 500M parameter DNA language model trained on 850 diverse species genomes, specializing in molecular phenotype prediction and DNA sequence analysis

Fill-Mask

HuggingFaceH4

zephyr-7b-alpha

Brief Details: Zephyr-7B-Alpha is a 7B parameter LLM fine-tuned from Mistral-7B, optimized using DPO on UltraChat and UltraFeedback datasets for enhanced chat capabilities.

Text Generation

hezarai

crnn-fa-printed-96-long

Brief Details: CRNN-based Persian OCR model optimized for printed text, supporting up to 96 characters with enhanced capabilities for handling mixed LTR/RTL text and special characters.

Image-to-Text

bvanaken

clinical-assertion-negation-bert

Brief-details: Clinical BERT model for classifying medical assertions as PRESENT, ABSENT, or POSSIBLE in clinical notes. Fine-tuned on i2b2 challenge data.

Text Classification

ashawkey

zero123-xl-diffusers

Brief-details: Zero123-XL Diffusers is an MIT-licensed generative AI model for converting single images to 3D objects, focused on research applications with emphasis on safety and ethical use.

Diffusers

TheBloke

Wizard-Vicuna-13B-Uncensored-GGUF

Brief-details: 13B parameter uncensored LLaMA-based model with multiple GGUF quantizations, optimized for unrestricted responses and creative freedom. Supports CPU/GPU inference.

Transformers

microsoft

llmlingua-2-xlm-roberta-large-meetingbank

Brief Details: A multilingual prompt compression model (559M params) that efficiently distills text while preserving meaning, based on XLM-RoBERTa architecture

Token Classification

idefics2-8b

Qwen-Audio-Chat

GritLM-7B

Llama-3.2-90B-Vision-Instruct-FP8-dynamic

llava-onevision-qwen2-0.5b-ov

colqwen2-v1.0

bert-base-parsbert-uncased

wav2vec2

dunzhang-stella_en_400M_v5

resnet18.tv_in1k

parler_tts_mini_v0.1

AnotherFineMess-SD2.1

use-cmlm-multilingual

nsfw-xl

nucleotide-transformer-v2-500m-multi-species

zephyr-7b-alpha

crnn-fa-printed-96-long

clinical-assertion-negation-bert

zero123-xl-diffusers

Wizard-Vicuna-13B-Uncensored-GGUF

llmlingua-2-xlm-roberta-large-meetingbank

The first platform built for prompt engineering