Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

SFR-Embedding-Mistral

Brief-details: A powerful 7.11B parameter text embedding model from Salesforce, built on Mistral-7B, optimized for retrieval tasks with MTEB benchmark performance

Feature Extraction

ruslandev

llama-3-8b-gpt-4o-ru1.0-gguf

BRIEF DETAILS: 8B parameter LLaMA-3 model fine-tuned for Russian language, achieving GPT-3.5-turbo level performance. Optimized for GGUF format with 73K+ downloads.

GGUF

dunzhang

stella-mrl-large-zh-v3.5-1792d

BRIEF-DETAILS: A Chinese language sentence transformer with variable dimension embeddings (128-1792d), optimized for retrieval and semantic tasks with 326M parameters. Achieves strong CMTEB benchmark scores.

Sentence Similarity

llava-hf

llava-v1.6-vicuna-7b-hf

Brief Details: LLaVA-NeXT 7B - Advanced multimodal vision-language model with improved OCR and reasoning capabilities, 7.06B parameters, FP16 precision

Image-Text-to-Text

google

bigbird-roberta-base

Brief Details: BigBird-RoBERTa base model - Transformer-based architecture supporting 4096-length sequences using block sparse attention. Apache 2.0 licensed.

Transformers

stabilityai

sd-turbo

Brief-details: SD-Turbo is a fast text-to-image model by StabilityAI that generates high-quality images in a single step, distilled from Stable Diffusion 2.1 using Adversarial Diffusion Distillation.

Text-to-Image

charactr

vocos-encodec-24khz

Brief-details: A PyTorch-based neural vocoder for high-quality audio synthesis, converting acoustic features to waveforms using GAN and Fourier transforms at 24kHz

PyTorch

google

owlv2-large-patch14-ensemble

Brief Details: OWLv2 is a 438M parameter zero-shot object detection model using CLIP backbone with ViT-L/14 architecture, enabling text-conditioned object detection.

Zero-Shot Object Detection

jinaai

jina-colbert-v2

Brief-details: Multilingual late interaction retriever supporting 94 languages with 559M params. Features Matryoshka embeddings and superior retrieval performance compared to v1.

ONNX

wangfuyun

AnimateLCM

Brief-details: AnimateLCM is a computation-efficient text-to-video generation model capable of creating high-quality animated content in just 4 steps, offering fast inference with personalized styling.

Text-to-Video

ByteDance

AnimateDiff-Lightning

Brief Details: A lightning-fast text-to-video generation model by ByteDance that runs 10x faster than original AnimateDiff with 1-8 step options.

Text-to-Video

sociocom

MedNER-CR-JA

Brief Details: Japanese medical NER model (110M params) for identifying medical entities in clinical text. Supports disease, medication & temporal annotations.

Token Classification

facebook

wav2vec2-xls-r-1b

Brief Details: Powerful multilingual speech model with 1B parameters, supporting 128 languages. Pre-trained on 436K hours of speech data, ideal for ASR tasks.

Transformers

CAMeL-Lab

bert-base-arabic-camelbert-mix-ner

Brief-details: Arabic Named Entity Recognition model built on CAMeLBERT-Mix, fine-tuned on ANERcorp dataset for accurate entity detection in Arabic text.

Token Classification

michellejieli

NSFW_text_classifier

Brief-details: A fine-tuned DistilRoBERTa model for NSFW text classification, trained on 14,317 Reddit posts to detect inappropriate content with binary classification (NSFW/SFW).

Text Classification

facebook

esm2_t30_150M_UR50D

BRIEF-DETAILS: ESM-2 protein language model with 150M parameters. Features 30 layers, MIT license, optimized for masked language modeling of protein sequences.

Fill-Mask

Yntec

epiCPhotoGasm

Brief Details: Photorealistic text-to-image model optimized for generating high-quality images, especially of people. Features 840KVAE integration and CreativeML license.

Text-to-Image

facebook

timesformer-base-finetuned-k400

Brief-details: TimeSformer base model fine-tuned on Kinetics-400 dataset for video classification, implementing space-time attention mechanisms with transformer architecture.

Video Classification

timm

nfnet_l0.ra2_in1k

Brief Details: NFNet-L0 is a lightweight, normalization-free neural network with 35.1M parameters, optimized for ImageNet classification using scaled weight standardization.

Image Classification

cross-encoder

ms-marco-TinyBERT-L-2

Brief-details: Efficient cross-encoder model for MS Marco passage ranking, achieving NDCG@10 of 67.43 on TREC DL 19, processing 9000 docs/sec on V100 GPU

Text Classification

KoboldAI

LLaMA2-13B-Tiefighter-GGUF

BRIEF-DETAILS: A 13B parameter LLaMA2-based creative writing model optimized for storytelling, chatbots, and adventures, featuring merged capabilities from multiple specialized LORAs

GGUF

SFR-Embedding-Mistral

llama-3-8b-gpt-4o-ru1.0-gguf

stella-mrl-large-zh-v3.5-1792d

llava-v1.6-vicuna-7b-hf

bigbird-roberta-base

sd-turbo

vocos-encodec-24khz

owlv2-large-patch14-ensemble

jina-colbert-v2

AnimateLCM

AnimateDiff-Lightning

MedNER-CR-JA

wav2vec2-xls-r-1b

bert-base-arabic-camelbert-mix-ner

NSFW_text_classifier

esm2_t30_150M_UR50D

epiCPhotoGasm

timesformer-base-finetuned-k400

nfnet_l0.ra2_in1k

ms-marco-TinyBERT-L-2

LLaMA2-13B-Tiefighter-GGUF

The first platform built for prompt engineering