Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

ner-german-large

Brief Details: German NER model achieving 92.31% F1-score on CoNLL-03, built with FLERT architecture and XLM-R embeddings. Identifies PER, LOC, ORG, MISC entities.

Token Classification

Lajavaness

bilingual-embedding-large

Brief-details: A powerful 560M parameter bilingual (French-English) embedding model based on XLM-RoBERTa, optimized for semantic search and similarity tasks.

Sentence Similarity

Qwen

Qwen2.5-3B-Instruct

Brief Details: Qwen2.5-3B-Instruct is a 3.09B parameter multilingual LLM with 32K context length, optimized for instruction-following and structured outputs.

Text Generation

ByteDance

Hyper-SD

Brief-details: High-performance diffusion model acceleration technique supporting 1-8 step inference with FLUX, SD3, SDXL and SD1.5 compatibility via LoRA

Text-to-Image

timm

vit_base_patch16_224.augreg_in21k

Brief Details: Vision Transformer (ViT) model with 103M params, trained on ImageNet-21k. Optimized with augmentation & regularization, ideal for image classification.

Image Classification

google

flan-t5-xl

BRIEF DETAILS: FLAN-T5-XL is a 2.85B parameter instruction-tuned language model built on T5, capable of multilingual text generation and excelling at zero/few-shot learning tasks.

Text2Text Generation

timm

swin_base_patch4_window7_224.ms_in22k_ft_in1k

Brief Details: Swin Transformer vision model with 88.1M params, pre-trained on ImageNet-22k and fine-tuned on ImageNet-1k. Excellent for hierarchical feature extraction and classification.

Image Classification

hpcai-tech

OpenSora-STDiT-v3

BRIEF DETAILS: OpenSora-STDiT-v3 is a 1.21B parameter transformer-based model for AI video generation, part of the Open-Sora project with Apache 2.0 license and F32 tensor support.

Transformers

timbrooks

instruct-pix2pix

Brief-details: Instruct-pix2pix is a powerful image-to-image transformation model with MIT license, allowing precise image editing through natural language instructions. Popular with 187K+ downloads.

Image-to-Image

Helsinki-NLP

opus-mt-it-en

Brief-details: Neural machine translation model for Italian to English conversion, achieving BLEU scores up to 70.9 on Tatoeba dataset, built by Helsinki-NLP team.

Translation

nomic-ai

nomic-embed-vision-v1

Brief-details: High-performing vision embedding model with 92.9M parameters, sharing embedding space with nomic-embed-text-v1. Achieves 70.7% on ImageNet 0-shot and 62.39% on MTEB.

Image Feature Extraction

TencentARC

InstantMesh

Brief Details: InstantMesh is a groundbreaking AI model for generating 3D meshes from single images in under 10 seconds, using sparse-view reconstruction and LRM architecture.

Image-to-3D

microsoft

Phi-3-small-8k-instruct

Brief-details: Phi-3-small-8k-instruct is a 7B parameter LLM optimized for reasoning and instruction following, featuring 8K context window and multilingual capabilities.

Text Generation

intfloat

e5-mistral-7b-instruct

Brief-details: E5-mistral-7b-instruct is a powerful 7.11B parameter instruction-tuned embedding model built on Mistral-7B, optimized for text embeddings with multi-language capability

Feature Extraction

EleutherAI

gpt-neo-1.3B

BRIEF DETAILS: GPT-Neo 1.3B: EleutherAI's 1.37B parameter language model trained on The Pile dataset. Offers strong performance in text generation with MIT license.

Text Generation

numind

NuNER_Zero

BRIEF DETAILS: Zero-shot Named Entity Recognition model using GLiNER architecture, outperforming previous models by 3.1% F1-Score. MIT licensed.

Token Classification

google

mobilenet_v1_0.75_192

Brief Details: MobileNet V1 (0.75, 192px) - Efficient CNN for mobile vision tasks, pre-trained on ImageNet-1k. Optimized for low latency and power consumption.

Image Classification

cledoux42

Ethnicity_Test_v003

Brief-details: Vision transformer-based ethnicity classification model with 79.6% accuracy, trained using AutoTrain. Features low carbon emissions of 6.02g CO2.

Image Classification

facebook

encodec_32khz

Brief Details: High-performance neural audio codec by Meta AI with 59M params, designed for real-time audio compression at 32kHz. Part of MusicGen project.

Feature Extraction

google

siglip-so400m-patch14-224

Brief-details: SigLIP vision model with 877M params, optimized for zero-shot image classification. Uses sigmoid loss for improved image-text pair processing and batch scaling.

Zero-Shot Image Classification

timm

resnet50_gn.a1h_in1k

Brief-details: ResNet-50 with Group Normalization, trained on ImageNet-1k using A1 recipe. 25.6M parameters, optimized for image classification with 81.22% top-1 accuracy.

Image Classification

ner-german-large

bilingual-embedding-large

Qwen2.5-3B-Instruct

Hyper-SD

vit_base_patch16_224.augreg_in21k

flan-t5-xl

swin_base_patch4_window7_224.ms_in22k_ft_in1k

OpenSora-STDiT-v3

instruct-pix2pix

opus-mt-it-en

nomic-embed-vision-v1

InstantMesh

Phi-3-small-8k-instruct

e5-mistral-7b-instruct

gpt-neo-1.3B

NuNER_Zero

mobilenet_v1_0.75_192

Ethnicity_Test_v003

encodec_32khz

siglip-so400m-patch14-224

resnet50_gn.a1h_in1k

The first platform built for prompt engineering