Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

vit_base_patch16_224.mae

Brief Details: Vision Transformer (ViT) model pretrained with MAE on ImageNet-1k. 85.8M params, 224x224 input, self-supervised learning for robust image feature extraction.

minishlab

M2V_base_output

Brief Details: A distilled version of bge-base-en-v1.5 that creates fast static text embeddings, offering significant speed improvements while maintaining quality

bartowski

magnum-v4-12b-GGUF

Brief-details: A comprehensive quantized version of Magnum v4 12B offering multiple GGUF variants optimized for different hardware and memory constraints, with high-quality compression options

neuralmagic

Meta-Llama-3-8B-Instruct-quantized.w8a16

BRIEF-DETAILS: 8B parameter Llama-3 model quantized to INT8, optimized for efficiency while maintaining 99.8% performance of original model. Ideal for commercial/research assistant-like chat applications.

second-state

Llava-v1.5-7B-GGUF

BRIEF-DETAILS: Quantized version of LLaVA-1.5-7B multimodal model with various GGUF formats, optimized for efficiency while maintaining performance.

gvs

wav2vec2-large-xlsr-malayalam

Brief-details: Speech recognition model fine-tuned on Malayalam language using Wav2Vec2-Large-XLSR-53, achieving 28.43% WER on combined test datasets from multiple Malayalam speech corpora.

IShallRiseAgain

StudioGhibli

Brief-details: A specialized AI model designed to generate Studio Ghibli-style anime artwork, created by IShallRiseAgain with explicit focus on non-NFT creative use.

deepmind

language-perceiver

Brief-details: Perceiver IO language model that processes raw UTF-8 bytes using cross-attention with latent vectors, achieving 81.8 GLUE score. Combines efficient processing with flexible output generation.

dbmdz

bert-mini-historic-multilingual-cased

Brief-details: Compact BERT model (11.55M params) trained on historical multilingual texts from Europeana/British Library, supporting German, French, English, Finnish and Swedish.

dbmdz

bert-large-cased-finetuned-conll03-english

Brief Details: BERT-large model fine-tuned on CoNLL-03 dataset for Named Entity Recognition (NER), specialized in identifying person names, organizations, locations, and misc entities in English text.

dbmdz

bert-base-italian-uncased

Brief Details: BERT base model for Italian language processing, trained on 13GB corpus with 2B tokens. Uncased version optimized for general NLP tasks.

dbmdz

bert-base-historic-multilingual-cased

Brief-details: Historical multilingual BERT model trained on 130GB of historical texts from 5 languages (German, French, English, Finnish, Swedish), optimized for NER tasks

dbmdz

bert-base-german-europeana-uncased

Brief-details: German BERT model trained on 51GB of Europeana newspapers data (8B tokens). Specialized for historical German text processing. Uncased version.

dbmdz

bert-base-german-europeana-cased

Brief Details: German BERT model trained on Europeana newspapers corpus (51GB, 8B tokens), specialized for historical text analysis and NLP tasks.

davanstrien

book-genre-classification

BRIEF DETAILS: BERT-based adapter model for book genre classification, built on bert-base-cased. Enables efficient text classification through adapter-transformers library integration.

dandelin

vilt-b32-mlm-itm

Brief Details: A Vision-Language Transformer model for visual question answering, trained on GCC+SBU+COCO+VG datasets without convolution or region supervision.

dandelin

vilt-b32-finetuned-flickr30k

BRIEF DETAILS: Vision-Language Transformer model fine-tuned on Flickr30k dataset, specializing in image-text retrieval tasks without requiring complex region supervision.

csebuetnlp

banglabert

BRIEF-DETAILS: BanglaBERT: State-of-the-art ELECTRA-based model for Bengali NLP, achieving 77.09 BangLUE score. Excels in sentiment analysis, NER, and QA tasks.

KBlueLeaf

DanTagGen-delta-rev2

BRIEF DETAILS: DanTagGen-delta-rev2 is a specialized AI model by KBlueLeaf hosted on HuggingFace, focused on tag generation and text analysis capabilities.

sentence-transformers

bert-base-nli-stsb-mean-tokens

Brief Details: BERT-based sentence embedding model (768d vectors) for semantic tasks - DEPRECATED and not recommended due to low quality outputs

realtreetune

rho-1b-sft-GSM8K

Brief details: A 1B parameter SFT model based on the Rho architecture, fine-tuned on GSM8K dataset for mathematical reasoning tasks. Implements techniques from the referenced arxiv paper.

vit_base_patch16_224.mae

M2V_base_output

magnum-v4-12b-GGUF

Meta-Llama-3-8B-Instruct-quantized.w8a16

Llava-v1.5-7B-GGUF

wav2vec2-large-xlsr-malayalam

StudioGhibli

language-perceiver

bert-mini-historic-multilingual-cased

bert-large-cased-finetuned-conll03-english

bert-base-italian-uncased

bert-base-historic-multilingual-cased

bert-base-german-europeana-uncased

bert-base-german-europeana-cased

book-genre-classification

vilt-b32-mlm-itm

vilt-b32-finetuned-flickr30k

banglabert

DanTagGen-delta-rev2

bert-base-nli-stsb-mean-tokens

rho-1b-sft-GSM8K

The first platform built for prompt engineering