Models

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

ner-spanish-large

Brief-details: Spanish Named Entity Recognition model using XLM-RoBERTa architecture. Achieves 90.54% F1-score on CoNLL-03 Spanish, identifying PER, LOC, ORG, and MISC entities.

flair

ner-dutch

Brief-details: Dutch Named Entity Recognition model using BERT and LSTM-CRF architecture. Achieves 92.58% F1-score on CoNLL-03, detects PER/LOC/ORG/MISC entities.

flair

ner-dutch-large

Brief-details: Large-scale Dutch Named Entity Recognition model using FLERT architecture. Achieves 95.25% F1-score on CoNLL-03 Dutch. Identifies PER, LOC, ORG, and MISC entities.

fhamborg

roberta-targeted-sentiment-classification-newsarticles

Brief-details: RoBERTa-based targeted sentiment classifier for news articles, specialized in analyzing sentiment for specific entities within text contexts

ffrmns

t5-small_XSum-finetuned

Brief Details: T5-small model fine-tuned on XSum dataset for abstractive text summarization, optimized for generating concise news summaries.

unsloth

Qwen2.5-3B-unsloth-bnb-4bit

BRIEF DETAILS: Qwen2.5-3B optimized with Unsloth's Dynamic 4-bit quantization. Offers 2x faster performance, 60% less memory usage, and specialized quantization for improved accuracy.

Ai-Thalli

Brief Details: Ai-Thalli is a fine-tuned LLaMA model optimized for multi-language text generation tasks, featuring easy integration with the Transformers library

datasocietyco

bge-base-en-v1.5-course-recommender-v5

Brief-details: A specialized sentence transformer model fine-tuned for course recommendation, based on BGE-base-en-v1.5, outputting 768-dimensional vectors for semantic search and similarity tasks

Infermatic

72B-Qwen2.5-Kunou-v1-FP8-Dynamic

Brief Details: 72B parameter Qwen2.5-based model optimized for generalist and roleplay tasks, featuring FP8 dynamic quantization and ChatML format support.

facebook

mms-300m

BRIEF-DETAILS: Facebook's MMS-300M is a multilingual speech model pretrained on 500K hours across 1400+ languages, ideal for ASR tasks after fine-tuning.

unsloth

Llama-3.2-1B-unsloth-bnb-4bit

Brief Details: Llama-3.2-1B by Meta, optimized with Unsloth's Dynamic 4-bit quantization. Offers multilingual capabilities with 70% reduced memory footprint.

Yntec

realistic-vision-v13

Brief-details: A high-resolution (768x768) image generation model focused on realistic visuals, known for creative compositions and detailed human portraits. Final version before composition restrictions.

climatebert

distilroberta-base-climate-specificity

Brief-details: ClimateBERT model for classifying climate-related text specificity, fine-tuned on paragraph-level data for identifying specific vs non-specific climate discussions

naver-clova-ix

donut-base-finetuned-rvlcdip

Brief-details: OCR-free document understanding transformer model combining Swin Transformer vision encoder with BART decoder, fine-tuned on RVL-CDIP dataset for document classification tasks.

sshleifer

tiny-distilbert-base-cased-distilled-squad

Brief Details: A compact question-answering model derived from DistilBERT, fine-tuned on SQuAD, offering efficient performance for Q&A tasks while maintaining reasonable accuracy.

Salesforce

moirai-1.1-R-base

Brief-details: Moirai-1.1-R-base is Salesforce's upgraded time series forecasting model with ~20% improvement for low-frequency data prediction, specifically optimized for yearly and quarterly forecasting tasks.

meta-llama

Llama-2-70b-chat-hf

BRIEF-DETAILS: Meta's largest Llama-2 variant (70B parameters) fine-tuned for chat. Advanced language model with strong dialogue capabilities and Meta privacy compliance.

nm-testing

TinyLlama-1.1B-compressed-tensors-kv-cache-scheme

BRIEF-DETAILS: TinyLlama 1.1B model variant implementing compressed tensors and optimized KV cache scheme for improved memory efficiency

fishaudio

fish-speech-1

BRIEF DETAILS: Advanced multilingual TTS model trained on 150k hours of audio data covering English, Chinese & Japanese. Released under BY-CC-NC-SA-4.0 license.

IlyaGusev

saiga_mistral_7b_gguf

Brief-details: Saiga Mistral 7B GGUF is a lightweight, Llama.cpp-compatible language model requiring only 10GB RAM, optimized for efficient deployment and inference

nvidia

nemotron-3-8b-base-4k

Brief-details: Nemotron-3-8B is NVIDIA's 3.8 billion parameter foundation model with 4K context window, requiring NVIDIA AI Foundation Models license for usage