WD 1.4 SwinV2 Tagger V2

Property	Value
Parameter Count	96.2M
License	Apache-2.0
Framework Support	TF-Keras, ONNX, Safetensors
Tensor Type	F32

What is wd-v1-4-swinv2-tagger-v2?

The WD 1.4 SwinV2 Tagger V2 is an advanced image tagging model developed by SmilingWolf, specifically designed for comprehensive image analysis and tagging. This model represents a significant advancement in automated image tagging, trained on the extensive Danbooru dataset and capable of handling ratings, characters, and general tags with impressive accuracy.

Implementation Details

The model was trained using the SW-CV-ModelZoo framework, utilizing TPUs provided by the TRC program. It processes Danbooru images with specific ID criteria (modulo 0000-0899 for training, 0950-0999 for validation) and achieves an F1 score of 0.6854 at a threshold of 0.3771.

Minimum requirement of 10 general tags per image
Tags filtered to include only those with 600+ images
Compatible with ONNXRuntime >= 1.17.0
Supports batch inference

Core Capabilities

Multi-framework support (TF-Keras, ONNX, Safetensors)
Flexible batch processing
Comprehensive tag coverage
High accuracy in tag prediction
Efficient processing with F32 tensor type

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its comprehensive tag coverage and efficient architecture, making it particularly suitable for large-scale image tagging tasks. The SwinV2 architecture provides excellent performance while maintaining reasonable computational requirements.

Q: What are the recommended use cases?

The model is ideal for automated image tagging systems, content organization, and image database management. It's particularly well-suited for applications requiring detailed image classification and tagging with multiple categories.

wd-v1-4-swinv2-tagger-v2