SDXL Detector

Property	Value
Parameter Count	86.8M
License	CC-BY-NC-3.0
Architecture	Transformer-based (Swin)
Validation Metrics	F1: 0.973, Accuracy: 0.981

What is sdxl-detector?

The SDXL Detector is a specialized image classification model designed to distinguish between authentic images and those generated by Stable Diffusion XL (SDXL). Built upon the foundation of the umm-maybe AI art detector, this model has been fine-tuned on a carefully curated dataset of Wikimedia-SDXL image pairs, where SDXL-generated images are created using BLIP-generated captions from Wikimedia images.

Implementation Details

This model leverages the Swin Transformer architecture and has been optimized using AutoTrain technology. It demonstrates exceptional performance with a 97.3% F1 score and 98.1% accuracy on validation data. The model supports both INT64 and FP32 tensor types, making it versatile for different deployment scenarios.

Improved detection capability for recent diffusion models
Enhanced performance on non-artistic imagery
Optimized for SDXL-generated content
Supports ONNX runtime for efficient inference

Core Capabilities

High-precision detection (99.45% precision rate)
Robust recall performance (95.29%)
Exceptional AUC score of 0.998
Specialized in detecting SDXL-generated images

Frequently Asked Questions

Q: What makes this model unique?

This model specifically excels at detecting images generated by newer diffusion models, particularly SDXL, while also performing well on non-artistic imagery due to its diverse training dataset from Wikimedia.

Q: What are the recommended use cases?

The model is ideal for non-commercial applications in education and personal use, particularly for detecting SDXL-generated images. However, it may have lower performance on images generated by older AI models like VQGAN+CLIP.

sdxl-detector