StableSR

Iceclear

StableSR is a diffusion-based image super-resolution model that leverages Stable Diffusion for high-quality upscaling, featuring time-aware encoding and controllable feature wrapping.

Property	Value
Developer	Jianyi Wang
License	S-Lab License 1.0
Paper	Research Paper
Model Type	Diffusion-based Image Super-Resolution

What is StableSR?

StableSR is an advanced image super-resolution model that builds upon Stable Diffusion technology. It introduces innovative features like time-aware encoding and controllable feature wrapping (CFW) to achieve high-quality image upscaling. The model has been specifically designed to handle real-world image super-resolution tasks with remarkable effectiveness.

Implementation Details

The model architecture combines multiple sophisticated components: a fixed autoencoder that transforms images into latent representations (with an 8x downsampling factor), a time-aware encoder for guidance, and a CFW module trained on synthetic paired data. The model offers multiple checkpoints, including variants optimized for different resolutions and a turbo version capable of 4-step sampling.

Trained on DF2K and OST datasets
Uses fixed autoencoder with 8x downsampling
Implements time-aware encoding for improved guidance
Features controllable feature wrapping module

Core Capabilities

High-quality image upscaling for real-world scenarios
Multiple resolution support (512-base and 768v variants)
Fast processing with Turbo version (4-step sampling)
Balanced approach between fidelity and detail generation

Frequently Asked Questions

Q: What makes this model unique?

StableSR stands out through its combination of diffusion-based processing with time-aware encoding and CFW module, offering superior quality in real-world image super-resolution compared to traditional GAN-based approaches.

Q: What are the recommended use cases?

The model is ideal for high-quality image upscaling tasks, particularly when dealing with real-world images. It's especially effective for images up to 512 or 768 pixels, though processing speed may decrease for larger images.