StableSR

StableSR

Iceclear

StableSR is a diffusion-based image super-resolution model that leverages Stable Diffusion for high-quality upscaling, featuring time-aware encoding and controllable feature wrapping.

PropertyValue
DeveloperJianyi Wang
LicenseS-Lab License 1.0
PaperResearch Paper
Model TypeDiffusion-based Image Super-Resolution

What is StableSR?

StableSR is an advanced image super-resolution model that builds upon Stable Diffusion technology. It introduces innovative features like time-aware encoding and controllable feature wrapping (CFW) to achieve high-quality image upscaling. The model has been specifically designed to handle real-world image super-resolution tasks with remarkable effectiveness.

Implementation Details

The model architecture combines multiple sophisticated components: a fixed autoencoder that transforms images into latent representations (with an 8x downsampling factor), a time-aware encoder for guidance, and a CFW module trained on synthetic paired data. The model offers multiple checkpoints, including variants optimized for different resolutions and a turbo version capable of 4-step sampling.

  • Trained on DF2K and OST datasets
  • Uses fixed autoencoder with 8x downsampling
  • Implements time-aware encoding for improved guidance
  • Features controllable feature wrapping module

Core Capabilities

  • High-quality image upscaling for real-world scenarios
  • Multiple resolution support (512-base and 768v variants)
  • Fast processing with Turbo version (4-step sampling)
  • Balanced approach between fidelity and detail generation

Frequently Asked Questions

Q: What makes this model unique?

StableSR stands out through its combination of diffusion-based processing with time-aware encoding and CFW module, offering superior quality in real-world image super-resolution compared to traditional GAN-based approaches.

Q: What are the recommended use cases?

The model is ideal for high-quality image upscaling tasks, particularly when dealing with real-world images. It's especially effective for images up to 512 or 768 pixels, though processing speed may decrease for larger images.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026