sdxs-512-0.9

sdxs-512-0.9

IDKiro

SDXS-512-0.9 is a real-time one-step latent diffusion model for high-resolution image generation, based on score distillation and feature matching techniques.

PropertyValue
LicenseOpenRail++
Research PaperSDXS: Real-Time One-Step Latent Diffusion Models
AuthorIDKiro
FrameworkDiffusers

What is sdxs-512-0.9?

SDXS-512-0.9 is an innovative text-to-image generation model designed for real-time high-resolution image creation. It represents an older version of the SDXS architecture, utilizing both SD Turbo as its teacher DM and SD v2.1 base as its offline DM, along with TAESD for image encoding/decoding.

Implementation Details

The model implements a sophisticated architecture combining score distillation and feature matching techniques. It utilizes TAESD for VAE operations and employs a modified attention mechanism where self-attention is replaced with cross-attention in the highest resolution stages.

  • Single-step inference with guidance scale set to 0
  • Compatible with both float32 and float16 weight types
  • Optimized for 512x512 image generation

Core Capabilities

  • Real-time high-resolution image generation
  • One-step inference process
  • Text-to-image conversion with stable diffusion foundation
  • Efficient memory usage through modified attention mechanisms

Frequently Asked Questions

Q: What makes this model unique?

This model stands out for its ability to generate high-quality images in real-time using just one inference step, making it significantly faster than traditional multi-step diffusion models.

Q: What are the recommended use cases?

The model is ideal for applications requiring real-time image generation, such as interactive design tools, rapid prototyping, and scenarios where processing speed is crucial while maintaining reasonable image quality.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026