xlsd32-alpha1

opendiffusionai

Experimental SD1.5 model with SDXL VAE, trained on LAION2B datasets using fp32 precision. Currently in alpha with 10 epochs of training.

Property	Value
Author	opendiffusionai
Model Type	Stable Diffusion 1.5 with SDXL VAE
Training Precision	FP32
Model URL	Hugging Face

What is xlsd32-alpha1?

xlsd32-alpha1 is an experimental work-in-progress model that combines Stable Diffusion 1.5 base with the SDXL VAE, specifically retrained to achieve compatibility and performance. Currently at epoch 10, the model has reportedly reached parity with existing SD base models for human output generation.

Implementation Details

The model was trained using full fp32 precision on a single NVIDIA 4090 GPU. The training dataset comprises multiple LAION2B variants, including both 1120px and 1024px/1536px square images, annotated with both moondream and wd14 captions.

Training datasets include LAION2B combinations at 1120px, 1024px, and 1536px resolutions
Utilizes both moondream and wd14 caption systems
Trained with full FP32 precision for maximum accuracy
Incorporates SDXL VAE architecture with SD1.5 base model

Core Capabilities

Compatible with standard SD1.5 workflows and pipelines
Optimized for human image generation
Supports high-resolution image generation
Combines benefits of SD1.5 and SDXL architectures

Frequently Asked Questions

Q: What makes this model unique?

This model uniquely combines SD1.5's base architecture with SDXL's VAE, creating a hybrid approach that maintains compatibility with SD1.5 while potentially offering improved image quality through the SDXL VAE.

Q: What are the recommended use cases?

The model is designed to be used like any other SD1.5 model, with particular strength in human image generation. It's currently in alpha stage, making it suitable for experimental use and testing rather than production environments.