UNO

UNO

bytedance-research

UNO is a multi-image conditioned subject-to-image model using diffusion transformers, featuring high-consistency data synthesis and universal rotary position embedding.

PropertyValue
AuthorByteDance Research
LicenseCC BY-NC 4.0
PaperarXiv:2504.02160
RepositoryHugging Face

What is UNO?

UNO is an innovative AI model that addresses the challenge of consistent multi-subject image generation through a sophisticated data synthesis pipeline. Developed by ByteDance Research, it leverages diffusion transformers' in-context generation capabilities to produce highly consistent paired data across multiple subjects.

Implementation Details

The model is built on a progressive cross-modal alignment architecture and incorporates universal rotary position embedding. It's trained iteratively from a text-to-image model to achieve multi-image conditioned subject-to-image generation capabilities.

  • Python environment requirements: >= 3.10 <= 3.12
  • Built on FLUX.1-dev base model
  • Implements both training and inference pipelines
  • Supports multi-subject driven generation

Core Capabilities

  • High-consistency multi-subject paired data generation
  • Progressive cross-modal alignment
  • Universal rotary position embedding
  • Controllable single-subject and multi-subject generation
  • In-context generation capabilities

Frequently Asked Questions

Q: What makes this model unique?

UNO's distinctive feature is its ability to achieve high consistency in multi-subject image generation while maintaining controllability. Its less-to-more generalization approach and in-context generation capabilities set it apart from traditional image generation models.

Q: What are the recommended use cases?

The model is particularly well-suited for applications requiring consistent multi-subject image generation, academic research, and scenarios where precise control over generated content is necessary. It's designed for responsible usage within the bounds of local laws and ethical guidelines.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026