zeroscope_v2_576w

cerspense

Text-to-video AI model optimized for 16:9 compositions, generates high-quality watermark-free videos at 576x320 resolution with smooth output. Uses 7.9GB VRAM for 30 frames.

Property	Value
License	CC-BY-NC-4.0
Author	cerspense
Downloads	82,736
Training Data	9,923 clips, 29,769 tagged frames

What is zeroscope_v2_576w?

zeroscope_v2_576w is a specialized text-to-video generation model designed to create high-quality 16:9 video compositions at 576x320 resolution. This model is particularly notable for producing watermark-free outputs and is optimized for smooth video generation. It's built upon the ModelScope framework and has been specifically trained using nearly 10,000 video clips.

Implementation Details

The model requires 7.9GB of VRAM for generating 30 frames at 576x320 resolution. It's implemented using the Diffusers library and can be easily integrated into existing pipelines. The model is particularly designed to work in conjunction with zeroscope_v2_XL for high-resolution upscaling.

Built on ModelScope architecture
Optimized for 24fps video generation
Supports DPMSolverMultistepScheduler
Compatible with Diffusers pipeline

Core Capabilities

Generates smooth, high-quality video outputs
Produces watermark-free content
Optimized for 16:9 aspect ratio
Supports upscaling integration
Efficient memory usage with model CPU offloading

Frequently Asked Questions

Q: What makes this model unique?

This model's unique strength lies in its optimization for 16:9 compositions and its ability to produce watermark-free outputs. It's specifically designed as a preliminary step for upscaling with zeroscope_v2_XL, allowing for efficient workflow in video generation.

Q: What are the recommended use cases?

The model is best suited for generating initial video content at 576x320 resolution that will be upscaled to higher resolutions. It's particularly effective when used in conjunction with the zeroscope_v2_XL model for upscaling to 1024x576 resolution with a denoise strength between 0.66 and 0.85.