zeroscope_v2_576w

Maintained By
cerspense

zeroscope_v2_576w

PropertyValue
LicenseCC-BY-NC-4.0
Authorcerspense
Downloads82,736
Training Data9,923 clips, 29,769 tagged frames

What is zeroscope_v2_576w?

zeroscope_v2_576w is a specialized text-to-video generation model designed to create high-quality 16:9 video compositions at 576x320 resolution. This model is particularly notable for producing watermark-free outputs and is optimized for smooth video generation. It's built upon the ModelScope framework and has been specifically trained using nearly 10,000 video clips.

Implementation Details

The model requires 7.9GB of VRAM for generating 30 frames at 576x320 resolution. It's implemented using the Diffusers library and can be easily integrated into existing pipelines. The model is particularly designed to work in conjunction with zeroscope_v2_XL for high-resolution upscaling.

  • Built on ModelScope architecture
  • Optimized for 24fps video generation
  • Supports DPMSolverMultistepScheduler
  • Compatible with Diffusers pipeline

Core Capabilities

  • Generates smooth, high-quality video outputs
  • Produces watermark-free content
  • Optimized for 16:9 aspect ratio
  • Supports upscaling integration
  • Efficient memory usage with model CPU offloading

Frequently Asked Questions

Q: What makes this model unique?

This model's unique strength lies in its optimization for 16:9 compositions and its ability to produce watermark-free outputs. It's specifically designed as a preliminary step for upscaling with zeroscope_v2_XL, allowing for efficient workflow in video generation.

Q: What are the recommended use cases?

The model is best suited for generating initial video content at 576x320 resolution that will be upscaled to higher resolutions. It's particularly effective when used in conjunction with the zeroscope_v2_XL model for upscaling to 1024x576 resolution with a denoise strength between 0.66 and 0.85.

🍰 Interesting in building your own agents?
PromptLayer provides Huggingface integration tools to manage and monitor prompts with your whole team. Get started here.