CogVideoX1.0-LoRA-Arcane-v1

Cseti

CogVideoX LoRA model trained on Arcane style, enables text-to-video generation with anime-style aesthetics. Non-commercial research project with character tokens.

Property	Value
Base Model	THUDM/CogVideoX-5b
Training Steps	17,000
Dataset Size	136 videos (49x720x480)
Papers	CogVideoX & CogVideo Research Papers
License	Research/Non-commercial Use Only

What is CogVideoX1.0-LoRA-Arcane-v1?

CogVideoX1.0-LoRA-Arcane-v1 is a specialized LoRA adaptation of the CogVideoX model, fine-tuned to generate videos in the style of the Arcane animated series. This model represents a fan project developed for research purposes, enabling text-to-video generation with specific aesthetic qualities reminiscent of the show.

Implementation Details

The model was trained using adamw optimizer with a learning rate of 1e-4 and cosine_with_restarts scheduler. It features a rank/alpha of 128/128 and was trained on a curated dataset of 136 videos. The implementation requires the base CogVideoX-5b model and supports specific character tokens for enhanced generation control.

Compatible exclusively with CogVideoX 1.0
Includes specialized character tokens for consistent character generation
Trained with batch size 1 for optimal results
No quantization or safe attention recommended for best quality

Core Capabilities

Text-to-video generation in Arcane-inspired style
Character-specific generation using dedicated tokens
Expression and lighting control through prompt engineering
Smooth transition animations between frames

Frequently Asked Questions

Q: What makes this model unique?

This model specifically targets the Arcane animation style and includes character-specific tokens for more controlled generation, though with the caveat that character reproduction accuracy is limited.

Q: What are the recommended use cases?

The model is intended for research and non-commercial purposes only, specifically for generating Arcane-style video content. It's particularly effective for creating mood-driven character animations with specific lighting and expression transitions.