Hierarchical audio-driven portrait animation model from Fudan University. Enables realistic facial animation from audio input with ethical considerations built-in."