This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation
video-editing diffusion video-captioning dit video-generation controllable-generation mllm multimodal-large-language-models video-diffusion video-dit
-
Updated
Apr 3, 2025