LongVideoAgent: Multi-Agent Reasoning with Long Videos Paper âĒ 2512.20618 âĒ Published 5 days ago âĒ 49
Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing Paper âĒ 2510.19808 âĒ Published Oct 22 âĒ 29
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance Paper âĒ 2512.08765 âĒ Published 19 days ago âĒ 126
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework Paper âĒ 2512.03041 âĒ Published 26 days ago âĒ 62
Can World Simulators Reason? Gen-ViRe: A Generative Visual Reasoning Benchmark Paper âĒ 2511.13853 âĒ Published Nov 17 âĒ 34
MultiBooth: Towards Generating All Your Concepts in an Image from Text Paper âĒ 2404.14239 âĒ Published Apr 22, 2024 âĒ 9