Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds Paper • 2511.08892 • Published Nov 12, 2025 • 200
VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning Paper • 2510.25772 • Published Oct 29, 2025 • 32
AdaViewPlanner: Adapting Video Diffusion Models for Viewpoint Planning in 4D Scenes Paper • 2510.10670 • Published Oct 12, 2025 • 18
VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning Paper • 2510.08555 • Published Oct 9, 2025 • 63
UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution Paper • 2510.08143 • Published Oct 9, 2025 • 20
data-is-better-together/open-image-preferences-v1-binarized Viewer • Updated Dec 9, 2024 • 7.46k • 109 • 57
The Unanticipated Asymmetry Between Perceptual Optimization and Assessment Paper • 2509.20878 • Published Sep 25, 2025 • 3
DiffusionNFT: Online Diffusion Reinforcement with Forward Process Paper • 2509.16117 • Published Sep 19, 2025 • 22
RewardDance: Reward Scaling in Visual Generation Paper • 2509.08826 • Published Sep 10, 2025 • 73
AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning Paper • 2507.12841 • Published Jul 17, 2025 • 41
Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control Paper • 2506.01943 • Published Jun 2, 2025 • 25
Scaling Image and Video Generation via Test-Time Evolutionary Search Paper • 2505.17618 • Published May 23, 2025 • 41
VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to Rank Paper • 2505.14460 • Published May 20, 2025 • 32