Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation Paper • 2403.12042 • Published Mar 18, 2024
Versatile Multimodal Controls for Whole-Body Talking Human Animation Paper • 2503.08714 • Published Mar 10 • 1
Designing a Better Asymmetric VQGAN for StableDiffusion Paper • 2306.04632 • Published Jun 7, 2023 • 3