5 55 31

QINGHE WANG

DecoderWQH666

https://qinghew.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper about 4 hours ago

SemanticGen: Video Generation in Semantic Space

upvoted a paper about 15 hours ago

DreaMontage: Arbitrary Frame-Guided One-Shot Video Generation

upvoted a paper 2 days ago

SemanticGen: Video Generation in Semantic Space

View all activity

Organizations

upvoted a paper about 15 hours ago

DreaMontage: Arbitrary Frame-Guided One-Shot Video Generation

Paper • 2512.21252 • Published 1 day ago • 22

upvoted a paper 2 days ago

SemanticGen: Video Generation in Semantic Space

Paper • 2512.20619 • Published 2 days ago • 83

upvoted 2 papers 3 days ago

StoryMem: Multi-shot Long Video Storytelling with Memory

Paper • 2512.19539 • Published 3 days ago • 15

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 3 days ago • 60

upvoted a paper 7 days ago

Kling-Omni Technical Report

Paper • 2512.16776 • Published 7 days ago • 155

upvoted 4 papers 9 days ago

WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling

Paper • 2512.14614 • Published 9 days ago • 63

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Paper • 2512.13604 • Published 10 days ago • 70

Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling

Paper • 2512.12675 • Published 11 days ago • 40

MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives

Paper • 2512.14699 • Published 9 days ago • 27

upvoted a paper 10 days ago

KlingAvatar 2.0 Technical Report

Paper • 2512.13313 • Published 10 days ago • 40

upvoted a paper 11 days ago

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Paper • 2512.11749 • Published 13 days ago • 36

upvoted 2 papers 23 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 24 days ago • 231

MultiShotMaster: A Controllable Multi-Shot Video Generation Framework

Paper • 2512.03041 • Published 23 days ago • 62

upvoted 2 papers about 2 months ago

AdaViewPlanner: Adapting Video Diffusion Models for Viewpoint Planning in 4D Scenes

Paper • 2510.10670 • Published Oct 12 • 18

VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning

Paper • 2510.25772 • Published Oct 29 • 32

upvoted 2 papers 2 months ago

Latent Diffusion Model without Variational Autoencoder

Paper • 2510.15301 • Published Oct 17 • 49

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published Oct 16 • 66

upvoted 3 papers 3 months ago

UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution

Paper • 2510.08143 • Published Oct 9 • 20

VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning

Paper • 2510.08555 • Published Oct 9 • 63

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2 • 95

QINGHE WANG

AI & ML interests

Recent Activity

Organizations

DecoderWQH666's activity