Kyu Song's picture

Kyu Song

kyunocap

·

AI & ML interests

None yet

Recent Activity

liked a model 9 days ago

facebook/pe-av-large

upvoted a paper 9 days ago

EgoX: Egocentric Video Generation from a Single Exocentric Video

upvoted a paper 9 days ago

Memory in the Age of AI Agents

View all activity

Organizations

None yet

upvoted 2 papers 9 days ago

EgoX: Egocentric Video Generation from a Single Exocentric Video

Paper • 2512.08269 • Published 17 days ago • 109

Memory in the Age of AI Agents

Paper • 2512.13564 • Published 10 days ago • 112

upvoted a paper 13 days ago

MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos

Paper • 2512.10881 • Published 14 days ago • 29

upvoted a paper 14 days ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6 • 125

upvoted 5 papers 15 days ago

Composing Concepts from Images and Videos via Concept-prompt Binding

Paper • 2512.09824 • Published 15 days ago • 27

OmniPSD: Layered PSD Generation with Diffusion Transformer

Paper • 2512.09247 • Published 16 days ago • 46

UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation

Paper • 2512.07831 • Published 17 days ago • 16

Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality

Paper • 2512.07951 • Published 17 days ago • 47

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published 16 days ago • 125

upvoted 3 papers 16 days ago

RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards

Paper • 2512.00473 • Published 27 days ago • 25

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Paper • 2512.07525 • Published 18 days ago • 55

Unified Video Editing with Temporal Reasoner

Paper • 2512.07469 • Published 18 days ago • 45

upvoted 5 papers 17 days ago

TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows

Paper • 2512.05150 • Published 23 days ago • 74

EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture

Paper • 2512.04810 • Published 22 days ago • 25

EditThinker: Unlocking Iterative Reasoning for Any Image Editor

Paper • 2512.05965 • Published 20 days ago • 38

PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling

Paper • 2512.04784 • Published 24 days ago • 24

SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

Paper • 2512.05905 • Published 20 days ago • 19

upvoted 3 papers about 2 months ago

Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals

Paper • 2510.27684 • Published Oct 31 • 22

UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions

Paper • 2511.03334 • Published Nov 5 • 52

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30 • 119