Wenxuan Song's picture

3 7

Wenxuan Song

Wenxuan123

·

https://songwxuan.github.io/

Songwxuan

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

upvoted a paper about 2 months ago

HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models

upvoted a paper 2 months ago

UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios

View all activity

Organizations

None yet

upvoted a paper 26 days ago

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Paper • 2601.06943 • Published 27 days ago • 210

upvoted a paper about 2 months ago

HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models

Paper • 2512.09928 • Published Dec 10, 2025 • 14

upvoted a paper 2 months ago

UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios

Paper • 2511.18050 • Published Nov 22, 2025 • 38

upvoted 2 papers 4 months ago

VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation

Paper • 2510.14902 • Published Oct 16, 2025 • 17

Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model

Paper • 2510.12276 • Published Oct 14, 2025 • 147

upvoted a paper 5 months ago

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11, 2025 • 246

upvoted a paper 9 months ago

OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation

Paper • 2505.03912 • Published May 6, 2025 • 9