The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion Paper • 2412.10523 • Published Dec 13, 2024
Decoding Visual Experience and Mapping Semantics through Whole-Brain Analysis Using fMRI Foundation Models Paper • 2411.07121 • Published Nov 11, 2024 • 2
Re-thinking Temporal Search for Long-Form Video Understanding Paper • 2504.02259 • Published Apr 3 • 1
VideoMultiAgents: A Multi-Agent Framework for Video Question Answering Paper • 2504.20091 • Published Apr 25
UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation Paper • 2508.01126 • Published Aug 2 • 5
TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation Paper • 2102.04306 • Published Feb 8, 2021 • 1
QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models Paper • 2512.19526 • Published 4 days ago • 10
SCOPE: Structural Continuity Preservation for Medical Image Segmentation Paper • 2304.14572 • Published Apr 28, 2023
DIAMANT: Dual Image-Attention Map Encoders For Medical Image Segmentation Paper • 2304.14571 • Published Apr 28, 2023