πππ New Research Alert - ICCV 2025 (Oral)! ππ€π π Title: Understanding Co-speech Gestures in-the-wild π
π Description: JEGAL is a tri-modal model that learns from gestures, speech and text simultaneously, enabling devices to interpret co-speech gestures in the wild.
π₯ Authors: @sindhuhegde, K R Prajwal, Taein Kwon, and Andrew Zisserman