The model proposed in TOGA: Temporally Grounded Open-Ended Video QA with Weak Supervision, published at ICCV 2025. This model can perform temporal grounding and video question answering.

The corresponding code can be found at GitHub.

Downloads last month
2
Safetensors
Model size
8B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support