The model proposed in TOGA: Temporally Grounded Open-Ended Video QA with Weak Supervision, published at ICCV 2025. This model can perform temporal grounding and video question answering.

The corresponding code can be found at GitHub.

Downloads last month: 2

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support