Enxin
/

VideoNSA

Video-Text-to-Text

video-understanding

sparse-attention

vision-language

Model card Files Files and versions

17.3 GB

1 contributor

History: 8 commits

Enxin's picture

Update README.md

d18c083 verified 3 months ago