CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion
Paper
•
2512.19535
•
Published
•
10
Artificial Intelligence, Computer Vision, Machine Learning, Computational Photography, Image Enhancement, Super-Resolution, Compression, Streaming