Efficient Audio-Visual Speech Separation with Discrete Lip Semantics and Multi-Scale Global-Local Attention Paper • 2509.23610 • Published Sep 28 • 13 • 2
Advances in Speech Separation: Techniques, Challenges, and Future Trends Paper • 2508.10830 • Published Aug 14 • 15 • 2
AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models Paper • 2505.16211 • Published May 22 • 18 • 2
Adapting Vision Foundation Models for Robust Cloud Segmentation in Remote Sensing Images Paper • 2411.13127 • Published Nov 20, 2024 • 4 • 2
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios Paper • 2410.01481 • Published Oct 2, 2024 • 2 • 2