Thinking While Listening: Simple Test Time Scaling For Audio Classification Paper • 2509.19676 • Published Sep 24 • 4 • 2
Large Language Models Implicitly Learn to See and Hear Just By Reading Paper • 2505.17091 • Published May 20 • 5 • 3
Large Language Models Implicitly Learn to See and Hear Just By Reading Paper • 2505.17091 • Published May 20 • 5 • 3
Whisper-GPT: A Hybrid Representation Audio Large Language Model Paper • 2412.11449 • Published Dec 16, 2024 • 4 • 2