Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint Paper • 2505.23759 • Published May 29, 2025 • 5
Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling Paper • 2504.13169 • Published Apr 17, 2025 • 39
TULIP: Towards Unified Language-Image Pretraining Paper • 2503.15485 • Published Mar 19, 2025 • 49