PRISMM-Bench: A Benchmark of Peer-Review Grounded Multimodal Inconsistencies Paper • 2510.16505 • Published Oct 18 • 3 • 2
LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content Paper • 2410.10783 • Published Oct 14, 2024 • 26 • 2