greattkiffy
's Collections
ToBReviewed
updated
Personalize Anything for Free with Diffusion Transformer
Paper
•
2503.12590
•
Published
•
44
R1-VL: Learning to Reason with Multimodal Large Language Models via
Step-wise Group Relative Policy Optimization
Paper
•
2503.12937
•
Published
•
30
Exploring the Vulnerabilities of Federated Learning: A Deep Dive into
Gradient Inversion Attacks
Paper
•
2503.11514
•
Published
•
18
Agentic Reward Modeling: Integrating Human Preferences with Verifiable
Correctness Signals for Reliable Reward Systems
Paper
•
2502.19328
•
Published
•
23
GenPRM: Scaling Test-Time Compute of Process Reward Models via
Generative Reasoning
Paper
•
2504.00891
•
Published
•
14
Advances and Challenges in Foundation Agents: From Brain-Inspired
Intelligence to Evolutionary, Collaborative, and Safe Systems
Paper
•
2504.01990
•
Published
•
301
HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World
Hallucination Detection
Paper
•
2505.00506
•
Published
MLLM-as-a-Judge for Image Safety without Human Labeling
Paper
•
2501.00192
•
Published
•
31
DeepResearchGym: A Free, Transparent, and Reproducible Evaluation
Sandbox for Deep Research
Paper
•
2505.19253
•
Published
•
32
Surfer-H Meets Holo1: Cost-Efficient Web Agent Powered by Open Weights
Paper
•
2506.02865
•
Published
•
33
LLMalMorph: On The Feasibility of Generating Variant Malware using
Large-Language-Models
Paper
•
2507.09411
•
Published
•
3
F1: A Vision-Language-Action Model Bridging Understanding and Generation
to Actions
Paper
•
2509.06951
•
Published
•
32