ToBReviewed - a greattkiffy Collection

greattkiffy 's Collections

ToBReviewed

updated Sep 11, 2025

Personalize Anything for Free with Diffusion Transformer

Paper • 2503.12590 • Published Mar 16, 2025 • 44
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

Paper • 2503.12937 • Published Mar 17, 2025 • 30
Exploring the Vulnerabilities of Federated Learning: A Deep Dive into Gradient Inversion Attacks

Paper • 2503.11514 • Published Mar 13, 2025 • 18
Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Paper • 2502.19328 • Published Feb 26, 2025 • 23
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Paper • 2504.00891 • Published Apr 1, 2025 • 14
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31, 2025 • 301
HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection

Paper • 2505.00506 • Published May 1, 2025
MLLM-as-a-Judge for Image Safety without Human Labeling

Paper • 2501.00192 • Published Dec 31, 2024 • 31
Running

2

Code Explainer Pilot

⚡

2

code-explainer-pilot
DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research

Paper • 2505.19253 • Published May 25, 2025 • 32
Surfer-H Meets Holo1: Cost-Efficient Web Agent Powered by Open Weights

Paper • 2506.02865 • Published Jun 3, 2025 • 33
LLMalMorph: On The Feasibility of Generating Variant Malware using Large-Language-Models

Paper • 2507.09411 • Published Jul 12, 2025 • 3
F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions

Paper • 2509.06951 • Published Sep 8, 2025 • 32