2 31 9

nanatata

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago

Thinking with Programming Vision: Towards a Unified View for Thinking with Images

upvoted a paper 26 days ago

Qwen3-VL Technical Report

liked a dataset about 1 month ago

We-Math/VTBench

View all activity

Organizations

None yet

upvoted 2 papers 26 days ago

Thinking with Programming Vision: Towards a Unified View for Thinking with Images

Paper • 2512.03746 • Published 26 days ago • 15

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26 • 144

liked a dataset about 1 month ago

We-Math/VTBench

Viewer • Updated Nov 26 • 500 • 122 • 7

upvoted 2 papers about 1 month ago

MedSAM3: Delving into Segment Anything with Medical Concepts

Paper • 2511.19046 • Published Nov 24 • 49

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20 • 92

upvoted 2 papers about 2 months ago

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published Nov 7 • 42

V-Thinker: Interactive Thinking with Images

Paper • 2511.04460 • Published Nov 6 • 96

liked 2 datasets about 2 months ago

We-Math/V-Perception-40K

Viewer • Updated Nov 7 • 36.7k • 166 • 7

We-Math/V-Interaction-400K

Viewer • Updated Nov 7 • 253k • 1.74k • 14

liked a model about 2 months ago

We-Math/V-Thinker

8B • Updated Nov 6 • 36 • 9

liked a dataset about 2 months ago

dongguanting/ARPO-RL-DeepSearch-1K

Viewer • Updated Oct 17 • 1.07k • 97 • 6

upvoted 3 papers about 2 months ago

π_RL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models

Paper • 2510.25889 • Published Oct 29 • 64

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29 • 77

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28 • 71

upvoted 3 papers 2 months ago

upvoted 3 papers 3 months ago

ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping

Paper • 2510.08457 • Published Oct 9 • 12

SpaceVista: All-Scale Visual Spatial Reasoning from mm to km

Paper • 2510.09606 • Published Oct 10 • 17

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Paper • 2510.08540 • Published Oct 9 • 109

nanatata

AI & ML interests

Recent Activity

Organizations

nanatata's activity