1 14 5

Sheng Yang

SuunnYang

AI & ML interests

None yet

Recent Activity

upvoted a collection 19 days ago

GLM-4.6V

upvoted a collection 3 months ago

Qwen3-VL

upvoted a paper 4 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

View all activity

Organizations

None yet

upvoted a collection 19 days ago

GLM-4.6V

Collection

3 items • Updated 20 days ago • 47

upvoted a collection 3 months ago

Qwen3-VL

Collection

37 items • Updated Nov 1 • 542

upvoted 4 papers 4 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31 • 84

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 180

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 259

DINOv3

Paper • 2508.10104 • Published Aug 13 • 291

upvoted 2 papers 5 months ago

We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning

Paper • 2508.10433 • Published Aug 14 • 144

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8 • 194

liked a model 5 months ago

zai-org/GLM-4.5V

Image-Text-to-Text • 108B • Updated Oct 25 • 44.4k • • 699

upvoted a collection 5 months ago

GLM-4.5

Collection

GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated Aug 11 • 250

liked a model 5 months ago

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11 • 22.9k • • 1.39k

upvoted 2 papers 6 months ago

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published Jul 7 • 39

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2 • 56

liked a model 6 months ago

internlm/POLAR-7B

Text Classification • Updated Jul 15 • 98 • 25

upvoted 2 papers 6 months ago

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2 • 130

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 431

authored 3 papers 6 months ago

Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transformers

Paper • 2405.10612 • Published May 17, 2024

Backdoor Defense via Suppressing Model Shortcuts

Paper • 2211.05631 • Published Nov 2, 2022

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 246

liked a model 6 months ago

zai-org/GLM-4.1V-9B-Thinking

Image-Text-to-Text • 10B • Updated Oct 25 • 208k • • 760

Sheng Yang

AI & ML interests

Recent Activity

Organizations

SuunnYang's activity