Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
rp-yu
's Collections
Discrete Diffusion LLM & MLLM
VPT Models
VPT Models
updated
Feb 20, 2025
Qwen2-VL Models with Visual Perception Token or used in training process.
Upvote
-
rp-yu/Qwen2-VL-2b-VPT-Seg
Image-Text-to-Text
•
3B
•
Updated
Jul 14, 2025
•
8
•
1
rp-yu/Qwen2-VL-2b-VPT-CLIP
Image-Text-to-Text
•
Updated
Mar 11, 2025
•
9
•
1
rp-yu/Qwen2-VL-2b-VPT-Seg-Alignment
Image-Text-to-Text
•
Updated
Mar 11, 2025
•
8
rp-yu/Qwen2-VL-2b-VPT-Det-Alignment
Image-Text-to-Text
•
Updated
Mar 11, 2025
•
9
rp-yu/Qwen2-VL-2b-VPT-Det
Image-Text-to-Text
•
Updated
Mar 11, 2025
•
8
rp-yu/Qwen2-VL-7b-VPT-CLIP
Image-Text-to-Text
•
8B
•
Updated
Jul 7, 2025
•
15
•
1
rp-yu/Qwen2-VL-2b-VPT-Det-NoPrompt
Image-Text-to-Text
•
Updated
Mar 11, 2025
•
9
Upvote
-
Share collection
View history
Collection guide
Browse collections