Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
rp-yu 's Collections
Discrete Diffusion LLM & MLLM
VPT Models

VPT Models

updated Feb 20, 2025

Qwen2-VL Models with Visual Perception Token or used in training process.

Upvote
-

  • rp-yu/Qwen2-VL-2b-VPT-Seg

    Image-Text-to-Text • 3B • Updated Jul 14, 2025 • 8 • 1

  • rp-yu/Qwen2-VL-2b-VPT-CLIP

    Image-Text-to-Text • Updated Mar 11, 2025 • 9 • 1

  • rp-yu/Qwen2-VL-2b-VPT-Seg-Alignment

    Image-Text-to-Text • Updated Mar 11, 2025 • 8

  • rp-yu/Qwen2-VL-2b-VPT-Det-Alignment

    Image-Text-to-Text • Updated Mar 11, 2025 • 9

  • rp-yu/Qwen2-VL-2b-VPT-Det

    Image-Text-to-Text • Updated Mar 11, 2025 • 8

  • rp-yu/Qwen2-VL-7b-VPT-CLIP

    Image-Text-to-Text • 8B • Updated Jul 7, 2025 • 15 • 1

  • rp-yu/Qwen2-VL-2b-VPT-Det-NoPrompt

    Image-Text-to-Text • Updated Mar 11, 2025 • 9
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs