GongDengxian's picture

8 8 3

GongDengxian

godx7

·

AI & ML interests

None yet

Recent Activity

liked a model 10 days ago

zhouyik/mask_tokenizer

updated a dataset 20 days ago

godx7/vector_plus_samtok

published a dataset 21 days ago

godx7/vector_plus_samtok

View all activity

Organizations

upvoted a collection 2 months ago

Describe Anything

Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 9 days ago • 61

upvoted 2 papers 3 months ago

DeH4R: A Decoupled and Hybrid Method for Road Network Graph Extraction

Paper • 2508.13669 • Published Aug 19, 2025 • 1

The 1st Solution for 7th LSVOS RVOS Track: SaSaSa2VA

Paper • 2509.16972 • Published Sep 21, 2025 • 2

upvoted a paper 4 months ago

SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment

Paper • 2507.02705 • Published Jul 3, 2025 • 2

upvoted a paper 6 months ago

ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing

Paper • 2506.19848 • Published Jun 24, 2025 • 26

upvoted a collection 6 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated 1 day ago • 550

upvoted a paper 9 months ago

Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding

Paper • 2504.10465 • Published Apr 14, 2025 • 27

upvoted a paper over 1 year ago

OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding

Paper • 2406.19389 • Published Jun 27, 2024 • 54