Barry Li

Brilliant-B

Brilliant-B

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

upvoted a paper about 2 months ago

Continuous Autoregressive Language Models

upvoted a paper 3 months ago

Unified Reinforcement and Imitation Learning for Vision-Language Models

View all activity

Organizations

None yet

upvoted a paper 12 days ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published 15 days ago • 60

upvoted a paper about 2 months ago

Continuous Autoregressive Language Models

Paper • 2510.27688 • Published Oct 31, 2025 • 70

upvoted a paper 3 months ago

Unified Reinforcement and Imitation Learning for Vision-Language Models

Paper • 2510.19307 • Published Oct 22, 2025 • 30

upvoted a paper 4 months ago

Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

Paper • 2509.07969 • Published Sep 9, 2025 • 58

liked a Space 4 months ago

FineVision: Open Data is All You Need

📝

215

A new open-source dataset for training VLMs

liked a model 5 months ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18, 2025 • 234k • • 2.34k

upvoted 4 papers 5 months ago

AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning

Paper • 2507.12841 • Published Jul 17, 2025 • 41

Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning

Paper • 2507.14137 • Published Jul 18, 2025 • 34

ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

Paper • 2507.16815 • Published Jul 22, 2025 • 39

Pixels, Patterns, but No Poetry: To See The World like Humans

Paper • 2507.16863 • Published Jul 21, 2025 • 68

upvoted a paper 6 months ago

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Paper • 2507.07104 • Published Jul 9, 2025 • 45

liked 2 models 6 months ago

google/siglip-so400m-patch14-384

Zero-Shot Image Classification • 0.9B • Updated Sep 26, 2024 • 4.2M • 635

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27, 2025 • 671k • • 12.1k

upvoted 3 papers 7 months ago

UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation

Paper • 2506.17202 • Published Jun 20, 2025 • 10

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Paper • 2506.03147 • Published Jun 3, 2025 • 58

TIIF-Bench: How Does Your T2I Model Follow Your Instructions?

Paper • 2506.02161 • Published Jun 2, 2025 • 13

upvoted 3 papers 9 months ago

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Paper • 2504.11468 • Published Apr 10, 2025 • 30

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 139

Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs

Paper • 2504.15280 • Published Apr 21, 2025 • 25

liked a dataset 9 months ago

TIGER-Lab/ViRL39K

Preview • Updated Apr 23, 2025 • 325 • 32

Barry Li

AI & ML interests

Recent Activity

Organizations

Brilliant-B's activity

FineVision: Open Data is All You Need