Tre Ci's picture

12 3

Tre Ci

Tressuras

·

AI & ML interests

None yet

Organizations

None yet

upvoted 4 papers 3 months ago

Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training

Paper • 2509.21500 • Published Sep 25, 2025 • 18

Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum

Paper • 2510.00526 • Published Oct 1, 2025 • 8

On the Use of Agentic Coding: An Empirical Study of Pull Requests on GitHub

Paper • 2509.14745 • Published Sep 18, 2025 • 4

ExGRPO: Learning to Reason from Experience

Paper • 2510.02245 • Published Oct 2, 2025 • 80

upvoted 6 papers 8 months ago

The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think

Paper • 2505.10185 • Published May 15, 2025 • 26

UniVLA: Learning to Act Anywhere with Task-centric Latent Actions

Paper • 2505.06111 • Published May 9, 2025 • 25

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Paper • 2505.04601 • Published May 7, 2025 • 29

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

Paper • 2505.01658 • Published May 3, 2025 • 39

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning

Paper • 2505.10557 • Published May 15, 2025 • 47

Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Paper • 2505.14810 • Published May 20, 2025 • 62

upvoted 2 papers 9 months ago

MR. Video: "MapReduce" is the Principle for Long Video Understanding

Paper • 2504.16082 • Published Apr 22, 2025 • 5

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21, 2025 • 88