1 15 11

J Li

jiazhengli

Monta3Pt's profile picture

Asad981's profile picture

Junrulu's profile picture

https://jiazhengli.com/

lijiazheng99

AI & ML interests

AI for Education

Recent Activity

commented on a paper about 1 month ago

SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space

authored a paper about 1 month ago

SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space

upvoted a paper about 1 month ago

SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space

View all activity

Organizations

None yet

jiazhengli 's collections 5

DARS

Two Heads Are Better Than One: Dual-Model Verbal Reflection at Inference-Time

Two Heads Are Better Than One: Dual-Model Verbal Reflection at Inference-Time

Paper • 2502.19230 • Published Feb 26, 2025 • 2
jiazhengli/DARS_synthethsis_reflection

Viewer • Updated Oct 22, 2025 • 43.5k • 24
jiazhengli/Qwen2.5-3B-Instruct-Critic

3B • Updated Oct 20, 2025 • 8
jiazhengli/Qwen2.5-3B-Instruct-Reasoner

3B • Updated Oct 20, 2025 • 6

MCTS with Preference Optimisation

Resources for EMNLP 2024 Paper: Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring

Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring

Paper • 2406.19949 • Published Jun 28, 2024 • 1
jiazhengli/Rationale_MCTS

Viewer • Updated Oct 14, 2024 • 8.71k • 42 • 2
jiazhengli/Synthetic_Rationale

Viewer • Updated Oct 14, 2024 • 32.9k • 66 • 1
jiazhengli/deberta-v3-large-Rationale-to-Score

Text Classification • 0.4B • Updated Jul 4, 2024 • 8 • 1

AERA

Resources for EMNLP 2023 Paper: Distilling ChatGPT for Explainable Automated Student Answer Assessment

Distilling ChatGPT for Explainable Automated Student Answer Assessment

Paper • 2305.12962 • Published May 22, 2023
jiazhengli/AERA

Viewer • Updated Oct 14, 2024 • 17.4k • 49
jiazhengli/long-t5-tglobal-large-AERA

Updated Oct 14, 2024 • 12

RoleMRC

A Fine-Grained Composite Benchmark for Role-Playing and Instruction-Following

RoleMRC: A Fine-Grained Composite Benchmark for Role-Playing and Instruction-Following

Paper • 2502.11387 • Published Feb 17, 2025 • 1
Junrulu/RoleMRC

Preview • Updated Mar 20, 2025 • 120 • 5
jiazhengli/Llama-3.1-8B-RoleMRC-dpo

8B • Updated Mar 11, 2025 • 6
jiazhengli/Llama-3.1-8B-RoleMRC-sft

8B • Updated Mar 11, 2025 • 17

SamPO

Resources for EMNLP 2024 Paper: Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence

Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence

Paper • 2406.10957 • Published Jun 16, 2024 • 2
jiazhengli/Pythia-2.8B-HH-RLHF-Iterative-SamPO

Text Generation • 3B • Updated Jun 17, 2024 • 11
jiazhengli/Pythia-2.8B-TLDR-Iterative-SamPO

Text Generation • 3B • Updated Jun 17, 2024 • 6
Junrulu/Llama-3-8B-Instruct-Iterative-SamPO

Text Generation • 8B • Updated Jun 14, 2024 • 8 • 1

DARS

Two Heads Are Better Than One: Dual-Model Verbal Reflection at Inference-Time

Two Heads Are Better Than One: Dual-Model Verbal Reflection at Inference-Time

Paper • 2502.19230 • Published Feb 26, 2025 • 2
jiazhengli/DARS_synthethsis_reflection

Viewer • Updated Oct 22, 2025 • 43.5k • 24
jiazhengli/Qwen2.5-3B-Instruct-Critic

3B • Updated Oct 20, 2025 • 8
jiazhengli/Qwen2.5-3B-Instruct-Reasoner

3B • Updated Oct 20, 2025 • 6

RoleMRC

A Fine-Grained Composite Benchmark for Role-Playing and Instruction-Following

RoleMRC: A Fine-Grained Composite Benchmark for Role-Playing and Instruction-Following

Paper • 2502.11387 • Published Feb 17, 2025 • 1
Junrulu/RoleMRC

Preview • Updated Mar 20, 2025 • 120 • 5
jiazhengli/Llama-3.1-8B-RoleMRC-dpo

8B • Updated Mar 11, 2025 • 6
jiazhengli/Llama-3.1-8B-RoleMRC-sft

8B • Updated Mar 11, 2025 • 17

MCTS with Preference Optimisation

Resources for EMNLP 2024 Paper: Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring

Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring

Paper • 2406.19949 • Published Jun 28, 2024 • 1
jiazhengli/Rationale_MCTS

Viewer • Updated Oct 14, 2024 • 8.71k • 42 • 2
jiazhengli/Synthetic_Rationale

Viewer • Updated Oct 14, 2024 • 32.9k • 66 • 1
jiazhengli/deberta-v3-large-Rationale-to-Score

Text Classification • 0.4B • Updated Jul 4, 2024 • 8 • 1

SamPO

Resources for EMNLP 2024 Paper: Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence

Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence

Paper • 2406.10957 • Published Jun 16, 2024 • 2
jiazhengli/Pythia-2.8B-HH-RLHF-Iterative-SamPO

Text Generation • 3B • Updated Jun 17, 2024 • 11
jiazhengli/Pythia-2.8B-TLDR-Iterative-SamPO

Text Generation • 3B • Updated Jun 17, 2024 • 6
Junrulu/Llama-3-8B-Instruct-Iterative-SamPO

Text Generation • 8B • Updated Jun 14, 2024 • 8 • 1

AERA

Resources for EMNLP 2023 Paper: Distilling ChatGPT for Explainable Automated Student Answer Assessment

Distilling ChatGPT for Explainable Automated Student Answer Assessment

Paper • 2305.12962 • Published May 22, 2023
jiazhengli/AERA

Viewer • Updated Oct 14, 2024 • 17.4k • 49
jiazhengli/long-t5-tglobal-large-AERA

Updated Oct 14, 2024 • 12