SafeSwitch: Steering Unsafe LLM Behavior via Internal Activation Signals Paper • 2502.01042 • Published Feb 3, 2025 • 1
Energy-Based Transformers are Scalable Learners and Thinkers Paper • 2507.02092 • Published Jul 2, 2025 • 69
EscapeBench: Towards Advancing Creative Intelligence of Language Model Agents Paper • 2412.13549 • Published Dec 18, 2024
Self-Aligned Reward: Towards Effective and Efficient Reasoners Paper • 2509.05489 • Published Sep 5, 2025 • 1
DRPG (Decompose, Retrieve, Plan, Generate): An Agentic Framework for Academic Rebuttal Paper • 2601.18081 • Published 7 days ago • 7
DRPG (Decompose, Retrieve, Plan, Generate): An Agentic Framework for Academic Rebuttal Paper • 2601.18081 • Published 7 days ago • 7
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model Paper • 2601.15892 • Published 10 days ago • 53
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model Paper • 2601.15892 • Published 10 days ago • 53
Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models Paper • 2506.22813 • Published Jun 28, 2025 • 7
Time-R1: Towards Comprehensive Temporal Reasoning in LLMs Paper • 2505.13508 • Published May 16, 2025 • 16
ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind Paper • 2505.22961 • Published May 29, 2025 • 8
SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents Paper • 2505.23559 • Published May 29, 2025 • 11
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment Paper • 2502.16894 • Published Feb 24, 2025 • 32
Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In Paper • 2305.17331 • Published May 27, 2023 • 1
An In-depth Look at Gemini's Language Abilities Paper • 2312.11444 • Published Dec 18, 2023 • 1
MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models Paper • 2406.06046 • Published Jun 10, 2024 • 1
Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning Paper • 2410.14208 • Published Oct 18, 2024 • 3
Data-Efficient Pretraining with Group-Level Data Influence Modeling Paper • 2502.14709 • Published Feb 20, 2025 • 1
Fusion-in-T5: Unifying Document Ranking Signals for Improved Information Retrieval Paper • 2305.14685 • Published May 24, 2023 • 1
Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In Paper • 2305.17331 • Published May 27, 2023 • 1