Ax H M Quamran Hasan, Housam Khalifa Bashier, Jiayi Dai, Mi-Young Kim, Randy Goebel 3/17/2026

Reason2Decide: Rationale-Driven Multi-Task Learning

Reason2Decide: two-stage training framework for clinical decision support LLMs to generate predictions with self-aligned explanations.

Ax Minhua Lin, Hanqing Lu, Zhan Shi, Bing He, Rui Mao, Zhiwei Zhang, Zongyu Wu, Xianfeng Tang, Hui Liu, Zhenwei Dai, Xiang Zhang, Suhang Wang, Benoit Dumoulin, Jian Pei 3/17/2026

Position: Agentic Evolution is the Path to Evolving LLMs

Position paper arguing agentic evolution via deployment-time adaptation is needed to close the train-deploy gap in LLM systems.

Ax Tony Feng, Junehyuk Jung, Sang-hyun Kim, Carlo Pagano, Sergei Gukov, Chiang-Chiang Tsai, David Woodruff, Adel Javanmard, Aryan Mokhtari, Dawsen Hwang, Yuri Chervonyi, Jonathan N. Lee, Garrett Bingham, Trieu H. Trinh, Vahab Mirrokni, Quoc V. Le, Thang Luong 3/17/2026

Aletheia tackles FirstProof autonomously

Aletheia mathematics research agent solved 6 of 10 FirstProof challenge problems autonomously using Gemini 3 Deep Think reasoning.

Ax Chen Bo Calvin Zhang, Christina Q. Knight, Nicholas Kruus, Jason Hausenloy, Pedro Medeiros, Nathaniel Li, Aiden Kim, Yury Orlovskiy, Coleman Breen, Bryce Cai, Jasper G\"otting, Andrew Bo Liu, Samira Nedungadi, Paula Rodriguez, Yannis Yiming He, Mohamed Shaaban, Zifan Wang, Seth Donoughe, Julian Michael 3/17/2026

LLM Novice Uplift on Dual-Use, In Silico Biology Tasks

Human study measuring whether LLM access improves novice performance on biology tasks versus internet-only baselines, with dual-use risk implications.

Ax Shiya Zhang, Yuhan Zhan, Ruixi Su, Ruihan Sun, Ziyi Song, Zhaohan Chen, Xiaofan Zhang 3/17/2026

EMPA: Evaluating Persona-Aligned Empathy as a Process

EMPA framework evaluates how well LLM dialogue agents maintain persona-aligned empathy across multi-turn conversations using process-oriented metrics.

Ax Ann Yuan, Asma Ghandeharioun, Carter Blum, Alicia Machado, Jessica Hoffmann, Daphne Ippolito, Martin Wattenberg, Lucas Dixon, Katja Filippova 3/17/2026

Think Before You Lie: How Reasoning Leads to Honesty

Study showing reasoning and deliberation increase honesty in LLM responses on moral trade-off scenarios.

Ax I. de Zarz\`a, J. de Curt\`o, Jordi Cabot, Pietro Manzoni, Carlos T. Calafate 3/17/2026

Semantic Invariance in Agentic AI

Research on semantic invariance property of LLM-based autonomous agents under input variations to ensure stable reasoning.

Ax Adrian de Wynter, Xun Wang, Qilong Gu, Si-Qing Chen 3/17/2026

On Meta-Prompting

Research on automated prompt generation and optimization techniques for improving LLM performance through meta-prompting approaches.

Ax Mosam Dabhi, Laszlo A. Jeni, Simon Lucey 3/17/2026

3D-LFM: Lifting Foundation Model

Deep learning approach for 3D structure and camera reconstruction from 2D landmarks using foundation models.

Ax Yan Zhuang, Qi Liu, Haoyang Bi, Zhenya Huang, Weizhe Huang, Jiatong Li, Junhao Yu, Zirui Liu, Zirui Hu, Yuting Hong, Zachary A. Pardos, Haiping Ma, Mengxiao Zhu, Shijin Wang, Enhong Chen 3/17/2026

Survey of Computerized Adaptive Testing: A Machine Learning Perspective

Survey examining computerized adaptive testing through machine learning lens, covering personalized assessment methods across domains.