Ax Reva Schwartz, Carina Westling, Morgan Briggs, Marzieh Fadaee, Isar Nejadgholi, Matthew Holmes, Fariza Rashid, Maya Carlyle, Afaf Ta\"ik, Kyra Wilson, Peter Douglas, Theodora Skeadas, Gabriella Waters, Rumman Chowdhury, Thiago Lacerda 3/20/2026

CIRCLE: A Framework for Evaluating AI from a Real-World Lens

CIRCLE lifecycle framework bridging gap between AI model metrics and real-world deployment outcomes through six-stage evaluation.

Ax Jakub Grudzien Kuba, Benjamin Kurt Miller, Sergey Levine, Pieter Abbeel 3/20/2026

Offline Materials Optimization with CliqueFlowmer

CliqueFlowmer approach for computational materials discovery using neural networks for offline optimization of material properties.

Ax Yulin Li, Tengyao Tu, Li Ding, Junjie Wang, Huiling Zhen, Yixin Chen, Yong Li, Zhuotao Tian 3/20/2026

Efficient Reasoning with Balanced Thinking

Method to reduce overthinking and underthinking in Large Reasoning Models through balanced token allocation for efficient inference.

Ax Ruijiang Gao, Steven Chong Xiao 3/20/2026

Nonstandard Errors in AI Agents

Study of nonstandard errors in AI coding agents deploying 150 Claude agents on market analysis tasks, showing agent-to-agent variation in analytical choices.

Ax Jillian Fisher, Shangbin Feng, Robert Aron, Thomas Richardson, Yejin Choi, Daniel W. Fisher, Jennifer Pan, Yulia Tsvetkov, Katharina Reinecke 3/20/2026

Biased AI can Influence Political Decision-Making

Experimental study measuring how partisan biases in LLMs influence human political opinions and decision-making.

Ax Alexandru Apostu, Silviu Gheorghe, Andrei H\^iji, Nicolae Cleju, Andrei P\u{a}tra\c{s}cu, Cristian Rusu, Radu Ionescu, Paul Irofti 3/20/2026

Detecting and Mitigating DDoS Attacks with AI: A Survey

Survey of AI-based detection and mitigation methods for DDoS attacks with taxonomy of attack categories.

Ax Tzeh Yuan Neoh, Jannik Peters, Nicholas Teh 3/20/2026

Online Fair Division with Additional Information

Online fair allocation of indivisible goods with sequential arrival, analyzing fairness guarantees with access to future information.

Ax Xiaoyuan Zhu, Yaowen Ye, Tianyi Qiu, Hanlin Zhu, Sijun Tan, Ajraf Mannan, Jonathan Michala, Raluca Ada Popa, Willie Neiswanger 3/20/2026

Auditing Black-Box LLM APIs with a Rank-Based Uniformity Test

Rank-based uniformity test for detecting undisclosed substitutions or quantization of black-box LLM APIs without access to model weights.

Ax Antonio Ferrara, Francesco Cozzi, Alan Perotti, Andr\'e Panisson, Francesco Bonchi 3/20/2026

Size-adaptive Hypothesis Testing for Fairness

Statistical methods for fairness testing in algorithmic decision-making systems accounting for sampling error and demographic subgroups.