Ax Stephan Rabanser, Sayash Kapoor, Peter Kirgis, Kangheng Liu, Saiteja Utpala, Arvind Narayanan 2/19/2026

Towards a Science of AI Agent Reliability

Research on AI agent reliability evaluation beyond accuracy metrics. Analyzes consistency, perturbation robustness, and operational failures in deployed agents.

Ax Yonghoon Lee, Meshi Bashari, Edgar Dobriban, Yaniv Romano 2/19/2026

Synthetic-Powered Multiple Testing with FDR Control

Research on FDR-controlled hypothesis testing using synthetic data from generative models. Statistical inference with auxiliary data.

Ax Rosie Zhao, Tian Qin, David Alvarez-Melis, Sham Kakade, Naomi Saphra 2/19/2026

Random Scaling of Emergent Capabilities

Research on emergent capabilities in language models via scaling. Analyzes breakthrough performance vs metric thresholding.

Ax Robin Staab, Nikola Jovanovi\'c, Kimberly Mai, Prakhar Ganesh, Martin Vechev, Ferdinando Fioretto, Matthew Jagielski 2/19/2026

SoK: Data Minimization in Machine Learning

Systematization of knowledge on data minimization principles in ML with focus on GDPR/CPRA regulatory compliance.

Ax Filip Szatkowski, Patryk B\k{e}dkowski, Alessio Devoto, Jan Dubi\'nski, Pasquale Minervini, Miko{\l}aj Pi\'orczy\'nski, Simone Scardapane, Bartosz W\'ojcik 2/19/2026

Universal Properties of Activation Sparsity in Modern Large Language Models

Research characterizing universal activation sparsity properties in modern LLMs with implications for efficiency and interpretability.

Ax Prajit Bhaskaran, Tom Viering 2/19/2026

Transformers can do Bayesian Clustering

Transformer-based model for Bayesian clustering on datasets with missing values. Unsupervised learning extension of prior-data fitting.

Ax S Nanayakkara 2/19/2026

Adaptive Aggregation with Two Gains in QFL

Adaptive aggregation method for quantum federated learning handling client quality variation, teleportation fidelity, and device instability.