Ax Max Rudolph, Nathan Lichtle, Sobhan Mohammadpour, Alexandre Bayen, J. Zico Kolter, Amy Zhang, Gabriele Farina, Eugene Vinitsky, Samuel Sokota 3/18/2026

Reevaluating Policy Gradient Methods for Imperfect-Information Games

Reevaluation of policy gradient methods (PPO) for imperfect-information games, questioning necessity of complex DRL algorithms based on fictitious play and CFR.

Ax Donato Crisostomi, Alessandro Zirilli, Antonio Andrea Gargiulo, Maria Sofia Bucarelli, Simone Scardapane, Fabrizio Silvestri, Iacopo Masi, Emanuele Rodol\`a 3/18/2026

MASS: MoErging through Adaptive Subspace Selection

MASS: adaptive subspace selection method for model merging, combining multiple fine-tuned models without training overhead while matching separate endpoint accuracy.

Ax Zhe Ye, Zhengxu Yan, Jingxuan He, Timothe Kasriel, Kaiyu Yang, Dawn Song 3/18/2026

VERINA: Benchmarking Verifiable Code Generation

VERINA benchmark for evaluating LLM code generation with jointly generated specifications and proofs, addressing correctness verification challenges.

Ax Yidi Wang, Ziyue Qiao, Jiawei Gu, Xubin Zheng, Pengyang Wang, Xiaobing Pei, Xiao Luo 3/18/2026

Out-of-Distribution Graph Models Merging

Graph model merging technique for combining GNN models pre-trained on different domains with distribution discrepancy to create generalized models.

Ax Weihua Du, Hailei Gong, Zhan Ling, Kang Liu, Lingfeng Shen, Xuesong Yao, Yufei Xu, Dingyuan Shi, Yiming Yang, Jiecao Chen 3/18/2026

Generalizable End-to-End Tool-Use RL with Synthetic CodeGym

Tool-augmented LLM agents trained with synthetic code environments via RL to improve generalization on tool-use tasks, addressing brittleness with new tools and unseen workflows.

Ax Piotr Komorowski, Elena Golimblevskaia, Reduan Achtibat, Thomas Wiegand, Sebastian Lapuschkin, Wojciech Samek 3/18/2026

Attribution-Guided Decoding

Attribution-Guided Decoding uses interpretability to improve LLM instruction-following and factual accuracy.

Ax Bahrul Ilmi Nasution, Floor Eijkelboom, Mark Elliot, Richard Allmendinger, Christian A. Naesseth 3/18/2026

Flow Matching for Tabular Data Synthesis

Empirical comparison of flow matching variants with diffusion models for privacy-preserving tabular data synthesis.

Ax Yunni Qu (The University of North Carolina at Chapel Hill), Dzung Dinh (The University of North Carolina at Chapel Hill), Grant King (University of Michigan), Whitney Ringwald (University of Minnisota Twin Cities), Bing Cai Kok (The University of North Carolina at Chapel Hill), Kathleen Gates (The University of North Carolina at Chapel Hill), Aidan Wright (University of Michigan), Junier Oliva (The University of North Carolina at Chapel Hill) 3/18/2026

Relaxed Efficient Acquisition of Context and Temporal Features

Active feature acquisition method for biomedical applications optimizing measurement selection under temporal and cost constraints.

Ax Vincent Zhihao Zheng, \'Etienne Marcotte, Arjun Ashok, Andrew Robert Williams, Lijun Sun, Alexandre Drouin, Valentina Zantedeschi 3/18/2026

Overcoming the Modality Gap in Context-Aided Forecasting

Research addressing multimodal model underperformance in context-aided forecasting via improved context quality assessment.