Ax Maurits Kaptein, Vassilis-Javed Khan, Andriy Podstavnychy 3/18/2026

Runtime Governance for AI Agents: Policies on Paths

Framework for runtime governance of LLM-based AI agents, balancing task completion with legal and reputational costs through execution-path monitoring.

Ax Ming Li, Xirui Li, Tianyi Zhou 3/18/2026

When AI Navigates the Fog of War

Analyzes AI reasoning about geopolitical conflicts using temporally grounded case study of 2026 Middle East conflict after model training cutoffs.

Ax Jian Yang, Wei Zhang, Shawn Guo, Zhengmao Ye, Lin Jing, Shark Liu, Yizhi Li, Jiajun Wu, Cening Liu, X. Ma, Yuyang Song, Siwei Wu, Yuwen Li, L. Liao, T. Zheng, Ziling Huang, Zelong Huang, Che Liu, Yan Xing, Renyuan Li, Qingsong Cai, Hanxu Yan, Siyue Wang, Shikai Li, Jason Klein Liu, An Huang, Yongsheng Kang, Jinxing Zhang, Chuan Hao, Haowen Wang, Weicheng Gu, Ran Tao, Mingjie Tang, Peihao Wu, Jianzhou Wang, Xianglong Liu, Weifeng Lv, Bryan Dai 3/18/2026

IQuest-Coder-V1 Technical Report

Code LLM series (7B-40B) using code-flow multi-stage training paradigm to capture dynamic software logic evolution.

Ax Ruijiang Gao, Steven Chong Xiao 3/18/2026

Nonstandard Errors in AI Agents

Study of reproducibility in AI coding agents, showing agent-to-agent variation produces nonstandard errors in empirical results.

Ax Yongyuan Liang, Shijie Zhou, Yu Gu, Hao Tan, Gang Wu, Franck Dernoncourt, Jihyung Kil, Ryan A. Rossi, Ruiyi Zhang 3/18/2026

Anticipatory Planning for Multimodal AI Agents

Two-stage RL framework training multimodal agents for anticipatory reasoning and long-term planning in multi-step tasks.

Ax Rui Ge, Yichao Fu, Yuyang Qian, Junda Su, Yiming Zhao, Peng Zhao, Hao Zhang 3/18/2026

Internalizing Agency from Reflective Experience

Method for training LLM agents to leverage rich environment feedback through reflective experience and post-training, improving long-horizon planning.

Ax Kyle Dumont, Nicholas Herbert, Hayder Tirmazi, Shrikanth Upadhayaya 3/18/2026

DRCY: Agentic Hardware Design Reviews

AI agent system for hardware design reviews using LLMs to verify semantic correctness of component connections against datasheets.

Ax Yulin Peng, Haowen Hou, Xinxin Zhu, Ying Tiffany He, F. Richard Yu 3/18/2026

SEMAG: Self-Evolutionary Multi-Agent Code Generation

SEMAG: self-evolutionary multi-agent code generation framework that decomposes programming tasks into planning, coding, debugging stages with adaptive workflow selection.