Ax Shripad Vilasrao Deshmukh, Will Schwarzer, Scott Niekum 3/23/2026

Evaluation-Aware Reinforcement Learning

Evaluation-Aware RL framework considers policy evaluation accuracy during training to reduce variance and bias.

Ax Moshe Kimhi, Nimrod Shabtay, Raja Giryes, Chaim Baskin, Eli Schwartz 3/23/2026

CARES: Context-Aware Resolution Selector for VLMs

CARES lightweight module selects appropriate image resolution for vision-language models to reduce token overhead and latency.

Ax Mikael Lundb\"ack, Erik Wallin, Carola H\"aggstr\"om, Mattias Nystr\"om, Andreas Gr\"onlund, Mats Richardson, Petrus J\"onsson, William Arnvik, Lucas Hedstr\"om, Arvid F\"alldin, Martin Servin 3/23/2026

FORWARD: Dataset of a forwarder operating in rough terrain

FORWARD dataset of heavy machinery operating in rough terrain with multimodal sensor data from Swedish forestry.

Ax George Pu, Michael S. Lee, Udari Madhushani Sehwag, David J. Lee, Bryan Zhu, Yash Maurya, Mohit Raghavendra, Yuan Xue, Samuel Marc Denton 3/23/2026

LHAW: Controllable Underspecification for Long-Horizon Tasks

Framework for managing ambiguity in long-horizon workflow agents. Task-agnostic approach for curating and measuring impact of underspecified instructions on agent execution.

Ax Andrew Seohwan Yu, Mohsen Hariri, Kunio Nakamura, Mingrui Yang, Xiaojuan Li, Vipin Chaudhary 3/23/2026

Medical Image Spatial Grounding with Semantic Sampling

Study of vision language models for spatial grounding in 3D medical imaging. Examines VLM performance across imaging modalities and slice directions.

Ax Yihao Zhang, Zeming Wei, Xiaokun Luan, Chengcan Wu, Zhixin Zhang, Jiangrong Wu, Haolin Wu, Huanran Chen, Jun Sun, Meng Sun 3/23/2026

ClawWorm: Self-Propagating Attacks Across LLM Agent Ecosystems

Security research on ClawWorm, self-propagating attacks across multi-agent LLM ecosystems. First study of attack propagation in interconnected agent systems like OpenClaw.

Ax Chun-Jui Wang, Jian-Ting Guo, Hung Guei, Chung-Chin Shih, Ti-Rong Wu, I-Chen Wu 3/23/2026

Evaluating Game Difficulty in Tetris Block Puzzle

Research using Stochastic Gumbel AlphaZero to evaluate game difficulty in Tetris Block Puzzle variants. Applies game-playing AI as evaluation metric.