Ax Jingyu Zhang, Tianjian Li, William Jurayj, Hongyuan Zhan, Benjamin Van Durme, Daniel Khashabi 12d ago

Many-Tier Instruction Hierarchy in LLM Agents

Instruction Hierarchy in LLM Agents arXiv paper addressing multi-source conflicting instructions in LLM systems. Examines privilege levels for safe instruction following.

Ax Maksim Anisimov (Imperial College London), Francesco Belardinelli (Imperial College London), Matthew Wicker (Imperial College London) 12d ago

SafeAdapt: Provably Safe Policy Updates in Deep Reinforcement Learning

SafeAdapt arXiv paper on provably safe policy updates in deep RL for non-stationary environments. Addresses safety preservation during policy changes.

Ax Stefan Andreas Baumann, Jannik Wiese, Tommaso Martorella, Mahdi M. Kalayeh, Bj\"orn Ommer 12d ago

Envisioning the Future, One Step at a Time

Method for predicting future scene evolution by modeling uncertainty and simulating trajectories rather than dense pixel-level changes.

Ax Shahab Rahimirad, Guven Gergerli, Lucia Romero, Angela Qian, Matthew Lyle Olson, Simon Stepputtis, Joseph Campbell 12d ago

Bayesian Social Deduction with Graph-Informed Language Models

Study evaluating LLM performance on social reasoning tasks in the Avalon game, testing inference capabilities and model distillation effects.

Ax Zhenfeng Lin, Haoji Hu, Ming Hao, Xuchao Zhang, Ryan Zhang, Junhao Li, Ze Li, Oleg Kulygin, Chetan Bansal, Hatay Tuna, Murali Chintalapati, Sheila Jiang, Salman Zafar, Angie Anderson 12d ago

ActionNex: A Virtual Outage Manager for Cloud Computing

Production agentic system for cloud outage management with real-time updates, knowledge distillation, and conditioned action recommendations.

Ax Jingyang Qiao, Weicheng Meng, Yu Cheng, Zhihang Lin, Zhizhong Zhang, Xin Tan, Jingyu Gong, Kun Shao, Yuan Xie 12d ago

Memory Intelligence Agent

Memory system for deep research agents enabling efficient evolution and reasoning through intelligent trajectory memory management.

Ax Wenxuan Liu, Zixuan Li, Long Bai, Chunmao Zhang, Fenghui Zhang, Zhuo Chen, Wei Li, Yuxin Zuo, Fei Wang, Bingbing Xu, Xuhui Jiang, Jin Zhang, Xiaolong Jin, Jiafeng Guo, Tat-Seng Chua, Xueqi Cheng 12d ago

Towards Knowledgeable Deep Research: Framework and Benchmark

Framework and benchmark for deep research agents using structured knowledge alongside unstructured web content for comprehensive reports.

Ax Monishwaran Maheswaran, Leon Lakhani, Zhongzhu Zhou, Shijia Yang, Junxiong Wang, Coleman Hooper, Yuezhou Hu, Rishabh Tiwari, Jue Wang, Harman Singh, Qingyang Wu, Yuqing Jian, Ce Zhang, Kurt Keutzer, Tri Dao, Xiaoxia Wu, Ben Athiwaratkun, James Zou, Chenfeng Xu 12d ago

Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution

Multi-model orchestration framework for verifier-free evolutionary inference balancing diversity and computational efficiency.

Ax Zixuan Hu, Yongxian Wei, Li Shen, Zhenyi Wang, Baoyuan Wu, Chun Yuan, Dacheng Tao 12d ago

Task-Distributionally Robust Data-Free Meta-Learning

Data-free meta-learning from pre-trained models without original training data, analyzing robustness and failure modes.