Ax Hanchen Li, Runyuan He, Qizheng Zhang, Changxiu Ji, Qiuyang Mang, Xiaokun Chen, Lakshya A Agrawal, Wei-Liang Liao, Eric Yang, Alvin Cheung, James Zou, Kunle Olukotun, Ion Stoica, Joseph E. Gonzalez 27d ago

Combee: Scaling Prompt Learning for Self-Improving Language Model Agents

Combee framework for scaling prompt learning in LLM agents enabling efficient self-improvement through system prompt optimization across parallel runs.

Ax Anushree Sinha, Srivaths Ranganathan, Debanshu Das, Abhishek Dharmaratnakar 27d ago

Beyond Fluency: Toward Reliable Trajectories in Agentic IR

Position paper analyzing failure modes in agentic IR systems where early errors cascade despite linguistic fluency, causing misalignment between reasoning and execution.

Ax Edward Hirst, Henrique N. S\'a Earp, Tom\'as S. R. Silva 27d ago

Minimising Willmore Energy via Neural Flow

Neural Willmore flow approach using neural architectures and PINN-style loss functions to minimize Willmore energy on 2D surfaces.

Ax Saad Alqithami 27d ago

Soft Tournament Equilibrium

Soft Tournament Equilibrium framework for evaluating LLM-based agents in non-transitive competitive settings using set-valued rankings instead of linear orderings.

Ax Saurav Jha, Maryam Hashemzadeh, Ali Saheb Pasand, Ali Parviz, Min-Joong Lee, Boris Knyazev 27d ago

REAM: Merging Improves Pruning of Experts in LLMs

REAM method for pruning mixture-of-experts in large language models by merging experts, addressing memory challenges in deployment of billion-parameter models.

Ax Maohao Shen, Kaiwen Zha, Zexue He, Zhang-Wei Hong, Siru Ouyang, J. Jon Ryu, Prasanna Sattigeri, Suhas Diggavi, Gregory Wornell 27d ago

Decocted Experience Improves Test-Time Inference in LLM Agents

Proposes methods to improve LLM agent performance at test-time without parameter updates by optimizing inference-time computation for complex reasoning tasks.

Ax Martin Kristjansen (Aalborg University), Kim Guldstrand Larsen (Aalborg University), Marius Miku\v{c}ionis (Aalborg University), Christian Schilling (Aalborg University) 27d ago

Safe and Near-Optimal Gate Control: A Case Study from the Danish West Coast

Optimization framework for automated water gate control balancing safety and performance requirements for Danish fjord management.

Ax Chenxi Wang, Zhuoyun Yu, Xin Xie, Wuguannan Yao, Runnan Fang, Shuofei Qiao, Kexin Cao, Guozhou Zheng, Xiang Qi, Peng Zhang, Shumin Deng 27d ago

SkillX: Automatically Constructing Skill Knowledge Bases for Agents

SkillX framework automatically constructs reusable skill knowledge bases for LLM agents, enabling efficient learning and generalization across tasks.