Ax Dimitri Staufer, Kirsten Morehouse, David Hartmann, Bettina Berendt 3/13/2026

Human-Centred LLM Privacy Audits: Findings and Frictions

LMP2: browser-based self-audit tool for inspecting LLM associations with individuals, with user study findings on privacy and model behavior.

Ax William Brach, Tomas Bedej, Jacob Nielsen, Jacob Pichna, Juraj Bedej, Eemeli Saarensilta, Julie Dupouy, Gianluca Barmina, Andrea Blasi N\'u\~nez, Peter Schneider-Kamp, Kristian Ko\v{s}\v{t}\'al, Michal Ries, Lukas Galke Poech 3/13/2026

SommBench: Assessing Sommelier Expertise of Language Models

SommBench: multilingual benchmark assessing LLM capabilities in sommelier expertise, evaluating cultural knowledge beyond linguistic encoding.

Ax Zhoujun Cheng, Yutao Xie, Yuxiao Qu, Amrith Setlur, Shibo Hao, Varad Pimpalkhute, Tongtong Liang, Feng Yao, Zhengzhong Liu, Eric Xing, Virginia Smith, Ruslan Salakhutdinov, Zhiting Hu, Taylor Killian, Aviral Kumar 3/13/2026

IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL

IsoCompute Playbook: scaling laws for optimal compute allocation in LLM reinforcement learning post-training across rollouts, batches, and update steps.

Ax {\L}ukasz Borchmann, Jordy Van Landeghem, Micha{\l} Turski, Shreyansh Padarha, Ryan Othniel Kearns, Adam Mahdi, Niels Rogge, Cl\'ementine Fourrier, Siwei Han, Huaxiu Yao, Artemis Llabr\'es, Yiming Xu, Dimosthenis Karatzas, Hao Zhang, Anupam Datta 3/13/2026

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

MADQA: benchmark with 2,250 questions over 800 PDFs evaluating whether multimodal agents use strategic reasoning or stochastic search in document-intensive workflows.

Ax Ryo Kuroiwa, J. Christopher Beck 3/13/2026

Domain-Independent Dynamic Programming

Domain-independent dynamic programming paradigm decoupling modeling from solving combinatorial optimization problems with problem-agnostic approach.

Ax Sayan Nag, K J Joseph, Koustava Goswami, Vlad I Morariu, Balaji Vasan Srinivasan 3/13/2026

Agentic Design Review System

Multi-agent system orchestrating collaborative design review where agents analyze graphics holistically with novel exemplar selection approach.

Ax Carlos N\'u\~nez-Molina, Vicen\c{c} G\'omez, Hector Geffner 3/13/2026

From Next Token Prediction to (STRIPS) World Models

Study on whether next-token prediction yields usable world models, introducing STRIPS Transformer for symbolic planning from action traces.

Ax Weixun Wang, XiaoXiao Xu, Wanhe An, Fangwen Dai, Wei Gao, Yancheng He, Ju Huang, Qiang Ji, Hanqi Jin, Xiaoyang Li, Yang Li, Zhongwen Li, Shirong Lin, Jiashun Liu, Zenan Liu, Tao Luo, Dilxat Muhtar, Yuanbin Qu, Jiaqiang Shi, Qinghui Sun, Yingshui Tan, Hao Tang, Runze Wang, Yi Wang, Zhaoguo Wang, Yanan Wu, Shaopan Xiong, Binchen Xu, Xander Xu, Yuchi Xu, Qipeng Zhang, Xixia Zhang, Haizhou Zhao, Jie Zhao, Shuaibing Zhao, Baihui Zheng, Jianhui Zheng, Suhang Zheng, Yanni Zhu, Mengze Cai, Kerui Cao, Xitong Chen, Yue Dai, Lifan Du, Tao Feng, Tao He, Jin Hu, Yijie Hu, Ziyu Jiang, Cheng Li, Xiang Li, Jing Liang, Xin Lin, Chonghuan Liu, ZhenDong Liu, Zhiqiang Lv, Haodong Mi, Yanhu Mo, Junjia Ni, Shixin Pei, Jingyu Shen, XiaoShuai Song, Cecilia Wang, Chaofan Wang, Kangyu Wang, Pei Wang, Tao Wang, Wei Wang, Ke Xiao, Mingyu Xu, Tiange Xu, Nan Ya, Siran Yang, Jianan Ye, Yaxing Zang, Duo Zhang, Junbo Zhang, Boren Zheng, Wanxi Deng, Ling Pan, Lin Qu, Wenbo Su, Jiamang Wang, Wei Wang, Hu Wei, Minggang Wu, Cheng Yu, Bing Zhao, Zhicheng Zheng, Bo Zheng 3/13/2026

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Agentic Learning Ecosystem (ALE) infrastructure for end-to-end agent development, enabling LLMs to operate in real-world environments with iterative refinement.

Ax Junnan Dong, Chuang Zhou, Zheng Yuan, Yifei Yu, Qiufeng Wang, Yinghui Li, Siyu An, Di Yin, Xing Sun, Feiyue Huang 3/13/2026

Deep Tabular Research via Continual Experience-Driven Execution

Agentic framework for multi-step reasoning over complex tabular data with hierarchical headers using closed-loop decision-making.

Ax Xin An, Jingyi Cai, Xiangyang Chen, Huayao Liu, Peiting Liu, Peng Wang, Bei Yang, Xiuwen Zhu, Yongfan Chen, Yan Gao, Yuan Gao, Baoyu Hou, Guangzheng Hu, Shuzhao Li, Weixu Qiao, Weidong Ren, Yanan Wang, Boyu Yang, Fan Yang, Jiangtao Zhang, Lixin Zhang, Lin Qu, Hu Wei, Xiaoxiao Xu, Bing Zhao 3/13/2026

Logics-Parsing-Omni Technical Report

Omni Parsing: Framework for multimodal parsing across documents, images, audio-visual with unified taxonomy and hierarchical levels.