Ax Weixun Wang, XiaoXiao Xu, Wanhe An, Fangwen Dai, Wei Gao, Yancheng He, Ju Huang, Qiang Ji, Hanqi Jin, Xiaoyang Li, Yang Li, Zhongwen Li, Shirong Lin, Jiashun Liu, Zenan Liu, Tao Luo, Dilxat Muhtar, Yuanbin Qu, Jiaqiang Shi, Qinghui Sun, Yingshui Tan, Hao Tang, Runze Wang, Yi Wang, Zhaoguo Wang, Yanan Wu, Shaopan Xiong, Binchen Xu, Xander Xu, Yuchi Xu, Qipeng Zhang, Xixia Zhang, Haizhou Zhao, Jie Zhao, Shuaibing Zhao, Baihui Zheng, Jianhui Zheng, Suhang Zheng, Yanni Zhu, Mengze Cai, Kerui Cao, Xitong Chen, Yue Dai, Lifan Du, Tao Feng, Tao He, Jin Hu, Yijie Hu, Ziyu Jiang, Cheng Li, Xiang Li, Jing Liang, Xin Lin, Chonghuan Liu, ZhenDong Liu, Zhiqiang Lv, Haodong Mi, Yanhu Mo, Junjia Ni, Shixin Pei, Jingyu Shen, XiaoShuai Song, Cecilia Wang, Chaofan Wang, Kangyu Wang, Pei Wang, Tao Wang, Wei Wang, Ke Xiao, Mingyu Xu, Tiange Xu, Nan Ya, Siran Yang, Jianan Ye, Yaxing Zang, Duo Zhang, Junbo Zhang, Boren Zheng, Wanxi Deng, Ling Pan, Lin Qu, Wenbo Su, Jiamang Wang, Wei Wang, Hu Wei, Minggang Wu, Cheng Yu, Bing Zhao, Zhicheng Zheng, Bo Zheng 3/13/2026

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Agentic Learning Ecosystem (ALE) infrastructure for end-to-end agent development, enabling LLMs to operate in real-world environments with iterative refinement.

Ax Junnan Dong, Chuang Zhou, Zheng Yuan, Yifei Yu, Qiufeng Wang, Yinghui Li, Siyu An, Di Yin, Xing Sun, Feiyue Huang 3/13/2026

Deep Tabular Research via Continual Experience-Driven Execution

Agentic framework for multi-step reasoning over complex tabular data with hierarchical headers using closed-loop decision-making.

Ax Xin An, Jingyi Cai, Xiangyang Chen, Huayao Liu, Peiting Liu, Peng Wang, Bei Yang, Xiuwen Zhu, Yongfan Chen, Yan Gao, Yuan Gao, Baoyu Hou, Guangzheng Hu, Shuzhao Li, Weixu Qiao, Weidong Ren, Yanan Wang, Boyu Yang, Fan Yang, Jiangtao Zhang, Lixin Zhang, Lin Qu, Hu Wei, Xiaoxiao Xu, Bing Zhao 3/13/2026

Logics-Parsing-Omni Technical Report

Omni Parsing: Framework for multimodal parsing across documents, images, audio-visual with unified taxonomy and hierarchical levels.

Ax Benjamin A. T. Grahama, Lauren Brown, Georgios Chochlakis, Morteza Dehghani, Raquel Delerme, Brittany Friedman, Ellie Graeden, Preni Golazizian, Rajat Hebbar, Parsa Hejabi, Aditya Kommineni, Mayag\"uez Salinas, Michael Sierra-Ar\'evalo, Jackson Trager, Nicholas Weller, Shrikanth Narayanan 3/13/2026

Community-Informed AI Models for Police Accountability

AI models for analyzing police bodycam footage to improve accountability and government transparency.

Ax Cornelius V. Braun, Robert T. Lange, Marc Toussaint 3/13/2026

Stein Variational Evolution Strategies

Stein Variational Evolution Strategies: Gradient-free variant of SVGD for sampling from unnormalized distributions.

Ax Jun Liu, Zhenglun Kong, Peiyan Dong, Changdi Yang, Tianqi Li, Hao Tang, Geng Yuan, Wei Niu, Wenbin Zhang, Pu Zhao, Xue Lin, Dong Huang, Yanzhi Wang 3/13/2026

Structured Agent Distillation for Large Language Model

Structured Agent Distillation compresses LLM-based agents into smaller student models while preserving reasoning and action consistency.

Ax Chengyu Shen, Zhen Hao Wong, Runming He, Hao Liang, Meiyi Qiang, Zimo Meng, Zhengyang Zhao, Bohan Zeng, Zhengzhou Zhu, Bin Cui, Wentao Zhang 3/13/2026

Let's Verify Math Questions Step by Step

Framework for verifying correctness of math questions used in LLM training. Focuses on QA data quality beyond answer correctness.

Ax Kai Li, Can Shen, Yile Liu, Jirui Han, Kelong Zheng, Xuechao Zou, Lionel Z. Wang, Shun Zhang, Xingjian Du, Hanjun Luo, Yingbin Jin, Xinxin Xing, Ziyang Ma, Yue Liu, Yifan Zhang, Junfeng Fang, Kun Wang, Yibo Yan, Gelei Deng, Haoyang Li, Yiming Li, Xiaobin Zhuang, Tianlong Chen, Qingsong Wen, Tianwei Zhang, Yang Liu, Haibo Hu, Zhizheng Wu, Xiaolin Hu, Eng-Siong Chng, Wenyuan Xu, XiaoFeng Wang, Wei Dong, Xinfeng Li 3/13/2026

AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models

AudioTrust benchmark evaluating trustworthiness of audio LLMs. Reveals vulnerabilities from non-semantic acoustic cues like timbre and accent.

Ax Sirui Lu, Zhijing Jin, Terry Jingchen Zhang, Pavel Kos, J. Ignacio Cirac, Bernhard Sch\"olkopf 3/13/2026

Can Theoretical Physics Research Benefit from Language Agents?

Investigates LLM limitations in theoretical physics. Identifies gaps in physical intuition and constraint satisfaction beyond prompting improvements.

Ax Nadav Kunievsky, James A. Evans 3/13/2026

Measuring Intent Comprehension in LLMs

Study measuring how well LLMs comprehend user intent beyond surface-level text matching. Analyzes gap between token prediction and actual user goals.

Ax Zhejun Zhao, Yuchen Li, Alley Liu, Yuehu Dong, Xiaolong Wei, Lixue Zheng, Pingsheng Liu, Dongdong Shen, Long Xia, Jiashu Zhao, Dawei Yin 3/13/2026

TURA: Tool-Augmented Unified Retrieval Agent for AI Search

TURA proposes a tool-augmented retrieval agent for conversational AI search that handles real-time data and structured queries beyond traditional RAG limitations.