Ax Varun Ursekar (Emily), Apaar Shanker (Emily), Veronica Chatrath (Emily), Yuan (Emily), Xue, Sam Denton 2/27/2026

VeRO: An Evaluation Harness for Agents to Optimize Agents

VeRO: Evaluation framework for assessing coding agents that optimize other agents through iterative edit-execute-evaluate cycles.

Ax Zhan Su, Fengran Mo, Jinghan Zhang, Yuchen Hui, Jia Ao Sun, Bingbing Wen, Jian-Yun Nie 2/27/2026

Towards Dynamic Dense Retrieval with Routing Strategy

Dynamic dense retrieval with routing strategy for adapting information retrieval models across domains without full retraining.

Ax Zhanhui Zhou, Lingjie Chen, Hanghang Tong, Dawn Song 2/27/2026

dLLM: Simple Diffusion Language Modeling

dLLM: Unified framework for diffusion language models. Standardizes components across research implementations for reproducibility.

Ax Ben Xue, Dan Liu, Lixiang Wang, Mingjie Sun, Peng Wang, Pengfei Zhang, Shaoyun Shi, Tianyu Xu, Yunhao Sha, Zhiqiang Liu, Bo Kong, Bo Wang, Hang Yang, Jieting Xue, Junhao Wang, Shengyu Wang, Shuping Hui, Wencai Ye, Xiao Lin, Yongzhi Li, Yuhang Chen, Zhihui Yin, Quan Chen, Shiyang Wen, Wenjin Wu, Han Li, Guorui Zhou, Changcheng Li, Peng Jiang 2/27/2026

Generative Recommendation for Large-Scale Advertising

GR4AD: Production generative recommendation system for large-scale advertising using LLMs. Architecture, learning, and serving optimization.

Ax Yujie Zhao, Boqin Yuan, Junbo Huang, Haocheng Yuan, Zhongming Yu, Haozhou Xu, Lanxiang Hu, Abhilash Shankarampeta, Zimeng Huang, Wentao Ni, Yuandong Tian, Jishen Zhao 2/27/2026

AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications

AMA-Bench: Benchmark evaluating long-horizon memory for LLM-based agentic applications. Addresses gap between dialogue benchmarks and real agent scenarios.

Ax Yinan Zheng, Tianyi Tan, Bin Huang, Enguang Liu, Ruiming Liang, Jianlin Zhang, Jianwei Cui, Guang Chen, Kun Ma, Hangjun Ye, Long Chen, Ya-Qin Zhang, Xianyuan Zhan, Jingjing Liu 2/27/2026

Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving

Research on diffusion models for end-to-end autonomous driving in real-world settings. arXiv paper exploring decision-making applications.

Ax Xiaoxi Li, Wenxiang Jiao, Jiarui Jin, Shijian Wang, Guanting Dong, Jiajie Jin, Hao Wang, Yinuo Wang, Ji-Rong Wen, Yuan Lu, Zhicheng Dou 2/27/2026

OmniGAIA: Towards Native Omni-Modal AI Agents

OmniGAIA benchmark for evaluating omni-modal AI agents with unified vision, audio, and language perception capabilities.

Ax Bangrui Xu, Qihang Yao, Zirui Tang, Xuanhe Zhou, Yeye He, Shihan Yu, Qianqian Xu, Bin Wang, Guoliang Li, Conghui He, Fan Wu 2/27/2026

MoDora: Tree-Based Semi-Structured Document Analysis System

Tree-based system for analyzing semi-structured documents with mixed content types enabling question-answering over complex layouts.

Ax Yang Yang, Yuzhu Long, Han Fang, Zhaoyun Chen, Zhonghui Li, Weiming Zhang, Guoping Guo 2/27/2026

Q-Tag: Watermarking Quantum Circuit Generative Models

Watermarking technique for protecting quantum circuits as intellectual property in quantum cloud computing platforms.

Ax Shuang Liang (Shanghai Jiao Tong University), Yang Hua (Queen's University Belfast), Linshan Jiang (National University of Singapore), Peishen Yan (Shanghai Jiao Tong University), Tao Song (Shanghai Jiao Tong University), Bin Yao (Shanghai Jiao Tong University), Haibing Guan (Shanghai Jiao Tong University) 2/27/2026

SettleFL: Trustless and Scalable Reward Settlement Protocol for Federated Learning on Permissionless Blockchains (Extended version)

SettleFL protocol for decentralized reward settlement in federated learning using blockchain, addressing scalability via off-chain batching.