Ax Ben Chen, Siyuan Wang, Yufei Ma, Zihan Liang, Xuxin Zhang, Yue Lv, Ying Yang, Huangyu Dai, Lingtao Mao, Tong Zhao, Zhipeng Qian, Xinyu Sun, Zhixin Zhai, Yang Zhao, Bochao Liu, Jingshan Lv, Xiao Liang, Hui Kong, Jing Chen, Han Li, Chenyi Lei, Wenwu Ou, Kun Gai 3/26/2026

OneSearch-V2: The Latent Reasoning Enhanced Self-distillation Generative Search Framework

OneSearch-V2 improves generative retrieval for search systems with latent reasoning and self-distillation. Industrial-scale framework.

Ax Zichuan Lin, Feiyu Liu, Yijun Yang, Jiafei Lyu, Yiming Gao, Yicheng Liu, Zhicong Lu, Yangbin Yu, Mingyu Yang, Junyou Li, Deheng Ye, Jie Jiang 3/26/2026

UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

Mobile GUI agent using rejection fine-tuning to learn from failed trajectories and improve credit assignment for long-horizon tasks.

Ax Reva Schwartz, Carina Westling, Morgan Briggs, Marzieh Fadaee, Isar Nejadgholi, Matthew Holmes, Fariza Rashid, Maya Carlyle, Afaf Ta\"ik, Kyra Wilson, Peter Douglas, Theodora Skeadas, Gabriella Waters, Rumman Chowdhury, Thiago Lacerda 3/26/2026

CIRCLE: A Framework for Evaluating AI from a Real-World Lens

CIRCLE framework for evaluating AI systems across six lifecycle stages, bridging gap between benchmarks and real-world deployment outcomes.

Ax Vishnu Narayanan Anilkumar, Abhijith Sreesylesh Babu, Trieu Hai Vo, Mohankrishna Kolla, Alexander Cuneo 3/26/2026

Relationship-Aware Safety Unlearning for Multimodal LLMs

Framework for relationship-aware safety unlearning in multimodal LLMs addressing relational safety failures without collateral damage.

Ax Dmitrii Krylov, Armin Karamzade, Roy Fox 3/26/2026

Moonwalk: Inverse-Forward Differentiation

Moonwalk: Inverse-forward differentiation technique addressing backpropagation's memory limitation for training deeper neural networks.

Ax Nina Corvelo Benz, Stratis Tsirtsis, Eleni Straitouri, Ivi Chatzi, Ander Artola Velasco, Suhas Thejaswi, Manuel Gomez-Rodriguez 3/26/2026

Evaluation of Large Language Models via Coupled Token Generation

Evaluation framework for large language models addressing randomization in coupled token generation with causal modeling approach.