Ax Chenhe Dong, Shaowei Yao, Pengkun Jiao, Jianhui Yang, Yiming Jin, Zerui Huang, Xiaojiang Zhou, Dan Ou, Haihong Tang, Bo Zheng 3/11/2026

TaoSR1: The Thinking Model for E-commerce Relevance Search

TaoSR1 deploys LLMs directly for e-commerce query-product relevance prediction using chain-of-thought reasoning with error mitigation.

Ax Yen-Ju Lu, Yashesh Gaur, Wei Zhou, Benjamin Muller, Jesus Villalba, Najim Dehak, Luke Zettlemoyer, Gargi Ghosh, Mike Lewis, Srinivasan Iyer, Duc Le 3/11/2026

Latent Speech-Text Transformer

Latent Speech-Text Transformer improves compute efficiency of auto-regressive speech-text models through latent representation compression.

Ax Haolin Yang, Yuxing Long, Zhuoyuan Yu, Zihan Yang, Minghan Wang, Jiapeng Xu, Yihan Wang, Ziyan Yu, Wenzhe Cai, Lei Kang, Hao Dong 3/11/2026

NavSpace: How Navigation Agents Follow Spatial Intelligence Instructions

NavSpace benchmark with 1,228 trajectory-instruction pairs evaluates spatial reasoning and perception capabilities of embodied navigation agents.

Ax Marcus Hoerger, Muhammad Sudrajat, Hanna Kurniawati 3/11/2026

Vectorized Online POMDP Planning

Vectorized parallel algorithm for POMDP planning under partial observability for autonomous robots leveraging modern hardware parallelization.

Ax Heisei Yonezawa, Ansei Yonezawa, Itsuro Kajiwara 3/11/2026

Continual uncertainty learning

Deep reinforcement learning approach for robust control of mechanical systems handling multiple sources of uncertainty.

Ax Yifei Zhang, Xu Yang, Xiao Yang, Bowen Xian, Qizheng Li, Shikai Fang, Jingyuan Li, Jian Wang, Mingrui Xu, Weiqing Liu, Jiang Bian 3/11/2026

Reasoning as Gradient: Scaling MLE Agents Beyond Tree Search

GOME: MLE agent framework replacing tree search with gradient-based optimization for machine learning engineering tasks using LLM reasoning.

Ax Siddharth Boppana, Annabel Ma, Max Loeffler, Raphael Sarfati, Eric Bigelow, Atticus Geiger, Owen Lewis, Jack Merullo 3/11/2026

Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought

Analysis of performative chain-of-thought in reasoning models, showing hidden beliefs diverge from generated reasoning tokens at task-specific difficulty levels.