Ax Cursor Reseach, :, Aaron Chan, Ahmed Shalaby, Alexander Wettig, Aman Sanger, Andrew Zhai, Anurag Ajay, Ashvin Nair, Charlie Snell, Chen Lu, Chen Shen, Emily Jia, Federico Cassano, Hanpeng Liu, Haoyu Chen, Henry Wildermuth, Jacob Jackson, Janet Li, Jediah Katz, Jiajun Yao, Joey Hejna, Josh Warner, Julius Vering, Kevin Frans, Lee Danilek, Less Wright, Lujing Cen, Luke Melas-Kyriazi, Michael Truell, Michiel de Jong, Naman Jain, Nate Schmidt, Nathan Wang, Niklas Muennighoff, Oleg Rybkin, Paul Loh, Phillip Kravtsov, Rishabh Yadav, Sahil Shah, Sam Kottler, Alexander M Rush, Shengtong Zhang, Shomil Jain, Sriram Sankar, Stefan Heule, Stuart H. Sul, Sualeh Asif, Victor Rong, Wanqi Zhu, William Lin, Yuchen Wu, Yuri Volkov, Yury Zemlyanskiy, Zack Holbrook, Zhiyuan Zhang 3/26/2026

Composer 2 Technical Report

Composer 2 model specialized for agentic software engineering with long-term planning and coding abilities trained via continued pretraining and reinforcement learning.

Ax Dmitrii Krylov, Armin Karamzade, Roy Fox 3/26/2026

Moonwalk: Inverse-Forward Differentiation

Inverse-forward differentiation method to reduce memory requirements for backpropagation by avoiding activation storage.

Ax Kefan Song, Amir Moeini, Peng Wang, Lei Gong, Rohan Chandra, Shangtong Zhang, Yanjun Qi 3/26/2026

Reward Is Enough: LLMs Are In-Context Reinforcement Learners

Research paper demonstrating LLMs perform in-context reinforcement learning during inference. ICRL prompting framework enables inference-time self-improvement.

Ax Divyat Mahajan, Sachin Goyal, Badr Youbi Idrissi, Mohammad Pezeshki, Ioannis Mitliagkas, David Lopez-Paz, Kartik Ahuja 3/26/2026

Beyond Multi-Token Prediction: Pretraining LLMs with Future Summaries

Proposes future summary pretraining for LLMs as alternative to next-token prediction, addressing limitations in long-horizon reasoning and planning tasks.

Ax Sebasti\'an Andr\'es Cajas Ord\'o\~nez, Luis Fernando Torres Torres, Mackenzie J. Meni, Carlos Andr\'es Duran Paredes, Eric Arazo, Cristian Bosch, Ricardo Simon Carbajo, Yuan Lai, Leo Anthony Celi 3/26/2026

Uncertainty Makes It Stable: Curiosity-Driven Quantized Mixture-of-Experts

Proposes curiosity-driven quantized Mixture-of-Experts framework using Bayesian uncertainty for deploying neural networks on resource-constrained devices.

Ax Ziwei Liu, Borui Kang, Hangjie Yuan, Zixiang Zhao, Wei Li, Yifan Zhu, Tao Feng 3/26/2026

Continual GUI Agents

Introduces continual learning task for GUI agents that must adapt to shifting domains and resolutions over time, identifying failure modes in existing agent methods.

Ax Bjarni Haukur Bjarnason, Andr\'e Silva, Martin Monperrus 3/26/2026

On Randomness in Agentic Evals

Study of variance in agentic system evaluations using 60,000 trajectories on SWE-Bench-Verified, showing pass@1 estimates vary significantly across runs, questioning single-run reliability assumptions.