Ax Peter Holderrieth, Ezra Erives 3/19/2026

An Introduction to Flow Matching and Diffusion Models

Tutorial on diffusion and flow-based generative models covering mathematical foundations, ODEs, SDEs, and core algorithms for image, video, and multi-modal generation.

Ax Yuxiang Ji, Ziyu Ma, Yong Wang, Guanhua Chen, Xiangxiang Chu, Liaoni Wu 3/19/2026

Tree Search for LLM Agent Reinforcement Learning

Tree-based group relative policy optimization for LLM agents addressing sparse supervision in multi-turn tasks.

Ax Nathan Breslow, Aayush Mishra, Mahler Revsine, Michael C. Schatz, Anqi Liu, Daniel Khashabi 3/19/2026

Genomic Next-Token Predictors are In-Context Learners

Demonstrates in-context learning emerges organically in genomic sequence models trained with next-token prediction on DNA sequences.

Ax Leo Elmecker-Plakolm, Pierre Fasterling, Philip Sosnin, Calvin Tsay, Matthew Wicker 3/19/2026

Provably Safe Model Updates

Develops methods for provably safe ML model updates preventing catastrophic forgetting and alignment drift in dynamic environments.

Ax Vedant Shah, Johan Obando-Ceron, Vineet Jain, Brian Bartoldson, Bhavya Kailkhura, Sarthak Mittal, Glen Berseth, Pablo Samuel Castro, Yoshua Bengio, Nikolay Malkin, Moksh Jain, Siddarth Venkatraman, Aaron Courville 3/19/2026

A Comedy of Estimators: On KL Regularization in RL Training of LLMs

Analyzes KL regularization estimators in RL training of LLMs, comparing bias-variance tradeoffs of different approximation methods.

Ax Teng Xiao, Yige Yuan, Hamish Ivison, Huaisheng Zhu, Faeze Brahman, Nathan Lambert, Pradeep Dasigi, Noah A. Smith, Hannaneh Hajishirzi 3/19/2026

Meta-Reinforcement Learning with Self-Reflection for Agentic Search

MR-Search proposes meta-reinforcement learning with self-reflection for agentic search, enabling agents to adapt strategies across episodes and improve in-context exploration.

Ax Jingpu Cheng, Qianxiao Li, Ting Lin, Zuowei Shen 3/19/2026

Deep learning and the rate of approximation by flows

Theoretical investigation of deep residual networks' approximation capacity in continuous dynamical systems, quantifying minimal time-horizons for diffeomorphism approximation.

Ax Jackson Trager, Alireza S. Ziabari, Elnaz Rahmati, Aida Mostafazadeh Davani, Preni Golazizian, Farzan Karimi-Malekabadi, Ali Omrani, Zhihe Li, Brendan Kennedy, Georgios Chochlakis, Nils Karl Reimer, Melissa Reyes, Kelsey Cheng, Mellow Wei, Christina Merrifield, Arta Khosravi, Evans Alvarez, Morteza Dehghani 3/19/2026

The Moral Foundations Reddit Corpus

Reddit corpus annotated with moral sentiment and framing for NLP tasks related to moral language detection and analysis.