Ax Mingyuan Zhang, Yue Bai, Huan Wang, Yizhou Wang, Qihua Dong, Yitian Zhang, Yun Fu 3/17/2026

Boosting Large Language Models with Mask Fine-Tuning

Mask Fine-Tuning (MFT) introduces a novel LLM fine-tuning method that improves performance by selectively masking model components without updating weights.

Ax Juntao Zhao, Qi Lu, Wei Jia, Borui Wan, Lei Zuo, Junda Feng, Jianyu Jiang, Yangrui Chen, Shuaishuai Cao, Jialing He, Kaihua Jiang, Yuanzhe Hu, Shibiao Nong, Yanghua Peng, Haibin Lin, Chuan Wu 3/17/2026

MegaScale-Data: Scaling Dataloader for Multisource Large Foundation Model Training

MegaScale-Data addresses computational challenges in training large foundation models from multiple data sources by optimizing dataloader distribution across parallel ranks.

Ax Syeda Nahida Akter, Shrimai Prabhumoye, Matvei Novikov, Seungju Han, Ying Lin, Evelina Bakhturina, Eric Nyberg, Yejin Choi, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro 3/17/2026

Nemotron-CrossThink: Scaling Self-Learning beyond Math Reasoning

Nemotron-CrossThink extends RL-based self-learning from math reasoning to broader domains using verifiable reward structures and diverse tasks.

Ax Yige Yuan, Teng Xiao, Shuchang Tao, Xue Wang, Jinyang Gao, Bolin Ding, Bingbing Xu 3/17/2026

Incentivizing Strong Reasoning from Weak Supervision

Method for improving LLM reasoning without expensive RL or high-quality demonstrations using weak supervision and incentive signals.

Ax Yige Yuan, Teng Xiao, Li Yunfan, Bingbing Xu, Shuchang Tao, Yunqi Qiu, Huawei Shen, Xueqi Cheng 3/17/2026

Inference-time Alignment in Continuous Space

Inference-time alignment method for LLMs that searches in continuous response space using reward models for improved exploration.

Ax Jonathan Wenger, Beau Coker, Juraj Marusic, John P. Cunningham 3/17/2026

Variational Deep Learning via Implicit Regularization

Analysis of implicit regularization in overparametrized deep neural networks and improved out-of-distribution generalization via variational methods.

Ax Yajie Zhou, Jiajun Ruan, Eric S. Wang, Sadjad Fouladi, Francis Y. Yan, Kevin Hsieh, Zaoxing Liu 3/17/2026

NetArena: Dynamic Benchmarks for AI Agents in Network Automation

Dynamic benchmark framework (NetArena) for evaluating AI agents in network automation with production-level complexity and reduced contamination risk.

Ax Yuan-An Xiao, Pengfei Gao, Chao Peng, Yingfei Xiong 3/17/2026

Reducing Cost of LLM Agents with Trajectory Reduction

Method for reducing LLM agent inference costs through trajectory reduction. Addresses token cost efficiency in multi-turn agent systems for software engineering.

Ax R\u{a}zvan-Andrei Mati\c{s}an, Vincent Tao Hu, Grigory Bartosh, Bj\"orn Ommer, Cees G. M. Snoek, Max Welling, Jan-Willem van de Meent, Mohammad Mahdi Derakhshani, Floor Eijkelboom 3/17/2026

Purrception: Variational Flow Matching for Vector-Quantized Image Generation

Variational flow matching approach for vector-quantized image generation combining categorical supervision with continuous transport dynamics.