Ax Hengjie Cao, Mengyi Chen, Yifeng Yang, Fang Dong, Ruijun Huang, Anrui Chen, Jixian Zhou, Mingzhi Dong, Yujiang Wang, Dongsheng Li, Wenyi Fang, Yuanyi Lin, Fan Wu, Li Shang 2/16/2026

Dispelling the Curse of Singularities in Neural Network Optimizations

Analysis of optimization instability in deep networks caused by singularities in parameter and representation space.

Ax Dongyeop Woo, Marta Skreta, Seonghyun Park, Kirill Neklyudov, Sungsoo Ahn 2/16/2026

Riemannian MeanFlow

Flow models for efficient generative modeling on Riemannian manifolds with reduced inference evaluations.

Ax Tiwei Bie, Maosong Cao, Xiang Cao, Bingsen Chen, Fuyuan Chen, Kun Chen, Lun Du, Daozhuo Feng, Haibo Feng, Mingliang Gong, Zhuocheng Gong, Yanmei Gu, Jian Guan, Kaiyuan Guan, Hongliang He, Zenan Huang, Juyong Jiang, Zhonghui Jiang, Zhenzhong Lan, Chengxi Li, Jianguo Li, Zehuan Li, Huabin Liu, Lin Liu, Guoshan Lu, Yuan Lu, Yuxin Ma, Xingyu Mou, Zhenxuan Pan, Kaida Qiu, Yuji Ren, Jianfeng Tan, Yiding Tian, Zian Wang, Lanning Wei, Tao Wu, Yipeng Xing, Wentao Ye, Liangyu Zha, Tianze Zhang, Xiaolu Zhang, Junbo Zhao, Da Zheng, Hao Zhong, Wanli Zhong, Jun Zhou, Junlin Zhou, Liwang Zhu, Muzhi Zhu, Yihong Zhuang 2/16/2026

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

LLaDA2.1 text diffusion improvement combining token-to-token and mask-to-token editing for faster generation.

Ax Sedigheh Eslami, Maksim Gaiduk, Markus Krimmel, Louis Milliken, Bo Wang, Denis Bykov 2/16/2026

Diffusion-Pretrained Dense and Contextual Embeddings

Multilingual embedding models using contrastive learning on diffusion-pretrained backbone for web-scale retrieval.

Ax Yuanyong Luo, Jing Huang, Yu Cheng, Ziwei Yu, Kaihua Tang, Xinda Ma, Xin Wang, Anping Tong, Guipeng Hu, Yun Xu, Mehran Taghian, Peng Wu, Guanglin Li, Yunke Peng, Tianchi Hu, Minqi Chen, Michael Bi Mi, Hu Liu, Xiping Zhou, Junsong Wang, Qiang Lin, Heng Liao 2/16/2026

HiFloat4 Format for Language Model Inference

HiFloat4 block floating-point format for efficient LLM inference, achieving 4.5 bits per value with three-level scaling.

Ax Xin Wen, Will Wei Sun, Yichen Zhang 2/16/2026

Online Tensor Inference

Online tensor inference method for real-time processing of sequentially arriving high-dimensional data with statistical capabilities.

Ax Anton Baumann, Rui Li, Marcus Klasson, Santeri Mentu, Shyamgopal Karthik, Zeynep Akata, Arno Solin, Martin Trapp 2/16/2026

Post-hoc Probabilistic Vision-Language Models

Post-hoc method adding probabilistic uncertainty to vision-language models like CLIP to better handle domain shifts in downstream tasks.

Ax Dheeraj Vattikonda, Santhoshi Ravichandran, Emiliano Penaloza, Hadi Nekoei, Megh Thakkar, Thibault Le Sellier de Chezelles, Nicolas Gontier, Miguel Mu\~noz-M\'armol, Sahar Omidi Shayegan, Stefania Raimondo, Xue Liu, Alexandre Drouin, Laurent Charlin, Alexandre Pich\'e, Alexandre Lacoste, Massimo Caccia 2/16/2026

How to Train Your LLM Web Agent: A Statistical Diagnosis

Statistical diagnosis and training methods for LLM-based web agents addressing multi-step interactions and reducing post-training compute costs.

Ax Sarah McClure, Evyatar Cohen, Alex Shpiner, Mark Silberstein, Sylvia Ratnasamy, Scott Shenker, Isaac Keslassy 2/16/2026

Load Balancing for AI Training Workloads

Technical analysis of load-balancing designs for AI training workloads, comparing approaches and establishing optimality bounds for distributed training.

Ax Gernot Fiala, Markus Plass, Robert Harb, Peter Regitnig, Kristijan Skok, Wael Al Zoughbi, Carmen Zerner, Paul Torke, Michaela Kargl, Heimo M\"uller, Tomas Brazdil, Matej Gallo, Jaroslav Kub\'in, Roman Stoklasa, Rudolf Nenutil, Norman Zerbe, Andreas Holzinger, Petr Holub 2/16/2026

From slides to AI-ready maps: Standardized multi-layer tissue maps as metadata for artificial intelligence in digital pathology

Standardized multi-layer tissue maps as metadata format for whole slide image AI algorithm development in digital pathology.

Ax Gabriela Pinto, Palash Goyal, Mihir Parmar, Yiwen Song, Souradip Chakraborty, Zifeng Wang, Jinsung Yoon, Hamid Palangi, Tomas Pfister 2/16/2026

HEART: Emotionally-Driven Test-Time Scaling of Language Models

HEART framework uses emotional cues during test-time scaling to improve LLM problem-solving by preventing repetitive thought patterns through alternating critical and encouraging tones.

Ax Leonardo Christov-Moore, Arthur Juliani, Alex Kiefer, Joel Lehman, Nicco Reggente, B. Scot Rousse, Adam Safron, Nicol\'as Hinrichs, Daniel Polani, Antonio Damasio 2/16/2026

The Conditions of Physical Embodiment Enable Generalization and Care

Research on physical embodiment constraints for AI agents in eldercare and disaster response scenarios, addressing generalization and care provision under uncertainty.

Ax Anastasiia Bakhmach, Paul Dufoss\'e, Andrea Vaglio, Florence Monville, Laurent Greillier, Fabrice Barl\'esi, S\'ebastien Benzekry 2/16/2026

ROOFS: RObust biOmarker Feature Selection

Feature selection methodology for biomarker discovery in high-dimensional biomedical data with low sample sizes.