Ax Yeva Gabrielyan (Akian College of Science and Engineering, American University of Armenia, Yerevan, Armenia), Varduhi Yeghiazaryan (Akian College of Science and Engineering, American University of Armenia, Yerevan, Armenia), Irina Voiculescu (Department of Computer Science, University of Oxford, Oxford, UK) 2/13/2026

PLESS: Pseudo-Label Enhancement with Spreading Scribbles for Weakly Supervised Segmentation

Weakly supervised segmentation method improving pseudo-label quality for scribble-based medical image annotations.

Ax MiniCPM Team, Wenhao An, Yingfa Chen, Yewei Fang, Jiayi Li, Xin Li, Yaohui Li, Yishan Li, Yuxuan Li, Biyuan Lin, Chuan Liu, Hezi Liu, Siyuan Liu, Hongya Lyu, Yinxu Pan, Shixin Ren, Xingyu Shen, Zhou Su, Haojun Sun, Yangang Sun, Zhen Leng Thai, Xin Tian, Rui Wang, Xiaorong Wang, Yudong Wang, Bo Wu, Xiaoyue Xu, Dong Xu, Shuaikang Xue, Jiawei Yang, Bowen Zhang, Jinqian Zhang, Letian Zhang, Shengnan Zhang, Xinyu Zhang, Xinyuan Zhang, Zhu Zhang, Hengyu Zhao, Jiacheng Zhao, Jie Zhou, Zihan Zhou, Shuo Wang, Chaojun Xiao, Xu Han, Zhiyuan Liu, Maosong Sun 2/13/2026

MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling

9B-parameter LLM architecture combining sparse and linear attention mechanisms for efficient long-context processing and reduced computational costs.

Ax Mikko Honkala, Dani Korpi, Elias Raninen, Janne M. J. Huttunen 2/13/2026

EqDeepRx: Learning a Scalable MIMO Receiver

Deep learning approach for MIMO receiver design combining linear signal processing with ML blocks for improved scalability and explainability.

Ax Tom Kempton, Julia Rozanova, Parameswaran Kamalaruban, Maeve Madigan, Karolina Wresilo, Yoann L. Launay, David Sutton, Stuart Burrell 2/13/2026

DMAP: A Distribution Map for Text

DMAP method for analyzing text using LLM next-token probability distributions, improving on perplexity metrics for context-dependent interpretation.

Ax Alon Beck, Yohai Bar Sinai, Noam Levi 2/13/2026

The Implicit Bias of Logit Regularization

Analysis of implicit bias from logit regularization in linear classifiers. Theoretical ML research, limited practical developer relevance.

Ax John Muchovej, Amanda Royka, Shane Lee, Julian Jara-Ettinger 2/13/2026

GPT-4o Lacks Core Features of Theory of Mind

Evaluation framework testing whether GPT-4o possesses theory of mind via causal mental state models. LLM capability research and evaluation.

Ax Krish Agarwal, Zhuoming Chen, Cheng Luo, Yongqi Chen, Haizhong Zheng, Xun Huang, Atri Rudra, Beidi Chen 2/13/2026

MonarchRT: Efficient Attention for Real-Time Video Generation

Efficient attention mechanism for real-time video generation using diffusion transformers. Developer tool for reducing computational bottlenecks.

Ax Leon Liangyu Chen, Haoyu Ma, Zhipeng Fan, Ziqi Huang, Animesh Sinha, Xiaoliang Dai, Jialiang Wang, Zecheng He, Jianwei Yang, Chunyuan Li, Junzhe Sun, Chu Wang, Serena Yeung-Levy, Felix Juefei-Xu 2/13/2026

UniT: Unified Multimodal Chain-of-Thought Test-time Scaling

Unified multimodal model with test-time scaling via chain-of-thought reasoning for complex tasks. LLM application combining vision and language.