Ax Daiwei Chen, Zhoutong Fu, Chengming Jiang, Haichao Zhang, Ran Zhou, Tan Wang, Chunnan Yao, Guoyao Li, Rui Cai, Yihan Cao, Ruijie Jiang, Fedor Borisyuk, Jianqiang Shen, Jingwei Wu, Ramya Korlakai Vinayak 4/3/2026

Grounded Token Initialization for New Vocabulary in LMs for Generative Recommendation

Analysis of token initialization strategies for new vocabulary in language models used for generative recommendation systems.

Ax Dmitrii Krylov, Armin Karamzade, Roy Fox 4/3/2026

Moonwalk: Inverse-Forward Differentiation

Moonwalk: inverse-forward differentiation technique addressing backpropagation's memory requirements for training deeper networks.

Ax Hrayr Harutyunyan, Rafayel Darbinyan, Samvel Karapetyan, Hrant Khachatrian 4/3/2026

In-context Learning in Presence of Spurious Correlations

Study of in-context learning in LLMs with spurious correlations, examining transformer robustness to spurious features in classification.

Ax Alexis Chevalier, Soumya Ghosh, Urvi Awasthi, James Watkins, Julia Bieniewska, Nichita Mitrea, Olga Kotova, Kirill Shkura, Andrew Noble, Michael Steinbaugh, Vijay Sadashivaiah, George Dasoulas, Julien Delile, Christoph Meier, Leonid Zhukov, Iya Khalil, Srayanta Mukherjee, Judith Mueller 4/3/2026

TEDDY: A Family Of Foundation Models For Understanding Single Cell Biology

Foundation model for single-cell RNA sequencing analysis in disease biology and drug discovery applications.

Ax Abigail J. Hayes, Tobias Schumacher, Markus Strohmaier 4/3/2026

What Do Temporal Graph Learning Models Learn?

Analysis of what temporal graph learning models learn, addressing reliability concerns in benchmark evaluation protocols.

Ax Yuen Chen, Yulun Wu, Samuel Sharpe, Igor Melnyk, Nam H. Nguyen, Furong Huang, C. Bayan Bruss, Rizal Fathony 4/3/2026

Bridging the Divide: End-to-End Sequence-Graph Learning

Research on jointly learning sequential and relational data for prediction tasks involving entities, integrating sequence and graph modeling.

Ax Yifan Zhang, Zixiang Chen, Yifeng Liu, Zhen Qin, Huizhuo Yuan, Kangping Xu, Yang Yuan, Quanquan Gu, Andrew Chi-Chih Yao 4/3/2026

Group Representational Position Encoding

Unified positional encoding framework using group actions for transformers, unifying rotational and additive approaches.