Ax Mariana A. Fazio, Manel Martinez-Ramon, Salvador Sosa G\"uitron, Marcus Babzien, Mikhail Fedurin, Junjie Li, Mark Palmer, Sandra S. Biedron 3/16/2026

Unsupervised anomaly detection in MeV ultrafast electron diffraction

Applies unsupervised anomaly detection to ultrafast electron diffraction data to identify beam instabilities in materials science experiments.

Ax Sibylle Marcotte, Gabriel Peyr\'e, R\'emi Gribonval 3/16/2026

Intrinsic training dynamics of deep neural networks

Theoretical study of implicit bias in deep neural network training showing gradient flow induces learning of lower-dimensional parameter structures.

Ax Giorgos Nikolaou, Tommaso Mencattini, Donato Crisostomi, Andrea Santilli, Yannis Panagakis, Emanuele Rodol\`a 3/16/2026

Language Models are Injective and Hence Invertible

Mathematical proof that transformer language models are injective, enabling exact input recovery from representations despite nonlinear components.

Ax Kemou Li, Qizhou Wang, Yue Wang, Fengpeng Li, Jun Liu, Bo Han, Jiantao Zhou 3/16/2026

LLM Unlearning with LLM Beliefs

Method for unlearning harmful content from LLMs by analyzing belief redistribution in probability space, avoiding unwanted side effects of gradient ascent.

Ax Yichuan Deng, Zhao Song, Kaijun Yuan, Tianyi Zhou 3/16/2026

Why Softmax Attention Outperforms Linear Attention

Comparative analysis of softmax vs linear attention mechanisms in transformer architectures, examining computational efficiency tradeoffs.

Ax Jianwei Li, Jung-Eun Kim 3/16/2026

Superficial Safety Alignment Hypothesis

Analyzes brittleness of LLM safety alignment mechanisms, proposing superficial safety alignment hypothesis explaining why standard alignment approaches are vulnerable.