Ax Barak Gahtan, Alex M. Bronstein 4/3/2026

Coupled Query-Key Dynamics for Attention

arXiv paper on coupled query-key dynamics for scaled dot-product attention. Improves language modeling perplexity by 6-7% on WikiText-103.

Ax Kang-Sin Choi 4/3/2026

Learn by Surprise, Commit by Proof

LSCP: Self-gated post-training framework for autonomous knowledge acquisition using self-generated Q&A chains and adaptive learning rates based on model conviction.

Ax Nathan Benjamin, A. Liam Fitzpatrick, Wei Li, Jesse Thaler 4/3/2026

Descending into the Modular Bootstrap

Machine learning optimization applied to solve modular bootstrap equations for exploring 2D conformal field theories.

Ax Neo Christopher Chung, Maxim Laletin 4/3/2026

Regularizing Attention Scores with Bootstrapping

Research on regularizing attention scores in vision transformers using bootstrapping to improve interpretability and reduce noisy attention maps.