Ax Yuma Aoki, Joon Park, Koh Takeuchi, Hisashi Kashima, Shinya Akimoto, Ryuichi Hashimoto, Takahiro Adachi, Takeshi Kishikawa, Takamitsu Sasaki 3/24/2026

Long-Term Outlier Prediction Through Outlier Score Modeling

Novel unsupervised method for long-term outlier prediction in time series data using outlier score modeling.

Ax Tasmay Pankaj Tibrewal, Pritish Saha, Ankit Meda, Kunal Singh, Pradeep Moturi 3/24/2026

Mixture of Chapters: Scaling Learnt Memory in Transformers

Learnable sparse memory banks with chapter-based routing for scaling knowledge storage in Transformers without prohibitive attention costs.

Ax Uzay Macar, Li Yang, Atticus Wang, Peter Wallich, Emmanuel Ameisen, Jack Lindsey 3/24/2026

Mechanisms of Introspective Awareness

Study of introspective awareness mechanisms in LLMs, investigating whether steering detection reflects genuine circuitry or shallow heuristics.

Ax Yawen Li, Tao Hu, Zhouhui Lian, Wan Tian, Yijie Peng, Huiming Zhang, Zhongyi Li 3/24/2026

Sharper Generalization Bounds for Transformer

Derives sharper generalization error bounds for Transformer architectures using offset Rademacher complexity across single and multi-head, multi-layer variants.