Ax Shervin Ghasemlou 2/13/2026

Dopamine: Brain Modes, Not Brains

Parameter-efficient fine-tuning method viewing adaptation as neuromodulation-inspired mode selection and rescaling of pretrained computations.

Ax Cl\'audio Correia, Alberto E. A. Ferreira, Lucas Martins, Miguel P. Bento, Sofia Guerreiro, Ricardo Ribeiro Pereira, Ana Sofia Gomes, Jacopo Bono, Hugo Ferreira, Pedro Bizarro 2/13/2026

MUSE: Multi-Tenant Model Serving With Seamless Model Updates

Multi-tenant model serving system handling seamless model updates with dynamic decision threshold management.

Ax Sebastian Zeng, Andreas Petersson, Wolfgang Bock 2/13/2026

Latent-Variable Learning of SPDEs via Wiener Chaos

Method for learning stochastic partial differential equations from spatiotemporal observations using latent-variable formulation and deep learning.

Ax Zixi Zhang, Zhiwen Mo, Yiren Zhao, Robert Mullins 2/13/2026

Deep Kernel Fusion for Transformers

DeepFusionKernel optimizes agentic LLM inference by fusing SwiGLU blocks, reducing memory bandwidth bottlenecks by 9-13% on A100/H100.

Ax Elif Akata, Konstantinos Voudouris, Vincent Fortuin, Eric Schulz 2/13/2026

In-Context Function Learning in Large Language Models

Analyzes in-context learning in LLMs through Gaussian Process lens, studying function learning from few demonstrations at inference.

Ax Roberto Molinaro, Niall Siegenheim, Henry Martin, Mark Frey, Niels Poulsen, Philipp Seitz, Marvin Vincent Gabler 2/13/2026

Universal Diffusion-Based Probabilistic Downscaling

Diffusion-based framework downscales low-resolution weather forecasts to probabilistic high-resolution predictions without model-specific tuning.

Ax Marco Bressan, Nataly Brukhim, Nicolo Cesa-Bianchi, Emmanuel Esposito, Yishay Mansour, Shay Moran, Maximilian Thiessen 2/13/2026

Learning Conditional Averages

Introduces PAC learning framework for predicting conditional averages over arbitrary neighborhoods rather than target concepts.

Ax Akhiad Bercovich, Nir Ailon, Vladimir Anisimov, Tomer Asida, Nave Assaf, Mohammad Dabbah, Ido Galil, Amnon Geifman, Yonatan Geifman, Izhak Golan, Roi Koren, Itay Levy, Zach Moshe, Pavlo Molchanov, Najeeb Nabwani, Mostofa Patwari, Omri Puny, Tomer Ronen, Itamar Schen, Elad Segal, Ido Shahaf, Oren Tropp, Ran Zilberstein, Ran El-Yaniv 2/13/2026

Extending Puzzle for Mixture-of-Experts Reasoning Models with Application to GPT-OSS Acceleration

Extends Puzzle NAS framework to optimize gpt-oss-120B into 88B model using MoE pruning and attention optimization for inference acceleration.

Ax Jihao Andreas Lin, Sebastian Ament, Louis C. Tiao, David Eriksson, Maximilian Balandat, Eytan Bakshy 2/13/2026

Empirical Gaussian Processes

Empirical GPs: principled framework for learning kernel functions automatically rather than handcrafting from standard functions.

Ax Yujun Zhou, Yue Huang, Han Bao, Kehan Guo, Zhenwen Liang, Pin-Yu Chen, Tian Gao, Werner Geyer, Nuno Moniz, Nitesh V Chawla, Xiangliang Zhang 2/13/2026

Capability-Oriented Training Induced Alignment Risk

Investigates whether LLMs trained with RL spontaneously exploit reward function loopholes without malicious intent, examining alignment risks.

Ax Zhaoxin Wang, Jiaming Liang, Fengbin Zhu, Weixiang Zhao, Junfeng Fang, Jiayi Ji, Handing Wang, Tat-Seng Chua 2/13/2026

SafeNeuron: Neuron-Level Safety Alignment for Large Language Models

SafeNeuron provides neuron-level safety alignment mechanism for LLMs by targeting safety-critical parameters to prevent alignment bypass attacks.

Ax Muhammad bin Javaid, Hasham Hussain, Ashima Khanna, Berke Kisin, Jonathan Pirnay, Alexander Mitsos, Dominik G. Grimm, Martin Grohe 2/13/2026

Amortized Molecular Optimization via Group Relative Policy Optimization

Group Relative Policy Optimization enables amortized molecular design learning transferable to unseen molecules rather than instance-specific optimization.