Ax Peiyuan Zhang, Matthew Noto, Wenxuan Tan, Chengquan Jiang, Will Lin, Wei Zhou, Hao Zhang 3/10/2026

Attn-QAT: 4-Bit Attention With Quantization-Aware Training

First systematic 4-bit quantization-aware training study for attention mechanisms enabling end-to-end FP4 computation on emerging GPUs.

Ax Zeyneb N. Kaya, Nick Rui 3/10/2026

Test-Time Meta-Adaptation with Self-Synthesis

MASS: meta-learning framework enabling LLMs to self-adapt at test time by generating synthetic training data for improved downstream performance.

Ax Levy Chaves, Chao Zhou, Rebekka Burkholz, Eduardo Valle, Sandra Avila 3/10/2026

Bridging Domains through Subspace-Aware Model Merging

Research on merging task-specific models into consolidated ones, analyzing parameter competition and domain generalization effects.

Ax Jiajun Xu, Jiageng Mao, Ang Qi, Weiduo Yuan, Alexander Romanus, Helen Xia, Vitor Campagnolo Guizilini, Yue Wang 3/10/2026

FuzzingRL: Reinforcement Fuzz-Testing for Revealing VLM Failures

FuzzingRL approach using reinforcement learning for fuzz testing Vision Language Models to automatically generate failure-inducing queries.

Ax Laha Ale, Ning Zhang, Scott A. King, Pingzhi Fan 3/10/2026

Switchable Activation Networks

Switchable Activation Networks that dynamically select activation functions for computational efficiency in LLMs and vision-action models during inference.

Ax Martino Ciaperoni, Collin Leiber, Aristides Gionis, Heikki Mannila 3/10/2026

Khatri-Rao Clustering for Data Summarization

Khatri-Rao Clustering approach for data summarization using centroid-based clustering with reduced redundancy in prototypes.

Ax Zhengguo Li, Chaobing Zheng, Wei Wang 3/10/2026

Correlation Analysis of Generative Models

Theoretical analysis of diffusion models and flow matching using unified representation via linear equations. Discusses correlation between noisy data and predictions.