Ax Jiace Zhu, Wentao Chen, Qi Fan, Zhixing Ren, Junying Wu, Xing Zhe Chai, Chotiwit Rungrueangwutthinon, Yehan Ma, An Zou 3/4/2026

CUDABench: Benchmarking LLMs for Text-to-CUDA Generation

CUDABench benchmark for evaluating LLM text-to-CUDA code generation with performance assessment metrics for GPU kernels.

Ax Laziz U. Abdullaev, Noelle Y. L. Wong, Ryan T. Z. Lee, Shiqi Jiang, Khoi N. M. Nguyen, Tan M. Nguyen 3/4/2026

Concept Heterogeneity-aware Representation Steering

Method for steering LLM behavior via representation manipulation that accounts for heterogeneous concept encoding across embedding spaces.

Ax Andy Yang, Pascal Bergstr\"a{\ss}er, Georg Zetzsche, David Chiang, Anthony W. Lin 3/4/2026

Length Generalization Bounds for Transformers

Theoretical analysis of length generalization bounds for transformers on CRASP language class, addressing model generalization guarantees.

Ax Shadab Ahamed, Eshed Gal, Simon Ghyselincks, Md Shahriar Rahim Siddiqui, Moshe Eliasof, Eldad Haber 3/4/2026

Preconditioned Score and Flow Matching

Preconditioning techniques for flow matching and score-based diffusion to improve optimization by handling ill-conditioned covariance matrices.

Ax Stefan Ankirchner, Maximilian Philipp Thiel 3/4/2026

Learning Optimal Search Strategies

Learning optimal threshold-based stopping rules for parking problems with unknown Poisson arrival processes via jump intensity estimation.

Ax Satish Chandran, Nicolas Roque dos Santos, Yunshu Wu, Greg Ver Steeg, Evangelos Papalexakis 3/4/2026

Spectral Regularization for Diffusion Models

Introduces loss-level spectral regularization using Fourier and wavelet-domain losses to improve diffusion model training without architecture changes.

Ax Zhaoyu Zhu, Shuhan Zhang, Rui Gao, Shuang Li 3/4/2026

Wasserstein Proximal Policy Gradient

Derives Wasserstein Proximal Policy Gradient using optimal transport geometry for continuous-action entropy-regularized RL without policy log-density evaluation.

Ax Yunxiang Li, Mark Schmidt, Reza Babanezhad, Sharan Vaswani 3/4/2026

Towards Parameter-Free Temporal Difference Learning

Develops parameter-free temporal difference learning for RL that avoids requiring problem-dependent quantities like feature covariance eigenvalues.

Ax Zhixia Zhang, Zixuan Huang, Xin Xia, Deqing Wang, Fuzhen Zhuang, Shuai Ma, Ning Ding, Yaodong Yang, Jianxin Li, Yikun Ban 3/4/2026

Heterogeneous Agent Collaborative Reinforcement Learning

Introduces HACRL, a collaborative reinforcement learning paradigm where heterogeneous agents share verified rollouts during training but execute independently at inference.

Ax Ryan Feng Lin, Yuantao Wei, Huiling Liao, Xiaoning Qian, Shuai Huang 3/4/2026

Causal Learning Should Embrace the Wisdom of the Crowd

Paradigm for causal structure learning from observational data leveraging human causal knowledge to address combinatorial explosion of possible graphs.