Ax Jialin Chen, Haolan Zuo, Haoyu Peter Wang, Siqi Miao, Pan Li, Rex Ying 2/24/2026

Towards A Universal Graph Structural Encoder

Universal graph encoder for learning transferable structural representations across diverse graph domains.

Ax Yaoyu Zhu, Di Huang, Hanqi Lyu, Xiaoyun Zhang, Chongxiao Li, Wenxuan Shi, Yutong Wu, Jianan Mu, Jinghua Wang, Yang Zhao, Pengwei Jin, Shuyao Cheng, Shengwen Liang, Xishan Zhang, Rui Zhang, Zidong Du, Qi Guo, Xing Hu, Yunji Chen 2/24/2026

QiMeng-CodeV-R1: Reasoning-Enhanced Verilog Generation

QiMeng-CodeV-R1 applies RLVR to Verilog generation from natural language, introducing verifiable reward signal for hardware design automation.

Ax Guojiang Zhao, Zixiang Lu, Yutang Ge, Sihang Li, Zheng Cheng, Haitao Lin, Lirong Wu, Hanchen Xia, Hengxing Cai, Wentao Guo, Hongshuai Wang, Mingjun Xu, Siyu Zhu, Guolin Ke, Linfeng Zhang, Zhifeng Gao 2/24/2026

MolReasoner: Toward Effective and Interpretable Reasoning for Molecular LLMs

MolReasoner enhances LLM reasoning for molecular tasks with domain-specific semantics, reducing hallucinations and improving interpretability.

Ax Francesco Leonardi, Markus Orsi, Jean-Louis Reymond, Kaspar Riesen 2/24/2026

GEDAN: Learning the Edit Costs for Graph Edit Distance

GEDAN learns optimal edit costs for graph edit distance computation, improving upon NP-hard approximation methods with learnable cost functions.

Ax Minqi Jiang, Andrei Lupu, Yoram Bachrach 2/24/2026

Bootstrapping Task Spaces for Self-Improvement

Presents Exploratory Iteration (ExIt), RL methods enabling agents to self-improve through iterative refinement without fixed iteration limits.

Ax Geon Lee, Bhuvesh Kumar, Clark Mingxuan Ju, Tong Zhao, Kijung Shin, Neil Shah, Liam Collins 2/24/2026

Sequential Data Augmentation for Generative Recommendation

Data augmentation strategies for generative recommendation systems improving generalization in sequential user behavior prediction.

Ax Aman Gupta, Rafael Celente, Abhishek Shivanna, D. T. Braithwaite, Gregory Dexter, Shao Tang, Hiroto Udagawa, Daniel Silva, Rohan Ramanath, S. Sathiya Keerthi 2/24/2026

Effective Quantization of Muon Optimizer States

8-bit blockwise quantization of Muon optimizer states reducing memory overhead for large-scale LLM pretraining.

Ax Jubayer Ibn Hamid, Ifdita Hasan Orney, Ellen Xu, Chelsea Finn, Dorsa Sadigh 2/24/2026

Polychromic Objectives for Reinforcement Learning

Polychromic objectives for RL fine-tuning preventing policy collapse and preserving diversity in pretrained model behaviors.

Ax Jaewoo Lee, Minsu Kim, Sanghyeok Choi, Inhyuck Song, Sujin Yun, Hyeongyu Kang, Woocheol Shin, Taeyoung Yun, Kiyoung Om, Jinkyoo Park 2/24/2026

Diffusion Alignment as Variational Expectation-Maximization

Diffusion Alignment as Variational EM framework addressing reward over-optimization and mode collapse in diffusion model alignment.

Ax Nirjhar Das, Mohit Sharma, Praharsh Nanavati, Kirankumar Shiragur, Amit Deshpande 2/24/2026

Cost Efficient Fairness Audit Under Partial Feedback

Fairness auditing framework for classifiers with partial feedback using cost-aware data acquisition strategies.