Ax Benjamin Gutteridge, Michael Bronstein, Xiaowen Dong 3/25/2026

Can Graph Foundation Models Generalize Over Architecture?

Studies whether graph foundation models can generalize across different GNN architectures and graph characteristics, revealing limitations in current approaches.

Ax Davide Scassola, Dylan Ponsford, Adri\'an Javaloy, Sebastiano Saccani, Luca Bortolussi, Henry Gouk, Antonio Vergari 3/25/2026

A Sobering Look at Tabular Data Generation via Probabilistic Circuits

Critical analysis of tabular data generation via probabilistic circuits, questioning progress claims and evaluation protocols in current benchmarks.

Ax Yutang Ge, Yaning Cui, Hanzheng Li, Jun-Jie Wang, Fanjie Xu, Jinhan Dong, Yongqi Jin, Dongxu Cui, Peng Jin, Guojiang Zhao, Hengxing Cai, Rong Zhu, Linfeng Zhang, Xiaohong Ji, Zhifeng Gao 3/25/2026

SpecXMaster Technical Report

AI system for automated spectroscopy interpretation in scientific discovery, reducing human bias in spectral analysis.

Ax Edoardo Cetin, Stefano Peluchetti, Emilio Castillo, Akira Naruse, Mana Murakami, Llion Jones 3/25/2026

Sparser, Faster, Lighter Transformer Language Models

Sparse packing format and CUDA kernels leveraging unstructured sparsity in LLM feedforward layers to reduce computational costs and model size.

Ax Miao Yu, Siyuan Fu, Moayad Aloqaily, Zhenhong Zhou, Safa Otoum, Xing fan, Kun Wang, Yufei Guo, Qingsong Wen 3/25/2026

SafeSeek: Universal Attribution of Safety Circuits in Language Models

Mechanistic interpretability framework identifying and attributing safety circuits in LLMs responsible for alignment, jailbreak, and backdoor behaviors.

Ax Peng-Yuan Wang, Ziniu Li, Tian Xu, Bohan Yang, Tian-Shuo Liu, ChenYang Wang, Xiong-Hui Chen, Yi-Chen Li, Tianyun Yang, Congliang Chen, Yang Yu 3/25/2026

Off-Policy Value-Based Reinforcement Learning for Large Language Models

Off-policy value-based reinforcement learning framework for LLMs enabling improved data utilization and sample efficiency for long-horizon tasks.

Ax Jenny Gao (College of Arts and Science, New York University, New York, NY), Yongfeng Zhang (Department of Computer Sciences, School of Arts & Sciences, Rutgers University, Piscataway, NJ), Mary L Disis (UW Medicine Cancer Vaccine Institute University of Washington, Seattle, WA), Lanjing Zhang (Department of Chemical Biology, Ernest Mario School of Pharmacy, Rutgers University, Piscataway, NJ, Department of Pathology, Princeton Medical Center, Plainsboro, NJ, Rutgers Cancer Institute, New Brunswick, NJ) 3/25/2026

Errors in AI-Assisted Retrieval of Medical Literature: A Comparative Study

Quantitative assessment of reference retrieval errors from 5 LLM platforms on 2,000 medical literature references. Evaluates Grok-2, ChatGPT, Gemini, Perplexity, DeepSeek.

Ax Ricardo Olmedo, Bernhard Sch\"olkopf, Moritz Hardt 3/25/2026

Computational Arbitrage in AI Model Markets

Framework for computational arbitrage in AI model markets where arbitrageurs allocate inference budget across competing providers to undercut pricing.

Ax Tom Ulanovski (Tel Aviv University), Eyal Blyachman (Tel Aviv University), Maya Bechler-Speicher (Meta) 3/25/2026

Improving LLM Predictions via Inter-Layer Structural Encoders

Method leveraging intermediate layer representations in LLMs via Inter-Layer Structural Encoders to improve task-specific predictions beyond final-layer features.