Ax Willem Schooltink, Fabio Massimo Zennaro 3/2/2026

Multi-Level Causal Embeddings

Framework for causal embeddings enabling multiple detailed models to map into sub-systems of coarser causal models.

Ax Zhuoran Li, Xun Wang, Hai Zhong, Qingxin Xia, Lihua Zhang, Longbo Huang 3/2/2026

OM2P: Offline Multi-Agent Mean-Flow Policy

OM2P offline multi-agent reinforcement learning using flow-based generative models with improved sampling efficiency.

Ax Shuofei Qiao, Yanqiu Zhao, Zhisong Qiu, Xiaobin Wang, Jintian Zhang, Zhao Bin, Ningyu Zhang, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen 3/2/2026

Scaling Generalist Data-Analytic Agents

DataMind: scalable data-analytic agent system with open-source training recipes for multi-step reasoning over diverse-format, large-scale data files.

Ax Haining Pan, James V. Roggeveen, Erez Berg, Juan Carrasquilla, Debanjan Chowdhury, Surya Ganguli, Federico Ghimenti, Juraj Hasik, Henry Hunt, Hong-Chen Jiang, Mason Kamb, Ying-Jer Kao, Ehsan Khatami, Michael J. Lawler, Di Luo, Titus Neupert, Xiaoliang Qi, Michael P. Brenner, Eun-Ah Kim 3/2/2026

CMT-Benchmark: A Benchmark for Condensed Matter Theory Built by Expert Researchers

CMT-Benchmark: 50 expert-level condensed matter theory problems for evaluating LLMs on advanced scientific reasoning and code generation.