Ax Jun Yang, Yuechun Sun, Yi Wu, Rodrigo Caridad, Yongwei Yuan, Jianan Yao, Shan Lu, Kexin Pei 3/30/2026

ExVerus: Verus Proof Repair via Counterexample Reasoning

LLM framework for formal proof repair using counterexample-guided reasoning and behavioral feedback to improve automated verification.

Ax Hyukjun Lim, Soojung Yang, Lucas Pin\`ede, Miguel Steiner, Yuanqi Du, Rafael G\'omez-Bombarelli 3/30/2026

A Priori Sampling of Transition States with Guided Diffusion

Method using guided diffusion to sample transition states on potential energy surfaces for chemical reaction and conformational change prediction.

Ax Afonso Simpl\'icio, Gon\c{c}alo Vinagre, Miguel Moura Ramos, Diogo Tavares, Rafael Ferreira, Giuseppe Attanasio, Duarte M. Alves, In\^es Calvo, In\^es Vieira, Rui Guerra, James Furtado, Beatriz Canaverde, Iago Paulo, Vasco Ramos, Diogo Gl\'oria-Silva, Miguel Faria, Marcos Treviso, Daniel Gomes, Pedro Gomes, David Semedo, Andr\'e Martins, Jo\~ao Magalh\~aes 3/30/2026

AMALIA Technical Report: A Fully Open Source Large Language Model for European Portuguese

AMALIA: fully open source LLM trained on high-quality European Portuguese data with native evaluation benchmark and improved pt-PT representation.

Ax Shaoxuan Li, Zhixuan Zhao, Hanze Deng, Zirun Ma, Shulin Tian, Zuyan Liu, Yushi Hu, Haoning Wu, Yuhao Dong, Benlin Liu, Ziwei Liu, Ranjay Krishna 3/30/2026

PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning

Video benchmark for complex perception reasoning requiring multiple temporally separated visual evidence pieces and compositional logic.

Ax Md Ashiqur Rahman, Lim Jun Hao, Jeremiah Jiang, Teck-Yian Lim, Raymond A. Yeh 3/30/2026

Tunable Soft Equivariance with Guarantees

Framework for constructing soft equivariant computer vision models by projecting weights into designed subspaces with theoretical bounds.