Ax LM-Provers, Yuxiao Qu, Amrith Setlur, Jasper Dekoninck, Edward Beeching, Jia Li, Ian Wu, Lewis Tunstall, Aviral Kumar 27d ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

QED-Nano trains small neural networks to prove mathematical theorems, enabling reproducible and efficient theorem-proving without large models.

Ax Yikun Ban, Yuchen Yan, Arindam Banerjee, Jingrui He 27d ago

Neural Exploitation and Exploration of Contextual Bandits

Neural networks applied to contextual multi-armed bandits, comparing epsilon-greedy, Thompson Sampling, and UCB techniques for exploration-exploitation trade-offs.

Ax Ricardo Gama, Ricardo Cunha, Daniel Fuertes, Carlos R. del-Blanco, Hugo L. Fernandes 27d ago

Multi-Agent Environments for Vehicle Routing Problems

Open-source RL framework for vehicle routing problems, extending reinforcement learning to discrete optimization in operations research.

Ax Fengqing Jiang, Fengbo Ma, Zhangchen Xu, Yuetai Li, Zixin Rao, Bhaskar Ramasubramanian, Luyao Niu, Bo Li, Xianyan Chen, Zhen Xiang, Radha Poovendran 27d ago

SoSBench: Benchmarking Safety Alignment on Six Scientific Domains

SoSBench benchmarks safety alignment of LLMs across six scientific domains with sophisticated risks beyond basic misuse scenarios.

Ax Patrick Vossler, Fan Xia, Yifan Mai, Adarsh Subbaswamy, Jean Feng 27d ago

LLMs Judging LLMs: A Simplex Perspective

Studies the problem of using LLMs as judges for evaluating LLM outputs, addressing epistemic uncertainty in judge quality beyond sampling variability.

Ax Fouad Oubari, Mohamed El-Baha, Raphael Meunier, Rodrigue D\'ecatoire, Mathilde Mougeot 27d ago

Multi-Component VAE with Gaussian Markov Random Field

Multi-component VAE using Gaussian Markov Random Fields for generative modeling of complex datasets with intricate dependencies.

Ax Vincent Grari, Tim Arni, Thibault Laugel, Sylvain Lamprier, James Zou, Marcin Detyniecki 27d ago

ACT: Agentic Classification Tree

Agentic Classification Tree (ACT) combining LLMs with decision trees for transparent, interpretable decisions on unstructured data.

Ax Matthew Lowery, Zhitong Xu, Da Long, Keyan Chen, Daniel S. Johnson, Yang Bai, Varun Shankar, Shandian Zhe 27d ago

Deep Gaussian Processes for Functional Maps

Deep Gaussian Processes for learning mappings between functional spaces with uncertainty quantification for spatiotemporal forecasting and climate modeling.