Ax Junxiong Wang, Fengxiang Bie, Jisen Li, Zhongzhu Zhou, Zelei Shao, Yubo Wang, Yinghui Liu, Qingyang Wu, Avner May, Sri Yanamandra, Yineng Zhang, Ce Zhang, Tri Dao, Percy Liang, Ben Athiwaratkun, Shuaiwen Leon Song, Chenfeng Xu, Xiaoxia Wu 29d ago

When RL Meets Adaptive Speculative Training: A Unified Training-Serving System

Unified training-serving system combining RL with adaptive speculative decoding for accelerated LLM inference.

Ax Vignesh Gopakumar, Ander Gray, Dan Giles, Lorenzo Zanisi, Matt J. Kusner, Timo Betcke, Stanislas Pamela, Marc Peter Deisenroth 29d ago

Learning Physical Operators using Neural Operators

Physics-informed neural operators for solving PDEs with improved generalization beyond training distributions.

Ax Xiangyang Zhu, Yuan Tian, Qi Jia, Kaiwei Zhang, Zicheng Zhang, Chunyi Li, Kaiyuan Ji, Dongrui Liu, Zijian Chen, Lu Sun, Renrui Zhang, Yan Teng, Jing Shao, Wei Sun, Xia Hu, Yu Qiao, Guangtao Zhai 29d ago

SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond

SafeSci: Framework for evaluating safety of large language models in scientific domains with comprehensive benchmarks.

Ax Aur Shalev Merin 29d ago

Temporal Credit Is Free

Recurrent network training without Jacobian propagation using hidden state temporal credit. Studies gradient normalization and online adaptation.

Ax Samuel Girard, Aurelien Bibaut, Arthur Gretton, Nathan Kallus, Houssam Zenati 29d ago

Fast Best-in-Class Regret for Contextual Bandits

Fast regret bounds for contextual bandits without realizability assumptions using pessimistic policy updates.

Ax Toufique Ahmed, Jatin Ganhotra, Avraham Shinnar, Martin Hirzel 29d ago

Investigating Test Overfitting on SWE-bench

Investigation of test overfitting in SWE-bench for code resolution, where models pass tests but miss important cases.

HN XYen0n 29d ago

SSH to any machine without IP

SSH tool for connecting to machines behind NAT/firewalls without port forwarding. Infrastructure utility unrelated to AI.