Ax H\"useyin Tun\c{c}, Do\u{g}anay \"Ozese, \c{S}. \.Ilker Birbil, Donato Maragno, Marco Caserta, Mustafa Baydo\u{g}an 29d ago

Output-Constrained Decision Trees

Methods for training constrained regression trees incorporating domain-specific output constraints using mixed-integer programming and other approaches.

Ax Lorenzo Sciandra, Roberto Esposito, Andrea Cesare Grosso, Laura Sacerdote, Cristina Zucca 29d ago

Supplementary Materials to Graph Convolutional Branch and Bound

Integration of neural networks into combinatorial optimization for NP-hard problems, learning heuristics and optimality scores via graph convolutional networks.

Ax Shin'ya Yamaguchi, Kosuke Nishida, Daiki Chijiwa, Yasutoshi Ida 29d ago

Zero-shot Concept Bottleneck Models

Zero-shot concept bottleneck models enabling interpretable predictions without target task training by leveraging pre-trained vision-language models.

Ax Duo Su, Huyu Wu, Huanran Chen, Yiming Shi, Yuzhu Wang, Xi Ye, Jun Zhu 29d ago

Diffusion Models as Dataset Distillation Priors

Dataset distillation method leveraging diffusion models as priors to synthesize compact, representative datasets with improved diversity and generalization.

Ax Junxiong Wang, Fengxiang Bie, Jisen Li, Zhongzhu Zhou, Zelei Shao, Yubo Wang, Yinghui Liu, Qingyang Wu, Avner May, Sri Yanamandra, Yineng Zhang, Ce Zhang, Tri Dao, Percy Liang, Ben Athiwaratkun, Shuaiwen Leon Song, Chenfeng Xu, Xiaoxia Wu 29d ago

When RL Meets Adaptive Speculative Training: A Unified Training-Serving System

Unified training-serving system combining RL with adaptive speculative decoding for accelerated LLM inference.

Ax Vignesh Gopakumar, Ander Gray, Dan Giles, Lorenzo Zanisi, Matt J. Kusner, Timo Betcke, Stanislas Pamela, Marc Peter Deisenroth 29d ago

Learning Physical Operators using Neural Operators

Physics-informed neural operators for solving PDEs with improved generalization beyond training distributions.

Ax Xiangyang Zhu, Yuan Tian, Qi Jia, Kaiwei Zhang, Zicheng Zhang, Chunyi Li, Kaiyuan Ji, Dongrui Liu, Zijian Chen, Lu Sun, Renrui Zhang, Yan Teng, Jing Shao, Wei Sun, Xia Hu, Yu Qiao, Guangtao Zhai 29d ago

SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond

SafeSci: Framework for evaluating safety of large language models in scientific domains with comprehensive benchmarks.

Ax Aur Shalev Merin 29d ago

Temporal Credit Is Free

Recurrent network training without Jacobian propagation using hidden state temporal credit. Studies gradient normalization and online adaptation.