Isolater - Feed

Ax Masahiro Kato 2/20/2026

genriesz: A Python Package for Automatic Debiased Machine Learning with Generalized Riesz Regression

Open-source Python package implementing debiased machine learning via Riesz regression for causal parameter estimation.

Ax Xiaokai Chen, Ilya Kuruzov, Gesualdo Scutari 2/20/2026

Adaptive Decentralized Composite Optimization via Three-Operator Splitting

Decentralized optimization method with adaptive stepsizes for multi-agent networks using three-operator splitting.

Ax Jyotin Goel, Souvik Maji, Pratik Mazumder 2/20/2026

Learning to Stay Safe: Adaptive Regularization Against Safety Degradation during Fine-Tuning

Adaptive regularization framework for maintaining LLM safety during fine-tuning while preserving utility.

Ax Hien Dang, Pratik Patil, Alessandro Rinaldo 2/20/2026

Optimal Unconstrained Self-Distillation in Ridge Regression: Strict Improvements, Precise Asymptotics, and One-Shot Tuning

Analysis of self-distillation in ridge regression showing generalization improvements with precise asymptotics.

Ax Lunjia Hu, Kevin Tian, Chutong Yang 2/20/2026

Simultaneous Blackwell Approachability and Applications to Multiclass Omniprediction

Omniprediction algorithm extending binary case to multiclass settings with bounds across multiple losses.

Ax Antonio Guillen-Perez 2/20/2026

Conditional Flow Matching for Continuous Anomaly Detection in Autonomous Driving on a Manifold-Aware Spectral Space

Unsupervised anomaly detection framework using conditional flow matching for autonomous vehicle safety validation.

Ax Alhad Sethi, Kavali Sofia Sagar, Shubhada Agrawal, Debabrota Basu, P. N. Karthik 2/20/2026

Asymptotically Optimal Sequential Testing with Markovian Data

Sequential hypothesis testing algorithm for Markovian data with asymptotically optimal expected stopping time.

Ax Jowaria Khan, Anindya Sarkar, Yevgeniy Vorobeychik, Elizabeth Bondi-Kelly 2/20/2026

Adapting Actively on the Fly: Relevance-Guided Online Meta-Learning with Latent Concepts for Geospatial Discovery

Online meta-learning approach for geospatial discovery using latent concepts with strategic sampling under constraints.

Ax Jianda Du, Youran Sun, Haizhao Yang 2/20/2026

AutoNumerics: An Autonomous, PDE-Agnostic Multi-Agent Pipeline for Scientific Computing

Multi-agent autonomous framework for designing, implementing, and verifying numerical PDE solvers without manual tuning.

Ax Jiaqi Xi, Raghav Saboo, Luming Chen, Martin Wang, Sudeep Das 2/20/2026

Mine and Refine: Optimizing Graded Relevance in E-commerce Search Retrieval

Contrastive learning framework for semantic embeddings in e-commerce search with graded relevance.

Ax Aidar Myrzakhan, Tianyi Li, Bowei Guo, Shengkun Tang, Zhiqiang Shen 2/20/2026

Sink-Aware Pruning for Diffusion Language Models

Pruning technique for diffusion language models reducing inference cost by reconsidering attention sink preservation.

Ax Christoph Lange, Isabel Thiele, Lara Santolin, Sebastian L. Riedel, Maxim Borisyak, Peter Neubauer, M. Nicolas Cruz Bournazou 2/20/2026

Data Augmentation Scheme for Raman Spectra with Highly Correlated Annotations

Data augmentation scheme for Raman spectroscopy with correlated annotations in biotechnology applications.

Ax Seyedeh Baharan Khatami, Harsh Parikh, Haowei Chen, Sudeepa Roy, Babak Salimi 2/20/2026

Graph Machine Learning based Doubly Robust Estimator for Network Causal Effects

Graph machine learning method for estimating causal effects in social networks with interference and confounding.

Ax Xiwen Tao, Chenyi Zhang, Helin Wang, Yexin Zhang, Tongyang Li 2/20/2026

Gradient Testing and Estimation by Comparisons

Algorithm for gradient testing and estimation using only comparison oracle for smooth functions.

Ax Nima Akbarzadeh, Yossiri Adulyasak, Erick Delage 2/20/2026

Risk-Aware Decision Making in Restless Bandits: Theory and Algorithms for Planning and Learning

Risk-aware decision making algorithms for restless bandits incorporating downside risk mitigation.

Ax Yung-Chen Tang, Pin-Yu Chen, Tsung-Yi Ho 2/20/2026

Defining and Evaluating Physical Safety for Large Language Models

Benchmark for evaluating physical safety risks of LLMs controlling robotic systems like drones with threat classification.

Ax Jangseop Park, Namwoo Kang 2/20/2026

Point-DeepONet: Predicting Nonlinear Fields on Non-Parametric Geometries under Variable Load Conditions

Surrogate model combining PointNet and DeepONet for predicting nonlinear fields on complex 3D geometries.

Ax Maria-Florina Balcan, Martino Bernasconi, Matteo Castiglioni, Andrea Celli, Keegan Harris, Zhiwei Steven Wu 2/20/2026

Nearly-Optimal Bandit Learning in Stackelberg Games with Side Information

Online learning algorithm for Stackelberg games with contextual information achieving improved regret bounds.

Ax Sanghyeon Lee, Sangjun Bae, Yisak Park, Seungyul Han 2/20/2026

Self-Improving Skill Learning for Robust Skill-based Meta-Reinforcement Learning

Hierarchical meta-reinforcement learning approach using self-improving skills to handle noisy offline demonstrations.

Ax Zander W. Blasingame, Chen Liu 2/20/2026

Rex: A Family of Reversible Exponential (Stochastic) Runge-Kutta Solvers

Reversible Runge-Kutta solvers for neural differential equations in generative models with improved numerical stability.

Ax Murat Onur Yildirim, Elif Ceren Gok Yildirim, Joaquin Vanschoren 2/20/2026

Unlocking [CLS] Features for Continual Post-Training

Continual learning approach for foundation models addressing stability-plasticity trade-off during post-training on new classes/domains.

Ax Fei Wu, Jia Hu, Geyong Min, Shiqiang Wang 2/20/2026

Efficient Orthogonal Fine-Tuning with Principal Subspace Adaptation

Parameter-efficient fine-tuning method using orthogonal adaptation on principal subspaces for adapting large models efficiently.

Ax Adrian Arnaiz-Rodriguez, Federico Errica 2/20/2026

Oversmoothing, Oversquashing, Heterophily, Long-Range, and more: Demystifying Common Beliefs in Graph Machine Learning

Survey demystifying common beliefs about oversmoothing, oversquashing, heterophily and long-range tasks in graph neural networks.

Ax Yifan Zhang, Yifeng Liu, Huizhuo Yuan, Yang Yuan, Quanquan Gu, Andrew Chi-Chih Yao 2/20/2026

On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning

Analyzes KL-regularization design choices in policy gradient algorithms for LLM reasoning, comparing forward/reverse KL variants.

Ax Sho Oshima, Yuji Okamoto, Taisei Tosaki, Ryosuke Kojima 2/20/2026

Supervised Graph Contrastive Learning for Gene Regulatory Networks

Supervised graph contrastive learning framework for gene regulatory networks addressing biological validity of perturbations.

Ax Elif Y{\i}lmaz, Christos Dimitrakakis 2/20/2026

Two-Player Zero-Sum Games with Bandit Feedback

Game theory algorithms for two-player zero-sum games with unknown payoffs estimated through bandit feedback.

Ax Alba Carballo-Castro, Manuel Madeira, Yiming Qin, Dorina Thanou, Pascal Frossard 2/20/2026

Generating Directed Graphs with Dual Attention and Asymmetric Encoding

Generative model for directed graphs using dual attention and asymmetric encoding to capture ordered relationships.

Ax Jaebak Hwang, Sanghyeon Lee, Jeongmo Kim, Seungyul Han 2/20/2026

Strict Subgoal Execution: Reliable Long-Horizon Planning in Hierarchical Reinforcement Learning

Strict Subgoal Execution method for hierarchical RL that improves long-horizon planning by validating subgoal feasibility.

Ax Mark Lee, Chang Lan, Tom Gunter, John Peebles, Hanzhi Zhou, Kelvin Zou, Sneha Bangalore, Chung-Cheng Chiu, Nan Du, Xianzhi Du, Philipp Dufter, Ruixuan Hou, Haoshuo Huang, Dongseong Hwang, Xiang Kong, Jinhao Lei, Tao Lei, Meng Li, Li Li, Jiarui Lu, Zhiyun Lu, Yiping Ma, David Qiu, Vivek Rathod, Senyu Tong, Zhucheng Tu, Jianyu Wang, Yongqiang Wang, Zirui Wang, Floris Weers, Sam Wiseman, Guoli Yin, Bowen Zhang, Xiyou Zhou, Danyang Zhuo, Cheng Leong, Ruoming Pang 2/20/2026

AXLearn: Modular, Hardware-Agnostic Large Model Training

AXLearn production system for scalable hardware-agnostic training of large models with modular software architecture.

Ax Jiequn Han, Kui Ren, Nathan Soedjak 2/20/2026

Instance-Wise Adaptive Sampling for Dataset Construction in Approximating Inverse Problem Solutions

Instance-wise adaptive sampling framework for constructing efficient training datasets for supervised inverse problem learning.

Ax Xuefeng Wang, Lei Zhang, Henglin Pu, Ahmed H. Qureshi, Husheng Li 2/20/2026

Continuous-Time Value Iteration for Multi-Agent Reinforcement Learning

Extends reinforcement learning to continuous-time systems using Hamilton-Jacobi-Bellman equations for irregular interaction frequencies.

Ax Shi Yin, Zujian Dai, Xinyang Pan, Lixin He 2/20/2026

Advancing Universal Deep Learning for Electronic-Structure Hamiltonian Prediction of Materials

Deep learning methods for predicting electronic-structure Hamiltonians in materials, advancing generalization across diverse atomic systems.

Ax Thibaud Gloaguen, Robin Staab, Nikola Jovanovi\'c, Martin Vechev 2/20/2026

Watermarking Diffusion Language Models

First watermarking method for diffusion language models that generate tokens non-sequentially, addressing unique DLM challenges.

Ax Xi Wang, James McInerney, Lequn Wang, Nathan Kallus 2/20/2026

Entropy After $\langle \texttt{/Think} \rangle$ for reasoning model early exiting

Analyzes overthinking in reasoning LLMs and proposes early exit using entropy after thinking tags to improve efficiency.

Ax Yumin Choi, Dongki Kim, Jinheon Baek, Sung Ju Hwang 2/20/2026

Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs

Extends prompt optimization to multimodal LLMs, proposing methods to optimize across text, images, video and other modalities.

Ax Andrew B. Kahng. Seokhyeong Kang, Seonghyeon Park, Dooseok Yoon 2/20/2026

ArtNet: Hierarchical Clustering-Based Artificial Netlist Generator for ML and DTCO Application

Hardware design tool using ML to generate synthetic netlists for chip optimization training without long design turnaround times.

Ax Ruchi Sandilya, Sumaira Perez, Charles Lynch, Lindsay Victoria, Benjamin Zebley, Derrick Matthew Buchanan, Mahendra T. Bhati, Nolan Williams, Timothy J. Spellman, Faith M. Gunning, Conor Liston, Logan Grosenick 2/20/2026

Contrastive Diffusion Alignment: Learning Structured Latents for Controllable Generation

Introduces ConDA, a contrastive learning layer for organizing diffusion model latent spaces to enable controllable generation.

Ax Hansheng Chen, Kai Zhang, Hao Tan, Leonidas Guibas, Gordon Wetzstein, Sai Bi 2/20/2026

pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation

Proposes pi-Flow, a policy-based approach to improve few-step diffusion model distillation by predicting network-free policies.

Ax Iv\'an Ojeda-Ruiz, Young Ju Lee, Malcolm Dickens, Leonardo Cambisaca 2/20/2026

Alternatives to the Laplacian for Scalable Spectral Clustering with Group Fairness Constraints

ML research on mitigating algorithmic bias in clustering through fairness constraints and group balance representation.

Ax Zhuojin Li, Marco Paolieri, Leana Golubchik 2/20/2026

Accelerating Mobile Inference through Fine-Grained CPU-GPU Co-Execution

System for accelerating neural network inference on mobile devices through fine-grained CPU-GPU co-execution and synchronization optimization.

Ax Ximan Sun, Xiang Cheng 2/20/2026

LRT-Diffusion: Calibrated Risk-Aware Guidance for Diffusion Policies

LRT-Diffusion applies risk-aware sequential hypothesis testing to improve diffusion policy guidance for offline reinforcement learning.

Ax Seonggyun Lee, Sungjun Lim, Seojin Park, Soeun Cheon, Kyungwoo Song 2/20/2026

Semi-Supervised Preference Optimization with Limited Feedback

Semi-supervised preference optimization framework for LLM alignment using limited labeled paired feedback data.

Ax Yan Sun, Jia Guo, Stanley Kok, Zihao Wang, Zujie Wen, Zhiqiang Zhang 2/20/2026

Efficient Reinforcement Learning for Large Language Models with Intrinsic Exploration

PREPO method improving data efficiency for LLM reinforcement learning with verifiable rewards using intrinsic exploration signals.

Ax Sanjeev Shrestha, Rahul Dubey, Hui Liu 2/20/2026

Beyond Linear Surrogates: High-Fidelity Local Explanations for Black-Box Models

Model-agnostic local explanation method using MARS and N-ball sampling for high-fidelity black-box model interpretability.

Ax Debamita Ghosh, George K. Atia, Yue Wang 2/20/2026

Online Robust Reinforcement Learning with General Function Approximation

Distributionally robust reinforcement learning approach using general function approximation for policy robustness under environment shift.

Ax Nathan Buskulic, Luca Calatroni, Lorenzo Rosasco, Silvia Villa 2/20/2026

On the Sample Complexity of Learning for Blind Inverse Problems

Theoretical analysis of sample complexity for data-driven approaches to blind inverse problems with interpretability concerns.

Ax Kiattikun Chobtham 2/20/2026

Reinforcement Learning to Discover a North-East Monsoon Index for Rainfall Prediction in Thailand

Reinforcement learning approach to discover local climate indices for improving rainfall prediction in Thailand.

Ax Yilong Dai, Shengyu Chen, Ziyi Wang, Xiaowei Jia, Yiqun Xie, Vipin Kumar, Runlong Yu 2/20/2026

Learning PDE Solvers with Physics and Data: A Unifying View of Physics-Informed Neural Networks and Neural Operators

Unified framework connecting physics-informed neural networks and neural operators for learning PDE solvers.

Ax Arshia Soltani Moakhar, Tanapoom Laoaron, Faraz Ghahremani, Kiarash Banihashem, MohammadTaghi Hajiaghayi 2/20/2026

Active Learning for Decision Trees with Provable Guarantees

Theoretical analysis of active learning label complexity for decision trees with provable polylogarithmic guarantees.

Ax Yijun Ma, Zehong Wang, Weixiang Sun, Yanfang Ye 2/20/2026

Temporal Graph Pattern Machine

Temporal graph pattern machine for learning transferable representations in dynamic networks without restrictive assumptions.