Isolater - Feed

Ax Junkang Liu, Fanhua Shang, Hongying Liu, Jin Liu, Weixin An, Yuanyuan Liu 2/24/2026

Taming Preconditioner Drift: Unlocking the Potential of Second-Order Optimizers for Federated Learning on Non-IID Data

Addresses preconditioner drift in federated second-order optimizers on non-IID data through curvature alignment techniques.

Ax Jiangjie Qiu, Wentao Li, Honghao Chen, Leyi Zhao, Xiaonan Wang 2/24/2026

AdsorbFlow: energy-conditioned flow matching enables fast and realistic adsorbate placement

Proposes AdsorbFlow, energy-conditioned flow matching for fast adsorbate placement prediction on catalytic surfaces.

Ax Svetlana Glazyrina, Maksim Kryzhanovskiy, Roman Ischenko 2/24/2026

Soft Sequence Policy Optimization: Bridging GMPO and SAPO

Bridges GMPO and SAPO by combining sequence-level importance sampling with soft clipping alternatives for improved LLM policy optimization.

Ax Barsat Khadka, Kawsher Roxy, Md Rubel Ahmed 2/24/2026

CTS-Bench: Benchmarking Graph Coarsening Trade-offs for GNNs in Clock Tree Synthesis

Benchmarks graph coarsening trade-offs for GNNs applied to clock tree synthesis in electronic design automation.

Ax Rishabh Bhattacharya, Vikaskumar Kalsariya, Naresh Manwani 2/24/2026

Training-Free Cross-Architecture Merging for Graph Neural Networks

Introduces H-GRAMA for merging heterogeneous GNN architectures through routing and message aggregation without retraining.

Ax Egor Denisov, Svetlana Glazyrina, Maksim Kryzhanovskiy, Roman Ischenko 2/24/2026

Smooth Gate Functions for Soft Advantage Policy Optimization

Proposes Soft Adaptive Policy Optimization (SAPO) replacing hard clipping with smooth sigmoid gate functions to stabilize LLM training and reasoning in GRPO framework.

Ax David Rawlinson, Gideon Kowadlo 2/24/2026

Active perception and disentangled representations allow continual, episodic zero and few-shot learning

Framework combining active perception and disentangled representations for continual and few-shot learning without destructive interference.

Ax Daniel Ritter, Owen Oertell, Bradley Guo, Jonathan Chang, Kiant\'e Brantley, Wen Sun 2/24/2026

LLMs Can Learn to Reason Via Off-Policy RL

Method for training LLMs to reason using off-policy reinforcement learning, addressing policy lag in distributed training architectures.

Ax Ali Saheb, Johan Obando-Ceron, Aaron Courville, Pouya Bashivan, Pablo Samuel Castro 2/24/2026

Stable Deep Reinforcement Learning via Isotropic Gaussian Representations

Analysis showing isotropic Gaussian representations improve stability in deep RL under non-stationary training dynamics.

Ax Jing Ren, Jiapeng Du, Bowen Li, Ziqi Xu, Xin Zheng, Hong Jia, Suyu Ma, Xiwei Xu, Feng Xia 2/24/2026

Spiking Graph Predictive Coding for Reliable OOD Generalization

Spiking graph neural network with predictive coding framework for out-of-distribution generalization in dynamic web environments.

Ax Phillip Si, Peng Chen 2/24/2026

LEVDA: Latent Ensemble Variational Data Assimilation via Differentiable Dynamics

Latent ensemble variational data assimilation method for long-range geophysical forecasts using differentiable dynamics.

Ax Nazal Mohamed, Ayush Mohanty, Nagi Gebraeel 2/24/2026

Federated Causal Representation Learning in State-Space Systems for Decentralized Counterfactual Reasoning

Federated learning approach for causal representation learning in state-space systems enabling decentralized counterfactual reasoning across networked assets.

Ax Pranay Anchuri 2/24/2026

RAmmStein: Regime Adaptation in Mean-reverting Markets with Stein Thresholds -- Optimal Impulse Control in Concentrated AMMs

Optimal impulse control framework for concentrated liquidity provision in decentralized exchanges using Stein thresholds.

Ax Qianfeng Yu, Ningkang Peng, Yanhui Gu 2/24/2026

PIS: A Physics-Informed System for Accurate State Partitioning of $A\beta_{42}$ Protein Trajectories

Physics-informed system for analyzing conformational state transitions in Aβ₄₂ protein trajectories for Alzheimer's disease research.

Ax Zelin He, Boran Han, Xiyuan Zhang, Shuai Zhang, Haotian Lin, Qi Zhu, Haoyang Fang, Danielle C. Maddix, Abdul Fatir Ansari, Akash Chandrayan, Abhinav Pradhan, Bernie Wang, Matthew Reimherr 2/24/2026

SenTSR-Bench: Thinking with Injected Knowledge for Time-Series Reasoning

Benchmark combining general reasoning LLMs with domain-specific time-series knowledge for improved time-series diagnostic reasoning tasks.

Ax Arjun Chatterjee, Sayeed Sajjad Razin, John Wu, Siddhartha Laghuvarapu, Jathurshan Pradeepkumar, Jimeng Sun 2/24/2026

Making Conformal Predictors Robust in Healthcare Settings: a Case Study on EEG Classification

Evaluation of conformal prediction methods for EEG classification handling distribution shifts in healthcare without standard i.i.d. assumptions.

Ax Bryan Guanrong Shan, Alysa Ziying Tan, Han Yu 2/24/2026

Federated Learning Playground

Interactive browser-based educational platform for learning Federated Learning concepts with real-time visualization of heterogeneous data effects.

Ax Rudrajit Das, Neel Patel, Meisam Razaviyayn, Vahab Mirrokni 2/24/2026

Less is More: Convergence Benefits of Fewer Data Weight Updates over Longer Horizon

Bilevel optimization analysis showing convergence benefits of fewer domain weight updates over longer training horizons.

Ax Pengxi Liu, Zeyu Michael Li, Xiang Cheng 2/24/2026

Variational Trajectory Optimization of Anisotropic Diffusion Schedules

Variational framework for diffusion models with matrix-valued anisotropic noise schedules that jointly train score networks.

Ax Dingyi Nie, Yixing Wu, C. -C. Jay Kuo 2/24/2026

A Statistical Approach for Modeling Irregular Multivariate Time Series with Missing Observations

Method for modeling irregular multivariate time series with missing values using time-agnostic summary statistics instead of deep learning.

Ax Pascal Jr Tikeng Notsawo, Guillaume Dumas, Guillaume Rabusseau 2/24/2026

Grokking Finite-Dimensional Algebra

Investigation of grokking phenomenon in neural networks learning multiplication in finite-dimensional algebras beyond group operations.

Ax Kasper Green Larsen, Markus Engelund Mathiasen, Chirag Pabbaraju, Clement Svendsen 2/24/2026

The Sample Complexity of Replicable Realizable PAC Learning

Theoretical analysis of sample complexity bounds in replicable realizable PAC learning using Cayley graphs and spectral analysis.

Ax Jeremy McEntire 2/24/2026

Leap+Verify: Regime-Adaptive Speculative Weight Prediction for Accelerating Neural Network Training

Leap+Verify applies speculative execution to accelerate neural network training by predicting and validating future weights across detected regimes.

Ax Shenghong He 2/24/2026

Advantage-based Temporal Attack in Reinforcement Learning

Investigates adversarial attacks on deep reinforcement learning agents using advantage-based temporal perturbations.

Ax Biswajit Sadhu, Kalpak Gupte, Trijit Sadhu, S. Anand 2/24/2026

Interpolation-Driven Machine Learning Approaches for Plume Shine Dose Estimation: A Comparison of XGBoost, Random Forest, and TabNet

Compares XGBoost, Random Forest, and TabNet for radiation dose estimation in nuclear safety using interpolation-driven ML approaches.

Ax Yijiashun Qi, Hanzhe Guo, Yijiazhen Qi 2/24/2026

Detecting High-Potential SMEs with Heterogeneous Graph Neural Networks

SME-HGT uses heterogeneous graph transformers to predict high-potential small and medium enterprises using public data from SBIR Phase I awardees.

Ax Ayush Nangia, Shikhar Mishra, Aman Gokrani, Paras Chopra 2/24/2026

ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads?

ISO-Bench benchmark evaluates coding agents on real-world LLM inference optimization tasks from vLLM and SGLang frameworks with 54 curated tasks.

Ax Luigi Simeone 2/24/2026

Variational Inference for Bayesian MIDAS Regression

Coordinate Ascent Variational Inference algorithm for Bayesian MIDAS regression with bilinear structure.

Ax Luhan Tang, Longxuan Yu, Shaorong Zhang, Greg Ver Steeg 2/24/2026

Is Your Diffusion Sampler Actually Correct? A Sampler-Centric Evaluation of Discrete Diffusion Language Models

Framework for evaluating discrete diffusion language models by separating sampler-induced error from denoiser approximation error.

Ax Jingbo Zhou, Jun Xia, Siyuan Li, Yunfan Liu, Wenjun Wang, Yufei Huang, Changxi Chi, Mutian Hong, Zhuoli Ouyang, Shu Wang, Zhongqi Wang, Xingyu Wu, Chang Yu, Stan Z. Li 2/24/2026

VecFormer: Towards Efficient and Generalizable Graph Transformer with Graph Token Attention

VecFormer improves graph transformers with token-level attention to reduce computational complexity and improve generalization.

Ax Jesse Farebrother, Matteo Pirotta, Andrea Tirinzoni, Marc G. Bellemare, Alessandro Lazaric, Ahmed Touati 2/24/2026

Compositional Planning with Jumpy World Models

Compositional planning with world models enabling agents to compose pre-trained policies for solving complex tasks.

Ax Marvin Chen, Manuel Eberhardinger, Johannes Maucher 2/24/2026

Evaluating the Impact of Data Anonymization on Image Retrieval

Study of how data anonymization impacts performance of content-based image retrieval systems.

Ax Pablo Herrero G\'omez, Antonio Jimeno Morenilla, David Mu\~noz-Hern\'andez, Higinio Mora Mora 2/24/2026

Spectral Phase Encoding for Quantum Kernel Methods

Analysis of quantum kernel methods under data corruption with introduction of Spectral Phase Encoding technique.

Ax Rampunit Kumar, Aditya Maheshwari 2/24/2026

NEXUS : A compact neural architecture for high-resolution spatiotemporal air quality forecasting in Delhi Nationa Capital Region

Neural architecture for high-resolution spatiotemporal air quality forecasting in Delhi using four years of data.

Ax Vishnu Subramanian 2/24/2026

Representation Stability in a Minimal Continual Learning Agent

Study of representational dynamics in minimal continual learning agents across multiple task executions.

Ax Kihyuk Yoon, Lingchao Mao, Catherine Chong, Todd J. Schwedt, Chia-Chun Chiang, Jing Li 2/24/2026

PaReGTA: An LLM-based EHR Data Encoding Approach to Capture Temporal Information

PaReGTA encodes temporal electronic health records using LLMs with lightweight fine-tuning for domain adaptation.

Ax Xinyu Yuan, Xixian Liu, Ya Shi Zhang, Zuobai Zhang, Hongyu Guo, Jian Tang 2/24/2026

PerturbDiff: Functional Diffusion for Single-Cell Perturbation Modeling

PerturbDiff uses functional diffusion models for predicting cellular responses to perturbations in biology.

Ax Johanna S. Fr\"ohlich, Bastian Heinlein, Jan U. Claar, Hans Rosenberger, Vasileios Belagiannis, Ralf R. M\"uller 2/24/2026

The Confusion is Real: GRAPHIC - A Network Science Approach to Confusion Matrices in Deep Learning

GRAPHIC: network science approach to visualize and analyze class confusion patterns in deep learning models.

Ax Shimeng Huang, Matthew Robinson, Francesco Locatello 2/24/2026

Addressing Instrument-Outcome Confounding in Mendelian Randomization through Representation Learning

Representation learning approach to address confounding in Mendelian Randomization epidemiological research.

Ax Dylan Baptiste (CRESTIC), Ramla Saddem (CRESTIC), Alexandre Philippot (CRESTIC), Fran\c{c}ois Foyer 2/24/2026

Unsupervised Anomaly Detection in NSL-KDD Using $\beta$-VAE: A Latent Space and Reconstruction Error Approach

Unsupervised anomaly detection for network intrusion detection using β-VAE on NSL-KDD dataset.

Ax Lotta M\"akinen, Jorge Lor\'ia, Samuel Kaski 2/24/2026

Bayesian Meta-Learning with Expert Feedback for Task-Shift Adaptation through Causal Embeddings

Bayesian meta-learning method using causal embeddings to improve adaptation to out-of-distribution tasks.

Ax Sophia N. Wilson, Gu{\dh}r\'un Fj\'ola Gu{\dh}mundsd\'ottir, Andrew Millard, Raghavendra Selvan, Sebastian Mair 2/24/2026

Stop Preaching and Start Practising Data Frugality for Responsible Development of AI

Position paper advocating adoption of data frugal machine learning practices to reduce computational and environmental costs.

Ax Hyunwoo Park 2/24/2026

I Dropped a Neural Net

Method to recover the correct ordering of shuffled neural network layers using only the dataset and layer information.

Ax Soumen Pachal, Prashanth L. A., Shalabh Bhatnagar, Avinash Achar 2/24/2026

Generalized Random Direction Newton Algorithms for Stochastic Optimization

Novel Hessian estimators using random direction stochastic approximation for optimization with noisy measurements.

Ax Zhongwei Wan, Yun Shen, Zhihao Dou, Donghao Zhou, Yu Zhang, Xin Wang, Hui Shen, Jing Xiong, Chaofan Tao, Zixuan Zhong, Peizhou Huang, Mi Zhang 2/24/2026

DSDR: Dual-Scale Diversity Regularization for Exploration in LLM Reasoning

DSDR: dual-scale diversity regularization method to improve exploration in LLM reasoning tasks with reinforcement learning from verifiers.

Ax Ghaith Mqawass (TUM School of Life Sciences Weihenstephan, Technical University of Munich, Germany, Machine Learning and Computational Sciences, Pfizer Research & Development, Berlin, Germany), Tuan Le (Machine Learning and Computational Sciences, Pfizer Research & Development, Berlin, Germany), Fabian Theis (TUM School of Life Sciences Weihenstephan, Technical University of Munich, Germany, TUM School of Computation, Information and Technology, Technical University of Munich, Germany, Institute of Computational Biology, Helmholtz Center Munich, Germany), Djork-Arn\'e Clevert (Machine Learning and Computational Sciences, Pfizer Research & Development, Berlin, Germany) 2/24/2026