Isolater - Feed

Ax Yasi Zhang, Tianyu Chen, Zhendong Wang, Ying Nian Wu, Mingyuan Zhou, Oscar Leong 3/19/2026

Score Distillation Beyond Acceleration: Generative Modeling from Corrupted Data

arXiv paper: Restoration Score Distillation framework for learning generative models from corrupted data. Novel ML research.

Ax Aobo Liang, Yan Sun, Xiaohou Shi, Ke Li 3/19/2026

Time Tracker: Mixture-of-Experts-Enhanced Foundation Time Series Forecasting Model with Decoupled Training Pipelines

Foundation model for time series forecasting using mixture-of-experts architecture with decoupled training to handle diverse temporal patterns and multi-variable correlations.

Ax Peter Holderrieth, Ezra Erives 3/19/2026

An Introduction to Flow Matching and Diffusion Models

Tutorial on diffusion and flow-based generative models covering mathematical foundations, ODEs, SDEs, and core algorithms for image, video, and multi-modal generation.

Ax Yihong Guo, Yu Yang, Pan Xu, Anqi Liu 3/19/2026

MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning

Offline reinforcement learning method for mismatched dynamics leveraging model-based approaches to explore high-reward states.

Ax Igor Urbanik, Pawe{\l} Gajewski 3/19/2026

SatSOM: Saturation Self-Organizing Maps for Continual Learning

Self-organizing map extension addressing catastrophic forgetting in continual learning scenarios.

Ax Zihan Guan, Zhiyuan Zhao, Fengwei Tian, Dung Nguyen, Payel Bhattacharjee, Ravi Tandon, B. Aditya Prakash, Anil Vullikanti 3/19/2026

Improving Epidemic Analyses with Privacy-Preserving Integration of Sensitive Data

Privacy-preserving neural network framework integrating epidemiological modeling with differential privacy guarantees.

Ax Luca Stradiotti, Dario Pesenti, Stefano Teso, Jesse Davis 3/19/2026

Knowing What You Cannot Explain: Learning to Reject Low-Quality Explanations

Learning to Reject framework extending ML models to abstain from predictions and explanations with low quality.

Ax Jinhyeok Jang, Jaehong Kim, Jung Uk Kim 3/19/2026

Learning from Oblivion: Predicting Knowledge Overflowed Weights via Retrodiction of Forgetting

Method predicting better pre-trained weights by leveraging structural properties and retrodiction of forgetting.

Ax Kazuki Irie, Samuel J. Gershman 3/19/2026

Fast weight programming and linear transformers: from machine learning to neurobiology

Fast weight programmers with 2D matrix hidden states connecting RNNs, language modeling, and neurobiology.

Ax Zhouyu Zhang, Chih-Yuan Chiu, Glen Chou 3/19/2026

Constraint Learning in Multi-Agent Dynamic Games from Demonstrations of Local Nash Interactions

Inverse game theory algorithm learning constraints from Nash equilibrium demonstrations using MILP formulations.

Ax Guangli Li, Canbiao Wu, Zhehao Zhou, Na Tian, Li Zhang, Zhen Liang 3/19/2026

Learning Domain- and Class-Disentangled Prototypes for Domain-Generalized EEG Emotion Recognition

Transfer learning framework for EEG-based emotion recognition using domain-class prototypes.

Ax Rapha\"el Berthier 3/19/2026

Diagonal Linear Networks and the Lasso Regularization Path

Theoretical analysis of implicit regularization in diagonal linear networks via Lasso regularization path.

Ax Yuxiang Ji, Ziyu Ma, Yong Wang, Guanhua Chen, Xiangxiang Chu, Liaoni Wu 3/19/2026

Tree Search for LLM Agent Reinforcement Learning

Tree-based group relative policy optimization for LLM agents addressing sparse supervision in multi-turn tasks.

Ax Aayush Mishra, Daniel Khashabi, Anqi Liu 3/19/2026

IA2: Alignment with ICL Activations Improves Supervised Fine-Tuning

Method aligning supervised fine-tuning with in-context learning activations to improve LLM generalization and calibration.

Ax Qiushui Xu, Yuhao Huang, Yushu Jiang, Lei Song, Jinyu Wang, Wenliang Zheng, Jiang Bian 3/19/2026

In-Context Compositional Q-Learning for Offline Reinforcement Learning

Offline reinforcement learning using in-context learning with linear Transformers for compositional Q-function estimation.

Ax Arpit Garg, Hemanth Saratchandran, Ravi Garg, Simon Lucey 3/19/2026

Stable Forgetting: Bounded Parameter-Efficient Unlearning in Foundation Models

Parameter-efficient unlearning method for foundation models addressing privacy/safety with bounded weight growth.

Ax Ziyan Wang, Zheng Wang, Xingwei Qu, Qi Cheng, Jie Fu, Shengpu Tang, Minjia Zhang, Xiaoming Huo 3/19/2026

Slow-Fast Policy Optimization: Reposition-Before-Update for LLM Reasoning

Slow-Fast Policy Optimization framework for improving LLM reasoning via reinforcement learning with stable gradient updates.

Ax Xuwang Yin, Claire Zhang, Julie Steele, Nir Shavit, Tony T. Wang 3/19/2026

Scalable Energy-Based Models via Adversarial Training: Unifying Discrimination and Generation

Energy-based models combining classification and generation via adversarial training to improve SGLD stability.

Ax Hyunsik Yoo, Ting-Wei Li, SeongKu Kang, Zhining Liu, Charlie Xu, Qilin Qi, Hanghang Tong 3/19/2026

Continual Low-Rank Adapters for LLM-based Generative Recommender Systems

Proposes continual low-rank adapters for LLM-based recommender systems handling evolving users and preferences without catastrophic forgetting.

Ax Kang Yang, Yuanlin Yang, Yuning Chen, Sikai Yang, Xinyu Zhang, Wan Du 3/19/2026

SoilX: Calibration-Free Comprehensive Soil Sensing through Contrastive Cross-Component Learning

Develops calibration-free soil sensing system using contrastive learning to predict moisture and macronutrients without retraining.

Ax Yanan Zhao, Feng Ji, Jingyang Dai, Jiaze Ma, Keyue Jiang, Kai Zhao, Wee Peng Tay 3/19/2026

Adaptive Multi-view Graph Contrastive Learning via Fractional-order Neural Diffusion Networks

Presents augmentation-free graph contrastive learning via fractional-order neural diffusion networks for multi-scale structure learning.

Ax Jorge Paz-Ruza, Jo\~ao Gama, Amparo Alonso-Betanzos, Bertha Guijarro-Berdi\~nas 3/19/2026

A robust methodology for long-term sustainability evaluation of Machine Learning models

Proposes standardized methodology for evaluating long-term sustainability and efficiency of ML models addressing Green AI gaps.

Ax Nathan Breslow, Aayush Mishra, Mahler Revsine, Michael C. Schatz, Anqi Liu, Daniel Khashabi 3/19/2026

Genomic Next-Token Predictors are In-Context Learners

Demonstrates in-context learning emerges organically in genomic sequence models trained with next-token prediction on DNA sequences.

Ax Leo Elmecker-Plakolm, Pierre Fasterling, Philip Sosnin, Calvin Tsay, Matthew Wicker 3/19/2026

Provably Safe Model Updates

Develops methods for provably safe ML model updates preventing catastrophic forgetting and alignment drift in dynamic environments.

Ax Zhongjian Qiao, Rui Yang, Jiafei Lyu, Chenjia Bai, Xiu Li, Siyang Gao, Shuang Qiu 3/19/2026

Efficient Cross-Domain Offline Reinforcement Learning with Dynamics- and Value-Aligned Data Filtering

Proposes data filtering method for cross-domain offline RL addressing dynamics misalignment between source and target domains.

Ax Vedant Shah, Johan Obando-Ceron, Vineet Jain, Brian Bartoldson, Bhavya Kailkhura, Sarthak Mittal, Glen Berseth, Pablo Samuel Castro, Yoshua Bengio, Nikolay Malkin, Moksh Jain, Siddarth Venkatraman, Aaron Courville 3/19/2026

A Comedy of Estimators: On KL Regularization in RL Training of LLMs

Analyzes KL regularization estimators in RL training of LLMs, comparing bias-variance tradeoffs of different approximation methods.

Ax Zhehao Huang, Baijiong Lin, Jingyuan Zhang, Jingying Wang, Yuhang Liu, Ning Lu, Tao Li, Xiaolin Huang 3/19/2026

VL-RouterBench: A Benchmark for Vision-Language Model Routing

VL-RouterBench benchmark for evaluating vision-language model routing systems with quality-cost tradeoff assessment at scale.

Ax Yuxuan Li, Harshith Reddy Kethireddy, Srijita Das 3/19/2026

Evaluating Feature Dependent Noise in Preference-based Reinforcement Learning

Evaluates feature-dependent noise in preference-based reinforcement learning with realistic noise patterns correlated to observations.

Ax Zhengyang Zhao, Lu Ma, Yizhen Jiang, Xiaochen Ma, Zimo Meng, Chengyu Shen, Lexiang Tang, Haoze Sun, Peng Pei, Wentao Zhang 3/19/2026

GIFT: Reconciling Post-Training Objectives via Finite-Temperature Gibbs Initialization

Proposes GIFT method reconciling SFT and RL post-training for Large Reasoning Models via Gibbs initialization to prevent distributional collapse.

Ax Ming Li 3/19/2026

Global Optimization By Gradient From Hierarchical Score-Matching Spaces

Solves constrained optimization problems via gradient-based methods using hierarchical score-matching spaces to overcome local optima.

Ax Wei Chen, Xingyu Guo, Shuang Li, Zhao Zhang, Yan Zhong, Fuzhen Zhuang, Deqing wang 3/19/2026

Learning Adaptive Distribution Alignment with Neural Characteristic Function for Graph Domain Adaptation

Proposes neural characteristic function approach for graph domain adaptation addressing distributional shifts without manual feature design.

Ax Jialin Yu, Mo\"ise Blanchard 3/19/2026

Distribution-Free Sequential Prediction with Abstentions

Studies sequential prediction with option to abstain in semi-adversarial settings mixing adversarial and stochastic instances.

Ax Danning Jing, Xinhai Chen, Xifeng Pu, Jie Hu, Chao Huang, Xuguang Chen, Qinglin Wang, Jie Liu 3/19/2026

A Deep Surrogate Model for Robust and Generalizable Long-Term Blast Wave Prediction

Creates deep surrogate model for blast wave prediction that generalizes to out-of-distribution urban scenarios using machine learning.

Ax Nazal Mohamed, Ayush Mohanty, Nagi Gebraeel 3/19/2026

Federated Causal Representation Learning in State-Space Systems for Decentralized Counterfactual Reasoning

Develops federated causal representation learning for decentralized counterfactual reasoning across coupled industrial systems while preserving data privacy.

Ax Yuandong Zhang, Othmane Echchabi, Tianshu Feng, Wenyi Zhang, Hsuai-Kai Liao, Charles Chang 3/19/2026

Detecting Transportation Mode Using Dense Smartphone GPS Trajectories and Transformer Models

Introduces SpeedTransformer, a transformer-based model for detecting transportation modes from smartphone GPS speed data.

Ax Harry Amad, Mihaela van der Schaar 3/19/2026

Hyperparameter Trajectory Inference with Conditional Lagrangian Optimal Transport

Proposes Hyperparameter Trajectory Inference to adjust neural network hyperparameters post-deployment without full retraining using optimal transport.

Ax Huihan Liu, Changyeon Kim, Bo Liu, Minghuan Liu, Yuke Zhu 3/19/2026

Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning

Studies how pretrained Vision-Language-Action models resist catastrophic forgetting during continual learning in robot policy training.

Ax Hengshuai Yao, Xing Chen, Ahmed Murtadha, Guan Wang 3/19/2026

Thin Keys, Full Values: Reducing KV Cache via Low-Dimensional Attention Selection

Reduces transformer KV cache by using low-dimensional keys for attention selection while maintaining high-dimensional values, achieving O(log N) dimensional compression.

Ax Fengxiang Nie, Yasuhiro Suzuki 3/19/2026

JAWS: Enhancing Long-term Rollout of Neural PDE Solvers via Spatially-Adaptive Jacobian Regularization

JAWS improves neural PDE solvers' long-term rollouts using spatially-adaptive Jacobian regularization to prevent spectral blow-up and unphysical divergence.

Ax Jialei Tan, Zheng Lin, Xiangming Cai, Ruoxi Zhu, Zihan Fang, Pingping Chen, Wei Ni 3/19/2026

Exploiting Adaptive Channel Pruning for Communication-Efficient Split Learning

Adaptive channel pruning technique reduces communication overhead in split learning by selectively transmitting intermediate feature representations.

Ax Teng Xiao, Yige Yuan, Hamish Ivison, Huaisheng Zhu, Faeze Brahman, Nathan Lambert, Pradeep Dasigi, Noah A. Smith, Hannaneh Hajishirzi 3/19/2026

Meta-Reinforcement Learning with Self-Reflection for Agentic Search

MR-Search proposes meta-reinforcement learning with self-reflection for agentic search, enabling agents to adapt strategies across episodes and improve in-context exploration.

Ax Barth\'el\'emy Dang-Nhu, Louis Annabi, Sylvain Argentieri 3/19/2026

Disentangled Representation Learning through Unsupervised Symmetry Group Discovery

Method for embodied agents to autonomously discover symmetry group structure for disentangled representation learning without requiring prior knowledge of group properties.

Ax Jingpu Cheng, Qianxiao Li, Ting Lin, Zuowei Shen 3/19/2026

Deep learning and the rate of approximation by flows

Theoretical investigation of deep residual networks' approximation capacity in continuous dynamical systems, quantifying minimal time-horizons for diffeomorphism approximation.

Ax Hao Wu, Yongheng Zhang, Yuan Gao, Fan Xu, Fan Zhang, Ruobing Xie, Ruijian Gou, Yuxuan Liang, Xiaomeng Huang, Xian Wu 3/19/2026

OMNIFLOW: A Physics-Grounded Multimodal Agent for Generalized Scientific Reasoning

OMNIFLOW is a multimodal agent combining LLMs with physics-grounded reasoning for scientific tasks involving PDEs, addressing hallucinations through cross-domain generalization.

Ax Dibakar Sigdel, Namuna Panday 3/19/2026

PhasorFlow: A Python Library for Unit Circle Based Computing

PhasorFlow: open-source Python library for computing on unit circle using complex phasors and unitary wave interference gates.

Ax Joe Standridge, Daniel Livescu, Paul Cizmas 3/19/2026

Trajectory-Optimized Time Reparameterization for Learning-Compatible Reduced-Order Modeling of Stiff Dynamical Systems

Time reparameterization technique for machine-learning reduced-order models of stiff dynamical systems improving training efficiency.

Ax Jackson Trager, Alireza S. Ziabari, Elnaz Rahmati, Aida Mostafazadeh Davani, Preni Golazizian, Farzan Karimi-Malekabadi, Ali Omrani, Zhihe Li, Brendan Kennedy, Georgios Chochlakis, Nils Karl Reimer, Melissa Reyes, Kelsey Cheng, Mellow Wei, Christina Merrifield, Arta Khosravi, Evans Alvarez, Morteza Dehghani 3/19/2026