Isolater - Feed

Ax Nathan Breslow, Aayush Mishra, Mahler Revsine, Michael C. Schatz, Anqi Liu, Daniel Khashabi 3/19/2026

Genomic Next-Token Predictors are In-Context Learners

Demonstrates in-context learning emerges organically in genomic sequence models trained with next-token prediction on DNA sequences.

Ax Leo Elmecker-Plakolm, Pierre Fasterling, Philip Sosnin, Calvin Tsay, Matthew Wicker 3/19/2026

Provably Safe Model Updates

Develops methods for provably safe ML model updates preventing catastrophic forgetting and alignment drift in dynamic environments.

Ax Zhongjian Qiao, Rui Yang, Jiafei Lyu, Chenjia Bai, Xiu Li, Siyang Gao, Shuang Qiu 3/19/2026

Efficient Cross-Domain Offline Reinforcement Learning with Dynamics- and Value-Aligned Data Filtering

Proposes data filtering method for cross-domain offline RL addressing dynamics misalignment between source and target domains.

Ax Vedant Shah, Johan Obando-Ceron, Vineet Jain, Brian Bartoldson, Bhavya Kailkhura, Sarthak Mittal, Glen Berseth, Pablo Samuel Castro, Yoshua Bengio, Nikolay Malkin, Moksh Jain, Siddarth Venkatraman, Aaron Courville 3/19/2026

A Comedy of Estimators: On KL Regularization in RL Training of LLMs

Analyzes KL regularization estimators in RL training of LLMs, comparing bias-variance tradeoffs of different approximation methods.

Ax Zhehao Huang, Baijiong Lin, Jingyuan Zhang, Jingying Wang, Yuhang Liu, Ning Lu, Tao Li, Xiaolin Huang 3/19/2026

VL-RouterBench: A Benchmark for Vision-Language Model Routing

VL-RouterBench benchmark for evaluating vision-language model routing systems with quality-cost tradeoff assessment at scale.

Ax Yuxuan Li, Harshith Reddy Kethireddy, Srijita Das 3/19/2026

Evaluating Feature Dependent Noise in Preference-based Reinforcement Learning

Evaluates feature-dependent noise in preference-based reinforcement learning with realistic noise patterns correlated to observations.

Ax Zhengyang Zhao, Lu Ma, Yizhen Jiang, Xiaochen Ma, Zimo Meng, Chengyu Shen, Lexiang Tang, Haoze Sun, Peng Pei, Wentao Zhang 3/19/2026

GIFT: Reconciling Post-Training Objectives via Finite-Temperature Gibbs Initialization

Proposes GIFT method reconciling SFT and RL post-training for Large Reasoning Models via Gibbs initialization to prevent distributional collapse.

Ax Ming Li 3/19/2026

Global Optimization By Gradient From Hierarchical Score-Matching Spaces

Solves constrained optimization problems via gradient-based methods using hierarchical score-matching spaces to overcome local optima.

Ax Wei Chen, Xingyu Guo, Shuang Li, Zhao Zhang, Yan Zhong, Fuzhen Zhuang, Deqing wang 3/19/2026

Learning Adaptive Distribution Alignment with Neural Characteristic Function for Graph Domain Adaptation

Proposes neural characteristic function approach for graph domain adaptation addressing distributional shifts without manual feature design.

Ax Jialin Yu, Mo\"ise Blanchard 3/19/2026

Distribution-Free Sequential Prediction with Abstentions

Studies sequential prediction with option to abstain in semi-adversarial settings mixing adversarial and stochastic instances.

Ax Danning Jing, Xinhai Chen, Xifeng Pu, Jie Hu, Chao Huang, Xuguang Chen, Qinglin Wang, Jie Liu 3/19/2026

A Deep Surrogate Model for Robust and Generalizable Long-Term Blast Wave Prediction

Creates deep surrogate model for blast wave prediction that generalizes to out-of-distribution urban scenarios using machine learning.

Ax Nazal Mohamed, Ayush Mohanty, Nagi Gebraeel 3/19/2026

Federated Causal Representation Learning in State-Space Systems for Decentralized Counterfactual Reasoning

Develops federated causal representation learning for decentralized counterfactual reasoning across coupled industrial systems while preserving data privacy.

Ax Yuandong Zhang, Othmane Echchabi, Tianshu Feng, Wenyi Zhang, Hsuai-Kai Liao, Charles Chang 3/19/2026

Detecting Transportation Mode Using Dense Smartphone GPS Trajectories and Transformer Models

Introduces SpeedTransformer, a transformer-based model for detecting transportation modes from smartphone GPS speed data.

Ax Harry Amad, Mihaela van der Schaar 3/19/2026

Hyperparameter Trajectory Inference with Conditional Lagrangian Optimal Transport

Proposes Hyperparameter Trajectory Inference to adjust neural network hyperparameters post-deployment without full retraining using optimal transport.

Ax Huihan Liu, Changyeon Kim, Bo Liu, Minghuan Liu, Yuke Zhu 3/19/2026

Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning

Studies how pretrained Vision-Language-Action models resist catastrophic forgetting during continual learning in robot policy training.

Ax Hengshuai Yao, Xing Chen, Ahmed Murtadha, Guan Wang 3/19/2026

Thin Keys, Full Values: Reducing KV Cache via Low-Dimensional Attention Selection

Reduces transformer KV cache by using low-dimensional keys for attention selection while maintaining high-dimensional values, achieving O(log N) dimensional compression.

Ax Fengxiang Nie, Yasuhiro Suzuki 3/19/2026

JAWS: Enhancing Long-term Rollout of Neural PDE Solvers via Spatially-Adaptive Jacobian Regularization

JAWS improves neural PDE solvers' long-term rollouts using spatially-adaptive Jacobian regularization to prevent spectral blow-up and unphysical divergence.

Ax Jialei Tan, Zheng Lin, Xiangming Cai, Ruoxi Zhu, Zihan Fang, Pingping Chen, Wei Ni 3/19/2026

Exploiting Adaptive Channel Pruning for Communication-Efficient Split Learning

Adaptive channel pruning technique reduces communication overhead in split learning by selectively transmitting intermediate feature representations.

Ax Teng Xiao, Yige Yuan, Hamish Ivison, Huaisheng Zhu, Faeze Brahman, Nathan Lambert, Pradeep Dasigi, Noah A. Smith, Hannaneh Hajishirzi 3/19/2026

Meta-Reinforcement Learning with Self-Reflection for Agentic Search

MR-Search proposes meta-reinforcement learning with self-reflection for agentic search, enabling agents to adapt strategies across episodes and improve in-context exploration.

Ax Barth\'el\'emy Dang-Nhu, Louis Annabi, Sylvain Argentieri 3/19/2026

Disentangled Representation Learning through Unsupervised Symmetry Group Discovery

Method for embodied agents to autonomously discover symmetry group structure for disentangled representation learning without requiring prior knowledge of group properties.

Ax Jingpu Cheng, Qianxiao Li, Ting Lin, Zuowei Shen 3/19/2026

Deep learning and the rate of approximation by flows

Theoretical investigation of deep residual networks' approximation capacity in continuous dynamical systems, quantifying minimal time-horizons for diffeomorphism approximation.

Ax Hao Wu, Yongheng Zhang, Yuan Gao, Fan Xu, Fan Zhang, Ruobing Xie, Ruijian Gou, Yuxuan Liang, Xiaomeng Huang, Xian Wu 3/19/2026

OMNIFLOW: A Physics-Grounded Multimodal Agent for Generalized Scientific Reasoning

OMNIFLOW is a multimodal agent combining LLMs with physics-grounded reasoning for scientific tasks involving PDEs, addressing hallucinations through cross-domain generalization.

Ax Dibakar Sigdel, Namuna Panday 3/19/2026

PhasorFlow: A Python Library for Unit Circle Based Computing

PhasorFlow: open-source Python library for computing on unit circle using complex phasors and unitary wave interference gates.

Ax Joe Standridge, Daniel Livescu, Paul Cizmas 3/19/2026

Trajectory-Optimized Time Reparameterization for Learning-Compatible Reduced-Order Modeling of Stiff Dynamical Systems

Time reparameterization technique for machine-learning reduced-order models of stiff dynamical systems improving training efficiency.

Ax Jackson Trager, Alireza S. Ziabari, Elnaz Rahmati, Aida Mostafazadeh Davani, Preni Golazizian, Farzan Karimi-Malekabadi, Ali Omrani, Zhihe Li, Brendan Kennedy, Georgios Chochlakis, Nils Karl Reimer, Melissa Reyes, Kelsey Cheng, Mellow Wei, Christina Merrifield, Arta Khosravi, Evans Alvarez, Morteza Dehghani 3/19/2026

The Moral Foundations Reddit Corpus

Reddit corpus annotated with moral sentiment and framing for NLP tasks related to moral language detection and analysis.

Ax Zhikai Li, Xiaoxuan Liu, Banghua Zhu, Zhen Dong, Qingyi Gu, Kurt Keutzer 3/19/2026

QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources

QFT: quantization-based approach for full-parameter fine-tuning of large language models with limited computational resources.

Ax Akash Kundu, Leopoldo Sarra 3/19/2026

Reinforcement learning with learned gadgets to tackle hard quantum problems on real hardware

Reinforcement learning method for quantum circuit design handling device noise and connectivity constraints on real quantum hardware.

Ax Quyu Kong, Yixuan Zhang, Yang Liu, Panrong Tong, Enqi Liu, Feng Zhou 3/19/2026

Byte-token Enhanced Language Models for Temporal Point Processes Analysis

Byte-token enhanced language models for temporal point processes analysis to model event sequences with temporal dynamics and textual descriptions.

Ax Neeraj Gangwar, Suma P Bhat, Nickvash Kani 3/19/2026

Integrating Arithmetic Learning Improves Mathematical Reasoning in Smaller Models

Method for improving mathematical reasoning in smaller LLMs by integrating arithmetic learning with knowledge distillation and data augmentation.

Ax Jing Liu, Yao Du, Kun Yang, Jiaqi Wu, Yan Wang, Xiping Hu, Zehua Wang, Yang Liu, Peng Sun, Azzedine Boukerche, Victor C. M. Leung 3/19/2026

Edge-Cloud Collaborative Computing on Distributed Intelligence and Model Optimization: A Survey

Survey of edge-cloud collaborative computing paradigms for distributed AI deployment, covering model optimization and LLM inference strategies.

Ax Hung Pham, Aiden Ren, Ibrahim Tahir, Jiatai Tong, Thiago Serra 3/19/2026

Optimization over Trained (and Sparse) Neural Networks: A Surrogate within a Surrogate

Framework for constraint learning using pruned neural networks as tractable surrogates in optimization problems.

Ax Rebecca Pelke, Jos\'e Cubero-Cascante, Nils Bosbach, Niklas Degener, Florian Idrizi, Lennart M. Reimann, Jan Moritz Joseph, Rainer Leupers 3/19/2026

Optimizing Binary and Ternary Neural Network Inference on RRAM Crossbars using CIM-Explorer

CIM-Explorer tool for optimizing binary and ternary neural networks on RRAM crossbar hardware architectures.

Ax Santiago Acevedo, Andrea Mascaretti, Riccardo Rende, Mat\'eo Mahaut, Marco Baroni, Alessandro Laio 3/19/2026

A quantitative analysis of semantic information in deep representations of text and images

Information Imbalance metric for analyzing semantic information alignment in deep representations across text and image models.

Ax Selina Carter, Arun K Kuchibhotla 3/19/2026

Statistical Inference for Online Algorithms

Methods for constructing confidence intervals and hypothesis tests for functionals derived from online/sequential algorithms with computational constraints.

Ax Mathew J. Koretsky, Maya Willey, Owen Bianchi, Chelsea X. Alvarado, Tanay Nayak, Nicole Kuznetsov, Sungwon Kim, Mike A. Nalls, Daniel Khashabi, Faraz Faghri 3/19/2026

BiomedSQL: Text-to-SQL for Scientific Reasoning on Biomedical Knowledge Bases

BiomedSQL benchmark for evaluating text-to-SQL systems on biomedical knowledge bases requiring implicit domain reasoning and scientific understanding.

Ax Andrei Paleyes, Radzim Sendyka, Diana Robinson, Christian Cabrera, Neil D. Lawrence 3/19/2026

Code Roulette: How Prompt Variability Affects LLM Code Generation

Study on how prompt variability affects LLM code generation quality and functionality across different user backgrounds and expertise levels.

Ax Zhen Qian, Sebastian Bathiany, Teng Liu, Lana L. Blaschke, Hoong Chen Teo, Niklas Boers 3/19/2026

Decadal sink-source shifts of forest aboveground carbon since 1988

Deep learning models integrated with satellite data to reconstruct global forest carbon dynamics from 1988-2021 with uncertainty quantification.

Ax Gunjan Auti, Hirofumi Daiguji, Gouhei Tanaka 3/19/2026

Hebbian Physics Networks: A Self-Organizing Computational Architecture Based on Local Physical Laws

Hebbian Physics Networks: self-organizing computational architecture using plastic transport geometry for solving physical dynamics problems.

Ax Minwei Zhao, Guosheng Yang, Zhuoni Zhang, Filip Biljecki, Hanzhi Zu, Cai Wu 3/19/2026

From Street Form to Spatial Justice: Explaining Urban Exercise Inequality via a Triadic SHAP-Informed Framework

SHAP-based framework for analyzing urban exercise inequality using spatial theory and machine learning on Shenzhen street data.

Ax Rickard Karlsson, Piersilvio De Bartolomeis, Issa J. Dahabreh, Jesse H. Krijthe 3/19/2026

Robust estimation of heterogeneous treatment effects in randomized trials leveraging external data

QR-learner model for estimating conditional treatment effects in trials using external data.

Ax Peng Ding 3/19/2026

ToolRegistry: A Protocol-Agnostic Tool Management Library for Function-Calling LLMs

ToolRegistry: protocol-agnostic tool management library for function-calling LLMs, addressing fragmentation in tool integration.

Ax Nil Ayday, Mahalakshmi Sabanayagam, Debarghya Ghoshdastidar 3/19/2026

Exact Generalisation Error Exposes Benchmarks Skew Graph Neural Networks Success (or Failure)

Analysis of GNN generalization error to explain performance variance and benchmark skew in graph neural networks.

Ax Gautam Sreekumar, Vishnu Naresh Boddeti 3/19/2026

InPhyRe Discovers: Large Multimodal Models Struggle in Inductive Physical Reasoning

Benchmark study showing large multimodal models fail at inductive physical reasoning beyond training distribution.

Ax Tianyu Chen, Yasi Zhang, Zhi Zhang, Peiyu Yu, Shu Wang, Zhendong Wang, Kevin Lin, Xiaofei Wang, Zhengyuan Yang, Linjie Li, Chung-Ching Lin, Jianwen Xie, Oscar Leong, Lijuan Wang, Ying Nian Wu, Mingyuan Zhou 3/19/2026

EdiVal-Agent: An Object-Centric Framework for Automated, Fine-Grained Evaluation of Multi-Turn Editing

EdiVal-Agent framework for automated, fine-grained evaluation of multi-turn image editing using object-centric assessment.

Ax Yongding Tao, Tian Wang, Yihong Dong, Huanyu Liu, Kechi Zhang, Xiaolong Hu, Ge Li 3/19/2026

Detecting Data Contamination from Reinforcement Learning Post-training for Large Language Models

Detection methods for data contamination in RL post-training phase of LLMs, addressing evaluation validity gap.

Ax Francesco Montagna 3/19/2026

On the identifiability of causal graphs with multiple environments

Causal discovery method using multi-environment data to achieve full causal graph identifiability with minimal environments.

Ax Lizhi Yang, Blake Werner, Massimiliano de Sa, Aaron D. Ames 3/19/2026

CBF-RL: Safety Filtering Reinforcement Learning in Training with Control Barrier Functions

CBF-RL integrates Control Barrier Functions into RL training to enforce safety constraints during policy learning.

Ax Chuansen Peng, Xiaojing Shen 3/19/2026

Learning Time-Varying Graphs from Incomplete Graph Signals

Unified optimization framework for jointly inferring time-varying network topologies and imputing missing graph signal data.

Ax Dailan He, Guanlin Feng, Xingtong Ge, Yazhe Niu, Yi Zhang, Bingqi Ma, Guanglu Song, Yu Liu, Hongsheng Li 3/19/2026

Neighbor GRPO: Contrastive ODE Policy Optimization Aligns Flow Models

Neighbor GRPO extends Group Relative Policy Optimization to flow matching models with contrastive ODE-based approach for generative model alignment.

Ax Syed Naveed Mahmood, Md. Rezaur Rahman Bhuiyan, Tasfia Zaman, Jareen Tasneem Khondaker, Md. Sameer Sakib, K. M. Shadman Wadith, Nazia Tasnim, Farig Sadeque 3/19/2026

Representation-Aware Unlearning via Activation Signatures: From Suppression to Knowledge-Signature Erasure

Knowledge Immunization Framework for selective knowledge erasure from LLMs via representation-aware activation signatures, addressing GDPR and safety.