Isolater - Feed

Ax Mayank Mishra, Shawn Tan, Ion Stoica, Joseph Gonzalez, Tri Dao 3/17/2026

M$^2$RNN: Non-Linear RNNs with Matrix-Valued States for Scalable Language Modeling

M²RNN introduces matrix-valued hidden states in RNNs to achieve greater expressive power than transformers for language modeling tasks.

Ax Asela Hevapathige, Yu Xia, Sachith Seneviratne, Saman Halgamuge 3/17/2026

From Specification to Architecture: A Theory Compiler for Knowledge-Guided Machine Learning

Theory compiler framework automates translation of formal domain knowledge into neural network architectural constraints with correctness guarantees.

Ax Parth Patne, Mahdi Taheri, Ali Mahani, Maksim Jenihhin, Reza Mahani, Christian Herglotz 3/17/2026

SPARQ: Spiking Early-Exit Neural Networks for Energy-Efficient Edge AI

SPARQ framework combines spiking neural networks with quantization-aware training and RL-guided early exits for energy-efficient edge AI deployment.

Ax Xiaoliang Fu, Jiaye Lin, Yangyi Fang, Chaowen Hu, Cong Qin, Zekai Shao, Binbin Zheng, Lu Pan, Ke Zeng 3/17/2026

From $\boldsymbol{\log\pi}$ to $\boldsymbol{\pi}$: Taming Divergence in Soft Clipping via Bilateral Decoupled Decay of Probability Gradient Weight

Bilateral Decoupled Decay improves soft clipping in LLM reasoning with RLVR, addressing gradient divergence and enabling better exploration during policy optimization.

Ax Wonbin Lee, Dongki Kim, Sung Ju Hwang 3/17/2026

ES-Merging: Biological MLLM Merging via Embedding Space Signals

ES-Merging merges biological multimodal LLMs using embedding space signals to enable cross-modal scientific discovery beyond single-modality specialization.

Ax AbdulQoyum A. Olowookere, Adewale U. Oguntola, Ebenezer. Leke Odekanle 3/17/2026

Graph-Based Deep Learning for Intelligent Detection of Energy Losses, Theft, and Operational Inefficiencies in Oil & Gas Production Networks

Spatiotemporal graph-based deep learning for detecting energy losses, theft, and inefficiencies in oil and gas production networks under distribution shifts.

Ax Shiyuan Li, Yixin Liu, Yu Zheng, Xiaofeng Cao, Shirui Pan, Heng Tao Shen 3/17/2026

Towards One-for-All Anomaly Detection for Tabular Data

OFA-TAD proposes generalist one-for-all anomaly detection for tabular data with cross-domain generalization, replacing dataset-specific training approaches.

Ax Yuantong Li, Lei Yuan, Zhihao Zheng, Weimiao Wu, Songbin Liu, Jeong Min Lee, Ali Selman Aydin, Shaofeng Deng, Junbo Chen, Xinyi Zhang, Hongjing Xia, Sam Fieldman, Matthew Kosko, Wei Fu, Du Zhang, Peiyu Yang, Albert Jin Chung, Xianlei Qiu, Miao Yu, Zhongwei Teng, Hao Chen, Sunny Baek, Hui Tang, Yang Lv, Renze Wang, Qifan Wang, Zhan Li, Tiantian Xu, Peng Wu, Ji Liu 3/17/2026

MBD: A Model-Based Debiasing Framework Across User, Content, and Model Dimensions

Model-based debiasing framework for recommendation systems addressing heterogeneous biases across user, content, and model dimensions in ranking aggregation.

Ax Ziwei Liu, Tao Feng, Borui Kang, Yanbing Yang, Jun Luo 3/17/2026

Zoom to Essence: Trainless GUI Grounding by Inferring upon Interface Elements

Trainless GUI grounding for MLLM-based agents using element-level inference to map natural language to UI components without fine-tuning or large datasets.

Ax Sungwoo Kang 3/17/2026

STAG-CN: Spatio-Temporal Apiary Graph Convolutional Network for Disease Onset Prediction in Beehive Sensor Networks

Graph convolutional network for disease prediction in beehives using spatio-temporal modeling of inter-hive relationships from sensor networks.

Ax Xinyu Yuan, Yan Qiao, Zonghui Wang, Wenzhi Chen 3/17/2026

On the (Generative) Linear Sketching Problem

Addresses linear sketching problem for data streaming, achieving near-perfect recovery from compact sketch summaries with lightweight computational procedures.

Ax Akshansh Mishra 3/17/2026

Geometric and Topological Deep Learning for Predicting Thermo-mechanical Performance in Cold Spray Deposition Process Modeling

Geometric deep learning framework using GNNs to predict cold spray particle impact responses from simulation data for deposition process modeling.

Ax Markus W. Baumgartner, Anson Lei, Joe Watson, Ingmar Posner 3/17/2026

Disentangling Dynamical Systems: Causal Representation Learning Meets Local Sparse Attention

Combines causal representation learning with local sparse attention for system identification, enabling interpretable deep learning of dynamical systems without predefined function libraries.

Ax Michal Wozniak, Marek Klonowski, Maciej Maczynski, Bartosz Krawczyk 3/17/2026

Unlearning-based sliding window for continual learning under concept drift

Unlearning-based sliding window approach for continual learning under concept drift, enabling models to adapt to non-stationary data streams without explicit task boundaries.

Ax Chenglong Duan, Dazhong Wu 3/17/2026

Predicting Stress-strain Behaviors of Additively Manufactured Materials via Loss-based and Activation-based Physics-informed Machine Learning

Physics-informed ML framework predicts stress-strain behavior of additively manufactured materials by combining data-driven models with physical constraints.

Ax Niklas Schweiger, Daniel Cremers, Karnik Ram 3/17/2026

Trust-Region Noise Search for Black-Box Alignment of Diffusion and Flow Models

Proposes trust-region search algorithm for aligning diffusion and flow models to target rewards at inference time without requiring differentiable reward models.

Ax Jingyi Liu, Jian Guo, Eberhard Gill 3/17/2026

Visualizing Critic Match Loss Landscapes for Interpretation of Online Reinforcement Learning Control Algorithms

Visualizes critic loss landscapes in online actor-critic reinforcement learning to improve interpretability of algorithm performance under changing system dynamics.

Ax Jan Kobiolka, Christian Frey, Arlind Kadra, Gresa Shala, Josif Grabocka 3/17/2026

Learning to Order: Task Sequencing as In-Context Optimization

Demonstrates deep neural networks can meta-learn task sequencing from few demonstrations, enabling generalization to new sequencing problems without task-specific training.

Ax Yongqiang Chen, Chenxi Liu, Zhenhao Chen, Tongliang Liu, Bo Han, Kun Zhang 3/17/2026

CausalEvolve: Towards Open-Ended Discovery with Causal Scratchpad

CausalEvolve improves LLM-based AI agents for open-ended scientific discovery by adding causal guidance and knowledge organization mechanisms to program evolution.

Ax Jingyi Liu, Jian Guo, Eberhard Gill 3/17/2026

Adapting Critic Match Loss Landscape Visualization to Off-policy Reinforcement Learning

Extends critic match loss landscape visualization from online to off-policy reinforcement learning to reveal optimization geometry in critic learning.

Ax Wilhelm Tranheden, Shahnawaz Ahmed, Devdatt Dubhashi, Jonna Matthiesen, Hannes von Essen 3/17/2026

FlashHead: Efficient Drop-In Replacement for the Classification Head in Language Model Inference

FlashHead provides efficient drop-in replacement for classification head in language model inference, reducing parameters and compute by ~50%.

Ax Yiming Lei, Qiannan Shen, Junhao Song 3/17/2026

A Multi-Scale Graph Learning Framework with Temporal Consistency Constraints for Financial Fraud Detection in Transaction Networks under Non-Stationary Conditions

Multi-scale graph learning framework for fraud detection in transaction networks handling sparse anomalies and temporal drift.

Ax Jingyi Liu, Jian Guo, Eberhard Gill 3/17/2026

A Loss Landscape Visualization Framework for Interpreting Reinforcement Learning: An ADHDP Case Study

Loss landscape visualization framework for interpreting reinforcement learning algorithms, demonstrated on critic-based control methods.

Ax Ian Osband 3/17/2026

Delightful Policy Gradient

Delightful policy gradient method that addresses variance issues in policy gradient updates by accounting for action likelihood under current policy.

Ax Iqtedar Uddin, Mazin Khider, Andr\'e Bauer 3/17/2026

Proactive Routing to Interpretable Surrogates with Distribution-Free Safety Guarantees

Proactive routing system that selects between black-box models and interpretable surrogates with distribution-free safety guarantees.

Ax Sai P. Selvaraj, Khadija Mahmoud, Anuj Iravane 3/17/2026

Anterior's Approach to Fairness Evaluation of Automated Prior Authorization System

Fairness evaluation framework for automated prior authorization systems addressing demographic differences in clinical decision-making.

Ax Anirudh Tunga, Michael J. Mueterthies, Jonathan Nistor 3/17/2026

A Methodology for Thermal Limit Bias Predictability Through Artificial Intelligence

Deep learning methodology to predict and correct thermal limit bias in boiling water reactors for nuclear power plant operations.

Ax Mike Amega 3/17/2026

EARCP: Self-Regulating Coherence-Aware Ensemble Architecture for Sequential Decision Making -- Ensemble Auto-Regule par Coherence et Performance

EARCP ensemble architecture dynamically weights heterogeneous expert models based on performance and inter-model coherence for sequential decision making.

Ax Zhaohui Geoffrey Wang 3/17/2026

AgentTrace: Causal Graph Tracing for Root Cause Analysis in Deployed Multi-Agent Systems

AgentTrace provides causal graph tracing for post-hoc failure diagnosis in deployed multi-agent systems through execution log analysis.

Ax Ping Chen, Xiang Liu, Xingpeng Zhang, Fei Shen, Xun Gong, Zhaoxiang Liu, Zezhou Chen, Huan Hu, Kai Wang, Shiguo Lian 3/17/2026

Chain-of-Trajectories: Unlocking the Intrinsic Generative Optimality of Diffusion Models via Graph-Theoretic Planning

Chain-of-Trajectories framework enables content-aware sampling schedules in diffusion models via graph-theoretic planning without additional training.

Ax Seunghan Lee, Jaehoon Lee, Jun Seo, Sungdong Yoo, Minjae Kim, Tae Yoon Lim, Dongwan Kang, Hwanil Choi, SoonYoung Lee, Wonbin Ahn 3/17/2026

Cross-RAG: Zero-Shot Retrieval-Augmented Time Series Forecasting via Cross-Attention

Cross-RAG applies retrieval-augmented generation with cross-attention to improve zero-shot time series forecasting using foundation models.

Ax Jeffrey D. Varner 3/17/2026

Training-Free Generation of Protein Sequences from Small Family Alignments via Stochastic Attention

Training-free protein sequence generation using stochastic attention over Hopfield energy without requiring training or pretraining data.

Ax Binesh Sadanandan 3/17/2026

Multimodal Deep Learning for Early Prediction of Patient Deterioration in the ICU: Integrating Time-Series EHR Data with Clinical Notes

Multimodal deep learning approach combining time-series EHR data and clinical notes for predicting patient deterioration in ICU settings.

Ax Zhiyu Wang, Mohammad Goudarzi, Mingming Gong, Rajkumar Buyya 3/17/2026

DeFRiS: Silo-Cooperative IoT Applications Scheduling via Decentralized Federated Reinforcement Learning

DeFRiS applies decentralized federated reinforcement learning for IoT application scheduling across heterogeneous devices while preserving privacy.

Ax Yu Hao (Beijing University of Posts and Telecommunications), Qiuyu Wang (Beijing University of Posts and Telecommunications), Cheng Yang (Beijing University of Posts and Telecommunications), Yawen Li (Beijing University of Posts and Telecommunications), Zhiqiang Zhang (Ant Group), Chuan Shi (Beijing University of Posts and Telecommunications) 3/17/2026

GNNVerifier: Graph-based Verifier for LLM Task Planning

GNNVerifier uses graph neural networks to verify and correct task plans generated by LLMs in autonomous agent systems, reducing hallucinations.

Ax Huijie Guo, Jingyao Wang, Lingyu Si, Jiahuan Zhou, Changwen Zheng, Wenwen Qiang 3/17/2026

CAMD: Coverage-Aware Multimodal Decoding for Efficient Reasoning of Multimodal Large Language Models

CAMD proposes coverage-aware decoding for multimodal LLMs to allocate compute efficiently by identifying easy vs hard reasoning cases.

Ax Matthew Burfitt, Jacek Brodzki, Pawel D{\l}otko 3/17/2026

Understanding the geometry of deep learning with decision boundary volume

Method to measure decision boundary geometry of neural networks using local surface volumes to analyze model accuracy and robustness properties.

Ax Xuanfei Ren, Allen Nie, Tengyang Xie, Ching-An Cheng 3/17/2026

POLCA: Stochastic Generative Optimization with LLM

POLCA framework uses LLMs as optimizers to automatically improve complex systems like prompts and multi-turn agents through numerical rewards and text feedback.

Ax Qiyuan Chen, Xian Wu, Yi Wang, Xianhao Chen 3/17/2026

HO-SFL: Hybrid-Order Split Federated Learning with Backprop-Free Clients and Dimension-Free Aggregation

HO-SFL proposes hybrid-order split federated learning to reduce memory costs of backpropagation on edge devices while maintaining convergence speed.

Ax Qing-Yuan Wen, Da-Qing Zhang 3/17/2026

Orthogonal Subspace Clustering: Enhancing High-Dimensional Data Analysis through Adaptive Dimensionality Reduction and Efficient Clustering

Orthogonal Subspace Clustering method for high-dimensional data using matrix decomposition and factor analysis with theoretical guarantees.

Ax Zihan Dun, Liuyi Xu, An-Yang Lu, Shuang Li, Yining Qian 3/17/2026

LaPro-DTA: Latent Dual-View Drug Representations and Salient Protein Feature Extraction for Generalizable Drug--Target Affinity Prediction

LaPro-DTA framework for generalizable drug-target affinity prediction using latent dual-view drug representations and salient protein features.

Ax Wen-Jing Li, Da-Qing Zhang 3/17/2026

GARCH-FIS: A Hybrid Forecasting Model with Dynamic Volatility-Driven Parameter Adaptation

GARCH-FIS hybrid model combining fuzzy inference with GARCH for financial time series forecasting with dynamic parameter adaptation.

Ax Yiming Gao, Liuyi Xu, Pengshan Cui, Yining Qian, An-Yang Lu, Xianpeng Wang 3/17/2026

Multi-Task Genetic Algorithm with Multi-Granularity Encoding for Protein-Nucleotide Binding Site Prediction

Multi-task genetic algorithm with multi-granularity encoding for protein-nucleotide binding site prediction.

Ax Zhaohui Geoffrey Wang 3/17/2026

Universe Routing: Why Self-Evolving Agents Need Epistemic Control

Universe Routing framework addressing epistemic control in self-evolving agents by managing epistemologically incompatible reasoning frameworks.

Ax Jan Williams, Dima Tretiak, Steven L. Brunton, J. Nathan Kutz, Krithika Manohar 3/17/2026

OpenReservoirComputing: GPU-Accelerated Reservoir Computing in JAX

OpenReservoirComputing: Python library for GPU-accelerated reservoir computing in JAX with automatic differentiation and JIT compilation.

Ax Yuri Kinoshita, Naoki Nishikawa, Taro Toyoizumi 3/17/2026

Dataset Distillation Efficiently Encodes Low-Dimensional Representations from Gradient-Based Learning of Non-Linear Tasks

Theoretical analysis of dataset distillation showing how gradient-based learning extracts and encodes task-relevant information into synthetic data.

Ax William Peng, Josheev Rai, Kevin Tseng, Siwei Wang, Sean Wu 3/17/2026