Isolater - Feed

Ax Xinyu Zhang 3/24/2026

What Do World Models Learn in RL? Probing Latent Representations in Learned Environment Simulators

Interpretability study probing internal representations of world models (IRIS and DIAMOND) in RL using linear/nonlinear probing and causal interventions.

Ax Andrii Shportko 3/24/2026

Kolmogorov Complexity Bounds for LLM Steganography and a Perplexity-Based Detection Proxy

Information-theoretic analysis of LLM steganography showing Kolmogorov complexity bounds on hidden payload embedding in text while preserving semantic meaning.

Ax Md Kaykobad Reza, Ameya Patil, Edward Ayrapetian, M. Salman Asif 3/24/2026

SSAM: Singular Subspace Alignment for Merging Multimodal Large Language Models

SSAM method merges multiple pre-trained multimodal LLMs without additional training by aligning singular subspaces, enabling efficient multi-modality integration.

Ax Devashish Chaudhary, Sutharshan Rajasegarar, Shiva Raj Pokhrel, Lei Pan, Ruby D 3/24/2026

In-network Attack Detection with Federated Deep Learning in IoT Networks: Real Implementation and Analysis

Lightweight autoencoder-based anomaly detection using federated learning for IoT networks, enabling privacy-preserving security monitoring on resource-constrained devices.

Ax Philip S. Yu, Li Sun 3/24/2026

Riemannian Geometry Speaks Louder Than Words: From Graph Foundation Model to Next-Generation Graph Intelligence

Framework for building general-purpose Graph Foundation Models using Riemannian geometry principles, analogous to large language models for graph-structured data.

Ax Woosung Koh, Jeyoung Jeon, Youngjin Song, Yujin Cheon, Soowon Oh, Jaehyeong Choi, Se-Young Yun 3/24/2026

mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT

mSFT algorithm for multi-task supervised fine-tuning that addresses heterogeneous overfitting by dynamically adjusting compute budget per dataset to balance learning rates.

Ax Abdou-Raouf Atarmla 3/24/2026

Rule-State Inference (RSI): A Bayesian Framework for Compliance Monitoring in Rule-Governed Domains

Bayesian framework for compliance monitoring in rule-governed domains, inferring latent states given known rules rather than learning rules from data.

Ax Shiyan Hu, Jianxin Jin, Yang Shu, Peng Chen, Bin Yang, Chenjuan Guo 3/24/2026

Towards Multimodal Time Series Anomaly Detection with Semantic Alignment and Condensed Interaction

Multimodal time series anomaly detection model combining numerical and semantic data with alignment and interaction mechanisms for dynamic system monitoring.

Ax Yuehu Gong, Zeyuan Wang, Yulin Chen, Yanwei Fu 3/24/2026

Proximal Policy Optimization in Path Space: A Schr\"odinger Bridge Perspective

GSB-PPO extends proximal policy optimization to trajectory-level generative policies using Schrödinger Bridge perspective, enabling diffusion and flow-based policy optimization.

Ax Yunchi Yang, Longlong Li, Jianliang Wu, Cunquan Qu 3/24/2026

MISApp: Multi-Hop Intent-Aware Session Graph Learning for Next App Prediction

Session-based graph learning model for predicting next mobile app launches by modeling multi-hop intent patterns and handling sparse/cold-start user profiles.

Ax Vagish Kumar, Syed Bahauddin Alam, Souvik Chakraborty 3/24/2026

TrustFed: Enabling Trustworthy Medical AI under Data Privacy Constraints

Federated learning framework for privacy-preserving medical AI training across healthcare institutions while addressing data heterogeneity and deployment challenges.

Ax Tian Xia 3/24/2026

Data-Free Layer-Adaptive Merging via Fisher Information for Long-to-Short Reasoning LLMs

Model merging technique using Fisher Information to combine long-chain-of-thought and base LLMs, preserving reasoning accuracy while reducing output length without additional training.

Ax Bahar Dibaei Nia, Farzan Farnia 3/24/2026

When Exploration Comes for Free with Mixture-Greedy: Do we need UCB in Diversity-Aware Multi-Armed Bandits?

Multi-armed bandit approach for selecting among generative models under diversity-aware metrics, addressing efficient model selection in generative AI without relying on classical UCB algorithms.

Ax Dongxia Wu, Yuhui Zhang, Serena Yeung-Levy, Emma Lundberg, Emily B. Fox 3/24/2026

Uncertainty Quantification for Distribution-to-Distribution Flow Matching in Scientific Imaging

arXiv paper on uncertainty quantification for distribution-to-distribution flow matching in scientific imaging applications.

Ax Bulent Haznedar, Levent Karacan 3/24/2026

FISformer: Replacing Self-Attention with a Fuzzy Inference System in Transformer Models for Time Series Forecasting

FISformer replaces self-attention with fuzzy inference systems in transformers for time series forecasting, addressing uncertainty modeling limitations of dot-product attention.

Ax Dongxia Wu, Shiye Su, Yuhui Zhang, Elaine Sui, Emma Lundberg, Emily B. Fox, Serena Yeung-Levy 3/24/2026

CellFluxRL: Biologically-Constrained Virtual Cell Modeling via Reinforcement Learning

Post-training virtual cell models with RL using biologically-constrained reward functions for drug discovery simulation.

Ax Yuze Qin, Qingyong Li, Zhiqing Guo, Wen Wang, Yan Liu, Yangli-ao Geng 3/24/2026

Extending Precipitation Nowcasting Horizons via Spectral Fusion of Radar Observations and Foundation Model Priors

Precipitation nowcasting approach combining radar imagery with weather foundation model predictions via spectral fusion.

Ax Armand Rousselot, Joran Wendebourg, Ullrich K\"othe 3/24/2026

Show Me What You Don't Know: Efficient Sampling from Invariant Sets for Model Validation

Method for analyzing feature invariances in ML models by sampling from learned equivalence classes without dedicated generators.

Ax Hanyin Cheng, Xingjian Wu, Yang Shu, Zhongwen Rao, Lujia Pan, Bin Yang, Chenjuan Guo 3/24/2026

CoRA: Boosting Time Series Foundation Models for Multivariate Forecasting through Correlation-aware Adapter

Lightweight adapter module enhancing time series foundation models by incorporating correlation information across channels.

Ax Mohammad Moulaeifard, Philip J. Aston, Peter H. Charlton, Nils Strodthoff 3/24/2026

Deriving Health Metrics from the Photoplethysmogram: Benchmarks and Insights from MIMIC-III-Ext-PPG

Benchmark dataset and baselines for PPG-based clinical prediction tasks from MIMIC-III data.

Ax Marc Franquesa Mon\'es, Jiaqi Zhang, Caroline Uhler 3/24/2026

On the Number of Conditional Independence Tests in Constraint-based Causal Discovery

Analysis of computational complexity in constraint-based causal discovery algorithms using conditional independence tests.

Ax Weilin Wan, Jingtao Han, Weizhong Zhang, Cheng Jin 3/24/2026

Holistic Scaling Laws for Optimal Mixture-of-Experts Architecture Optimization

Scaling laws for Mixture-of-Experts architecture design balancing global interactions and MoE-specific variables in LLMs.

Ax Xinyu Lu, Kaiqi Zhang, Jinglin Yang, Boxi Cao, Yaojie Lu, Hongyu Lin, Min He, Xianpei Han, Le Sun 3/24/2026

P^2O: Joint Policy and Prompt Optimization

Joint optimization of RL policies and LLM prompts for improving reasoning with verifiable rewards on hard samples.

Ax Nikolas Stavrou, Siamak Mehrkanoon 3/24/2026

SmaAT-QMix-UNet: A Parameter-Efficient Vector-Quantized UNet for Precipitation Nowcasting

Parameter-efficient vector-quantized UNet variant for weather precipitation nowcasting with reduced computational requirements.

Ax Ziyang Zhang, Zheshun Wu, Jie Liu, Luca Mottola 3/24/2026

SparseDVFS: Sparse-Aware DVFS for Energy-Efficient Edge Inference

Energy optimization technique for edge device inference using fine-grained DVFS scaling aware of network sparsity.

Ax Juan Sebastian Rojas, Chi-Guhn Lee 3/24/2026

Deep Reinforcement Learning and The Tale of Two Temporal Difference Errors

Analysis of temporal difference error interpretations in deep reinforcement learning and impact on critic loss formulation.

Ax Xixi Wu, Qianguo Sun, Ruiyang Zhang, Chao Song, Junlong Wu, Yiyan Qi, Hong Cheng 3/24/2026

Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe

Systematic empirical study on scaling RL for autonomous LLM agents with long-horizon tool orchestration using TravelPlanner benchmark.

Ax Ehimare Okoyomon, Christoph Goebel 3/24/2026

BOOST-RPF: Boosted Sequential Trees for Radial Power Flow

Gradient-boosted decision trees method for power flow analysis in distribution systems using sequential path-based learning.

Ax Dilina Rajapakse, Juan C. Rosero, Ivana Dusparic 3/24/2026

TREX: Trajectory Explanations for Multi-Objective Reinforcement Learning

Framework for explaining trajectories in multi-objective reinforcement learning agents handling conflicting objectives.

Ax Cristian P\'erez-Corral, Alberto Fern\'andez-Hern\'andez, Jose I. Mestre, Manuel F. Dolz, Enrique S. Quintana-Ort\'i 3/24/2026

{\lambda}-GELU: Learning Gating Hardness for Controlled ReLU-ization in Deep Networks

Learning-based approach to parameterize GELU activation functions for converting smooth networks to piecewise-linear ReLU equivalents.

Ax Paolo Toccaceli 3/24/2026

CRPS-Optimal Binning for Conformal Regression

Non-parametric conformal regression method using binning optimization with CRPS metric for conditional distribution estimation.

Ax Xinyan Wang, Xiaogeng Liu, Chaowei Xiao 3/24/2026

ROM: Real-time Overthinking Mitigation via Streaming Detection and Intervention

Method to reduce overthinking in Large Reasoning Models by detecting and stopping redundant reasoning steps, lowering latency and compute costs.

Ax Peter Pak, Amir Barati Farimani 3/24/2026

AdditiveLLM2: A Multi-modal Large Language Model for Additive Manufacturing

AdditiveLLM2 domain-adapted multimodal LLM based on Gemma 3 for additive manufacturing using instruction tuning on domain corpus.

Ax Tianxiang Xu, Xiaoyan Zhu, Xin Lai, Sizhe Dang, Xin Lian, Hangyu Cheng, Jiayin Wang 3/24/2026

Do Papers Match Code? A Benchmark and Framework for Paper-Code Consistency Detection in Bioinformatics Software

Framework and benchmark for detecting inconsistencies between research papers and their implementations in bioinformatics software.

Ax Julius Kobialka, Emanuel Sommer, Chris Kolb, Juntae Kwon, Daniel Dold, David R\"ugamer 3/24/2026

On the Interplay of Priors and Overparametrization in Bayesian Neural Network Posteriors

Analysis of how overparametrization and priors interact in Bayesian neural network posteriors and their effects on inference.

Ax Valentin Petrov 3/24/2026

On the Failure of Topic-Matched Contrast Baselines in Multi-Directional Refusal Abliteration

Study on why topic-matched contrast baselines fail in directional refusal abliteration for removing safety behaviors from LLMs.

Ax Aurora Esteban, Amelia Zafra, Sebasti\'an Ventura 3/24/2026

MIHT: A Hoeffding Tree for Time Series Classification using Multiple Instance Learning

MIHT algorithm for time series classification using multi-instance learning on variable-length and high-dimensional temporal data.

Ax Kexin Huang, Haoming Meng, Junkang Wu, Jinda Lu, Chiyu Ma, Ziqian Chen, Xue Wang, Bolin Ding, Jiancan Wu, Xiang Wang, Xiangnan He, Guoyin Wang, Jingren Zhou 3/24/2026

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Analysis of reinforcement learning with verifiable rewards for LLM reasoning, focusing on direction rather than magnitude of weight updates.

Ax Shreeram Murali, Cristian R. Rojas, Dominik Baumann 3/24/2026

Computationally lightweight classifiers with frequentist bounds on predictions

Computationally efficient classifier with frequentist uncertainty bounds suitable for safety-critical applications.

Ax Alois Bachmann 3/24/2026

dynActivation: A Trainable Activation Family for Adaptive Nonlinearity

Trainable activation function family (dynActivation) providing adaptive nonlinearity for vision and language modeling tasks.

Ax Abolfazl Hashemi 3/24/2026

RAMPAGE: RAndomized Mid-Point for debiAsed Gradient Extrapolation

RAMPAGE algorithm addressing discretization bias in extragradient methods for variational inequalities with variance reduction.

Ax Moritz G\"ogl, Christopher Yau 3/24/2026

Multimodal Survival Analysis with Locally Deployable Large Language Models

Multimodal survival analysis combining clinical text, tabular data, and genomics using locally deployable lightweight LLMs for privacy-constrained settings.

Ax Dharshan Kumaran, Nathaniel Daw, Simon Osindero, Petar Velickovic, Viorica Patraucean 3/24/2026

Causal Evidence that Language Models use Confidence to Drive Behavior

Causal investigation of whether LLMs use internal confidence estimates to regulate behavior through abstention paradigm experiments.

Ax Yurong Chen, Zhiyi Huang, Michael I. Jordan, Haipeng Luo 3/24/2026

Calibeating Made Simple

Theoretical framework reducing calibration of forecasts to online learning techniques with results for general proper losses.

Ax Oscar Novo, Oscar Bastidas-Jossa, Alberto Calvo, Antonio Peris, Carlos Kuchkovsky 3/24/2026

Revisiting Quantum Code Generation: Where Should Domain Knowledge Live?

Study on incorporating domain knowledge into LLM-based code generation for quantum software development while maintaining maintainability.

Ax Kangqi Ni, Wenyue Hua, Xiaoxiang Shi, Jiang Guo, Shiyu Chang, Tianlong Chen 3/24/2026

Chimera: Latency- and Performance-Aware Multi-agent Serving for Heterogeneous LLMs

Chimera serving system for multi-agent LLM workflows optimizing latency and performance on heterogeneous model deployments.

Ax Kexian Tang, Jiani Wang, Shaowen Wang, Kaifeng Lyu 3/24/2026

SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection

SPA baseline method using prompt engineering to generate synthetic data for knowledge injection into LLMs in specialized domains.

Ax Qilin Wang 3/24/2026

Noise Titration: Exact Distributional Benchmarking for Probabilistic Time Series Forecasting

Benchmarking methodology for probabilistic time series forecasting using noise titration to test model robustness to non-stationarity.

Ax Changxiao Cai, Gen Li 3/24/2026

Confidence-Based Decoding is Provably Efficient for Diffusion Language Models

Decoding strategy analysis for diffusion language models showing confidence-based decoding is provably efficient for parallel token generation.

Ax Zakaria Mhammedi, James Cohan 3/24/2026

Decoupling Exploration and Policy Optimization: Uncertainty Guided Tree Search for Hard Exploration

Reinforcement learning approach decoupling exploration and policy optimization using uncertainty-guided tree search for autonomous agent exploration.