Isolater - Feed

Ax Jack Young 4/2/2026

S0 Tuning: Zero-Overhead Adaptation of Hybrid Recurrent-Attention Models

S0 tuning zero-overhead adaptation of hybrid recurrent-attention models outperforming LoRA on code generation.

Ax Abdullah Tokmak, Toni Karvonen, Thomas B. Sch\"on, Dominik Baumann 4/2/2026

Safe learning-based control via function-based uncertainty quantification

Function-based uncertainty quantification for safe learning-based control in safety-critical systems.

Ax Fangjun Hu, Christian Kokail, Milan Kornja\v{c}a, Pedro L. S. Lopes, Weiyuan Gong, Sheng-Tao Wang, Xun Gao, Stefan Ostermann 4/2/2026

Learning and Generating Mixed States Prepared by Shallow Channel Circuits

Learning to generate mixed quantum states prepared by shallow channel circuits in trivial phases.

Ax Yiheng Su, Matthew Lease 4/2/2026

LLM REgression with a Latent Iterative State Head

RELISH lightweight architecture for text regression with LLMs using iterative latent state refinement.

Ax Shichang Zhang (Celine), Atefeh Sohrabizadeh (Celine), Cheng Wan (Celine), Zijie Huang (Celine), Ziniu Hu (Celine), Yewen Wang (Celine), Yingyan (Celine), Lin, Jason Cong, Yizhou Sun 4/2/2026

A Survey on Graph Neural Network Acceleration: Algorithms, Systems, and Customized Hardware

Survey on Graph Neural Network acceleration techniques across algorithms, systems, and customized hardware.

Ax Chong Xiang, Tong Wu, Zexuan Zhong, David Wagner, Danqi Chen, Prateek Mittal 4/2/2026

Certifiably Robust RAG against Retrieval Corruption

RobustRAG defense framework with certifiable robustness against retrieval corruption attacks on RAG systems.

Ax Jungeum Kim, Xiao Wang 4/2/2026

Inductive Global and Local Manifold Approximation and Projection

Inductive manifold learning approach for nonlinear dimensional reduction with local and global structure.

Ax Kulunu Dharmakeerthi, YoonHaeng Hur, Tengyuan Liang 4/2/2026

Learning When the Concept Shifts: Confounding, Invariance, and Dimension Reduction

Domain adaptation with distribution shifts and unobserved confounding using linear structural causal models.

Ax Tiago F. Tavares, Fabio Ayres, Paris Smaragdis 4/2/2026

Diagnosing Neural Convergence with Topological Alignment Spectra

Topological Alignment Spectra method for analyzing multi-scale structural relationships in neural network representations.

Ax Sergio Calvo-Ordo\~nez, Jonathan Plenk, Richard Bergna, Alvaro Cartea, Jose Miguel Hernandez-Lobato, Konstantina Palla, Kamil Ciosek 4/2/2026

A Gaussian Process View on Observation Noise and Initialization in Wide Neural Networks

Gaussian Process interpretation of wide neural networks with observation noise and arbitrary prior means.

Ax Ethan Harvey, Mikhail Petrov, Michael C. Hughes 4/2/2026

Learning Hyperparameters via a Data-Emphasized Variational Objective

Gradient-based hyperparameter learning via evidence lower bound objective from Bayesian variational methods.

Ax Yali Wei, Alan J. X. Guo, Zihui Yan, Yufan Dai, Wenjia Fan 4/2/2026

VT-Former: Efffcient Transformer-based Decoder for Varshamov-Tenengolts Codes

Transformer-based decoder for Varshamov-Tenengolts codes correcting insertion, deletion, and substitution errors.

Ax Fateme Jamshidi, Mohammad Shahverdikondori, Negar Kiyavash 4/2/2026

Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference

Multi-armed bandit algorithm using local graph structure to minimize regret under network interference.

Ax Leo Henry, Thomas Neele, Mohammad Reza Mousavi, Matteo Sammartino 4/2/2026

A Detailed Account of Compositional Automata Learning through Alphabet Refinement

Compositional automata learning technique for inferring models of concurrent systems through alphabet refinement.

Ax Stepan Tretiakov, Xingjian Li, Krishna Kumar 4/2/2026

SetONet: A Set-Based Operator Network for Solving PDEs with Variable-Input Sampling

SetONet neural operator for solving PDEs with variable sensor layouts by treating inputs as unordered sets.

Ax Carlos Rodriguez-Pardo, Leonardo Chiani, Emanuele Borgonovo, Massimo Tavoni 4/2/2026

Neural Conditional Transport Maps

Neural framework for learning conditional optimal transport maps using hypernetworks to generate adaptive transport parameters.

Ax Leon Eshuijs, Archie Chaudhury, Alan McBeth, Ethan Nguyen 4/2/2026

But what is your honest answer? Aiding LLM-judges with honest alternatives using steering vectors

JUSSA framework uses steering vectors to improve LLM-as-a-judge reliability, detecting and mitigating sycophancy through honesty-promoting alternatives.

Ax Rafael Sojo, Javier D\'iaz-Rozo, Concha Bielza, Pedro Larra\~naga 4/2/2026

Binned semiparametric Bayesian networks for efficient kernel density estimation

Binned semiparametric Bayesian networks for efficient kernel density estimation using data binning to reduce computational cost.

Ax Hanlin Dong, Arian Prabowo, Hao Xue, Ao Shuang, Tianyi Zhou, Flora D. Salim 4/2/2026

Double-Diffusion: ODE-Prior Accelerated Diffusion Models for Spatio-Temporal Graph Forecasting

Double-Diffusion integrates ODE-prior with denoising diffusion models for spatio-temporal graph forecasting, balancing deterministic and stochastic components.

Ax Zhenpeng Su, Leiyu Pan, Xue Bai, Dening Liu, Guanting Dong, Jiaming Huang, Minxuan Lv, Wenping Hu, Fuzheng Zhang, Kun Gai, Guorui Zhou 4/2/2026

Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

Klear-Reasoner model with long reasoning capabilities using gradient-preserving clipping policy optimization, with detailed training disclosures.

Ax Muntasir Hoq, Griffin Pitts, Tirth Bhatt, Aum Pandya, Andrew Lan, Peter Brusilovsky, Bita Akram 4/2/2026

Pattern-based Knowledge Component Extraction from Student Code Using Representation Learning

Knowledge component discovery in programming using representation learning on student code for personalized instruction systems.

Ax Mohammad Taha Shah, Sabrina Khurshid, Gourab Ghatak 4/2/2026

Order Optimal Regret Bounds for Sharpe Ratio Optimization under Thompson Sampling

Thompson sampling analysis for Sharpe ratio optimization in multi-armed bandit setting, addressing fractional objective with dependent mean-variance.

Ax Sima Najafzadehkhoei, George Vega Yon, Bernardo Modenesi, Derek S. Meyer 4/2/2026

Machine Generalize Learning in Agent-Based Models: Going Beyond Surrogate Models for Calibration in ABMs

LSTM-based machine learning calibrator for agent-based epidemic models, learning inverse mapping from time series to SIR parameters.

Ax Robiul Islam, Dmitry I. Ignatov, Karl Kaberg, Roman Nabatchikov 4/2/2026

Exploring the Relationship between Brain Hemisphere States and Frequency Bands through Classical Machine Learning and Deep Learning Optimization Techniques with Neurofeedback

EEG classification study comparing neural network architectures and optimizers across brain hemisphere frequency bands using TensorFlow/PyTorch.

Ax Zelong Bi, Pierre Lafaye de Micheaux 4/2/2026

A Survey and Comparative Evaluation of Intrinsic Dimension Estimators under the Manifold Hypothesis

Comprehensive survey of intrinsic dimension estimators under manifold hypothesis, reviewing theoretical foundations and comparing eight methods.

Ax David Arbour, Harsh Parikh, Bijan Niknam, Elizabeth Stuart, Kara Rudolph, Avi Feller 4/2/2026

Regularizing Extrapolation in Causal Inference

Analysis of weight constraints in linear smoothers for causal inference, balancing feature imbalance against parametric modeling assumptions.

Ax Jubayer Ibn Hamid, Ifdita Hasan Orney, Ellen Xu, Chelsea Finn, Dorsa Sadigh 4/2/2026

Polychromic Objectives for Reinforcement Learning

Polychromic objectives approach to prevent mode collapse in reinforcement learning fine-tuning, preserving policy diversity during exploration.

Ax Aleksandar Armacki, Ali H. Sayed 4/2/2026

High-probability Convergence Guarantees of Decentralized SGD

Convergence analysis for decentralized SGD with high-probability guarantees, removing restrictive assumptions on gradient bounds and noise.

Ax Jacek Karwowski, Raymond Douglas 4/2/2026

Incoherence in Goal-Conditioned Autoregressive Models

Mathematical analysis of incoherence in goal-conditioned autoregressive models, studying policy improvement through fine-tuning with online RL.

Ax S Sairam, Sara Girdhar, Shivam Soni 4/2/2026

The Final-Stage Bottleneck: A Systematic Dissection of the R-Learner for Network Causal Inference

Empirical study examining the R-Learner framework limitations for network causal inference with graph-dependent heterogeneous treatment effects.

Ax Giovanni Conforti, Alain Durmus, Le-Tuyet-Nhi Pham, Gael Raoul 4/2/2026

Non-Asymptotic Convergence of Discrete Diffusion Models: Masked and Random Walk dynamics

Theoretical analysis of diffusion models on discrete state spaces, establishing convergence guarantees for masked and random walk dynamics.

Ax Takuya Kanazawa 4/2/2026

Multivariate Uncertainty Quantification with Tomographic Quantile Forests

Tomographic Quantile Forests (TQF) for nonparametric uncertainty quantification in multivariate regression tasks.

Ax Kevin Zhang, Yixin Wang 4/2/2026

Meta-probabilistic Modeling

Meta-probabilistic modeling framework for discovering latent structure across collections of related datasets using probabilistic graphical models.

Ax Anderson de Andrade, Alon Harell, Ivan V. Baji\'c 4/2/2026

Lossy Common Information in a Learnable Gray-Wyner Network

Research on learnable Gray-Wyner networks for disentangling common and task-specific information in computer vision.

Ax Yuze Wang, Yujia Tong, Xuan Liu, Junhao Dong 4/2/2026

SAU: Sparsity-Aware Unlearning for LLMs via Gradient Masking and Importance Redistribution

SAU method for machine unlearning in sparse LLMs via gradient masking and importance redistribution for privacy.

Ax Sohan Venkatesh, Ashish Mahendran Kurapath 4/2/2026

On the Non-Identifiability of Steering Vectors in Large Language Models

Research showing activation steering vectors in LLMs are fundamentally non-identifiable with large equivalence classes.

Ax Isaac Han, Sangyeon Park, Seungwon Oh, Donghu Kim, Hojoon Lee, Kyung-Joong Kim 4/2/2026

FIRE: Frobenius-Isometry Reinitialization for Balancing the Stability-Plasticity Tradeoff

FIRE method for reinitialization in continual learning that balances stability and plasticity in neural networks.

Ax Deyi Kong, Zaiwei Chen, Shuzhong Zhang, Shancong Mou 4/2/2026

Natural Hypergradient Descent: Algorithm Design, Convergence Analysis, and Parallel Implementation

Research on Natural Hypergradient Descent for bilevel optimization using Fisher information matrix as Hessian surrogate.

Ax Tatsuya Sagawa, Ryosuke Kojima 4/2/2026

How Well Do Large-Scale Chemical Language Models Transfer to Downstream Tasks?

Evaluation of scaling laws for Chemical Language Models on downstream molecular property prediction tasks.

Ax Adrian Garcia-Casta\~neda, Jon Irureta, Jon Imaz, Aizea Lojo 4/2/2026

Grow, Assess, Compress: Adaptive Backbone Scaling for Memory-Efficient Class Incremental Learning

Introduces adaptive backbone scaling framework for class incremental learning to balance plasticity and stability while reducing memory overhead.

Ax Savannah L. Ferretti, Jerry Lin, Sara Shamekh, Jane W. Baldwin, Michael S. Pritchard, Tom Beucler 4/2/2026

Data-Driven Integration Kernels for Interpretable Nonlocal Operator Learning

Framework for learning interpretable nonlocal operator kernels from data for climate process modeling.

Ax Mansoor Ahmed, Nadeem Taj, Imdad Ullah Khan, Hemanth Venkateswara, Murray Patterson 4/2/2026

CHIMERA-Bench: A Benchmark Dataset for Epitope-Specific Antibody Design

Standardized benchmark dataset for epitope-specific antibody design with unified evaluation metrics for generative methods.

Ax Shenyang Deng, Zhuoli Ouyang, Tianyu Pang, Zihang Liu, Ruochen Jin, Shuhua Yu, Yaoqing Yang 4/2/2026

RMNP: Row-Momentum Normalized Preconditioning for Scalable Matrix-Based Optimization

Preconditioned optimization method using row-momentum normalization for scalable matrix-based neural network training.

Ax Shinsaku Sakaue 4/2/2026

Simple Projection-Free Algorithm for Contextual Recommendation with Logarithmic Regret and Robustness

Projection-free algorithm for contextual bandits achieving logarithmic regret with improved efficiency over Online Newton Step.

Ax YanZhao Zheng, ZhenTao Zhang, Chao Ma, YuanQiang Yu, JiHuai Zhu, Yong Wu, Tianze Xu, Baohua Dong, Hangcheng Zhu, Ruohui Huang, Gang Yu 4/2/2026

SkillRouter: Skill Routing for LLM Agents at Scale

Skill routing system for LLM agents that identifies relevant skills from large ecosystems before planning or execution.

Ax Dogan Urgun, Gokhan Gungor 4/2/2026

Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning

Framework using LLMs to automatically design reward programs for cooperative multi-agent RL systems with sparse task feedback.

Ax Pengxuan Yang, Yupeng Zheng, Deheng Qian, Zebin Xing, Qichao Zhang, Linbo Wang, Yichen Zhang, Shaoyu Guo, Zhongpu Xia, Qiang Chen, Junyu Han, Lingyun Xu, Yifeng Pan, Dongbin Zhao 4/2/2026

DreamerAD: Efficient Reinforcement Learning via Latent World Model for Autonomous Driving

DreamerAD uses latent world models for efficient RL in autonomous driving, compressing diffusion sampling 80x with visual interpretability.

Ax Marc-Antoine Allard, Arnaud Teinturier, Victor Xing, Gautier Viaud 4/2/2026

Experiential Reflective Learning for Self-Improving LLM Agents

ERL framework enabling LLM agents to self-improve through experiential learning from past interactions and reflective adaptation.

Ax Devashish Gaikwad, Wil M. P. van der Aalst, Gyunam Park 4/2/2026

Neuro-Symbolic Process Anomaly Detection

Neuro-symbolic method for process anomaly detection combining neural networks with domain knowledge from process mining.

Ax Shoujin Wang, Mingze Ni, Wei Liu, Victor W. Chu, Bryan Zheng, Ayush Kanwal, Roy Jing Yang, Kenneth Sabir, Fang Chen 4/2/2026

Neural Federated Learning for Livestock Growth Prediction

Federated learning approach for livestock growth prediction addressing privacy concerns and limited datasets in agricultural applications.