Isolater - Feed

Ax Yubo Zhou, Luo Luo, Guang Dai, Haishan Ye 3/2/2026

On the Convergence of Single-Loop Stochastic Bilevel Optimization with Approximate Implicit Differentiation

Theoretical analysis of single-loop stochastic bilevel optimization convergence for meta-learning and hyperparameter optimization.

Ax Zhihao Ding, Jinming Li, Ze Lu, Jieming Shi 3/2/2026

FlexGuard: Continuous Risk Scoring for Strictness-Adaptive LLM Content Moderation

FlexGuard proposes continuous risk scoring for LLM content moderation that adapts to varying strictness levels across platforms and time.

Ax Haoran Zhang, Dongjun Kim, Seohyeon Cha, Haris Vikalo 3/2/2026

FedRot-LoRA: Mitigating Rotational Misalignment in Federated LoRA

FedRot-LoRA addresses rotational misalignment in federated LoRA fine-tuning of LLMs, improving communication-efficient training on decentralized data.

Ax Kohei Obata, Zheng Chen, Yasuko Matsubara, Lingwei Zhu, Yasushi Sakurai 3/2/2026

Selective Denoising Diffusion Model for Time Series Anomaly Detection

Diffusion-based method for time series anomaly detection using selective denoising instead of conditional reconstruction strategies.

Ax Kohei Obata, Taichi Murayama, Zheng Chen, Yasuko Matsubara, Yasushi Sakurai 3/2/2026

Disentangled Mode-Specific Representations for Tensor Time Series via Contrastive Learning

MoST contrastive learning method for disentangled mode-specific representations in multi-mode tensor time series.

Ax Yongzhong Xu 3/2/2026

Optimizer-Induced Low-Dimensional Drift and Transverse Dynamics in Transformer Training

Geometric analysis of transformer training trajectories revealing low-dimensional drift direction and transverse oscillatory dynamics.

Ax Hanping Zhang, Yuhong Guo 3/2/2026

Bridging Dynamics Gaps via Diffusion Schr\"odinger Bridge for Cross-Domain Reinforcement Learning

BDGxRL uses Diffusion Schrödinger Bridge to address dynamics gaps in cross-domain reinforcement learning without target reward supervision.

Ax Yuyu Geng, Lei Sun, Yao Gao, Xinxin Hu, Zhonghua Yi, Xiaolong Qian, Weijian Hu, Jian Bai, Kaiwei Wang 3/2/2026

OPTIAGENT: A Physics-Driven Agentic Framework for Automated Optical Design

OPTIAGENT uses LLM-based agentic framework with physics-driven optimization for automated optical design and lens system configuration.

Ax Chenxing Lin, Xinhui Gao, Haipeng Zhang, Xinran Li, Haitao Wang, Songzhu Mei, Chenglu Wen, Weiquan Liu, Siqi Shen, Cheng Wang 3/2/2026

MAGE: Multi-scale Autoregressive Generation for Offline Reinforcement Learning

MAGE multi-scale autoregressive generation framework for offline RL addressing long-horizon tasks with sparse rewards via hierarchical decomposition.

Ax Zhiwei Han, Stefan Matthes, Hao Shen 3/2/2026

Provable Subspace Identification of Nonlinear Multi-view CCA

Provable identifiability framework for nonlinear multi-view canonical correlation analysis via subspace identification.

Ax Aleksandr Ananikian (Saint-Petersburg University), Daniil Drozdov (Saint-Petersburg University), Konstantin Yakovlev (Saint-Petersburg University) 3/2/2026

UPath: Universal Planner Across Topological Heterogeneity For Grid-Based Pathfinding

Learning-based pathfinding using neural networks to approximate informed heuristics for grid-based search across different map topologies.

Ax Tiantong Wang, Xinyu Yan, Tiantong Wu, Yurong Hao, Yong Jiang, Fei Huang, Wei Yang Bryan Lim 3/2/2026

MPU: Towards Secure and Privacy-Preserving Knowledge Unlearning for Large Language Models

MPU framework for privacy-preserving knowledge unlearning in LLMs without sharing server parameters or client forget sets.

Ax Andreas Kernbach, Amr Elsheikh, Nicolas Grupp, Ren\'e Nagel, Marco F. Huber 3/2/2026

Actor-Critic Pretraining for Proximal Policy Optimization

Actor-critic pretraining approach for PPO that leverages expert data to reduce environment interactions required for RL training.

Ax Xiang Li, Nan Jiang, Yuheng Zhang 3/2/2026

Beyond State-Wise Mirror Descent: Offline Policy Optimization with Parameteric Policies

Theoretical investigation of offline reinforcement learning with general function approximation and parametric policies beyond state-wise methods.

Ax George Papadopoulos, George A. Vouros 3/2/2026

Learning to maintain safety through expert demonstrations in settings with unknown constraints: A Q-learning perspective

Q-learning approach for learning safe policies from expert demonstrations with unknown constraints in constrained MDPs.

Ax Junkang Liu, Fanhua Shang, Yuxuan Tian, Hongying Liu, Yuanyuan Liu 3/2/2026

FedNSAM:Consistency of Local and Global Flatness for Federated Learning

FedNSAM addresses sharpness-aware minimization in federated learning under high data heterogeneity, ensuring both local and global model flatness.

Ax Zhaowen Wang, Dongdong Zhou, Qi Xu, Fengyu Cong, Mohammad Al-Sa'd, Jenni Raitoharju 3/2/2026

ULW-SleepNet: An Ultra-Lightweight Network for Multimodal Sleep Stage Scoring

Ultra-lightweight neural network for automated sleep stage classification from multimodal polysomnography data.

Ax Zhang Wan, Tingting Mu, Samuel Kaski 3/2/2026

A Theory of Random Graph Shift in Truncated-Spectrum vRKHS

Theory of graph classification under domain shift using random graph models and domain adaptation techniques for structured data.

Ax Alexander Samarin, Sergei Krutikov, Anton Shevtsov, Sergei Skvortsov, Filipp Fisin, Alexander Golubev 3/2/2026

LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding

LK Losses directly optimize acceptance rate in speculative decoding for LLM inference, improving upon KL divergence proxy objectives for draft model training.

Ax Oscar Hill, Mateo Espinosa Zarlenga, Mateja Jamnik 3/2/2026

Hierarchical Concept-based Interpretable Models

Hierarchical Concept Embedding Models improve interpretability of deep neural networks by mapping inputs to human-interpretable concept representations with inter-concept relationships.

Ax David Fox, Sam Bowyer, Song Liu, Laurence Aitchison, Raul Santos-Rodriguez, Mengyue Yang 3/2/2026

Learning Generation Orders for Masked Discrete Diffusion Models via Variational Inference

Learns optimal generation orders for masked discrete diffusion models via variational inference to balance parallel generation and sample quality.

Ax Xianglong Shi, Ziheng Chen, Yunhan Jiang, Nicu Sebe 3/2/2026

Intrinsic Lorentz Neural Network

Proposes Intrinsic Lorentz Neural Network for fully intrinsic hyperbolic geometry operations on hierarchical data representations.

Ax Vrushank Ahire, Yogesh Kumar, Anouck Girard, M. A. Ganaie 3/2/2026

MINT: Multimodal Imaging-to-Speech Knowledge Transfer for Early Alzheimer's Screening

Transfers knowledge from multimodal neuroimaging to speech analysis for early Alzheimer's disease screening via speech-based classifiers.

Ax Florent Delgrange 3/2/2026

Foundation World Models for Agents that Learn, Verify, and Adapt Reliably Beyond Static Environments

Outlines vision for foundation world models as persistent compositional representations enabling agents to learn and adapt in open worlds.

Ax Roy Betser, Eyal Gofer, Meir Yossef Levi, Guy Gilboa 3/2/2026

InfoNCE Induces Gaussian Distribution

Theoretical analysis showing InfoNCE contrastive loss induces Gaussian structure in learned representations for foundation models.

Ax Daniel Yang, Samuel Stante, Florian Redhardt, Lena Libon, Parnian Kassraie, Ido Hakimi, Barna P\'asztor, Andreas Krause 3/2/2026

RewardUQ: A Unified Framework for Uncertainty-Aware Reward Models

Proposes RewardUQ framework for uncertainty-aware reward models in LLM alignment that reduces annotation costs and prevents overoptimization.

Ax Tobias Nygaard 3/2/2026

pathsig: A GPU-Accelerated Library for Truncated and Projected Path Signatures

Introduces pathsig, a PyTorch-native GPU-accelerated library for computing path signatures as trainable features for sequential data.

Ax Ryan DeWolfe 3/2/2026

Leveraging Non-linear Dimension Reduction and Random Walk Co-occurrence for Node Embedding

Develops high-dimensional node embedding method using non-linear dimension reduction and random walk co-occurrence for graph tasks.

Ax Viet Bac Nguyen, Phuong Thai Nguyen 3/2/2026

Adaptive Correlation-Weighted Intrinsic Rewards for Reinforcement Learning

Proposes ACWI framework that adaptively balances intrinsic and extrinsic rewards online for sparse reward reinforcement learning exploration.

Ax Xinlong Du, Harsha Honnappa, Vinayak Rao 3/2/2026

Neural Diffusion Intensity Models for Point Process Data

Introduces Neural Diffusion Intensity Models using variational framework with neural SDEs for intractable Cox process inference.

Ax Zhizhou He, Yang Luo, Xinkai Liu, Mahdi Boloursaz Mashhadi, Mohammad Shojafar, Merouane Debbah, Rahim Tafazolli 3/2/2026

Agentic AI-RAN: Enabling Intent-Driven, Explainable and Self-Evolving Open RAN Intelligence

Surveys agentic AI systems with planning, tool use, and self-management capabilities applied to Open RAN network control and optimization.

Ax Zitian Li, Wang Chi Cheung 3/2/2026

Learning with a Budget: Identifying the Best Arm with Resource Constraints

Studies best arm identification problem with heterogeneous resource costs and constraints across multiple resource types.

Ax Daniel S. Berman, Brian Merritt, Stanley Ta, Dana Udwin, Amanda Ernlund, Jeremy Ratcliff, Vijay Narayan 3/2/2026

What You Read is What You Classify: Highlighting Attributions to Text and Text-Like Inputs

Proposes explainable AI method for discrete token inputs like text using attribution highlighting to identify important tokens in transformers.

Ax Adam R. Klivans, Konstantinos Stavropoulos, Arsen Vasilyan 3/2/2026

Sandwiching Polynomials for Geometric Concepts with Low Intrinsic Dimension

Develops sandwiching polynomial approximators for learning with distribution shift and contaminated data in low intrinsic dimension settings.

Ax Sikata Sengupta, Guangyi Liu, Omer Gottesman, Joseph W Durham, Michael Kearns, Aaron Roth, Michael Caldara 3/2/2026

Multi-Objective Reinforcement Learning for Large-Scale Tote Allocation in Human-Robot Collaborative Fulfillment Centers

Applies multi-objective reinforcement learning to optimize container consolidation in human-robot collaborative fulfillment centers.

Ax Egor Antipov, Alessandro Palma, Lorenzo Consoli, Stephan G\"unnemann, Andrea Dittadi, Fabian J. Theis 3/2/2026

Flow-Based Density Ratio Estimation for Intractable Distributions with Applications in Genomics

Uses normalizing flows to estimate density ratios between intractable distributions with applications to genomics data analysis.

Ax Mohsen Tajgardan, Atena Shiranzaei, Mahdi Rabbani, Reza Khoshkangini, Mahtab Jamali 3/2/2026

An Efficient Unsupervised Federated Learning Approach for Anomaly Detection in Heterogeneous IoT Networks

Proposes federated learning approach for anomaly detection in heterogeneous IoT networks while preserving privacy through distributed training.

Ax Miras Seilkhan, Adilbek Taizhanov 3/2/2026

Comparing Classical and Quantum Variational Classifiers on the XOR Problem

Compares classical logistic regression and MLPs with variational quantum classifiers on XOR problem using quantum superposition principles.

Ax Hongrui Xie, Junyu Cao, Kan Xu 3/2/2026

Adaptive Combinatorial Experimental Design: Pareto Optimality for Decision-Making and Inference

Investigates trade-off between regret minimization and statistical power in combinatorial multi-armed bandits using Pareto optimality framework.

Ax Javier Pulido, Filipe Rodrigues 3/2/2026

Time Series Foundation Models as Strong Baselines in Transportation Forecasting: A Large-Scale Benchmark Analysis

arXiv paper benchmarking general-purpose time-series foundation models for zero-shot transportation forecasting across multiple datasets.

Ax Xiaolong Zhang, Jianwei Zhang, Selim Sevim, Emek Demir, Ece Eksi, Xubo Song 3/2/2026

Histopathology Image Normalization via Latent Manifold Compaction

arXiv paper on Latent Manifold Compaction for unsupervised harmonization of histopathology images across different batch effects and scanners.

Ax Yijiashun Qi, Yijiazhen Qi, Tanmay Wagh 3/2/2026

Coverage-Aware Web Crawling for Domain-Specific Supplier Discovery via a Web--Knowledge--Web Pipeline

arXiv paper proposing Web-Knowledge-Web pipeline for discovering suppliers in specialized industries via iterative web crawling and knowledge base integration.

Ax Shruti Joshi, Th\'eo Saulus, Wieland Brendel, Philippe Brouillard, Dhanya Sridhar, Patrik Reizinger 3/2/2026

Who Guards the Guardians? The Challenges of Evaluating Identifiability of Learned Representations

Analyzes limitations of standard identifiability metrics (MCC, DCI, R²) on synthetic benchmarks, revealing implicit structural assumptions in representation learning evaluation.

Ax Ali Behrouz, Zeman Li, Yuan Deng, Peilin Zhong, Meisam Razaviyayn, Vahab Mirrokni 3/2/2026

Memory Caching: RNNs with Growing Memory

Memory caching architecture enabling RNNs with growing memory capacity and subquadratic complexity as alternative to Transformers for sequence modeling.

Ax Zhengbo Wang, Jian Liang, Ran He, Zilei Wang, Tieniu Tan 3/2/2026

Taming Momentum: Rethinking Optimizer States Through Low-Rank Approximation

Low-rank approximation method (LoRA-Pre) for optimizer memory efficiency in Adam and Muon, reducing overhead for large language model training.

Ax Weinan Dai, Hanlin Wu, Qiying Yu, Huan-ang Gao, Jiahao Li, Chengquan Jiang, Weiqiang Lou, Yufan Song, Hongli Yu, Jiaze Chen, Wei-Ying Ma, Ya-Qin Zhang, Jingjing Liu, Mingxuan Wang, Xin Liu, Hao Zhou 3/2/2026

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Agentic RL system using LLMs for high-performance CUDA kernel generation at scale, outcompeting traditional compiler-based approaches.

Ax Vugar Ismailov 3/2/2026

Universality of Shallow and Deep Neural Networks on Non-Euclidean Spaces

Framework establishing universal approximation properties for shallow and deep neural networks on non-Euclidean topological spaces.

Ax Mingkai Liao 3/2/2026

Pacing Opinion Polarization via Graph Reinforcement Learning

Graph reinforcement learning approach to moderate opinion polarization in social networks under Friedkin-Johnsen model with improved scalability.

Ax George Bird 3/2/2026

On De-Individuated Neurons: Continuous Symmetries Enable Dynamic Topologies

Methodology for dynamic neural networks using isotropic activation functions enabling real-time architectural growth and shrinkage via symmetry-principled primitives.

Ax Yifan Li, Mehrdad Salimitari, Taiyu Zhang, Guang Li, David Dreizin 3/2/2026

SALIENT: Frequency-Aware Paired Diffusion for Controllable Long-Tail CT Detection

Frequency-aware diffusion model for detecting rare lesions in CT scans with extreme class imbalance using controlled synthetic augmentation.