Isolater - Feed

Ax Benjamin Plaut 3/4/2026

Safety Training Persists Through Helpfulness Optimization in LLM Agents

Study of safety training persistence in multi-step agentic LLM settings when optimizing for helpfulness, comparing DPO effects.

Ax Linxuan Wang, Ziyi Wang, Yikun Bai, Wei Deng, Guang Lin, Qifan Song 3/4/2026

Generalized Discrete Diffusion with Self-Correction

Generalized discrete diffusion model with self-correction during pretraining using uniform-absorbing objective.

Ax Amirhossein Afsharrad, Ruida Zhou, Luca Viano, Sanjay Lall, Mohammad Ghavamzadeh 3/4/2026

Beyond Binary Preferences: A Principled Framework for Reward Modeling with Ordinal Feedback

Principled mathematical framework for reward modeling leveraging ordinal preference feedback from human annotators for LLM alignment.

Ax Jean-Baptiste Fermanian (PREMEDICAL), Batiste Le Bars (MAGNET, CRIStAL), Aur\'elien Bellet (PREMEDICAL) 3/4/2026

Adaptive Personalized Federated Learning via Multi-task Averaging of Kernel Mean Embeddings

Personalized federated learning approach using kernel mean embeddings to learn inter-agent weight combinations without raw data sharing.

Ax Yizhak Y. Elboher, Reuven Peleg, Zhouxing Shi, Guy Katz, Jan K\v{r}et\'insk\'y 3/4/2026

Talking with Verifiers: Automatic Specification Generation for Neural Network Verification

Framework for automatic specification generation to improve neural network verification tool adoption by supporting higher-level semantic constraints.

Ax Jiace Zhu, Wentao Chen, Qi Fan, Zhixing Ren, Junying Wu, Xing Zhe Chai, Chotiwit Rungrueangwutthinon, Yehan Ma, An Zou 3/4/2026

CUDABench: Benchmarking LLMs for Text-to-CUDA Generation

CUDABench benchmark for evaluating LLM text-to-CUDA code generation with performance assessment metrics for GPU kernels.

Ax Laziz U. Abdullaev, Noelle Y. L. Wong, Ryan T. Z. Lee, Shiqi Jiang, Khoi N. M. Nguyen, Tan M. Nguyen 3/4/2026

Concept Heterogeneity-aware Representation Steering

Method for steering LLM behavior via representation manipulation that accounts for heterogeneous concept encoding across embedding spaces.

Ax Andy Yang, Pascal Bergstr\"a{\ss}er, Georg Zetzsche, David Chiang, Anthony W. Lin 3/4/2026

Length Generalization Bounds for Transformers

Theoretical analysis of length generalization bounds for transformers on CRASP language class, addressing model generalization guarantees.

Ax Shibing Mo, Jiarui Zhang, Jiayu Xie, Xiangyi Teng, Jing Liu 3/4/2026

High-order Knowledge Based Network Controllability Robustness Prediction: A Hypergraph Neural Network Approach

Hypergraph neural network approach for predicting network controllability robustness against attacks, replacing computationally expensive simulations.

Ax Yunlong Gao, Xinyue Liu, Yingbo Wang, Linlin Zong, Bo Xu 3/4/2026

Boosting Meta-Learning for Few-Shot Text Classification via Label-guided Distance Scaling

Label-guided distance scaling method for few-shot text classification, improving meta-learner effectiveness with selective label guidance.

Ax Jeet Bandhu Lahiri, Parshva Runwal, Arvasu Kulkarni, Mahir Jain, Aditya Ray Mishra, Siddharth Panwar, Sandeep Singh 3/4/2026

PRISM: Exploring Heterogeneous Pretrained EEG Foundation Model Transfer to Clinical Differential Diagnosis

PRISM foundation model for EEG diagnosis using masked autoencoders, ablated across pretraining populations and clinical adaptation domains.

Ax Binon Teji, Subhajit Bandyopadhyay, Swarup Roy 3/4/2026

Graph Attention Based Prioritization of Disease Responsible Genes from Multimodal Alzheimer's Network

Graph transformer framework (NETRA) for prioritizing disease genes in Alzheimer's networks using multimodal biological data.

Ax Guanzhe Zhang, Shanshan Ding, Zhezhen Jin 3/4/2026

A Comparative Study of UMAP and Other Dimensionality Reduction Methods

Comparative analysis of UMAP with PCA, Kernel PCA, SIR variants, and t-SNE for dimensionality reduction across benchmarks.

Ax Jinge Ma, Fengqing Zhu 3/4/2026

Temporal Imbalance of Positive and Negative Supervision in Class-Incremental Learning

Addressing catastrophic forgetting in class-incremental learning by analyzing temporal imbalance of positive/negative supervision signals.

Ax Kaiyang Xing, Han Fang, Zhaoyun Chen, Zhonghui Li, Yang Yang, Weiming Zhang, Guoping Guo 3/4/2026

Quantum-Inspired Fine-Tuning for Few-Shot AIGC Detection via Phase-Structured Reparameterization

Quantum-enhanced LoRA fine-tuning method for few-shot AI-generated content detection, combining quantum neural networks with low-rank adaptation.

Ax Shadab Ahamed, Eshed Gal, Simon Ghyselincks, Md Shahriar Rahim Siddiqui, Moshe Eliasof, Eldad Haber 3/4/2026

Preconditioned Score and Flow Matching

Preconditioning techniques for flow matching and score-based diffusion to improve optimization by handling ill-conditioned covariance matrices.

Ax Haochuan Kevin Wang 3/4/2026

Diffusion-MPC in Discrete Domains: Feasibility Constraints, Horizon Effects, and Critic Alignment: Case study with Tetris

Diffusion-based model predictive control with discrete denoising for game playing, tested on Tetris with feasibility constraints and critic alignment.

Ax Xin Li, Jonathan Cohen, Shai Pilosof, Rami Puzis 3/4/2026

Learning graph topology from metapopulation epidemic encoder-decoder

Joint inference of epidemic parameters and mobility networks in metapopulation models using encoder-decoder architecture.

Ax Stefan Ankirchner, Maximilian Philipp Thiel 3/4/2026

Learning Optimal Search Strategies

Learning optimal threshold-based stopping rules for parking problems with unknown Poisson arrival processes via jump intensity estimation.

Ax Zhanghan Ni, Yanjing Li, Zeju Qiu, Bernhard Sch\"olkopf, Hongyu Guo, Weiyang Liu, Shengchao Liu 3/4/2026

Rigidity-Aware Geometric Pretraining for Protein Design and Conformational Ensembles

Proposes rigidity-aware geometric pretraining for protein design and conformational ensembles using global geometric representations.

Ax Leo (Muxing), Wang, Pengkun Yang, Lili Su 3/4/2026

Personalized Multi-Agent Average Reward TD-Learning via Joint Linear Approximation

Studies personalized multi-agent average reward TD learning with joint linear approximation, inspired by federated learning approaches.

Ax Logan Frank, Jim Davis 3/4/2026

A Unified Revisit of Temperature in Classification-Based Knowledge Distillation

Analyzes temperature parameter selection in knowledge distillation and its interaction with optimizer, pretraining, and finetuning choices.

Ax Satish Chandran, Nicolas Roque dos Santos, Yunshu Wu, Greg Ver Steeg, Evangelos Papalexakis 3/4/2026

Spectral Regularization for Diffusion Models

Introduces loss-level spectral regularization using Fourier and wavelet-domain losses to improve diffusion model training without architecture changes.

Ax Semih Cant\"urk, Thomas Sabourin, Frederik Wenkel, Michael Perlmutter, Guy Wolf 3/4/2026

Can Computational Reducibility Lead to Transferable Models for Graph Combinatorial Optimization?

Studies computational reducibility in neural solvers for graph combinatorial optimization, enabling model generalization across task distributions.

Ax Zhongxi Wang, Yueqian Lin, Jingyang Zhang, Hai Helen Li, Yiran Chen 3/4/2026

MUSE: A Run-Centric Platform for Multimodal Unified Safety Evaluation of Large Language Models

MUSE is an open-source platform for multimodal safety evaluation of LLMs with cross-modal payload generation and multi-turn attack algorithms.

Ax Aran Nayebi 3/4/2026

What Capable Agents Must Know: Selection Theorems for Robust Decision-Making under Uncertainty

Proves selection theorems showing that low average-case regret forces AI agents to develop internal world models or belief states for robust decision-making.

Ax Liu Yang, Zeyu Nie, Andrew Liu, Felix Zou, Deniz Altinb\"uken, Amir Yazdanbakhsh, Quanquan C. Liu 3/4/2026

ParEVO: Synthesizing Code for Irregular Data: High-Performance Parallelism through Agentic Evolution

ParEVO uses LLM-based agentic evolution to synthesize parallel code for irregular data structures, addressing limitations of standard models on concurrent programming.

Ax G\"orkem Can S\"uleymano\u{g}lu 3/4/2026

Thermodynamic Regulation of Finite-Time Gibbs Training in Energy-Based Models: A Restricted Boltzmann Machine Study

Studies thermodynamic regulation of finite-time Gibbs chain training in Restricted Boltzmann Machines, analyzing energy landscape evolution during learning.

Ax Kwanyoung Kim 3/4/2026

Bridging Diffusion Guidance and Anderson Acceleration via Hopfield Dynamics

Establishes theoretical connection between classifier-free guidance in diffusion models and Anderson acceleration via Hopfield dynamics.

Ax Yuchen Shi, Qijun Hou, Pingyi Fan, Khaled B. Letaief 3/4/2026

EdgeFLow: Serverless Federated Learning via Sequential Model Migration in Edge Networks

Presents EdgeFLow, a federated learning framework using sequential model migration in edge networks to reduce communication bottlenecks in IoT systems.

Ax Zhaoyu Zhu, Shuhan Zhang, Rui Gao, Shuang Li 3/4/2026

Wasserstein Proximal Policy Gradient

Derives Wasserstein Proximal Policy Gradient using optimal transport geometry for continuous-action entropy-regularized RL without policy log-density evaluation.

Ax Yunxiang Li, Mark Schmidt, Reza Babanezhad, Sharan Vaswani 3/4/2026

Towards Parameter-Free Temporal Difference Learning

Develops parameter-free temporal difference learning for RL that avoids requiring problem-dependent quantities like feature covariance eigenvalues.

Ax Mengru Wu, Jiawei Li, Jiaqi Wei, Bin Lyu, Kai-Kit Wong, Hyundong Shin 3/4/2026

Joint Optimization of Model Partitioning and Resource Allocation for Anti-Jamming Collaborative Inference Systems

Studies DNN partitioning and resource allocation for device-edge collaborative inference under jamming attacks on resource-constrained systems.

Ax Zhixia Zhang, Zixuan Huang, Xin Xia, Deqing Wang, Fuzhen Zhuang, Shuai Ma, Ning Ding, Yaodong Yang, Jianxin Li, Yikun Ban 3/4/2026

Heterogeneous Agent Collaborative Reinforcement Learning

Introduces HACRL, a collaborative reinforcement learning paradigm where heterogeneous agents share verified rollouts during training but execute independently at inference.

Ax Tianze Zhu, Yinuo Wang, Wenjun Zou, Tianyi Zhang, Likun Wang, Letian Tao, Feihong Zhang, Yao Lyu, Shengbo Eben Li 3/4/2026

Real-Time Generative Policy via Langevin-Guided Flow Matching for Autonomous Driving

Proposes diffusion actor-critic with flow matching for real-time autonomous driving policies, addressing inference latency in generative RL approaches.

Ax Federico Vittorio Cortesi, Giuseppe Iannone, Giulia Crippa, Tomaso Poggio, Pierfrancesco Beneventano 3/4/2026

Same Error, Different Function: The Optimizer as an Implicit Prior in Financial Time Series

Neural networks on financial time series show underspecification where different optimizers produce identical test loss but learn different functions in volatility forecasting.

Ax Jiawen Li 3/4/2026

Implicit Bias in Deep Linear Discriminant Analysis

Theoretical analysis of implicit regularization in Deep Linear Discriminant Analysis for metric learning objectives.

Ax Raghav Thakar, Gaurav Dixit, Kagan Tumer 3/4/2026

Post Hoc Extraction of Pareto Fronts for Continuous Control

Multi-objective reinforcement learning method for extracting Pareto fronts of policies in continuous control tasks, addressing trade-offs between multiple objectives.

Ax Zhi Hong, Qian Zhang, Jiahang Sun, Zhiwei Shang, Mingze Kong, Xiangyi Wang, Yao Shu, Zhongxiang Dai 3/4/2026

MASPOB: Bandit-Based Prompt Optimization for Multi-Agent Systems with Graph Neural Networks

Bandit-based prompt optimization for multi-agent systems using graph neural networks to improve LLM-powered workflow performance without modifying workflows.

Ax Mohammed Nowaz Rabbani Chowdhury, Hsinyu Tsai, Geoffrey W. Burr, Kaoutar El Maghraoui, Liu Liu, Meng Wang 3/4/2026

Robust Heterogeneous Analog-Digital Computing for Mixture-of-Experts Models with Theoretical Generalization Guarantees

Heterogeneous analog-digital computing approach for efficient Mixture-of-Experts inference with theoretical generalization guarantees and hardware nonideality mitigation.

Ax Zixuan Xu, Tiancheng He, Huahui Yi, Kun Wang, Xi Chen, Gongli Xi, Qiankun Li, Kang Li, Yang Liu, Zhigang Zeng 3/4/2026

SaFeR-ToolKit: Structured Reasoning via Virtual Tool Calling for Multimodal Safety

SaFeR-ToolKit formalizes multimodal safety as checkable protocol using virtual tool calling for vision-language models to prevent jailbreaks.

Ax Feihu Huang, Guanyi Zhang, Songcan Chen 3/4/2026

HomeAdam: Adam and AdamW Algorithms Sometimes Go Home to Obtain Better Provable Generalization

HomeAdam variant of Adam optimizer improves generalization bounds to match SGD convergence rates for deep learning model training.

Ax Yuan Lu, Dongqi Han, Yansen Wang, Dongsheng Li 3/4/2026

Improving Diffusion Planners by Self-Supervised Action Gating with Energies

SAGE method improves diffusion planners for offline RL by using latent consistency signals to penalize dynamically inconsistent plans at inference-time.

Ax Shuyi Zhou, Zeen Song, Wenwen Qiang, Jiyan Sun, Yao Zhou, Yinlong Liu, Wei Ma 3/4/2026

From Shallow to Deep: Pinning Semantic Intent via Causal GRPO

Two-Stage Causal-GRPO framework addresses shallow safety alignment in LLMs vulnerable to adversarial prefix attacks through semantic intent pinning.

Ax Ryan Feng Lin, Yuantao Wei, Huiling Liao, Xiaoning Qian, Shuai Huang 3/4/2026

Causal Learning Should Embrace the Wisdom of the Crowd

Paradigm for causal structure learning from observational data leveraging human causal knowledge to address combinatorial explosion of possible graphs.

Ax Sijie Mai, Shiqin Han, Haifeng Hu 3/4/2026

Addressing Missing and Noisy Modalities in One Solution: Unified Modality-Quality Framework for Low-quality Multimodal Data

Unified framework addressing both missing and noisy modalities in multimodal learning to improve robustness on low-quality real-world data.

Ax L. Juli\'an Lechuga L\'opez, Farah E. Shamout, Tim G. J. Rudner 3/4/2026