Isolater - Feed

Ax Steven Motta, Gioele Nanni 3/16/2026

Federated Few-Shot Learning on Neuromorphic Hardware: An Empirical Study Across Physical Edge Nodes

Empirical study of federated few-shot learning on neuromorphic hardware using spike-timing-dependent plasticity.

Ax Noel Smith, Andrzej Ruszczynski 3/16/2026

Convergence Rate of a Functional Learning Method for Contextual Stochastic Optimization

Convergence analysis of functional learning methods for contextual stochastic optimization problems.

Ax Pierre Moreau, Emeline Pineau Ferrand, Yann Choho, Benjamin Wong, Annabelle Blangero, Milan Bhan 3/16/2026

Towards Faithful Multimodal Concept Bottleneck Models

Research on interpretable multimodal concept bottleneck models ensuring faithful explanations through proper concept detection.

Ax Xiangyu Liu, Kaiqing Zhang 3/16/2026

Partially Observable Multi-Agent Reinforcement Learning with Information Sharing

Provable multi-agent reinforcement learning in partially observable stochastic games leveraging information sharing among agents.

Ax Wasu Top Piriyakulkij, Yingheng Wang, Volodymyr Kuleshov 3/16/2026

Denoising Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors

Introduces diffusion models as expressive variational posteriors for black-box inference in latent variable models.

Ax Alejandro Parada-Mayorga, Alejandro Ribeiro 3/16/2026

Sampling and Uniqueness Sets in Graphon Signal Processing

Graph signal processing research extending sampling theory to graphon signals using limits of large graphs.

Ax Jeonghye Kim, Suyoung Lee, Woojun Kim, Youngchul Sung 3/16/2026

Adaptive $Q$-Aid for Conditional Supervised Learning in Offline Reinforcement Learning

Research on offline reinforcement learning combining return-conditioned supervised learning with Q-functions to improve stitching capability and stability.

Ax Yinan Huang, Haoyu Wang, Pan Li 3/16/2026

What Are Good Positional Encodings for Directed Graphs?

Introduces Walk Profile method and explores positional encodings for directed graphs in graph neural networks and graph transformers.

Ax Elyssa Sliheet, Md Abu Talha, Weihua Geng 3/16/2026

A DNN Biophysics Model with Topological and Electrostatic Features

Develops DNN model using topological and electrostatic features to predict protein biophysics properties like Coulomb and solvation energies.

Ax Hemanth Saratchandran, Jianqiao Zheng, Yiping Ji, Wenbo Zhang, Simon Lucey 3/16/2026

Rethinking Attention: Polynomial Alternatives to Softmax in Transformers

Explores polynomial attention alternatives to softmax in transformers, arguing regularization rather than probability distribution drives performance.

Ax Felix Lehner, Pasquale Lombardo, Susana Castillo, Oliver Hupe, Marcus Magnor 3/16/2026

RadField3D: A Data Generator and Data Format for Deep Learning in Radiation-Protection Dosimetry for Medical Applications

Open-source Geant4-based tool and data format for generating 3D radiation field datasets for dosimetry deep learning research.

Ax Antoine Moulin, Gergely Neu, Luca Viano 3/16/2026

Optimistically Optimistic Exploration for Provably Efficient Infinite-Horizon Reinforcement and Imitation Learning

Proposes first computationally efficient algorithm with optimal regret for infinite-horizon discounted reinforcement and imitation learning.

Ax Ruta Binkyte, Ivaxi Sheth, Zhijing Jin, Mohammad Havaei, Bernhard Sch\"olkopf, Mario Fritz 3/16/2026

Causality Is Key to Understand and Balance Multiple Goals in Trustworthy ML and Foundation Models

Advocates integrating causal methods into ML to balance trustworthiness objectives like fairness, privacy, robustness, and explainability.

Ax Heng-Sheng Chang, Prashant G. Mehta 3/16/2026

Dual Filter: A Transformer-like Inference Architecture for Hidden Markov Models

Proposes Dual Filter framework connecting Hidden Markov Models to transformer decoder architecture for causal nonlinear prediction.

Ax Mariana A. Fazio, Manel Martinez-Ramon, Salvador Sosa G\"uitron, Marcus Babzien, Mikhail Fedurin, Junjie Li, Mark Palmer, Sandra S. Biedron 3/16/2026

Unsupervised anomaly detection in MeV ultrafast electron diffraction

Applies unsupervised anomaly detection to ultrafast electron diffraction data to identify beam instabilities in materials science experiments.

Ax Yueheng Li, Guangming Xie, Zongqing Lu 3/16/2026

Guided Policy Optimization under Partial Observability

Introduces Guided Policy Optimization framework for RL in partially observable environments using privileged information from simulators.

Ax Nicolas Keriven 3/16/2026

Backward Oversmoothing: why is it hard to train deep Graph Neural Networks?

Analyzes oversmoothing problem in deep Graph Neural Networks and explores why networks fail to learn non-oversmoothed representations.

Ax Lakshita Dodeja, Karl Schmeckpeper, Shivam Vats, Thomas Weng, Mingxi Jia, George Konidaris, Stefanie Tellex 3/16/2026

Accelerating Residual Reinforcement Learning with Uncertainty Estimation

Proposes uncertainty estimation improvements to Residual Reinforcement Learning for faster adaptation of pretrained policies with sparse rewards.

Ax Rui Huang, Shitong Shao, Zikai Zhou, Pukun Zhao, Hangyu Guo, Tian Ye, Lichen Bai, Shuo Yang, Zeke Xie 3/16/2026

Accelerating Diffusion Model Training under Minimal Budgets: A Condensation-Based Perspective

Data condensation approach for training diffusion models with minimal computational budget by constructing smaller synthetic training datasets.

Ax Tianyin Liao, Ziwei Zhang, Yufei Sun, Chunyu Hu, Jianxin Li 3/16/2026

Invariant Graph Transformer for Out-of-Distribution Generalization

Graph transformer architecture designed for invariant learning to improve out-of-distribution generalization on graph-structured data.

Ax Sibylle Marcotte, Gabriel Peyr\'e, R\'emi Gribonval 3/16/2026

Intrinsic training dynamics of deep neural networks

Theoretical study of implicit bias in deep neural network training showing gradient flow induces learning of lower-dimensional parameter structures.

Ax Gyutae Oh, Jitae Shin 3/16/2026

UniPrompt-CL: Sustainable Continual Learning in Medical AI with Unified Prompt Pools

Continual learning framework with unified prompt pools for medical imaging tasks, addressing domain-specific challenges in adaptive AI.

Ax Arwen Bradley 3/16/2026

Local Mechanisms of Compositional Generalization in Conditional Diffusion

Analysis of compositional generalization mechanisms in conditional diffusion models, studying length generalization on controlled image generation tasks.

Ax Prabhat Karmakar, Sayan Gupta, Ilaksh Adlakha 3/16/2026

Extended Low-Rank Approximation Accelerates Learning of Elastic Response in Heterogeneous Materials

Low-rank approximation technique for accelerating machine learning models predicting mechanical properties of heterogeneous materials.

Ax Abhishek Moturu, Muhammad Muzammil, Anna Goldenberg, Babak Taati 3/16/2026

LiLAW: Lightweight Learnable Adaptive Weighting to Meta-Learn Sample Difficulty, Improve Noisy Training, Increase Fairness, and Effectively Use Synthetic Data

Lightweight meta-learning method using three parameters to dynamically adjust sample loss weights for noisy training, fairness, and synthetic data utilization.

Ax Krishu K Thapa, Reet Barik, Krishna Teja Chitty-Venkata, Murali Emani, Venkatram Vishwanath 3/16/2026

PreLoRA: Hybrid Pre-training of Vision Transformers with Full Training and Low-Rank Adapters

Hybrid pre-training approach using low-rank adapters alongside full training to reduce computational cost for vision transformer training.

Ax Xvyuan Liu, Xiangfei Qiu, Hanyin Cheng, Xingjian Wu, Chenjuan Guo, Bin Yang, Jilin Hu 3/16/2026

ASTGI: Adaptive Spatio-Temporal Graph Interactions for Irregular Multivariate Time Series Forecasting

Graph-based method for forecasting irregular multivariate time series in healthcare and finance with adaptive spatio-temporal interactions.

Ax Jonas Ngnaw\'e, Maxime Heuillet, Sabyasachi Sahoo, Yann Pequignot, Ola Ahmad, Audrey Durand, Fr\'ed\'eric Precioso, Christian Gagn\'e 3/16/2026

Robust Fine-Tuning from Non-Robust Pretrained Models: Mitigating Suboptimal Transfer With Epsilon-Scheduling

Method for robust fine-tuning non-robust pretrained models using epsilon-scheduling to achieve adversarial robustness and task adaptation simultaneously.

Ax Jiayi Li, Flora D. Salim 3/16/2026

DRIFT-Net: A Spectral--Coupled Neural Operator for PDEs Learning

Neural operator architecture combining spectral and coupling methods for efficiently learning partial differential equation dynamics.

Ax Harshwardhan Fartale, Ashish Kattamuri, Rahul Raja, Arpita Vats, Ishita Prasad, Akshata Kishore Moharir 3/16/2026

Disentangling Recall and Reasoning in Transformer Models through Layer-wise Attention and Activation Analysis

Analysis of transformer internals distinguishing recall from reasoning mechanisms through layer-wise attention and activation patterns for interpretability.

Ax Giorgos Nikolaou, Tommaso Mencattini, Donato Crisostomi, Andrea Santilli, Yannis Panagakis, Emanuele Rodol\`a 3/16/2026

Language Models are Injective and Hence Invertible

Mathematical proof that transformer language models are injective, enabling exact input recovery from representations despite nonlinear components.

Ax Rikard Vinge, Isabelle Wittmann, Jannik Schneider, Michael Marszalek, Luis Gilch, Thomas Brunschwiler, Conrad M Albrecht 3/16/2026

NeuCo-Bench: A Novel Benchmark Framework for Neural Embeddings in Earth Observation

Benchmark framework for evaluating neural compression and representation learning on earth observation satellite imagery tasks.

Ax Kemou Li, Qizhou Wang, Yue Wang, Fengpeng Li, Jun Liu, Bo Han, Jiantao Zhou 3/16/2026

LLM Unlearning with LLM Beliefs

Method for unlearning harmful content from LLMs by analyzing belief redistribution in probability space, avoiding unwanted side effects of gradient ascent.

Ax Tingkai Yan, Haodong Wen, Binghui Li, Kairong Luo, Wenguang Chen, Kaifeng Lyu 3/16/2026

Larger Datasets Can Be Repeated More: A Theoretical Analysis of Multi-Epoch Scaling in Linear Regression

Theoretical analysis of data scaling laws in linear regression when training multiple epochs on limited datasets, relevant to LLM training efficiency.

Ax Pramudita Satria Palar, Paul Saves, Rommel G. Regis, Koji Shimoyama, Shigeru Obayashi, Nicolas Verstaevel, Joseph Morlier 3/16/2026

Global Sensitivity Analysis for Engineering Design Based on Individual Conditional Expectations

Global sensitivity analysis technique for engineering design using individual conditional expectations to improve interpretability of black-box models.

Ax Sonal Prabhune, Balaji Padmanabhan, Kaushik Dutta 3/16/2026

Information-Consistent Language Model Recommendations through Group Relative Policy Optimization

Method to improve LLM consistency and reliability across semantically equivalent prompts using group relative policy optimization for business-critical applications.

Ax Damian Hodel, Jevin D. West 3/16/2026

Epistemic diversity across language models mitigates knowledge collapse

Study demonstrating that ensemble diversity across language models mitigates knowledge collapse from training on model-generated outputs.

Ax Taeyun Kim 3/16/2026

Structural Incompatibility of Differentiable Sorting and Within-Vector Rank Normalization

Theoretical analysis proving structural incompatibility between differentiable sorting operators and rank normalization techniques.

Ax Jiawen Chen, Qi Shao, Mingtong Zhou, Duxin Chen, Wenwu Yu 3/16/2026

CCMamba: Topologically-Informed Selective State-Space Networks on Combinatorial Complexes for Higher-Order Graph Learning

Selective state-space networks on combinatorial complexes for higher-order graph learning using topological deep learning.

Ax Evandro S. Ortigossa, Guy Lutsker, Eran Segal 3/16/2026