Isolater - Feed

Ax Anirudh Satheesh, Pankaj Kumar Barman, Washim Uddin Mondal, Vaneet Aggarwal 3/10/2026

Global Convergence of Average Reward Constrained MDPs with Neural Critic and General Policy Parameterization

Primal-dual natural actor-critic algorithm for constrained MDPs with neural network critics and general policy parameterization, enabling high-dimensional continuous control.

Ax Pablo M. Bern\'a 3/10/2026

Step-Size Decay and Structural Stagnation in Greedy Sparse Learning

Theoretical analysis of greedy sparse learning algorithms examining convergence failure with step-size decay in matching pursuit and boosting methods.

Ax Darius Catrina, Christian Bepler, Samuel Sledzieski, Rohit Singh 3/10/2026

Reverse Distillation: Consistently Scaling Protein Language Model Representations

Reverse Distillation framework addressing poor scaling in protein language models by decomposing large model representations using smaller model guidance.

Ax Jinshan Liu, Ken Li, Jiazhe Wei, Bin Shi, Bo Dong 3/10/2026

Hide and Find: A Distributed Adversarial Attack on Federated Graph Learning

FedShift: distributed adversarial attack on federated graph learning systems with two-stage hide-and-find approach for model poisoning.

Ax Christopher Brix, Julia Walczak, Nils Lommen, Thomas Noll 3/10/2026

Using GPUs And LLMs Can Be Satisfying for Nonlinear Real Arithmetic Problems

GANRA: GPU-accelerated SMT solver combining LLMs and gradient descent for solving quantifier-free nonlinear real arithmetic problems.

Ax Zongqian Li, Shaohan Huang, Zewen Chi, Yixuan Su, Lexin Zhou, Li Dong, Nigel Collier, Furu Wei 3/10/2026

Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models

MicroCoder-GRPO: improved training approach for code generation models using Group Relative Policy Optimization with conditional truncation masking for handling longer outputs.

Ax Jinzhou Tan, Gabriel Adineera, Jinoh Kim 3/10/2026

ProgAgent:A Continual RL Agent with Progress-Aware Rewards

ProgAgent: continual reinforcement learning agent using progress-aware reward learning from unlabeled expert videos, addresses catastrophic forgetting in robotic learning with JAX architecture.

Ax Caihao Sun, Mingqi Yuan, Shiyuan Wang, Jiayu Chen 3/10/2026

Vision Transformers that Never Stop Learning

arXiv paper investigating loss of plasticity in Vision Transformers for continual learning, examining why attention-based models struggle to adapt to new tasks over time.

Ax Zaid Abdullah, Merouane Debbah, Symeon Chatzinotas, Bjorn Ottersten 3/10/2026

Neural Precoding in Complex Projective Spaces

Deep learning approach for multi-user MIMO wireless precoding using complex projective space parameterization of neural network outputs.

Ax Th\'eo Vincent, Kevin Gerhardt, Yogesh Tripathi, Habib Maraqten, Adam White, Martha White, Jan Peters, Carlo D'Eramo 3/10/2026

Gradient Iterated Temporal-Difference Learning

Temporal-difference reinforcement learning algorithm that incorporates gradients of bootstrapped estimates to improve stability over semi-gradient approaches.

Ax Abduragim Shtanchaev, Albina Ilina, Yazid Janati, Arip Asadulaev, Martin Tak\'ac, Eric Moulines 3/10/2026

Guess & Guide: Gradient-Free Zero-Shot Diffusion Guidance

Gradient-free guidance method for diffusion models in Bayesian inverse problems avoiding computationally expensive vector-Jacobian products.

Ax Noah Golowich, Fan Chen, Dhruv Rohatgi, Raghav Singhal, Carles Domingo-Enrich, Dylan J. Foster, Akshay Krishnamurthy 3/10/2026

Reject, Resample, Repeat: Understanding Parallel Reasoning in Language Model Inference

Particle filtering analysis of inference-time aggregation and pruning methods for steering LLMs using process reward models to optimize accuracy-cost tradeoffs.

Ax Colin Aitken, Rajat Masiwal, Adam Marchakitus, Katherine Kowal, Mayank Gupta, Tyler Yang, Amir Jina, Pedram Hassanzadeh, William R. Boos, Michael Kremer 3/10/2026

Designing probabilistic AI monsoon forecasts to inform agricultural decision-making

Decision-theory framework for designing probabilistic weather forecasts tailored to heterogeneous farmer decision-making contexts.

Ax Lizhi Ma, Yi-Xiang Hu, Yihui Ren, Feng Wu, Xiang-Yang Li 3/10/2026

LeJOT-AutoML: LLM-Driven Feature Engineering for Job Execution Time Prediction in Databricks Cost Optimization

LLM-driven feature engineering pipeline for predicting job execution times in Databricks cloud systems to optimize cost allocation.

Ax Sajib Debnath, Md. Uzzal Mia 3/10/2026

Bayesian Transformer for Probabilistic Load Forecasting in Smart Grids

Bayesian Transformer framework for probabilistic power grid load forecasting with uncertainty quantification under distributional shifts.

Ax Zihao Zheng, Hangyu Cao, Sicheng Tian, Jiayu Chen, Maoliang Li, Xinhao Sun, Hailong Zou, Zhaobo Zhang, Xuanzhe Liu, Donggang Cao, Hong Mei, Xiang Chen 3/10/2026

DyQ-VLA: Temporal-Dynamic-Aware Quantization for Embodied Vision-Language-Action Models

Quantization technique for Vision-Language-Action models that adapts precision dynamically across inference stages to reduce computational overhead for edge deployment.

Ax Yusong Wang, Chuang Yang, Jiawei Wang, Xiaohang Xu, Jiayi Xu, Dongyuan Li, Chuan Xiao, Renhe Jiang 3/10/2026

ELLMob: Event-Driven Human Mobility Generation with Self-Aligned LLM Framework

ELLMob generates human trajectories during large-scale events using LLM framework with event-annotated mobility datasets capturing deviations from routine patterns.

Ax Boris Kriuk, Fedor Kriuk 3/10/2026

PSTNet: Physically-Structured Turbulence Network

PSTNet estimates atmospheric turbulence intensity using physics-structured ML models respecting conservation laws for real-time aircraft safety applications.

Ax Qianyu Yang, Yang Liu, Jiaqi Li, Jun Bai, Hao Chen, Kaiyuan Chen, Tiliang Duan, Jiayun Dong, Xiaobo Hu, Zixia Jia, Yang Liu, Tao Peng, Yixin Ren, Ran Tian, Zaiyuan Wang, Yanglihong Xiao, Gang Yao, Lingyue Yin, Ge Zhang, Chun Zhang, Jianpeng Jiao, Zilong Zheng, Yuan Gong 3/10/2026

\$OneMillion-Bench: How Far are Language Agents from Human Experts?

$OneMillion-Bench evaluates language agents on 400 expert-curated real-world tasks across Law, Finance, Healthcare, Industry, and Science requiring multi-step reasoning and tool use.

Ax Bhavesh Kumar, Dylan Feng, Leonard Tang 3/10/2026

MJ1: Multimodal Judgment via Grounded Verification

MJ1 is a multimodal judge trained with RL to enforce visual grounding through structured verification chains and counterfactual consistency rewards.

Ax Theo X. Olausson, Jo\~ao Monteiro, Michal Klein, Marco Cuturi 3/10/2026

Amortizing Maximum Inner Product Search with Learned Support Functions

Amortized MIPS uses neural networks to predict maximum inner product search solutions, reducing computational cost for fixed query and key distributions.

Ax Peishen Yan, Yang Hua, Hao Wang, Jiaru Zhang, Xiaoyu Wu, Tao Song, Haibing Guan 3/10/2026

FedMomentum: Preserving LoRA Training Momentum in Federated Fine-Tuning

FedMomentum preserves optimization momentum during federated LoRA fine-tuning of LLMs through noise-free aggregation maintaining structural expressiveness.

Ax Jingwei Li, Xinran Gu, Jingzhao Zhang 3/10/2026

Capacity-Aware Mixture Law Enables Efficient LLM Data Optimization

Compute-efficient pipeline for data mixture scaling in LLM training, enabling extrapolation to large models without costly searches on target models.

Ax Jiayu Huang, Xiaohu Wu, Tiantian He, Qicheng Lao 3/10/2026

Stabilized Fine-Tuning with LoRA in Federated Learning: Mitigating the Side Effect of Client Size and Rank via the Scaling Factor

Stabilized LoRA fine-tuning for federated LLM training using scaling factors to mitigate client heterogeneity effects and aggregation instability in distributed settings.

Ax Kevin Dradjat, Massinissa Hamidi, Blaise Hanczar 3/10/2026

Adversarial Domain Adaptation Enables Knowledge Transfer Across Heterogeneous RNA-Seq Datasets

Adversarial domain adaptation for RNA-seq phenotype prediction addressing data scarcity through knowledge transfer between heterogeneous transcriptomic datasets.

Ax Weiyu Huang, Pengle Zhang, Xiaolu Zhang, Jun Zhou, Jun Zhu, Jianfei Chen 3/10/2026

Deterministic Differentiable Structured Pruning for Large Language Models

Deterministic differentiable structured pruning method for LLMs using l0 sparsity constraints, eliminating train-test mismatch from stochastic relaxations in prior work.

Ax Paulius Rauba, Claudio Fanconi, Mihaela van der Schaar 3/10/2026

Tiny Autoregressive Recursive Models

Explores autoregressive tiny recursive models for general prediction tasks, extending TRM mechanism beyond ARC-AGI to support iterative refinement in diverse domains.

Ax Chang Han, Yijie Hu, Jingling Liu 3/10/2026

EAGLE-Pangu: Accelerator-Safe Tree Speculative Decoding on Ascend NPUs

EAGLE-Pangu implements tree speculative decoding for LLM acceleration on Ascend NPUs, optimizing inference speed through multi-token verification with hardware compatibility.

Ax Guangnian Wan, Xinyin Ma, Gongfan Fang, Xinchao Wang 3/10/2026

Invisible Safety Threat: Malicious Finetuning for LLM via Steganography

Demonstrates safety vulnerability in LLMs where steganographic fine-tuning allows models to maintain safety facade while covertly generating harmful content through hidden instructions.

Ax Zhongjian Qiao, Jiafei Lyu, Boxiang Lyu, Yao Shu, Siyang Gao, Shuang Qiu 3/10/2026

Model-based Offline RL via Robust Value-Aware Model Learning with Implicitly Differentiable Adaptive Weighting

Model-based offline RL method using adversarial model learning with adaptive weighting to mitigate model exploitation in policy exploration from limited offline data.

Ax Aurelio Raffa Ugolini, Jessica Leoni, Valentina Breschi, Damiano Paniccia, Francesco Aldo Tucci, Luigi Capone, Mara Tanelli 3/10/2026

Explainable Condition Monitoring via Probabilistic Anomaly Detection Applied to Helicopter Transmissions

Probabilistic anomaly detection methodology for condition monitoring of helicopter transmissions using Bayesian approach trained only on healthy operational data.

Ax Yunhui Liu, Qizhuo Xie, Yinfeng Chen, Xudong Jin, Tao Zheng, Bin Chong, Tieke He 3/10/2026

Mitigating Homophily Disparity in Graph Anomaly Detection: A Scalable and Adaptive Approach

SAGAD addresses graph anomaly detection with scalable GNN-based approach handling homophily disparity and computational efficiency challenges in node classification.

Ax Mingxi Zou, Jiaxiang Chen, Junfan Li, Langzhang Liang, Qifan Wang, Xu Yinghui, Zenglin Xu 3/10/2026

DARC: Disagreement-Aware Alignment via Risk-Constrained Decoding

DARC proposes an inference-time method for aligning LLMs with heterogeneous human preferences by framing response selection as a risk-constrained decoding problem, avoiding retraining.

Ax Lukas K\"onig, Manuel Kuhn, David Kappel, Anand Subramoney 3/10/2026

Training event-based neural networks with exact gradients via Differentiable ODE Solving in JAX

JAX-based framework for training spiking neural networks with exact gradients via differentiable ODE solving, enabling flexible neuron models.

Ax Jiayang Gao, Tianyi Zheng, Jiayang Zou, Fengxiang Yang, Shice Liu, Luyao Fan, Zheyu Zhang, Hao Zhang, Jinwei Chen, Peng-Tao Jiang, Bo Li, Jia Wang 3/10/2026

C$^2$FG: Control Classifier-Free Guidance via Score Discrepancy Analysis

Theoretical analysis of classifier-free guidance in diffusion models with adaptive score discrepancy-based control for better conditional generation.

Ax Thanapol Phungtua-eng, Yoshitaka Yamamoto 3/10/2026

Are We Winning the Wrong Game? Revisiting Evaluation Practices for Long-Term Time Series Forecasting

Critique of evaluation practices in long-term time series forecasting, questioning reliance on pointwise error metrics for progress assessment.

Ax Yunhui Liu, Yongchao Liu, Yinfeng Chen, Chuntao Hong, Tao Zheng, Tieke He 3/10/2026

Learning Hierarchical Knowledge in Text-Rich Networks with Taxonomy-Informed Representation Learning

Taxonomy-informed representation learning for text-rich networks, leveraging hierarchical knowledge structures for better semantic understanding.

Ax Sidharth Sinha, Anson Bastos, Xuchao Zhang, Akshay Nambi, Chetan Bansal, Saravan Rajmohan 3/10/2026

AutoAdapt: An Automated Domain Adaptation Framework for LLMs

AutoAdapt automated framework for domain adaptation in LLMs, handling hyperparameter selection and evolving knowledge without manual tuning.

Ax Yeonsik Park, Hyeonseong Kim, Seungkyu Choi 3/10/2026

SERQ: Saliency-Aware Low-Rank Error Reconstruction for LLM Quantization

SERQ: post-training quantization method for LLMs using saliency-aware low-rank error reconstruction for efficient deployment.

Ax Tingting Chen, Feng Chu, Jiantong Zhang 3/10/2026

Sequential Service Region Design with Capacity-Constrained Investment and Spillover Effect

Operations research framework for sequential geographic service network expansion under capacity constraints and demand uncertainty.

Ax Jonas Landsgesell, Pascal Knoll 3/10/2026

Distributional Regression with Tabular Foundation Models: Evaluating Probabilistic Predictions via Proper Scoring Rules

Distributional regression using TabPFN and TabICL foundation models for tabular data with probabilistic scoring evaluation.

Ax Patrick Wilhelm, Odej Kao 3/10/2026

Revisiting Gradient Staleness: Evaluating Distance Metrics for Asynchronous Federated Learning Aggregation

Evaluation of distance metrics for staleness measurement in asynchronous federated learning aggregation methods.

Ax Dai Shi, Luke Thompson, Andi Han, Peiyan Hu, Junbin Gao, Jos\'e Miguel Hern\'andez-Lobato 3/10/2026

Wiener Chaos Expansion based Neural Operator for Singular Stochastic Partial Differential Equations

Wiener Chaos Expansion-based neural operator using FiLM for solving singular stochastic partial differential equations.

Ax Chang Li, Tshihao Tsu, Yaren Zhang, Chao Xue, Xiaodong He 3/10/2026

Fibration Policy Optimization

Fibration Policy Optimization introduces APC-Obj for training heterogeneous LLM systems with multi-scale hierarchical stability control.

Ax Magnus Ross, Nel Swanepoel, Akish Luintel, Emma McGuire, Ingemar J. Cox, Steve Harris, Vasileios Lampos 3/10/2026

Optimising antibiotic switching via forecasting of patient physiology

Neural forecasting approach for predicting patient physiology to optimize antibiotic therapy transitions and reduce hospital stays.

Ax Prakash Kumbhakar, Shrey Srivastava, Haroon R Lone 3/10/2026

FedPrism: Adaptive Personalized Federated Learning under Non-IID Data

FedPrism framework for federated learning with non-IID data, using adaptive personalization strategies under statistical heterogeneity.

Ax Antonia Hager, Sven Nebendahl, Alexej Klushyn, Jasper Krauser, Torleiv H. Bryne, Tor Arne Johansen 3/10/2026

Airborne Magnetic Anomaly Navigation with Neural-Network-Augmented Online Calibration

Neural network-augmented calibration system for airborne magnetic anomaly navigation without extensive offline pre-training.

Ax Yuxiang Zhang, Enyan Dai 3/10/2026

SCL-GNN: Towards Generalizable Graph Neural Networks via Spurious Correlation Learning

SCL-GNN addresses spurious correlations in graph neural networks to improve generalization across diverse graph tasks.

Ax Yilin Wen, Yi Guo, Bo Zhao, Wei Qi, Zechun Hu, Colin Jones, Jian Sun 3/10/2026

PolyFormer: learning efficient reformulations for scalable optimization under complex physical constraints

Physics-informed ML method (PolyFormer) for solving constrained optimization problems using transformer architecture and geometric knowledge.

Ax Chaewon Moon, Dongkuk Si, Chulhee Yun 3/10/2026

Minor First, Major Last: A Depth-Induced Implicit Bias of Sharpness-Aware Minimization

Theoretical analysis of implicit bias in Sharpness-Aware Minimization showing depth-dependent behavior in linear networks diverges from gradient descent.