Isolater - Feed

Ax Zichen Xie, Wenxi Wang 3/20/2026

Can LLMs Reason Like Automated Theorem Provers for Rust Verification? VCoT-Bench: Evaluating via Verification Chain of Thought

VCoT-Bench evaluates LLMs on Rust program verification via chain-of-thought reasoning, testing logical deduction abilities beyond binary pass/fail.

Ax Yanchuan Tang, Taowen Wang, Yuefei Chen, Boxuan Zhang, Qiang Guan, Ruixiang Tang 3/20/2026

Shifting Uncertainty to Critical Moments: Towards Reliable Uncertainty Quantification for VLA Model

Method for reliable uncertainty quantification in Vision-Language-Action models by shifting focus to safety-critical moments in robotic control.

Ax Hashini Senaratne, Richard Attfield, Samith Widhanapathirana, David Howard, Cecile Paris, Dana Kulic, Leimin Tian 3/20/2026

HRI-SA: A Multimodal Dataset for Online Assessment of Human Situational Awareness during Remote Human-Robot Teaming

Dataset for detecting human situational awareness gaps in remote human-robot teaming through multimodal sensor data.

Ax Ruishuo Chen, Yu Chen, Zhuoran Li, Longbo Huang 3/20/2026

PowerFlow: Unlocking the Dual Nature of LLMs via Principled Distribution Matching

PowerFlow applies principled distribution matching to unsupervised reinforcement learning from LLM internal feedback without external supervision.

Ax Mingda Qiao 3/20/2026

Computational and Statistical Hardness of Calibration Distance

Theoretical study of computational and statistical hardness in computing calibration distance for probabilistic predictor evaluation.

Ax Bohan Wu, Julius von K\"ugelgen, David M. Blei 3/20/2026

Multi-Domain Causal Empirical Bayes Under Linear Mixing

Research on estimating causal representations from multi-domain data using empirical Bayes methods for causal representation learning.

Ax Arushi Rai, Qiang Zhang, Hanqing Zeng, Yunkai Zhang, Dipesh Tamboli, Xiangjun Fan, Zhuokai Zhao 3/20/2026

TARo: Token-level Adaptive Routing for LLM Test-time Alignment

TARo enables frozen LLMs to perform structured reasoning at inference time through token-level adaptive routing, avoiding expensive post-training alignment.

Ax Yugo Miyata, Tomohiro Shiraishi, Shunichi Nishino, Ichiro Takeuchi 3/20/2026

Statistical Testing Framework for Clustering Pipelines by Selective Inference

Statistical framework for quantifying reliability of results from data analysis pipelines using selective inference techniques.

Ax Jason Dury 3/20/2026

From Topic to Transition Structure: Unsupervised Concept Discovery at Corpus Scale via Predictive Associative Memory

Unsupervised discovery of transition-structure concepts in text via temporal co-occurrence patterns using contrastive learning on large corpus.

Ax Lang Zhou, Shuxuan Li, Zhuohao Li, Shi Liu, Zhilin Zhao, Wei-Shi Zheng 3/20/2026

UT-ACA: Uncertainty-Triggered Adaptive Context Allocation for Long-Context Inference

Adaptive context allocation method for LLM long-context inference using uncertainty-triggered token-level budgeting to address attention dilution.

Ax Aditi Naiknaware, Salimeh Sekeh 3/20/2026

T-QPM: Enabling Temporal Out-Of-Distribution Detection and Domain Generalization for Vision-Language Models in Open-World

Vision-language model method for temporal out-of-distribution detection and domain generalization in open-world settings using adaptive pattern matching.

Ax Esteban Garces Arias, Nurzhan Sapargali, Christian Heumann, Matthias A{\ss}enmacher 3/20/2026

The Truncation Blind Spot: How Decoding Strategies Systematically Exclude Human-Like Token Choices

Analysis of how standard LLM decoding strategies (top-k, nucleus sampling) exclude contextually appropriate but statistically rare tokens compared to human language production.

Ax Reza Ghane, Danil Akhtiamov, Babak Hassibi 3/20/2026

Precise Performance of Linear Denoisers in the Proportional Regime

Theoretical analysis of linear denoisers for noisy data, studying performance in proportional regime without known covariance.

Ax Yixuan Zhang, Ruihao Zhu, Qiaomin Xie 3/20/2026

On the Peril of (Even a Little) Nonstationarity in Satisficing Regret Minimization

Analyzes optimal satisficing regret bounds for nonstationary K-armed bandits with piecewise-stationary segments.

Ax Abhinaba Basu, Pavan Chakraborty 3/20/2026

When Names Change Verdicts: Intervention Consistency Reveals Systematic Bias in LLM Decision-Making

ICE-Guard detects spurious feature reliance in LLM decision-making through intervention consistency testing on demographic, authority, and framing features.

Ax Andrew Choi, Xinjie Wang, Zhizhong Su, Wei Xu 3/20/2026

Scaling Sim-to-Real Reinforcement Learning for Robot VLAs with Generative 3D Worlds

Addresses sim-to-real transfer for vision-language-action models in robotics by generating diverse 3D simulation worlds for RL fine-tuning.

Ax Jiangtao Luo, Bingbing Xu, Shaohua Xia, Yongyi Ran 3/20/2026

iSatCR: Graph-Empowered Joint Onboard Computing and Routing for LEO Data Delivery

iSatCR optimizes onboard computing and routing for LEO satellite data processing using graph neural networks to reduce ground transmission bottlenecks.

Ax Yuhan Ye, Saurabh Amin, Asuman Ozdauglar 3/20/2026

Learning Decision-Sufficient Representations for Linear Optimization

Studies construction of compressed decision-sufficient datasets for linear programs with unknown cost vectors using decision-relevant dimension theory.

Ax Jiacheng Tang, Zhiyuan Zhou, Zhuolin He, Jia Zhang, Kai Zhang, Jian Pu 3/20/2026

CausalVAD: De-confounding End-to-End Autonomous Driving via Causal Intervention

CausalVAD applies causal intervention to de-confound end-to-end autonomous driving models, addressing dataset bias and improving reliability.

Ax Abhinaba Basu, Pavan Chakraborty 3/20/2026

ICE: Intervention-Consistent Explanation Evaluation with Statistical Grounding for LLMs

ICE framework evaluates explanation faithfulness in LLMs via randomization tests with multiple intervention operators, distinguishing genuine faithfulness from chance.

Ax Haotian Lu, Jincong Lu, Sachin Sachdeva, Sheldon X. -D. Tan 3/20/2026

WarPGNN: A Parametric Thermal Warpage Analysis Framework with Physics-aware Graph Neural Network

WarPGNN applies physics-aware graph neural networks for efficient thermal warpage analysis in chiplet-package systems, replacing costly numerical simulations.

Ax Eduar Castrillo Velilla 3/20/2026

Breaking Hard Isomorphism Benchmarks with DRESS

DRESS graph fingerprinting achieves unique fingerprints across 51,718 non-isomorphic strongly regular graphs using single-deletion vertex operations.

Ax Mohammadhossein Homaei, Iman Khazrak, Rub\'en Molano, Andr\'es Caro, Mar \'Avila 3/20/2026

Cyber-Resilient Digital Twins: Discriminating Attacks for Safe Critical Infrastructure Control

i-SDT combines predictive modelling and multi-class attack discrimination for detecting and responding to cyber-physical system attacks without full shutdowns.

Ax Rong Fu, Jiekai Wu, Haiyun Wei, Xiaowen Ma, Shiyin Lin, Kangan Qian, Chuang Liu, Jianyuan Ni, Simon James Fong 3/20/2026

SwiftGS: Episodic Priors for Immediate Satellite Surface Recovery

SwiftGS enables rapid 3D satellite surface reconstruction via meta-learned Gaussian primitives predicted in a single forward pass for environmental monitoring.

Ax Samuel Gruffaz, Kyurae Kim, Fares Guehtar, Hadrien Duval-decaix, Pac\^ome Trautmann 3/20/2026

A Theoretical Comparison of No-U-Turn Sampler Variants: Necessary and Su?cient Convergence Conditions and Mixing Time Analysis under Gaussian Targets

Theoretical analysis comparing NUTS-mul and NUTS-BPS variants for Bayesian sampling with convergence guarantees under Gaussian targets.

Ax Donglin Xie, Qingshuo Zhao, Jingyu Wang, Shijia Geng, Jiarui Jin, Jun Li, Rongrong Guo, Guangkun Nie, Gongzheng Tang, Yuxi Zhou, Thomas Penzel, Shenda Hong 3/20/2026

Holter-to-Sleep: AI-Enabled Repurposing of Single-Lead ECG for Sleep Phenotyping

Applies AI to repurpose single-lead ECG from Holter devices for sleep phenotyping, linking cardiovascular monitoring to sleep assessment.

Ax Huichi Zhou, Siyuan Guo, Anjie Liu, Zhongwei Yu, Ziqin Gong, Bowen Zhao, Zhixun Chen, Menglong Zhang, Yihang Chen, Jinsong Li, Runyu Yang, Qiangbin Liu, Xinlei Yu, Jianmin Zhou, Na Wang, Chunyang Sun, Jun Wang 3/20/2026

Memento-Skills: Let Agents Design Agents

Memento-Skills introduces an LLM agent that autonomously designs and improves task-specific agents through continual learning with stateful prompts and reusable skills.

Ax Yufei Zhang, Tao Wang, Jingyi Zhang 3/20/2026

SRRM: Improving Recursive Transport Surrogates in the Small-Discrepancy Regime

Studies consistency and convergence rates of Recursive Rank Matching for computing Wasserstein distance surrogates in small-discrepancy regime.

Ax Samuel Ofosu Mensah, Maria Camila Roa Carvajal, Kerol Djoumessi, Philipp Berens 3/20/2026

Towards Interpretable Foundation Models for Retinal Fundus Images

Dual-IFM develops an interpretable-by-design foundation model for retinal fundus image analysis using self-supervised learning with local and global interpretability.

Ax Xiucheng Wang, Zhenye Chen, Nan Cheng 3/20/2026

Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments

Proposes variational guidance for autonomous aerial vehicle trajectory learning to address credit assignment and training instability in sparse reward RL settings.

Ax Xiucheng Wang, Yue Zhang, Nan Cheng 3/20/2026

BeamAgent: LLM-Aided MIMO Beamforming with Decoupled Intent Parsing and Alternating Optimization for Joint Site Selection and Precoding

BeamAgent combines LLMs with wireless beamforming optimization through decoupled intent parsing and alternating optimization, separating LLM reasoning from numerical computation.

Ax Xiao Feng, Bo Han, Zhanke Zhou, Jiaqi Fan, Jiangchao Yao, Ka Ho Li, Dahai Yu, Michael Kwok-Po Ng 3/20/2026

RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language Models

RewardFlow proposes topology-aware reward propagation on state graphs for RL-enhanced LLM agents, addressing sparse reward limitations without expensive dedicated reward models.

Ax Samuel Del Fr\'e, Gilberto A. Alou Angulo, Maurice Monnerville, Alejandro Rivero Santamar\'ia 3/20/2026

Data-driven construction of machine-learning-based interatomic potentials for gas-surface scattering dynamics: the case of NO on graphite

Machine-learning interatomic potential workflow for gas-surface scattering dynamics simulation on graphite.

Ax Xiucheng Wang, Zixuan Guo, Nan Cheng 3/20/2026

RadioDiff-FS: Physics-Informed Manifold Alignment in Few-Shot Diffusion Models for High-Fidelity Radio Map Construction

Physics-informed diffusion model for radio map construction using few-shot learning with manifold alignment.

Ax Tianci Luo, Jinpeng Wang, Shiyu Qin, Niu Lian, Yan Feng, Bin Chen, Chun Yuan, Shu-Tao Xia 3/20/2026

PromptHub: Enhancing Multi-Prompt Visual In-Context Learning with Locality-Aware Fusion, Concentration and Alignment

Introduces PromptHub for visual in-context learning using locality-aware fusion of multiple visual demonstrations with alignment and concentration mechanisms.

Ax Min Hun Lee 3/20/2026

From Accuracy to Readiness: Metrics and Benchmarks for Human-AI Decision-Making

Proposes evaluation framework beyond accuracy for human-AI collaborative decision-making, addressing miscalibrated reliance and team effectiveness.

Ax Sakshi Arya, Satarupa Bhattacharjee, Bharath K. Sriperumbudur 3/20/2026

Kernel Single-Index Bandits: Estimation, Inference, and Learning

Analyzes contextual bandits with single-index reward models where arms represent stable decisions and covariates evolve under bandit policy.

Ax Xinghao Zhao 3/20/2026

Entropy trajectory shape predicts LLM reasoning reliability: A diagnostic study of uncertainty dynamics in chain-of-thought

Studies entropy trajectory shape in chain-of-thought reasoning to predict LLM correctness without additional inference, testing on GSM8K with Qwen2.5-7B.

Ax Bruna Alves, Armando J. Pinho, S\'onia Gouveia 3/20/2026