Isolater - Feed

Ax Zekun Wu, Adriano Koshiyama, Sahan Bulathwela, Maria Perez-Ortiz 3/20/2026

AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Agents

Analysis of safety drift in tool-augmented LLM agents, showing ranking metrics miss unsafe recommendations in high-stakes financial advisor scenarios.

Ax Wanyin Wu, Kanxue Li, Baosheng Yu, Haoyun Zhao, Yibing Zhan, Dapeng Tao, Hua Jin 3/20/2026

PREBA: Surgical Duration Prediction via PCA-Weighted Retrieval-Augmented LLMs and Bayesian Averaging Aggregation

Surgical duration prediction using retrieval-augmented LLMs and Bayesian averaging without fine-tuning, applied to hospital resource management.

Ax Yitong Zhang, Chengze Li, Ruize Chen, Guowei Yang, Xiaoran Jia, Yijie Ren, Jia Li 3/20/2026

To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation

Study on improving LLM code generation with private libraries, showing retrieval-based API documentation injection is insufficient for effective library usage.

Ax Yongzhong Xu 3/20/2026

Spectral Edge Dynamics of Training Trajectories: Signal--Noise Geometry Across Scales

Spectral Edge Dynamics quantifies transformer training trajectory structure through rolling SVD, identifying boundary between optimization directions and noise.

Ax Elad Hirsch, Shubham Yadav, Mohit Garg, Purvanshi Mehta 3/20/2026

LICA: Layered Image Composition Annotations for Graphic Design Research

LICA dataset of 1.55M layered graphic design compositions with hierarchical metadata for layout understanding and generation.

Ax Redwan Sony, Anil K Jain, Ross Arun 3/20/2026

MLLM-based Textual Explanations for Face Comparison

Analysis of multimodal LLM-generated natural language explanations for face verification on unconstrained images using IJB-S dataset.

Ax Utkarsh Grover, Ravi Ranjan, Mingyang Mao, Trung Tien Dong, Satvik Praveen, Zhenqi Wu, J. Morris Chang, Tinoosh Mohsenin, Yi Sheng, Agoritsa Polyzou, Eiman Kanjo, Xiaomin Lin 3/20/2026

Embodied Foundation Models at the Edge: A Survey of Deployment Constraints and Mitigation Strategies

Survey of deployment constraints and mitigation strategies for foundation models in resource-constrained embodied edge systems.

Ax Shenzhi Wang, Shixuan Liu, Jing Zhou, Chang Gao, Xiong-Hui Chen, Binghai Wang, An Yang, Shiji Song, Bowen Yu, Gao Huang, Junyang Lin 3/20/2026

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

HopChain improves vision-language reasoning through multi-hop data synthesis to address perception, reasoning, and hallucination errors in VLMs.

Ax Shuizhou Chen, Lang Yu, Kedu Jin, Songming Zhang, Hao Wu, Wenxuan Huang, Sheng Xu, Quan Qian, Qin Chen, Lei Bai, Siqi Sun, Zhangyang Gao 3/20/2026

SCALE:Scalable Conditional Atlas-Level Endpoint transport for virtual cell perturbation prediction

SCALE addresses bottlenecks in virtual cell perturbation prediction using foundation models for in silico experimentation.

Ax Ahmed Sharshar, Hosam Elgendy, Saad El Dine Ahmed, Yasser Rohaim, Yuxia Wang 3/20/2026

Harm or Humor: A Multimodal, Multilingual Benchmark for Overt and Covert Harmful Humor

Multimodal multilingual benchmark with 3000 texts and 6000 images for detecting harmful humor across English and Arabic.

Ax Pepe Alonso, Sergio Yovine, Victor A. Braberman 3/20/2026

TDAD: Test-Driven Agentic Development - Reducing Code Regressions in AI Coding Agents via Graph-Based Impact Analysis

TDAD is open-source tool performing impact analysis for AI coding agents to detect and prevent regressions in test-driven agentic development.

Ax Davis Wertheimer, Aozhong Zhang, Derrick Liu, Penghang Yin, Naigang Wang 3/20/2026

Frayed RoPE and Long Inputs: A Geometric Perspective

Geometric analysis of Rotary Positional Embedding performance breakdown on long inputs, explaining channel rotation distribution shift.

Ax J. Clayton Kerce 3/20/2026

Engineering Verifiable Modularity in Transformers via Per-Layer Supervision

Architectural approach using per-layer supervision to expose hidden modularity in Transformers, enabling interpretability and causal control of components.

Ax Natalia Wojak-Strzelecka, Szymon Bobek, Grzegorz J. Nalepa, Jerzy Stefanowski 3/20/2026

Towards Differentiating Between Failures and Domain Shifts in Industrial Data Streams

Methods for distinguishing system failures from domain shifts in industrial data streams using anomaly detection techniques.

Ax Ming Li, Ting Gao, Jingqiao Dua 3/20/2026

Taming Epilepsy: Mean Field Control of Whole-Brain Dynamics

Graph-regularized Koopman mean-field game framework for controlling high-dimensional neural dynamics during epileptic seizures.

Ax Takato Yasuno 3/20/2026

Adapting Methods for Domain-Specific Japanese Small LMs: Scale, Architecture, and Quantization

Systematic methodology for fine-tuning domain-specific Japanese small language models, identifying optimal training scale (4k samples), base models, and quantization strategies.

Ax Mark M. Bailey 3/20/2026

Quotient Geometry and Persistence-Stable Metrics for Swarm Configurations

Mathematical framework for comparing multi-agent swarm configurations using quotient geometry and persistence-stable metrics.

Ax Zhaohui Geoffrey Wang 3/20/2026

NANOZK: Layerwise Zero-Knowledge Proofs for Verifiable Large Language Model Inference

Zero-knowledge proof system enabling cryptographic verification that proprietary LLM API outputs come from claimed models, preventing model substitution or degradation.

Ax Dip Roy, Rajiv Misra, Sanjay Kumar Singh 3/20/2026

Fundamental Limits of Neural Network Sparsification: Evidence from Catastrophic Interpretability Collapse

Investigates how mechanistic interpretability features survive extreme neural network sparsification using adaptive sparsity scheduling in VAE-SAE architectures.

Ax Yi Yu, Junzhuo Ma, Chenghuang Shen, Xingyan Liu, Jing Gu, Hangyi Sun, Guangquan Hu, Jianfeng Liu, Weiting Liu, Mingyue Pu, Yu Wang, Zhengdong Xiao, Rui Xie, Longjiu Luo, Qianrong Wang, Gurong Cui, Honglin Qiao, Wenlian Lu 3/20/2026

Lightweight Adaptation for LLM-based Technical Service Agent: Latent Logic Augmentation and Robust Noise Reduction

Lightweight adaptation method for LLM-based technical service agents using latent logic augmentation and noise reduction without full retraining.

Ax Dibakar Sigdel 3/20/2026

Variational Phasor Circuits for Phase-Native Brain-Computer Interface Classification

Variational Phasor Circuit architecture for brain-computer interface classification using phase-native learnable parameters inspired by quantum circuits.

Ax Prince Zizhuang Wang, Shuli Jiang 3/20/2026

SLEA-RL: Step-Level Experience Augmented Reinforcement Learning for Multi-Turn Agentic Training

Step-level experience augmented reinforcement learning for multi-turn LLM agents that dynamically retrieve and refine experiences throughout episodes.

Ax Ratun Rahman, Dinh C. Nguyen 3/20/2026

Probabilistic Federated Learning on Uncertain and Heterogeneous Data with Model Personalization

Meta-BayFL framework for federated learning with probabilistic approaches to handle data uncertainty and heterogeneity while managing computational overhead.

Ax Hao Ma, Zhiqiang Pu, Yang Liu, Xiaolin Ai 3/20/2026

Enhancing Reinforcement Learning Fine-Tuning with an Online Refiner

Proposes dynamic constraints for reinforcement learning fine-tuning that adapt to model capabilities, resolving tension between optimization and constraint satisfaction.

Ax Rahul D Ray 3/20/2026

ARTEMIS: A Neuro Symbolic Framework for Economically Constrained Market Dynamics

Neuro-symbolic framework combining neural operators with economic constraints for interpretable quantitative finance models respecting no-arbitrage principles.

Ax Sahil Tyagi, Feiyi Wang 3/20/2026

Tula: Optimizing Time, Cost, and Generalization in Distributed Large-Batch Training

Tula optimizes distributed large-batch training by balancing communication overhead, computation cost, and model generalization across scaling configurations.

Ax Hefei Xu, Le Wu, Yu Wang, Min Hou, Han Wu, Zhen Zhang, Meng Wang 3/20/2026

VC-Soup: Value-Consistency Guided Multi-Value Alignment for Large Language Models

Proposes VC-Soup method for aligning LLMs with multiple potentially conflicting human values through value-consistency guided optimization.

Ax Jing Wang, Jie Shen, Amar Sra, Qiaomin Xie, Jeremy C Weiss 3/20/2026

LLM-Augmented Computational Phenotyping of Long Covid

LLM-augmented computational phenotyping framework for discovering clinical subphenotypes in Long COVID through iterative hypothesis generation and evidence extraction.

Ax Xunzhuo Liu, Hao Wu, Huamin Chen, Bowei He, Xue Liu 3/20/2026

Conflict-Free Policy Languages for Probabilistic ML Predicates: A Framework and Case Study with the Semantic Router DSL

Framework for detecting conflicts in policy languages that use probabilistic ML predicates, applied to semantic router DSL for LLM routing systems.

Ax Wenshuo Wang, Fan Zhang 3/20/2026

Gradient-Informed Temporal Sampling Improves Rollout Accuracy in PDE Surrogate Training

Improves PDE surrogate model training through gradient-informed temporal sampling strategies that optimize rollout accuracy under fixed data budgets.

Ax Sindhuja Madabushi, Arda Dogan, Jonathan Liu, Dian Chen, Dong S. Ha, Sook Shin, Sam H. Noh, Jin-Hee Cho 3/20/2026

AGRI-Fidelity: Evaluating the Reliability of Listenable Explanations for Poultry Disease Detection

Proposes AGRI-Fidelity framework to evaluate reliability of explainable AI for poultry disease detection in noisy farm environments.

Ax Philippe Formont, Maxime Darrin, Ismail Ben Ayed, Pablo Piantanida 3/20/2026

MolRGen: A Training and Evaluation Setting for De Novo Molecular Generation with Reasonning Models

Framework for evaluating reasoning-based LLMs on de novo molecular generation and drug discovery without requiring ground-truth molecule pairs.

Ax Jiaxin Liu 3/20/2026

Discovering What You Can Control: Interventional Boundary Discovery for Reinforcement Learning

Proposes Interventional Boundary Discovery to identify causal state dimensions agents can control, using Pearl's do-operator for causal identification.

Ax Haocheng Luo, Zehang Deng, Thanh-Toan Do, Mehrtash Harandi, Dinh Phung, Trung Le 3/20/2026

Sharpness-Aware Minimization in Logit Space Efficiently Enhances Direct Preference Optimization

Addresses the squeezing effect in Direct Preference Optimization (DPO) for LLM alignment using sharpness-aware minimization in logit space.

Ax Gregory N. Frank 3/20/2026

Detection Is Cheap, Routing Is Learned: Why Refusal-Based Alignment Evaluation Fails

Studies alignment evaluation in LLMs by examining political censorship in Chinese language models, focusing on routing mechanisms beyond concept detection and refusal behaviors.

Ax Simon M. Brealy, Lawrence A. Bull, Daniel S. Brennan, Pauline Beltrando, Anders Sommer, Nikolaos Dervilis, Keith Worden 3/20/2026

On Additive Gaussian Processes for Wind Farm Power Prediction

Additive Gaussian processes for wind farm power prediction using population-based structural health monitoring perspective.

Ax Zijin Gu, Tatiana Likhomanenko, Vimal Thilak, Jason Ramapuram, Navdeep Jaitly 3/20/2026

Path-Constrained Mixture-of-Experts

Path-constrained mixture-of-experts architecture constraining expert routing paths to improve statistical efficiency and meaningful parameter structure.

Ax Zhanqi Zhang, Shun Li, Bernardo L. Sabatini, Mikio Aoi, Gal Mishne 3/20/2026

ALIGN: Adversarial Learning for Generalizable Speech Neuroprosthesis

ALIGN: adversarial learning framework for session-invariant speech neuroprosthesis decoding from brain-computer interfaces.

Ax Kaiyang Li, Shihao Ji, Zhipeng Cai, Wei Li 3/20/2026

Approximate Subgraph Matching with Neural Graph Representations and Reinforcement Learning

Neural graph representation learning with RL for approximate subgraph matching, an NP-hard problem in graph analysis.

Ax Nived Rajaraman, Audrey Huang, Miro Dudik, Robert Schapire, Dylan J. Foster, Akshay Krishnamurthy 3/20/2026

Learning to Reason with Curriculum I: Provable Benefits of Autocurriculum

Autocurriculum training methods with provable benefits for chain-of-thought reasoning in language models with reduced data/compute costs.

Ax Amirhossein Roknilamouki, Arnob Ghosh, Eylem Ekici, Ness B. Shroff 3/20/2026

Escaping Offline Pessimism: Vector-Field Reward Shaping for Safe Frontier Exploration

Vector-field reward shaping for offline RL to enable safe exploration near dataset boundaries using simulator confidence.

Ax Muhammad Mubashar, Fabio Cuzzolin 3/20/2026

Epistemic Generative Adversarial Networks

Epistemic GANs using Dempster-Shafer theory to improve output diversity and architectural enhancements for generative models.

Ax Xiaojing Ye 3/20/2026

Mathematical Foundations of Deep Learning

Comprehensive book on mathematical foundations of deep learning covering neural network approximation theory, optimal control, RL, and generative models.

Ax Yifan Zhang, Liang Zheng 3/20/2026

RE-SAC: Disentangling aleatoric and epistemic risks in bus fleet control: A stable and robust ensemble DRL approach

RE-SAC: ensemble deep reinforcement learning for bus fleet control that disentangles aleatoric and epistemic uncertainty.

Ax Jianan Nie, Peng Gao 3/20/2026

FlowMS: Flow Matching for De Novo Structure Elucidation from Mass Spectra

Flow matching approach for de novo molecular structure elucidation from mass spectra using deep generative models.

Ax Arundhathi Dev, Justin Zhan 3/20/2026

Self-Tuning Sparse Attention: Multi-Fidelity Hyperparameter Optimization for Transformer Acceleration

AFBS-BO framework for automated hyperparameter optimization of sparse attention mechanisms in transformers via adaptive fidelity Bayesian optimization.

Ax Zhuoyue Chen, Kechao Cai 3/20/2026

Towards Noise-Resilient Quantum Multi-Armed and Stochastic Linear Bandits

Quantum multi-armed and stochastic linear bandits algorithms robust to noise in NISQ devices, achieving quadratic speedups over classical methods.

Ax Haechan Kim, Soohyun Ryu, Gyouk Chu, Doohyuk Jang, Eunho Yang 3/20/2026

Discounted Beta--Bernoulli Reward Estimation for Sample-Efficient Reinforcement Learning with Verifiable Rewards

Sample-efficient reward estimation method for RL with verifiable rewards in large language model post-training.

Ax Haoxin Liu, Harshavardhan Kamarthi, Zhiyuan Zhao, Hongjie Chen, B. Aditya Prakash 3/20/2026

Seeking Universal Shot Language Understanding Solutions

Training suite for film shot language understanding using vision-language models to match expert cinematographic analysis.

Ax Chengxuan Lu, Shukuan Wang, Yanjie Li, Wei Liu, Shiji Jin, Fuyuan Qian, Peiming Li, Baigui Sun, Yang Liu 3/20/2026

AcceRL: A Distributed Asynchronous Reinforcement Learning and World Model Framework for Vision-Language-Action Models

Distributed asynchronous RL framework for Vision-Language-Action models with integrated trainable world models.