Isolater - Feed

Ax Boyang Ma, Hechuan Guo, Peizhuo Lv, Minghui Xu, Xuelong Dai, YeChao Zhang, Yijun Yang, Yue Zhang 2/20/2026

What Breaks Embodied AI Security:LLM Vulnerabilities, CPS Flaws,or Something Else?

Analysis of security vulnerabilities in embodied AI systems including LLM-driven agents, autonomous vehicles, and service robots.

Ax Justyna Andrys-Olek, Paulina Tworek, Luca Gherardini, Mark W. Ruddock, Mary Jo Kurt, Peter Fitzgerald, Jose Sousa 2/20/2026

A feature-stable and explainable machine learning framework for trustworthy decision-making under incomplete clinical data

Machine learning framework for trustworthy clinical decision-making with feature stability under incomplete data.

Ax Nuno Saavedra, Pedro Ribeiro, Andr\'e Coelho, Rui Campos 2/20/2026

Voice-Driven Semantic Perception for UAV-Assisted Emergency Networks

Framework for integrating voice communications with UAV-assisted emergency networks using semantic perception.

Ax Lorenzo Caselli, Marco Mistretta, Simone Magistri, Andrew D. Bagdanov 2/20/2026

SpectralGCD: Spectral Concept Selection and Cross-modal Representation Learning for Generalized Category Discovery

SpectralGCD method for generalized category discovery using spectral concept selection and cross-modal representation learning.

Ax Bingqian Li, Bowen Zheng, Xiaolei Wang, Long Zhang, Jinpeng Wang, Sheng Chen, Wayne Xin Zhao, Ji-rong Wen 2/20/2026

Improving LLM-based Recommendation with Self-Hard Negatives from Intermediate Layers

Research on improving LLM-based recommendation systems using self-hard negatives from intermediate layers for better preference learning during fine-tuning.

Ax Dylan Bouchard, Mohit Singh Chauhan, Viren Bajaj, David Skarbrevik 2/20/2026

Fine-Grained Uncertainty Quantification for Long-Form Language Model Outputs: A Comparative Study

Taxonomy and comparative study of uncertainty quantification methods for detecting hallucinations in long-form LLM outputs.

Ax Amirereza Abbasi, Mohsen Hooshmand 2/20/2026

Beyond Pipelines: A Fundamental Study on the Rise of Generative-Retrieval Architectures in Web Research

Study on generative-retrieval architectures in web search and how LLMs have transformed information retrieval practices.

Ax Wyatt Benno, Alberto Centelles, Antoine Douchet, Khalil Gibran 2/20/2026

Jolt Atlas: Verifiable Inference via Lookup Arguments in Zero Knowledge

arXiv paper on Jolt Atlas, a zero-knowledge ML framework for verifiable ONNX tensor operation inference using lookup arguments.

Ax Dimitri Staufer, Kirsten Morehouse 2/20/2026

What Do LLMs Associate with Your Name? A Human-Centered Black-Box Audit of Personal Data

Audit of personal data associations in 8 LLMs using LMP2 privacy probe, examining how models retain and surface personal information.

Ax Yichen Lu, Siwei Nie, Minlong Lu, Xudong Yang, Xiaobo Zhang, Peng Zhang 2/20/2026

Tracing Copied Pixels and Regularizing Patch Affinity in Copy Detection

arXiv paper on image copy detection using self-supervised learning with patch-level contrastive learning for manipulated content.

Ax Veit Elser, Manish Krishan Lal 2/20/2026

Learning with Boolean threshold functions

arXiv paper on training neural networks with Boolean threshold functions where all node values are strictly ±1.

Ax Kasun Dewage, Marianna Pensky, Suranadi De Silva, Shankadeep Mondal 2/20/2026

LORA-CRAFT: Cross-layer Rank Adaptation via Frozen Tucker Decomposition of Pre-trained Attention Weights

arXiv paper introducing LORA-CRAFT, a parameter-efficient fine-tuning method using Tucker decomposition on transformer attention weights.

Ax Peter Balogh 2/20/2026

The Anxiety of Influence: Bloom Filters in Transformer Attention Heads

arXiv paper identifying transformer attention heads functioning as membership filters, analyzing their spectrum of testing strategies across language models.

Ax Zachary Berger, Daniel Prakah-Asante, John Guttag, Collin M. Stultz 2/20/2026

Position: Evaluation of ECG Representations Must Be Fixed

Position paper arguing current ECG representation learning benchmarks must be revised to align with clinically meaningful objectives.

Ax Ihor Kendiukhov 2/20/2026

Systematic Evaluation of Single-Cell Foundation Model Interpretability Reveals Attention Captures Co-Expression Rather Than Unique Regulatory Signal

arXiv paper systematically evaluating mechanistic interpretability in single-cell foundation models using 37 analyses and 153 tests.

Ax Chris Tennant 2/20/2026

Toward a Fully Autonomous, AI-Native Particle Accelerator

Position paper proposing AI co-design for autonomous particle accelerator operation with minimal human intervention.

Ax Xiaoliang Fu, Jiaye Lin, Yangyi Fang, Binbin Zheng, Chaowen Hu, Zekai Shao, Cong Qin, Lu Pan, Ke Zeng, Xunliang Cai 2/20/2026

MASPO: Unifying Gradient Utilization, Probability Mass, and Signal Reliability for Robust and Sample-Efficient LLM Reasoning

arXiv paper introducing MASPO, a reinforcement learning method improving gradient utilization and probability mass handling for LLM reasoning.

Ax Minheng Chen, Jing Zhang, Tong Chen, Chao Cao, Tianming Liu, Li Su, Dajiang Zhu 2/20/2026

Probability-Invariant Random Walk Learning on Gyral Folding-Based Cortical Similarity Networks for Alzheimer's and Lewy Body Dementia Diagnosis

arXiv paper on gyral folding-based cortical networks for Alzheimer's and Lewy body dementia diagnosis.

Ax Sofiane Ennadir, Tianze Wang, Oleg Smirnov, Sahar Asadi, Lele Cao 2/20/2026

Be Wary of Your Time Series Preprocessing

arXiv paper analyzing how normalization strategies impact Transformer expressivity for time series representation learning.

Ax Antonio Guillen-Perez 2/20/2026

Conditional Flow Matching for Continuous Anomaly Detection in Autonomous Driving on a Manifold-Aware Spectral Space

arXiv paper on Deep-Flow, an unsupervised anomaly detection framework for autonomous vehicles using optimal transport conditional flow matching.

Ax Jayadev Billa 2/20/2026

The Cascade Equivalence Hypothesis: When Do Speech LLMs Behave Like ASR$\rightarrow$LLM Pipelines?

arXiv paper testing whether speech LLMs behave identically to ASR-to-LLM cascades across four models and six tasks.

Ax Jowaria Khan, Anindya Sarkar, Yevgeniy Vorobeychik, Elizabeth Bondi-Kelly 2/20/2026

Adapting Actively on the Fly: Relevance-Guided Online Meta-Learning with Latent Concepts for Geospatial Discovery

arXiv paper on relevance-guided online meta-learning for geospatial discovery under resource constraints and dynamic environments.

Ax Baihe Huang, Eric Xu, Kannan Ramchandran, Jiantao Jiao, Michael I. Jordan 2/20/2026

Towards Anytime-Valid Statistical Watermarking

arXiv paper on anytime-valid statistical watermarking for distinguishing machine-generated content from human text in LLMs.

Ax Luke Huang, Zhuoyang Zhang, Qinghao Hu, Shang Yang, Song Han 2/20/2026

Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs

arXiv paper on variance control in asynchronous off-policy RL for LLMs, addressing high variance from stale rollouts in critic-free methods.

Ax Shayan Kiyani, Sima Noorani, George Pappas, Hamed Hassani 2/20/2026

When to Trust the Cheap Check: Weak and Strong Verification for Reasoning

arXiv paper analyzing weak vs strong verification mechanisms in LLM reasoning systems, examining cost-reliability tradeoffs in verification loops.

Ax Xinghong Fu, Yanhong Li, Georgios Papaioannou, Yoon Kim 2/20/2026

Reverso: Efficient Time Series Foundation Models for Zero-shot Forecasting

arXiv paper on Reverso, a time series foundation model for zero-shot forecasting that scales to hundreds of millions of parameters.

Ax Keith Burghardt, Jienan Liu, Sadman Sakib, Yuning Hao, Bo Li 2/20/2026

FAMOSE: A ReAct Approach to Automated Feature Discovery

FAMOSE: ReAct-based agent for automated feature engineering in tabular data that autonomously explores and generates optimal features without domain expertise.

Ax Xiaohan Zhao, Zhaoyi Li, Yaxin Luo, Jiacheng Cui, Zhiqiang Shen 2/20/2026

Pushing the Frontier of Black-Box LVLM Attacks via Fine-Grained Detail Targeting

Black-box adversarial attack method on Large Vision-Language Models using fine-grained detail targeting to address gradient-free optimization challenges.

Ax Payel Bhattacharjee, Osvaldo Simeone, Ravi Tandon 2/20/2026

MARS: Margin-Aware Reward-Modeling with Self-Refinement

MARS framework for reward modeling using margin-aware training and self-refinement to reduce reliance on costly human-labeled preference data.

Ax Aidar Myrzakhan, Tianyi Li, Bowei Guo, Shengkun Tang, Zhiqiang Shen 2/20/2026

Sink-Aware Pruning for Diffusion Language Models

Novel pruning technique for Diffusion Language Models that optimizes inference efficiency by reconsidering attention sink preservation assumptions.

Ax Rachel Ma, Jingyi Qu, Andreea Bobu, Dylan Hadfield-Menell 2/20/2026

Goal Inference from Open-Ended Dialog

Research on embodied AI agents using LLMs for open-ended dialog to infer and accomplish diverse user goals efficiently and robustly.

Ax Masahiro Sato 2/20/2026

GAI: Generative Agents for Innovation

GAI: multi-agent LLM framework with reflection and dialogue for collective reasoning to drive innovation.

Ax Gali Noti, Kate Donahue, Jon Kleinberg, Sigal Oren 2/20/2026

AI-Assisted Decision Making with Human Learning

Framework studying AI-assisted human decision-making where humans learn through repeated interactions with algorithms.

Ax Andr\'e Barreto, Vincent Dumoulin, Yiran Mao, Mark Rowland, Nicolas Perez-Nieves, Bobak Shahriari, Yann Dauphin, Doina Precup, Hugo Larochelle 2/20/2026

Capturing Individual Human Preferences with Reward Features

Method for learning user-specific reward models in RLHF to capture individual preferences in LLM training.

Ax Neil Mallinar, A. Ali Heydari, Xin Liu, Anthony Z. Faranesh, Brent Winslow, Nova Hammerquist, Benjamin Graef, Cathy Speed, Mark Malhotra, Shwetak Patel, Javier L. Prieto, Daniel McDuff, Ahmed A. Metwally 2/20/2026

A Scalable Framework for Evaluating Health Language Models

Evaluation framework for assessing health-focused LLMs on personalized response quality with scalable methodology.

Ax Bernardo Cuenca Grau, Eva Feng, Przemys{\l}aw Andrzej Wa{\l}\k{e}ga 2/20/2026

The Correspondence Between Bounded Graph Neural Networks and Fragments of First-Order Logic

Theoretical correspondence between bounded GNNs and first-order logic fragments characterizing expressive power.

Ax Bosung Kim, Prithviraj Ammanabrolu 2/20/2026

Beyond Needle(s) in the Embodied Haystack: Environment, Architecture, and Training Considerations for Long Context Reasoning

∞-THOR: framework for long-horizon embodied AI tasks with benchmark testing long-context reasoning across extended trajectories.

Ax Mert Cemri, Nived Rajaraman, Rishabh Tiwari, Xiaoxuan Liu, Kurt Keutzer, Ion Stoica, Kannan Ramchandran, Ahmad Beirami, Ziteng Sun 2/20/2026

$\texttt{SPECS}$: Faster Test-Time Scaling through Speculative Drafts

SPECS: method for faster test-time scaling in LLMs using speculative drafts to reduce latency while maintaining performance.

Ax David A Kelly, Hana Chockler 2/20/2026

Sufficient, Necessary and Complete Causal Explanations in Image Classification

Formal causal explanations for image classifier decisions using logic-based approaches.

Ax Szymon Pawlonka, Miko{\l}aj Ma{\l}ki\'nski, Jacek Ma\'ndziuk 2/20/2026

Bongard-RWR+: Real-World Representations of Fine-Grained Concepts in Bongard Problems

Bongard-RWR+: benchmark for abstract visual reasoning with fine-grained concepts using real-world images.

Ax Diego Ortiz Barbosa, Mohit Agrawal, Yash Malegaonkar, Luis Burbano, Axel Andersson, Gy\"orgy D\'an, Henrik Sandberg, Alvaro A. Cardenas 2/20/2026

Drones that Think on their Feet: Sudden Landing Decisions with Embodied AI

Embodied AI system enabling autonomous drones to make adaptive decisions for sudden events using visual language models.

Ax Gil Pasternak, Dheeraj Rajagopal, Julia White, Dhruv Atreja, Matthew Thomas, George Hurn-Maloney, Ash Lewis 2/20/2026

Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents

PROBE: benchmark for measuring proactive problem-solving in LLM agents across extended contexts and time horizons.

Ax Myung Ho Kim 2/20/2026

Bridging Symbolic Control and Neural Reasoning in LLM Agents: Structured Cognitive Loop with a Governance Layer

SCL: modular agent architecture separating cognition into five phases with soft symbolic control governance layer for LLM agents.

Ax Maohao Ran, Zhenglin Wan, Cooper Lin, Yanting Zhang, Hongyu Xin, Hongwei Fan, Yibo Xu, Beier Luo, Yaxin Zhou, Wangbo Zhao, Lijie Yang, Lang Feng, Fuchao Yang, Jingxuan Wu, Yiqiao Huang, Chendong Ma, Dailing Jiang, Jianbo Deng, Sirui Han, Yang You, Bo An, Yike Guo, Jun Song 2/20/2026