Isolater - Feed

Ax Fernando Ropero, Erkin Turkoz, Daniel Matos, Junqing Du, Antonio Ruiz, Yanfeng Zhang, Lu Liu, Mingwei Sun, Yongliang Wang 3/17/2026

RieMind: Geometry-Grounded Spatial Agent for Scene Understanding

Spatial reasoning agent decoupling perception from reasoning in visual language models for improved metric and geometric scene understanding.

Ax Yanning Dai, Yuhui Wang, Dylan R. Ashley, J\"urgen Schmidhuber 3/17/2026

Efficient Morphology-Control Co-Design via Stackelberg Proximal Policy Optimization

Bi-level optimization approach using Stackelberg game theory for coupled morphology-control co-design in embodied agents.

Ax Noe Claudel, Weisi Guo, Yang Xing 3/17/2026

AI Evasion and Impersonation Attacks on Facial Re-Identification with Activation Map Explanations

Adversarial patch framework for evasion and impersonation attacks against facial re-identification systems across non-overlapping cameras.

Ax Yu Pan, Wenlong Yu, Tiejun Wu, Xiaohu Ye, Qiannan Si, Guangquan Xu, Bin Wu 3/17/2026

SFCoT: Safer Chain-of-Thought via Active Safety Evaluation and Calibration

Safety defense mechanism for LLMs monitoring intermediate reasoning steps in chain-of-thought to prevent jailbreak attacks.

Ax Tingxu Han, Yi Zhang, Wei Song, Chunrong Fang, Zhenyu Chen, Youcheng Sun, Lijie Hu 3/17/2026

SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering?

Benchmark evaluating marginal utility of agent skills for LLM-based software engineering agents on real GitHub issues and requirements.

Ax Jia Wang, Chuanyu Qin, Mingyu Zheng, Qingyi Si, Peize Li, Zheng Lin 3/17/2026

A Closer Look into LLMs for Table Understanding

Empirical study of 16 LLMs examining internal mechanisms for table understanding across attention dynamics, layer depth, and expert activation.

Ax Mohamed Aziz Younes, Nicolas Saunier, Guillaume-Alexandre Bilodeau 3/17/2026

Detection of Autonomous Shuttles in Urban Traffic Images Using Adaptive Residual Context

Video object detection method using adaptive residual context for identifying autonomous shuttles in urban traffic monitoring.

Ax Kai Wang, Biaojie Zeng, Zeming Wei, Chang Jin, Hefeng Zhou, Xiangtian Li, Chao Yang, Jingjing Qu, Xingcheng Xu, Xia Hu 3/17/2026

TrinityGuard: A Unified Framework for Safeguarding Multi-Agent Systems

Comprehensive safety evaluation and monitoring framework for LLM-based multi-agent systems addressing novel risks beyond single agents.

Ax Ali Soltan Mohammadi, Samira Nazari, Ali Azarpeyvand, Mahdi Taheri, Milos Krstic, Michael Huebner, Christian Herglotz, Tara Ghasempouri 3/17/2026

RESQ: A Unified Framework for REliability- and Security Enhancement of Quantized Deep Neural Networks

Framework for improving robustness of quantized DNNs through three-stage fine-tuning addressing both fault and attack resilience.

Ax Vanshaj Khattar, Md Rafi ur Rashid, Moumita Choudhury, Jing Liu, Toshiaki Koike-Akino, Ming Jin, Ye Wang 3/17/2026

Amplification Effects in Test-Time Reinforcement Learning: Safety and Reasoning Vulnerabilities

Analysis of safety vulnerabilities in test-time training methods for LLMs, examining susceptibility to prompt injection and adversarial attacks.

Ax Shahil Shaik, Aditya Parameshwaran, Anshul Nayak, Jonathon M. Smereka, Yue Wang 3/17/2026

MA-VLCM: A Vision Language Critic Model for Value Estimation of Policies in Multi-Agent Team Settings

Vision-language critic model leveraging pre-trained VLAs for multi-agent reinforcement learning value estimation with improved generalization.

Ax Taeyun Roh, Wonjune Jang, Junha Jung, Jaewoo Kang 3/17/2026

CLAG: Adaptive Memory Organization via Agent-Driven Clustering for Small Language Model Agents

Memory management framework for small language model agents using adaptive clustering to organize experiences and prevent knowledge corruption.

Ax Vlad Medvedev, Leon Armbruster, Christopher Straub, Georg Kruse, Andreas Rosskopf 3/17/2026

Physics-informed fine-tuning of foundation models for partial differential equations

Fine-tuning strategies for PDE foundation models using physics-informed training to adapt to new tasks with limited domain-specific data.

Ax Sachin Prajuli, Abhishek Karna, OmPrakash Dhakl 3/17/2026

Music Genre Classification: A Comparative Analysis of Classical Machine Learning and Deep Learning Approaches

Comparative study of classical ML and deep learning for music genre classification, focusing on underrepresented Nepali music traditions.

Ax Simone Aonzo, Merve Sahin, Aur\'elien Francillon, Daniele Perito 3/17/2026

Evasive Intelligence: Lessons from Malware Analysis for Evaluating AI Agents

Framework evaluating AI agent vulnerabilities by applying malware analysis concepts to test-time agent behavior and adversarial robustness.

Ax Haichao Liu, Yuheng Zhou, Zhenyu Wu, Ziheng Ji, Ziyu Shan, Qianzhun Wang, Ruixuan Liu, Zhiyuan Yang, Yejun Gu, Shalman Khan, Shijun Yan, Jun Liu, Haiyue Zhu, Changliu Liu, Jianfei Yang, Jingbing Zhang, Ziwei Wang 3/17/2026

RoCo Challenge at AAAI 2026: Benchmarking Robotic Collaborative Manipulation for Assembly Towards Industrial Automation

Benchmark challenge for robotic collaborative manipulation and assembly tasks in industrial automation settings.

Ax Shovon Niverd Pereira, Krishna Khadka, Yu Lei 3/17/2026

TabKD: Tabular Knowledge Distillation through Interaction Diversity of Learned Feature Bins

Knowledge distillation method for tabular models that addresses feature interactions without original training data, enabling privacy-preserving model compression.

Ax Xianbao Hou, Yonghao He, Zeyd Boukhers, John See, Hu Su, Wei Sui, Cong Yang 3/17/2026

RSGen: Enhancing Layout-Driven Remote Sensing Image Generation with Diverse Edge Guidance

RSGen: Framework for layout-driven remote sensing image generation using diffusion models with edge guidance.

Ax Andrew Ritchhart, Sarah I. Allec, Pravalika Butreddy, Krista Kulesa, Qingpu Wang, Dan Thien Nguyen, Maxim Ziatdinov, Elias Nakouzi 3/17/2026

Agentic workflow enables the recovery of critical materials from complex feedstocks via selective precipitation

Multi-agentic workflow deploying AI agents with automated instruments to recover critical materials via selective precipitation.

Ax Pratyush Acharya, Habish Dhakal 3/17/2026

Grokking as a Variance-Limited Phase Transition: Spectral Gating and the Epsilon-Stability Threshold

Analyzes grokking phenomenon in neural networks through spectral gating mechanism and optimizer noise interaction.

Ax Raeid Saqur, Christoph Bergmeir, Blanka Horvath, Daniel Schmidt, Frank Rudzicz, Terry Lyons 3/17/2026

Seeking SOTA: Time-Series Forecasting Must Adopt Taxonomy-Specific Evaluation to Dispel Illusory Gains

Argues for taxonomy-specific evaluation in time-series forecasting to accurately assess ML progress versus classical methods.

Ax David \v{S}teva\v{n}\'ak, Marek \v{S}uppa 3/17/2026

SlovKE: A Large-Scale Dataset and LLM Evaluation for Slovak Keyphrase Extraction

SlovKE: Dataset and LLM evaluation for keyphrase extraction in Slovak, a morphologically rich low-resource language.

Ax Aleksander Krasowski, Ren\'e P. Klausen, Aycan Celik, Sebastian Lapuschkin, Wojciech Samek, Jonas Naujoks 3/17/2026

Building Trust in PINNs: Error Estimation through Finite Difference Methods

Proposes error estimation method for Physics-informed neural networks using finite difference post-hoc validation.

Ax Yifan Wang, Debabrota Basu, Pierre Bourhis, Romain Rouvoy, Patrick Royer 3/17/2026

DOT: Dynamic Knob Selection and Online Sampling for Automated Database Tuning

DOT: Automated database tuning system using dynamic knob selection and online sampling to optimize DBMS performance.

Ax Shaojie Shi, Zhengyu Shi, Lingran Zheng, Xinyu Su, Anna Xie, Bohao Lv, Rui Xu, Zijian Chen, Zhichao Chen, Guolei Liu, Naifu Zhang, Mingjian Dong, Zhuo Quan, Bohao Chen, Teqi Hao, Yuan Qi, Yinghui Xu, Libo Wu 3/17/2026

InterveneBench: Benchmarking LLMs for Intervention Reasoning and Causal Study Design in Real Social Systems

InterveneBench: Benchmark evaluating LLMs on causal inference and intervention reasoning in realistic social science scenarios.

Ax Yanick Zengaffinen, Andreas Opedal, Donya Rooein, Kv Aditya Srivatsa, Shashank Sonkar, Mrinmaya Sachan 3/17/2026

Can LLMs Model Incorrect Student Reasoning? A Case Study on Distractor Generation

Studies how LLMs model student misconceptions when generating multiple-choice distractors, analyzing reasoning strategies.

Ax Seth Karten, Jake Grigsby, Tersoo Upaa Jr, Junik Bae, Seonghun Hong, Hyunyoung Jeong, Jaeyoon Jung, Kun Kerdthaisong, Gyungbo Kim, Hyeokgi Kim, Yujin Kim, Eunju Kwon, Dongyu Liu, Patrick Mariglia, Sangyeon Park, Benedikt Schink, Xianwei Shi, Anthony Sistilli, Joseph Twin, Arian Urdu, Matin Urdu, Qiao Wang, Ling Wu, Wenli Zhang, Kunsheng Zhou, Stephanie Milani, Kiran Vodrahalli, Amy Zhang, Fei Fang, Yuke Zhu, Chi Jin 3/17/2026

The PokeAgent Challenge: Competitive and Long-Context Learning at Scale

PokeAgent Challenge: Large-scale benchmark for competitive multi-agent decision-making with partial observability and long-horizon planning.

Ax Ivan Stetsenko 3/17/2026

Lore: Repurposing Git Commit Messages as a Structured Knowledge Protocol for AI Coding Agents

Lore: Protocol using structured Git commit messages to preserve decision context and institutional knowledge for AI coding agents.

Ax Vasiliy A. Es'kin, Egor V. Ivanov 3/17/2026

Physics-Informed Neural Systems for the Simulation of EUV Electromagnetic Wave Diffraction from a Lithography Mask

Physics-informed neural networks and neural operators for simulating EUV electromagnetic wave diffraction in lithography.

Ax Yibin Liu, Yaxing Lyu, Daqi Gao, Zhixuan Liang, Weiliang Tang, Shilong Mu, Xiaokang Yang, Yao Mu 3/17/2026

From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

PRIMO R1: Framework using reinforcement learning to improve multimodal models for process reasoning in robotic manipulation.

Ax Lingyu Li, Yan Teng, Yingchun Wang 3/17/2026

Mechanistic Origin of Moral Indifference in Language Models

Analyzes moral indifference in LLMs due to compressed moral concepts and proposes remedial techniques.

Ax Lianghui Zhu, Yuxin Fang, Bencheng Liao, Shijie Wang, Tianheng Cheng, Zilong Huang, Chen Chen, Lai Wei, Yutao Zeng, Ya Wang, Yi Lin, Yu Li, Xinggang Wang 3/17/2026

Mixture-of-Depths Attention

Mixture-of-Depths Attention: Mechanism addressing signal degradation in deep LLMs by enabling attention to multiple depth levels.

Ax Zun Li, Marc Lanctot, Kevin R. McKee, Luke Marris, Ian Gemp, Daniel Hennes, Paul Muller, Kate Larson, Yoram Bachrach, Michael P. Wellman 3/17/2026

Combining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement Learning

Combines tree-search, generative models, and Nash bargaining for opponent modeling in game-theoretic reinforcement learning.

Ax Guangkun Nie, Jiabao Zhu, Gongzheng Tang, Deyun Zhang, Shijia Geng, Qinghao Zhao, Shenda Hong 3/17/2026

A Review of Deep Learning Methods for Photoplethysmography Data

Review of deep learning methods for photoplethysmography signal analysis in clinical and wearable applications.

Ax Alessio Buscemi, Daniele Proverbio, Alessandro Di Stefano, The-Anh Han, German Castignani, Pietro Li\`o 3/17/2026

FAIRGAME: a Framework for AI Agents Bias Recognition using Game Theory

FAIRGAME: Framework using game theory to detect and recognize bias in multi-agent AI systems.

Ax Danlong Yuan, Tian Xie, Shaohan Huang, Zhuocheng Gong, Huishuai Zhang, Chong Luo, Furu Wei, Dongyan Zhao 3/17/2026

Shorten After You're Right: Lazy Length Penalties for Reasoning RL

Method to reduce reasoning path length in large reasoning models like o1 and R1 using reward designs in reinforcement learning.

Ax Dhaval Patel, Shuxin Lin, James Rayfield, Nianjun Zhou, Chathurangi Shyalika, Suryanarayana R Yarrabothula, Roman Vaculin, Natalia Martinez, Fearghal O'donncha, Jayant Kalagnanam 3/17/2026

AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance

AssetOpsBench: A benchmark framework for evaluating LLM agents on industrial asset operations tasks like condition monitoring and maintenance scheduling.

Ax Sydney Levine, Matija Franklin, Tan Zhi-Xuan, Secil Yanik Guyot, Lionel Wong, Daniel Kilov, Yejin Choi, Joshua B. Tenenbaum, Noah Goodman, Seth Lazar, Iason Gabriel 3/17/2026

Resource Rational Contractualism Should Guide AI Alignment

Framework for AI alignment grounded in resource-rational contractualism, enabling diverse stakeholders to reach agreements on AI decision-making.

Ax Monoshiz Mahbub Khan, Xiaoyin Xi, Andrew Meneely, Yiming Tang, Zhe Yu 3/17/2026

Efficient Story Point Estimation With Comparative Learning

Machine learning approach to automate story point estimation for software sprint planning using comparative learning from historical team decisions.

Ax Lei Chen, Xuanle Zhao, Zhixiong Zeng, Jing Huang, Yufeng Zhong, Lin Ma 3/17/2026

Chart-R1: Chain-of-Thought Supervision and Reinforcement for Advanced Chart Reasoner

Vision-language model for chart reasoning using chain-of-thought supervision and reinforcement learning to improve numerical comprehension and multi-level visual understanding.

Ax Zhuohang Jiang, Pangjing Wu, Xu Yuan, Wenqi Fan, Qing Li 3/17/2026

QA-Dragon: Query-Aware Dynamic RAG System for Knowledge-Intensive Visual Question Answering

Dynamic retrieval-augmented generation system for visual question answering that retrieves from both text and images to handle complex multimodal queries.

Ax Lei Chen, Xuanle Zhao, Zhixiong Zeng, Jing Huang, Liming Zheng, Yufeng Zhong, Lin Ma 3/17/2026

Breaking the SFT Plateau: Multimodal Structured Reinforcement Learning for Chart-to-Code Generation

Multimodal reinforcement learning approach for chart-to-code generation that combines structured output requirements with visual reasoning on information-rich images.

Ax Andrew Ferguson, Marisa LaFleur, Lars Ruthotto, Jesse Thaler, Yuan-Sen Ting, Pratyush Tiwary, Soledad Villar, E. Paulo Alves, Jeremy Avigad, Simon Billinge, Camille Bilodeau, Keith Brown, Emmanuel Candes, Arghya Chattopadhyay, Bingqing Cheng, Jonathan Clausen, Connor Coley, Andrew Connolly, Fred Daum, Sijia Dong, Chrisy Xiyu Du, Cora Dvorkin, Cristiano Fanelli, Eric B. Ford, Luis Manuel Frutos, Nicol\'as Garc\'ia Trillos, Cecilia Garraffo, Robert Ghrist, Rafael Gomez-Bombarelli, Gianluca Guadagni, Sreelekha Guggilam, Sergei Gukov, Juan B. Guti\'errez, Salman Habib, Johannes Hachmann, Boris Hanin, Philip Harris, Murray Holland, Elizabeth Holm, Hsin-Yuan Huang, Shih-Chieh Hsu, Nick Jackson, Olexandr Isayev, Heng Ji, Aggelos Katsaggelos, Jeremy Kepner, Yannis Kevrekidis, Michelle Kuchera, J. Nathan Kutz, Branislava Lalic, Ann Lee, Matt LeBlanc, Josiah Lim, Rebecca Lindsey, Yongmin Liu, Peter Y. Lu, Sudhir Malik, Vuk Mandic, Vidya Manian, Emeka P. Mazi, Pankaj Mehta, Peter Melchior, Brice M\'enard, Jennifer Ngadiuba, Stella Offner, Elsa Olivetti, Shyue Ping Ong, Christopher Rackauckas, Philippe Rigollet, Chad Risko, Philip Romero, Grant Rotskoff, Brett Savoie, Uros Seljak, David Shih, Gary Shiu, Dima Shlyakhtenko, Eva Silverstein, Taylor Sparks, Thomas Strohmer, Christopher Stubbs, Stephen Thomas, Suriyanarayanan Vaikuntanathan, Rene Vidal, Francisco Villaescusa-Navarro, Gregory Voth, Benjamin Wandelt, Rachel Ward, Melanie Weber, Risa Wechsler, Stephen Whitelam, Olaf Wiest, Mike Williams, Zhuoran Yang, Yaroslava G. Yingling, Bin Yu, Shuwen Yue, Ann Zabludoff, Huimin Zhao, Tong Zhang 3/17/2026