Isolater - Feed

Ax Oleg Somov, Mikhail Chaichuk, Mikhail Seleznyov, Alexander Panchenko, Elena Tutubalina 3/18/2026

Breaking the Chain: A Causal Analysis of LLM Faithfulness to Intermediate Structures

Causal evaluation protocol measuring whether intermediate structures (rubrics, checklists) causally determine LLM outputs or merely accompany them.

Ax Zihe Wang, Yihuan Wang, Haiyang Yu. Zhiyong Cui, Xiaojian Liao, Chengcheng Wang, Yonglin Tian, Yongxin Tong 3/18/2026

ExpressMind: A Multimodal Pretrained Large Language Model for Expressway Operation

Multimodal LLM (ExpressMind) for expressway operation, applying cognitive intelligence to transportation systems beyond rule-based approaches.

Ax Lu\'is Freire, Fernanda A. Andal\'o, Nicki Skafte Detlefsen 3/18/2026

Exploring different approaches to customize language models for domain-specific text-to-code generation

Investigates customization approaches for smaller open-source LLMs to improve domain-specific code generation without relying on large proprietary models.

Ax Carmen Ng 3/18/2026

Designing for Disagreement: Front-End Guardrails for Assistance Allocation in LLM-Enabled Robots

Proposes guardrails for LLM-enabled robots allocating scarce assistance across multiple users with conflicting values and unpredictable LLM behavior.

Ax Sangyeon Yoon, Sunkyoung Kim, Hyesoo Hong, Wonje Jeung, Yongil Kim, Wooseok Seo, Heuiyeen Yeen, Albert No 3/18/2026

BenchPreS: A Benchmark for Context-Aware Personalized Preference Selectivity of Persistent-Memory LLMs

BenchPreS evaluates whether memory-based LLM personalization appropriately suppresses user preferences in context-sensitive communication settings.

Ax Seyed Mahed Mousavi, Christian Moiola, Massimo Rizzoli, Simone Alghisi, Giuseppe Riccardi 3/18/2026

V-DyKnow: A Dynamic Benchmark for Time-Sensitive Knowledge in Vision Language Models

V-DyKnow benchmark evaluates how vision-language models handle time-sensitive knowledge that becomes outdated after training.

Ax Maurits Kaptein, Vassilis-Javed Khan, Andriy Podstavnychy 3/18/2026

Runtime Governance for AI Agents: Policies on Paths

Framework for runtime governance of LLM-based AI agents, balancing task completion with legal and reputational costs through execution-path monitoring.

Ax Ming Li, Xirui Li, Tianyi Zhou 3/18/2026

When AI Navigates the Fog of War

Analyzes AI reasoning about geopolitical conflicts using temporally grounded case study of 2026 Middle East conflict after model training cutoffs.

Ax Imko Marijnissen, J. Christopher Beck, Emir Demirovi\'c, Ryo Kuroiwa 3/18/2026

Domain-Independent Dynamic Programming with Constraint Propagation

Integrates constraint propagation into dynamic programming to bridge gap between state-based and constraint-based paradigms for combinatorial problems.

Ax Beno\^it Alcaraz 3/18/2026

What if Pinocchio Were a Reinforcement Learning Agent: A Normative End-to-End Pipeline

Pipeline for developing norm-compliant reinforcement learning agents inspired by Pinocchio story, addressing safe AI integration into society.

Ax Ziqin Gong, Ning Li, Huaikang Zhou 3/18/2026

Machines acquire scientific taste from institutional traces

Fine-tuning LLMs on journal publication decisions to enable models to assess scientific merit and predict promising research directions.

Ax Firoj Alam, Fatema Ahmad, Ali Ezzat Shahroor, Mohamed Bayan Kmainasi, Elisa Sartori, Giovanni Da San Martino, Abul Hasnat, Raian Ali 3/18/2026

CritiSense: Critical Digital Literacy and Resilience Against Misinformation

Mobile app teaching digital literacy and prebunking misinformation tactics through interactive challenges in nine languages.

Ax Jian Yang, Wei Zhang, Shawn Guo, Zhengmao Ye, Lin Jing, Shark Liu, Yizhi Li, Jiajun Wu, Cening Liu, X. Ma, Yuyang Song, Siwei Wu, Yuwen Li, L. Liao, T. Zheng, Ziling Huang, Zelong Huang, Che Liu, Yan Xing, Renyuan Li, Qingsong Cai, Hanxu Yan, Siyue Wang, Shikai Li, Jason Klein Liu, An Huang, Yongsheng Kang, Jinxing Zhang, Chuan Hao, Haowen Wang, Weicheng Gu, Ran Tao, Mingjie Tang, Peihao Wu, Jianzhou Wang, Xianglong Liu, Weifeng Lv, Bryan Dai 3/18/2026

IQuest-Coder-V1 Technical Report

Code LLM series (7B-40B) using code-flow multi-stage training paradigm to capture dynamic software logic evolution.

Ax Caglar Yildirim 3/18/2026

Differential Harm Propensity in Personalized LLM Agents: The Curious Case of Mental Health Disclosure

Investigation of how user personalization and mental health disclosure affect harmful behavior in tool-using LLM agents.

Ax Min Zeng, Shuang Zhou, Zaifu Zhan, Rui Zhang 3/18/2026

MedCL-Bench: Benchmarking stability-efficiency trade-offs and scaling in biomedical continual learning

Benchmark for evaluating continual learning in biomedical NLP across task-diverse datasets with robustness and efficiency metrics.

Ax Ruijiang Gao, Steven Chong Xiao 3/18/2026

Nonstandard Errors in AI Agents

Study of reproducibility in AI coding agents, showing agent-to-agent variation produces nonstandard errors in empirical results.

Ax Yongyuan Liang, Shijie Zhou, Yu Gu, Hao Tan, Gang Wu, Franck Dernoncourt, Jihyung Kil, Ryan A. Rossi, Ruiyi Zhang 3/18/2026

Anticipatory Planning for Multimodal AI Agents

Two-stage RL framework training multimodal agents for anticipatory reasoning and long-term planning in multi-step tasks.

Ax Swata Marik, Swayamjit Saha, Garga Chatterjee 3/18/2026

Beyond Accuracy: Evaluating Forecasting Models by Multi-Echelon Inventory Cost

Pipeline integrating forecasting models and ML regressors with inventory optimization, evaluated on M5 Walmart dataset.

Ax Yi Chen, Daiwei Chen, Sukrut Madhav Chikodikar, Caitlyn Heqi Yin, Ramya Korlakai Vinayak 3/18/2026

Is Conformal Factuality for RAG-based LLMs Robust? Novel Metrics and Systematic Insights

Evaluation of conformal factuality as reliability guarantee for RAG-based LLMs with novel metrics and robustness analysis.

Ax Zhitao Zeng, Mengya Xu, Jian Jiang, Pengfei Guo, Yunqiu Xu, Zhu Zhuo, Chang Han Low, Yufan He, Dong Yang, Chenxi Lin, Yiming Gu, Jiaxin Guo, Yutong Ban, Daguang Xu, Qi Dou, Yueming Jin 3/18/2026

Surg$\Sigma$: A Spectrum of Large-Scale Multimodal Data and Foundation Models for Surgical Intelligence

Large-scale multimodal surgical dataset and foundation models for cross-procedure generalization in surgical AI tasks.

Ax Maksim Eren, Eric Michalak, Brian Cook, Johnny Seales Jr 3/18/2026

Prompt Programming for Cultural Bias and Alignment of Large Language Models

Study of cultural bias in LLMs and prompt-based methods to improve cultural alignment for policy and decision-making tasks.

Ax Karthik Ragunath Ananda Kumar, Subrahmanyam Arunachalam 3/18/2026

Learning to Present: Inverse Specification Rewards for Agentic Slide Generation

RL environment where LLM agents learn to generate professional presentations through research, planning, and tool use with multi-component reward system.

Ax Rui Ge, Yichao Fu, Yuyang Qian, Junda Su, Yiming Zhao, Peng Zhao, Hao Zhang 3/18/2026

Internalizing Agency from Reflective Experience

Method for training LLM agents to leverage rich environment feedback through reflective experience and post-training, improving long-horizon planning.

Ax Tianyu Xie, Jinfa Huang, Yuexiao Ma, Rongfang Luo, Yan Yang, Wang Chen, Yuhui Zeng, Ruize Fang, Yixuan Zou, Xiawu Zheng, Jiebo Luo, Rongrong Ji 3/18/2026

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Benchmark evaluating audio-visual social interactivity capabilities of omni-modal LLMs in dynamic dialogue settings.

Ax Chenyu Ge 3/18/2026

SAC-NeRF: Adaptive Ray Sampling for Neural Radiance Fields via Soft Actor-Critic Reinforcement Learning

RL framework using Soft Actor-Critic to learn adaptive ray sampling policies for efficient neural radiance field rendering.

Ax Suyash Mishra, Srikanth Patil, Satyanarayan Pati, Sagar Sahu, Baddu Narendra 3/18/2026

Finder: A Multimodal AI-Powered Search Framework for Pharmaceutical Data Retrieval

Multimodal AI search framework combining vector search, hybrid retrieval, and reasoning for pharmaceutical data across text, images, audio, and video.

Ax Yu Li, Yuchen Zheng, Giles Hamilton-Fletcher, Marco Mezzavilla, Yao Wang, Sundeep Rangan, Maurizio Porfiri, Zhou Yu, John-Ross Rizzo 3/18/2026

Exploring the Use of VLMs for Navigation Assistance for People with Blindness and Low Vision

Evaluation of VLMs (GPT-4V, Gemini, Claude, Llava) for navigation assistance tasks for people with vision impairments.

Ax Guangchen Lan 3/18/2026

Alternating Reinforcement Learning with Contextual Rubric Rewards

Framework extending RLHF using multi-dimensional rubric-based rewards instead of scalar signals for RL training.

Ax Zeyu Zhang, Xiangxiang Dai, Ziyi Han, Xutong Liu, John C. S. Lui 3/18/2026

Steering Frozen LLMs: Adaptive Social Alignment via Online Prompt Routing

Inference-time governance approach for LLMs using adaptive prompt routing to enable social alignment without retraining.

Ax Yue Chang, Guangsen Lin, Jyun Jie Chuang, Shunqi Liu, Xinkui Li, Yaozheng Li 3/18/2026

A federated learning framework with knowledge graph and temporal transformer for early sepsis prediction in multi-center ICUs

Federated learning framework integrating knowledge graphs and temporal transformers for early sepsis prediction in multi-center ICUs.

Ax Keivan Alizadeh, Parshin Shojaee, Minsik Cho, Mehrdad Farajtabar 3/18/2026

Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context

Study on recursive language models with self-reflective program search for long-context handling, addressing information extraction challenges.

Ax Ruixi Lin 3/18/2026

Discovering the Hidden Role of Gini Index In Prompt-based Classification

Analysis of Gini Index role in prompt-based classification for detecting and optimizing class accuracy disparities in long-tailed datasets.

Ax Liu Hung Ming 3/18/2026

Beyond Reward Suppression: Reshaping Steganographic Communication Protocols in MARL via Dynamic Representational Circuit Breaking

Defense mechanism against steganographic collusion in multi-agent reinforcement learning using dynamic representational circuit breaking.

Ax Peiyu Yang, Naveed Akhtar, Jiantong Jiang, Ajmal Mian 3/18/2026

Attribution-Guided Model Rectification of Unreliable Neural Network Behaviors

Model rectification framework using attribution-guided rank-one editing to fix unreliable neural network behaviors on corrupted samples.

Ax Lansiaux Edouard, Leman Margaux 3/18/2026

OrthoAI v2: From Single-Agent Segmentation to Dual-Agent Treatment Planning for Clear Aligners

Open-source pipeline extending single-agent AI orthodontic treatment planning to dual-agent framework with improved tooth segmentation and landmarks.

Ax Alexis Kirke 3/18/2026

Quantum Amplitude Estimation for Catastrophe Insurance Tail-Risk Pricing: Empirical Convergence and NISQ Noise Analysis

Application of quantum amplitude estimation to catastrophe insurance tail-risk pricing with convergence analysis and NISQ noise effects.

Ax Kyle Dumont, Nicholas Herbert, Hayder Tirmazi, Shrikanth Upadhayaya 3/18/2026

DRCY: Agentic Hardware Design Reviews

AI agent system for hardware design reviews using LLMs to verify semantic correctness of component connections against datasheets.

Ax Alexandre Cristov\~ao Maiorano 3/18/2026

Automated Self-Testing as a Quality Gate: Evidence-Driven Release Management for LLM Applications

Framework for LLM application release management using automated self-testing with evidence-based quality gates across five dimensions.

Ax Yongzhong Xu 3/18/2026

Spectral Edge Dynamics of Training Trajectories: Signal--Noise Geometry Across Scales

Analysis of transformer training dynamics using Spectral Edge Dynamics to measure coherent optimization directions versus stochastic noise.

Ax Lingyun Zhang, Yu Xie, Ping Chen 3/18/2026

IdentityGuard: Context-Aware Restriction and Provenance for Personalized Synthesis

Context-aware safety framework for personalized text-to-image models that prevents misuse without broad concept erasure.

Ax Pengcheng Li, Jie Zhang, Tianwei Zhang, Han Qiu, Zhang kejun, Weiming Zhang, Nenghai Yu, Wenbo Zhou 3/18/2026

State-Dependent Safety Failures in Multi-Turn Language Model Interaction

Analysis of multi-turn safety failures in LLMs through state-space perspective, showing structured contextual evolution enables jailbreaks.

Ax Bingzhou Li, Tao Huang 3/18/2026

DASH: Dynamic Audio-Driven Semantic Chunking for Efficient Omnimodal Token Compression

Token compression method for omnimodal LLMs using dynamic audio-driven semantic chunking to reduce inference costs for audio-visual processing.

Ax Yubo Hou, Mohamed Ragab, Yucheng Wang, Min Wu, Abdulla Alseiari, Chee-Keong Kwoh, Xiaoli Li, Zhenghua Chen 3/18/2026

Evidential Domain Adaptation for Remaining Useful Life Prediction with Incomplete Degradation

Domain adaptation approach for remaining useful life prediction using evidential learning under incomplete degradation trajectories.

Ax Weihao Zhang, Yitong Zhou, Huanyu Qu, Hongyi Li 3/18/2026

Loosely-Structured Software: Engineering Context, Structure, and Evolution Entropy in Runtime-Rewired Multi-Agent Systems

Study on engineering challenges in LLM-based multi-agent systems, addressing context pressure, coordination errors, and system drift at scale.

Ax Ruyi Zhang, Heng Gao, Songlei Jian, Yusong Tan, Haifang Zhou 3/18/2026