Isolater - Feed

Ax Haocheng Ju, Guoxiong Gao, Jiedong Jiang, Bin Wu, Zeming Sun, Leheng Chen, Yutong Wang, Yuefeng Wang, Zichen Wang, Wanyi He, Peihao Wu, Liang Xiao, Ruochuan Liu, Bryan Dai, Bin Dong 27d ago

Automated Conjecture Resolution with Formal Verification

Automated framework for research-level mathematical problem solving combining LLMs with formal verification to reliably resolve conjectures and verify proofs.

Ax Dipkumar Patel 27d ago

Representational Collapse in Multi-Agent LLM Committees: Measurement and Diversity-Aware Consensus

Representational collapse in multi-agent LLM committees: measurement of similarity showing agents produce redundant rationales despite different role prompts, with diversity-aware consensus.

Ax Felix Stillger, Lukas Hahn, Frederik Hasecke, Tobias Meisen 27d ago

InCaRPose: In-Cabin Relative Camera Pose Estimation Model and Dataset

InCaRPose: Transformer-based model for relative camera pose estimation in automotive in-cabin monitoring with distorted imaging environments.

Ax Jonas De Schouwer, Haitz S\'aez de Oc\'ariz Borde, Xiaowen Dong 27d ago

k-Maximum Inner Product Attention for Graph Transformers and the Expressive Power of GraphGPS The Expressive Power of GraphGPS

k-Maximum Inner Product Attention for efficient graph transformers, reducing quadratic complexity while maintaining expressiveness for large-scale graphs.

Ax Hope McGovern, Caroline Craig, Thomas Lippincott, Hale Sirin 27d ago

When Models Know More Than They Say: Probing Analogical Reasoning in LLMs

Analysis of analogical reasoning in LLMs comparing probed representations with prompted performance, revealing limitations in latent abstraction and generalization.

Ax Zonghan Li, Yi Liu, Chunyan Wang, Song Tong, Kaiping Peng, Feng Ji 27d ago

Enhancing behavioral nudges with large language model-based iterative personalization: A field experiment on electricity and hot-water conservation

Field experiment on LLM agent providing iterative personalized behavioral nudges for electricity and hot-water conservation across intervention rounds.

Ax Indar Kumar, Akanksha Tiwari 27d ago

Regime-Calibrated Demand Priors for Ride-Hailing Fleet Dispatch and Repositioning

Regime-calibrated demand priors for ride-hailing dispatch using historical segmentation and multi-metric similarity ensemble for fleet repositioning.

Ax Saad Alqithami 27d ago

Latency-Aware Resource Allocation over Heterogeneous Networks: A Lorentz-Invariant Market Mechanism

Lorentz-Invariant Auction mechanism for bandwidth allocation across heterogeneous-delay networks including LEO satellites and deep-space relays.

Ax Haotian Zong, Binze Li, Yufei Long, Sinyin Chang, Jialong Wu, Gillian K. Hadfield 27d ago

I-CALM: Incentivizing Confidence-Aware Abstention for LLM Hallucination Mitigation

I-CALM: prompt-only intervention reducing LLM hallucinations by incentivizing confidence-aware abstention through reward scheme announcements and humility principles.

Ax Saad Alqithami 27d ago

DC-Ada: Reward-Only Decentralized Observation-Interface Adaptation for Heterogeneous Multi-Robot Teams

DC-Ada: reward-only decentralized adaptation for heterogeneous multi-robot teams, adapting frozen policies to mismatched sensor configurations.

Ax Dalal Alharthi, Ivan Roberto Kawaminami Garcia 27d ago

Automating Cloud Security and Forensics Through a Secure-by-Design Generative AI Framework

Secure-by-design GenAI framework for cloud security and forensics using LLMs with defenses against prompt injection and forensic rigor requirements.

Ax Atahan Dokme, Sriram Vishwanath 27d ago

Interpreting Video Representations with Spatio-Temporal Sparse Autoencoders

Spatio-temporal sparse autoencoders for interpretable video representation learning, using contrastive objectives and hierarchical grouping to preserve temporal coherence.

Ax Xinyi Ling, Ye Liu, Reza Averly, Xia Ning 27d ago

Uncertainty as a Planning Signal: Multi-Turn Decision Making for Goal-Oriented Conversation

Multi-turn decision making framework for goal-oriented conversational systems balancing information acquisition and target commitment under user intent uncertainty.

Ax Fangzhou Lin, Peiran Li, Shuo Xing, Siyuan Yang, Qianwen Ge, Kazunori Yamada, Ziming Zhang, Haichong Zhang, Zhengzhong Tu 27d ago

AdaptFuse: Training-Free Sequential Preference Learning via Externalized Bayesian Inference

AdaptFuse: training-free framework for LLMs to perform Bayesian belief updating across multi-turn interactions without fine-tuning on user data.

Ax Indar Kumar, Girish Karhana, Sai Krishna Jasti, Ankit Hemant Lade 27d ago

Supervised Dimensionality Reduction Revisited: Why LDA on Frozen CNN Features Deserves a Second Look

Regime-calibrated approach for ride-hailing demand prediction using historical trip segmentation and similarity ensemble matching across temporal patterns.

Ax Yifu Ding, Xinhao Zhang, Jinyang Guo 27d ago

Diagonal-Tiled Mixed-Precision Attention for Efficient Low-Bit MXFP Inference

Low-bit mixed-precision attention kernel using MXFP format for efficient LLM inference, reducing memory bandwidth and computational costs of transformer attention mechanisms.

Ax Hongwei Xu 27d ago

Symbolic-Vector Attention Fusion for Collective Intelligence

Symbolic-Vector Attention Fusion (SVAF): mechanism for multi-agent communication enabling agents to evaluate which signal dimensions to use in collective intelligence systems.

Ax Ravi Ranjan, Agoritsa Polyzou 27d ago

VLA-Forget: Vision-Language-Action Unlearning for Embodied Foundation Models

VLA-Forget: unlearning framework for vision-language-action embodied models in robotic manipulation, removing unsafe behaviors while preserving perception and language grounding.

Ax Khanh Linh Nguyen, Hoa Nghiem, Tu Tran 27d ago

TraceGuard: Structured Multi-Dimensional Monitoring as a Collusion-Resistant Control Protocol

TraceGuard: structured multi-dimensional monitoring protocol for detecting attacks on untrusted AI agents, addressing collusion risks through five-dimensional evaluation of agent reasoning and actions.

Ax Minglei Chen, Weilong Wang, Jiang Duan, Ye Deng 27d ago

Gram-Anchored Prompt Learning for Vision-Language Models via Second-Order Statistics

Gram-anchored prompt learning method for Vision-Language Models using second-order statistics for parameter-efficient adaptation.

Ax Shenzhi Yang, Guangcheng Zhu, Bowen Song, Sharon Li, Haobo Wang, Xing Zheng, Yingfan Ma, Zhongqi Chen, Weiqiang Wang, Gang Chen 27d ago

Can LLMs Learn to Reason Robustly under Noisy Supervision?

Analysis of noisy label robustness in Reinforcement Learning with Verifiable Rewards for training LLM reasoning models.

Ax Mohammad Hossein Chinaei 27d ago

Causality Laundering: Denial-Feedback Leakage in Tool-Calling LLM Agents

Causality laundering: security vulnerability in tool-calling LLM agents where adversaries exfiltrate information through denial-feedback patterns.

Ax Siyuan Li, Zehao Liu, Xi Lin, Qinghua Mao, Yuliang Chen, Haoyu Li, Jun Wu, Jianhua Li, Xiu Su 27d ago

CoopGuard: Stateful Cooperative Agents Safeguarding LLMs Against Evolving Multi-Round Attacks

CoopGuard: stateful cooperative multi-agent defense framework protecting LLMs against evolving adversarial attacks across multi-round interactions.

Ax Jihoon Jeong 27d ago

Extracting and Steering Emotion Representations in Small Language Models: A Methodological Comparison

First comparative analysis of emotion vector extraction methods across 9 small language models using multiple architectural families.

Ax Taiping Qu, Hongkai Zhang, Lantian Zhang, Can Zhao, Nan Zhang, Hui Wang, Zhen Zhou, Mingye Zou, Kairui Bo, Pengfei Zhao, Xingxing Jin, Zixian Su, Kun Jiang, Huan Liu, Yu Du, Maozhou Wang, Ruifang Yan, Zhongyuan Wang, Tiejun Huang, Lei Xu, Henggui Zhang 27d ago

BAAI Cardiac Agent: An intelligent multimodal agent for automated reasoning and diagnosis of cardiovascular diseases from cardiac magnetic resonance imaging

BAAI Cardiac Agent: multimodal AI agent for automated cardiovascular disease diagnosis from cardiac MRI with specialized expert models.

Ax Shkelqim Sherifi 27d ago

Intelligent Traffic Monitoring with YOLOv11: A Case Study in Real-Time Vehicle Detection

Real-time traffic monitoring system using YOLOv11 object detection with multi-object tracking in PyTorch/OpenCV.

Ax Andre Opris, Denis Antipov 27d ago

Parent Selection Mechanisms in Elitist Crossover-Based Algorithms

Theoretical analysis of parent selection mechanisms in genetic algorithms and evolutionary computation optimization.

Ax Yuanhao Liu, Zihan Zhou, Kaiying Wu, Shuo Liu, Yiyang Huang, Jiajun Guo, Aimin Zhou, Hong Qian 27d ago

Embedding Enhancement via Fine-Tuned Language Models for Learner-Item Cognitive Modeling

Fine-tuning language models to enhance embeddings for cognitive modeling in online education systems.

Ax Yi Zhou 27d ago

From Paper to Program: A Multi-Stage LLM-Assisted Workflow for Accelerating Quantum Many-Body Algorithm Development

Multi-stage LLM-assisted workflow for generating quantum many-body algorithms using LaTeX intermediate specifications.

Ax Xuelin Zhang, Hong Chen, Bin Gu, Tieliang Gong, Feng Zheng 27d ago

Fine-grained Analysis of Stability and Generalization for Stochastic Bilevel Optimization

Research on generalization guarantees for stochastic bilevel optimization in machine learning, hyperparameter optimization, and meta-learning.

Ax Mahyar T. Moghaddam, Mina Alipour, Torben Worm, Mikkel Baun Kj{\ae}rgaard 27d ago

Toward a Sustainable Software Architecture Community: Evaluating ICSA's Environmental Impact

Analysis of carbon footprint from GenAI tool usage and conference activities in software architecture research.

Ax Leonardo Bitzki, Diego Kreutz, Tiago Heinrich, Douglas Fideles, Leandro Bertholdo, Silvio Quincozes, Angelo Diniz 27d ago

NetSecBed: A Container-Native Testbed for Reproducible Cybersecurity Experimentation

Container-based testbed for reproducible cybersecurity experimentation and network traffic generation.

Ax Rub\'en Moreno-Aguado, Alba Magall\'on, Victor Moreno, Yingying Fang, Guang Yang 27d ago

Learning Robust Visual Features in Computed Tomography Enables Efficient Transfer Learning for Clinical Tasks

Study on training robust vision features for CT imaging to enable transfer learning for clinical diagnostic tasks.

Ax Juhan Park, Taerim Yoon, Seungmin Kim, Joonggil Kim, Wontae Ye, Jeongeun Park, Yoonbyung Chai, Geonwoo Cho, Geunwoo Cho, Dohyeong Kim, Kyungjae Lee, Yongjae Kim, Sungjoon Choi 27d ago

Learning Dexterous Grasping from Sparse Taxonomy Guidance

Research on dexterous robotic grasping using reinforcement learning with sparse guidance for multi-finger manipulation control.

Ax Cheol Woo Kum, Jai Moondra, Roozbeh Nahavandi, Andrew Perrault, Milind Tambe, Swati Gupta 27d ago

Many Preferences, Few Policies: Towards Scalable Language Model Personalization

Method for scalable LLM personalization using portfolio selection across heterogeneous user preferences, maintaining single shared model instead of per-user instances.

Ax Sofiane Bouaziz, Adel Hafiane, Raphael Canals, Rachid Nedjai 27d ago

Uncertainty-Aware Test-Time Adaptation for Cross-Region Spatio-Temporal Fusion of Land Surface Temperature

Test-time adaptation approach for cross-region generalization in land surface temperature prediction, addressing domain shifts in remote sensing applications.

Ax Xu Yan, Jun Yin, Shiliang Sun, Minghua Wan 27d ago

Incomplete Multi-View Multi-Label Classification via Shared Codebook and Fused-Teacher Self-Distillation

Method for incomplete multi-view multi-label classification using shared codebook and fused-teacher self-distillation under dual-missing conditions.

Ax Yaohan Guan, Pristina Wang, Najim Dehak, Alan Yuille, Jieneng Chen, Daniel Khashabi 27d ago

GENFIG1: Visual Summaries of Scholarly Work as a Challenge for Vision-Language Models

GENFIG1 benchmark evaluating vision-language models on generating Figure 1 visual summaries of scholarly research, assessing conceptual richness in scientific communication.

Ax Adrienne Deganutti, Elad Hirsch, Haonan Zhu, Jaejung Seol, Purvanshi Mehta 27d ago

Graphic-Design-Bench: A Comprehensive Benchmark for Evaluating AI on Graphic Design Tasks

GraphicDesignBench: first comprehensive benchmark for evaluating AI models on professional graphic design tasks including layout, typography, and communicative intent.

Ax Kamyar Barakati, Boris N. Slautin, Utkarsh Pratiush, Hiroshi Funakubo, Sergei V. Kalinin 27d ago

PATHFINDER: Multi-objective discovery in structural and spectral spaces

Multi-objective automated discovery framework for microscopy and characterization workflows, addressing premature convergence through exploration coordination across structural and spectral spaces.

Ax Fuda van Diggelen 27d ago

Robots Need Some Education: On the complexity of learning in evolutionary robotics

Analysis of learning complexity in evolutionary robotics versus robot learning, examining optimization time scales and what is being optimized in robotic systems.

Ax Haonian Ji, Kaiwen Xiong, Siwei Han, Peng Xia, Shi Qiu, Yiyang Zhou, Jiaqi Liu, Jinlong Li, Bingzhou Li, Zeyu Zheng, Cihang Xie, Huaxiu Yao 27d ago

ClawArena: Benchmarking AI Agents in Evolving Information Environments

ClawArena benchmark evaluating AI agents' ability to maintain correct beliefs in evolving information environments with contradictory sources and changing evidence.

Ax Mir Tafseer Nayeem, Davood Rafiei 27d ago

Which English Do LLMs Prefer? Triangulating Structural Bias Towards American English in Foundation Models

Postcolonial analysis of structural bias toward American English in foundation models, examining geopolitical data curation and linguistic standardization in LLM development.

Ax Xiaohang Yu, William Knottenbelt 27d ago

LOCARD: An Agentic Framework for Blockchain Forensics

LOCARD: agentic framework modeling blockchain forensics as sequential decision-making, enabling dynamic iterative investigations instead of static inference pipelines.

Ax Aniruddh G. Puranic, Sebastian Schirmer, John S. Baras, Calin Belta 27d ago

Learning from Imperfect Demonstrations via Temporal Behavior Tree-Guided Trajectory Repair

Formal framework using Temporal Behavior Trees to repair suboptimal trajectories from imperfect demonstrations before downstream imitation and reinforcement learning.

Ax Linyao Chen, Bo Huang, Qinlao Zhao, Shuai Shao, Zhi Han, Zicai Cui, Ziheng Zhang, Guangtao Zeng, Wenzheng Tang, Yikun Wang, Yuanjian Zhou, Zimian Peng, Yong Yu, Weiwen Liu, Hiroki Kobayashi, Weinan Zhang 27d ago

Agentization of Digital Assets for the Agentic Web: Concepts, Techniques, and Benchmark

Framework and benchmark for converting web elements into autonomous agents as foundational primitives for the Agentic Web, enabling automated agent generation from digital assets.

Ax Donghuo Zeng, Hao Niu, Masato Taya 27d ago

Hierarchical Semantic Correlation-Aware Masked Autoencoder for Unsupervised Audio-Visual Representation Learning

Dual-path teacher-student framework for learning aligned multimodal embeddings from weakly paired audio-visual corpora using hierarchical semantic consistency.

Ax Charafeddine Mouzouni 27d ago

Three Phases of Expert Routing: How Load Balance Evolves During Mixture-of-Experts Training

Analysis of Mixture-of-Experts token routing across training phases using congestion game modeling, tracking three-phase trajectory in OLMoE and OpenMoE models.

Ax Sajad Ghawami 27d ago

Good Rankings, Wrong Probabilities: A Calibration Audit of Multimodal Cancer Survival Models

Systematic audit of probability calibration in multimodal deep learning models combining histopathology images and genomic data for cancer survival prediction.

Ax Mahmoud Srewa, Tianyu Zhao, Salma Elmalaki 27d ago

APPA: Adaptive Preference Pluralistic Alignment for Fair Federated RLHF of LLMs

Federated reinforcement learning from human feedback method for aligning LLMs with diverse human preferences while preserving privacy and achieving fair reward aggregation.