Isolater - Feed

Ax Keyu Li, Jin Gao, Dequan Wang 12d ago

Aligned Agents, Biased Swarm: Measuring Bias Amplification in Multi-Agent Systems

Aligned Agents, Biased Swarm: Empirical study measuring how multi-agent system topologies and feedback loops amplify bias in emergent behaviors.

Ax Avni Mittal, Shanu Kumar, Sandipan Dandapat, Monojit Choudhury 12d ago

Litmus (Re)Agent: A Benchmark and Agentic System for Predictive Evaluation of Multilingual Models

Litmus ReAgent: Benchmark and agentic system for evaluating multilingual LLM performance prediction across 1,500 questions spanning six tasks and five evidence scenarios.

Ax Yi Luo, Xu Sun, Guangchun Luo, Aiguo Chen 12d ago

Neighbourhood Transformer: Switchable Attention for Monophily-Aware Graph Learning

Neighbourhood Transformer: Graph neural network architecture using switchable attention to handle heterophilic graph learning where dissimilar nodes are frequently connected.

Ax Jihwan Oh, Soowon Oh, Murad Aghazada, Minchan Jeong, Sungnyun Kim, Se-Young Yun 12d ago

PerMix-RLVR: Preserving Persona Expressivity under Verifiable-Reward Alignment

PerMix-RLVR: Training method for aligning LLM personas with reward models while preserving output diversity, avoiding inference-time computation overhead.

Ax Zhiyu Zhou, Peilin Liu, Ruoxuan Zhang, Luyang Zhang, Cheng Zhang, Hongxia Xie, Wen-Huang Cheng 12d ago

PinpointQA: A Dataset and Benchmark for Small Object-Centric Spatial Understanding in Indoor Videos

PinpointQA dataset and benchmark for evaluating small object localization and spatial reasoning in video MLLMs.

Ax Xiaoke Guo, Songze Li, Zhiqiang Liu, Zhaoyan Gong, Yuanxiang Liu, Huajun Chen, Wen Zhang 12d ago

ASTRA: Adaptive Semantic Tree Reasoning Architecture for Complex Table Question Answering

ASTRA: adaptive semantic tree reasoning architecture for LLM-based complex table question answering.

Ax Wenxi Li, Xihao Wang, Weiwei Sun 12d ago

Towards Linguistically-informed Representations for English as a Second or Foreign Language: Review, Construction and Application

Survey and construction of linguistically-informed representations for English as a second/foreign language.

Ax Carlos Jimeno Miguel, Raul Orduna, Francesco Zola 12d ago

Identification and Anonymization of Named Entities in Unstructured Information Sources for Use in Social Engineering Detection

Named entity identification and anonymization system for cybercrime datasets using speech-to-text and image processing.

Ax Andre Bacellar 12d ago

Regime-Conditional Retrieval: Theory and a Transferable Router for Two-Hop QA

Regime-conditional retrieval with transferable router for two-hop question answering with theoretical foundations.

Ax Qixuan Huang, Khalid Zaman, Masashi Unoki 12d ago

Noise-Aware In-Context Learning for Hallucination Mitigation in ALLMs

Noise-aware in-context learning approach to mitigate hallucinations in auditory large language models.

Ax Zedian Shao, Hongbin Liu, Yuepeng Hu, Neil Zhenqiang Gong 12d ago

Leave My Images Alone: Preventing Multi-Modal Large Language Models from Analyzing Images via Visual Prompt Injection

ImageProtector prevents multi-modal LLMs from analyzing images via visual prompt injection attacks.

Ax Chenjie Yang, Yutian Jiang, Chenyu Wu 12d ago

Skill-Conditioned Visual Geolocation for Vision-Language

Vision-language models for image geolocation with structured geographic reasoning and autonomous self-evolution.

Ax Yeonjun Hwang, Sungyong Park, Minju Kim, Dongha Lee, Jinyoung Yeo 12d ago

CONDESION-BENCH: Conditional Decision-Making of Large Language Models in Compositional Action Space

CONDESION-BENCH evaluates LLM decision-making with compositional action spaces and conditional feasibility constraints.

Ax Salva R\"uhling Cachay, Duncan Watson-Parris, Rose Yu 12d ago

U-Cast: A Surprisingly Simple and Efficient Frontier Probabilistic AI Weather Forecaster

U-Cast: simple probabilistic weather forecasting using standard U-Net architecture achieving frontier performance.

Ax Mauricio Fadel Argerich, Jonathan F\"urst, Marta Pati\~no-Mart\'inez 12d ago

Watt Counts: Energy-Aware Benchmark for Sustainable LLM Inference on Heterogeneous GPU Architectures

Watt Counts: open-access energy consumption benchmark for LLM inference across 50 models and 10 GPU architectures.

Ax Min Young Baeg, Yoon-Yeong Kim 12d ago

PDE-regularized Dynamics-informed Diffusion with Uncertainty-aware Filtering for Long-Horizon Dynamics

PDYffusion combines diffusion models with physics-informed dynamics for long-horizon spatiotemporal prediction.

Ax Guoqing Wang, Pin Tang, Xiangxuan Ren, Guodongfang Zhao, Bailan Feng, Chao Ma 12d ago

Learning Vision-Language-Action World Models for Autonomous Driving

Vision-Language-Action models for autonomous driving combining perception, reasoning, and temporal dynamics modeling.

Ax Yuxi Zhou, Zhengbo Zhang, Jingyu Pan, Zhiyu Lin, Zhigang Tu 12d ago

Frequency-Enhanced Diffusion Models: Curriculum-Guided Semantic Alignment for Zero-Shot Skeleton Action Recognition

Frequency-enhanced diffusion models for zero-shot skeleton action recognition in computer vision.

Ax Parjanya Aditya Shukla, Shubham Kumar Nigam, Debtanu Datta, Balaramamahanthi Deepak Patnaik, Noel Shallum, Pradeep Reddy Vanga, Saptarshi Ghosh, Arnab Bhattacharya 12d ago

NyayaMind- A Framework for Transparent Legal Reasoning and Judgment Prediction in the Indian Legal System

NyayaMind framework for transparent legal reasoning and judgment prediction in Indian courts using LLMs.

Ax Harry Proshian, Nikita Severin, Sergey Nikolenko, Kireev Ivan, Andrey Savchenko, Ivan Sergeev, Maria Postnova, Ilya Makarov 12d ago

Beyond Isolated Clients: Integrating Graph-Based Embeddings into Event Sequence Models

Method integrating graph-based embeddings into event sequence models for improved user prediction on digital platforms.

Ax Li Huang, Zhongxin Liu, Yifan Wu, Tao Yin, Dong Li, Jichao Bi, Nankun Mu, Hongyu Zhang, Meng Yan 12d ago

DeepGuard: Secure Code Generation via Multi-Layer Semantic Aggregation

DeepGuard improves secure code generation by LLMs through multi-layer semantic aggregation to mitigate vulnerable patterns.

Ax Akshit Jindal, Saket Anand, Chetan Arora, Vikram Goyal 12d ago

CLIP-Inspector: Model-Level Backdoor Detection for Prompt-Tuned CLIP via OOD Trigger Inversion

CLIP-Inspector detects backdoor attacks in prompt-tuned vision-language models through out-of-distribution trigger inversion.

Ax Tommy Shaffer Shane, Simon Mylius, Hamish Hobbs 12d ago

Scheming in the wild: detecting real-world AI scheming incidents with open-source intelligence

Research on detecting covert misaligned AI behavior in real-world settings using open-source intelligence methods.

Ax Chenhao Ye, Huaizheng Zhang, Mingcong Han, Baoquan Zhong, Xiang Li, Qixiang Chen, Xinyi Zhang, Weidong Zhang, Kaihua Jiang, Wang Zhang, He Sun, Wencong Xiao, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau 12d ago

TensorHub: Scalable and Elastic Weight Transfer for LLM RL Training

TensorHub introduces Reference-Oriented Storage for efficient weight transfer in LLM reinforcement learning across heterogeneous computational resources.

Ax Changi Hong, Yoonah Song, Hwayoung Park, Chaewoon Bang, Dayeon Gu, Do Hyun Lee, Hong Kook Kim 12d ago

PS-TTS: Phonetic Synchronization in Text-to-Speech for Achieving Natural Automated Dubbing

PS-TTS method for phonetic synchronization in automated dubbing, addressing duration and lip-sync challenges in AI-based video translation.

Ax Peng Wang (X-LANCE Lab, Shanghai Jiao Tong University), Yanqiao Zhu (X-LANCE Lab, Shanghai Jiao Tong University), Zixuan Jiang (X-LANCE Lab, Shanghai Jiao Tong University), Qinyuan Chen (School of Computer Science, Fudan University), Xingjian Zhao (School of Computer Science, Fudan University), Xipeng Qiu (School of Computer Science, Fudan University), Wupeng Wang (Tongyi Fun Team, Alibaba Group), Zhifu Gao (Tongyi Fun Team, Alibaba Group), Xiangang Li (Tongyi Fun Team, Alibaba Group), Kai Yu (X-LANCE Lab, Shanghai Jiao Tong University), Xie Chen (X-LANCE Lab, Shanghai Jiao Tong University) 12d ago

Interactive ASR: Towards Human-Like Interaction and Semantic Coherence Evaluation for Agentic Speech Recognition

Interactive ASR system with human-like interaction and semantic coherence evaluation, replacing WER metric with agent-based correction mechanisms.

Ax Yi-Lun Liao, Alexander J. Hoffman, Sabrina C. Shen, Alexandre Duval, Sam Walton Norwood, Tess Smidt 12d ago

EquiformerV3: Scaling Efficient, Expressive, and General SE(3)-Equivariant Graph Attention Transformers

EquiformerV3: SE(3)-equivariant graph attention Transformer for 3D atomistic modeling, improving efficiency, expressivity, and physical consistency.

Ax Yushi Feng, Junye Du, Qifan Wang, Zizhan Ma, Qian Niu, Yutaka Matsuo, Long Feng, Lequan Yu 12d ago

CORA: Conformal Risk-Controlled Agents for Safeguarded Mobile GUI Automation

CORA framework for risk-controlled GUI automation agents using conformal prediction to provide formally verified, user-tunable safety guarantees for VLM-powered mobile automation.

Ax Fatma Bet\"ul G\"ure\c{s}, Tanya Nazaretsky, Seyed Parsa Neshaei, Tanja K\"aser 12d ago

Structuring versus Problematizing: How LLM-based Agents Scaffold Learning in Diagnostic Reasoning

LLM-based agents for scaffolding diagnostic reasoning in educational settings, combining scenario-based learning with learning analytics and personalized support.

Ax Yuqin Yang, Haowu Zhou, Haoran Tu, Zhiwen Hui, Shiqi Yan, HaoYang Li, Dong She, Xianrong Yao, Yang Gao, Zhanpeng Jin 12d ago

Persona-E$^2$: A Human-Grounded Dataset for Personality-Shaped Emotional Responses to Textual Events

Dataset for personality-shaped emotional responses to text events, addressing limitations of LLM role-playing and personality illusion in affective computing.

Ax Mansour Zoubeirou a Mayaki 12d ago

Generalization and Scaling Laws for Mixture-of-Experts Transformers

Theoretical analysis of generalization and scaling laws for Mixture-of-Experts Transformers, separating active capacity from routing combinatorics with covering-number bounds.

Ax Avni Mittal 12d ago

Do LLMs Follow Their Own Rules? A Reflexive Audit of Self-Stated Safety Policies

Symbolic-Neural Consistency Audit framework extracting and formalizing LLM self-stated safety policies.

Ax Francesca Fati, Felipe Coutinho, Marika Reinius, Marina Rosanu, Gabriel Funingana, Luigi De Vitis, Gabriella Schivardi, Hannah Clayton, Alice Traversa, Zeyu Gao, Guilherme Penteado, Shangqi Gao, Francesco Pastori, Ramona Woitek, Maria Cristina Ghioni, Giovanni Damiano Aletti, Mercedes Jimenez-Linan, Sarah Burge, Nicoletta Colombo, Evis Sala, Maria Francesca Spadea, Timothy L. Kline, James D. Brenton, Jaime Cardoso, Francesco Multinu, Elena De Momi, Mireia Crispin-Ortuzar, Ines P. Machado 12d ago

Vision Transformers for Preoperative CT-Based Prediction of Histopathologic Chemotherapy Response Score in High-Grade Serous Ovarian Carcinoma

Vision transformer application predicting chemotherapy response in ovarian cancer from preoperative CT scans.

Ax Anas Hattay, Fred Ngole Mboula, Eric Gascard, Zakaria Yahoun 12d ago

On the Role of DAG topology in Energy-Aware Cloud Scheduling : A GNN-Based Deep Reinforcement Learning Approach

GNN-based deep reinforcement learning scheduler for cloud workflow DAG assignment minimizing time and energy.

Ax Yunqiang Wang, Hengyuan Na, Di Wu, Miao Hu, Guocong Quan 12d ago

GRM: Utility-Aware Jailbreak Attacks on Audio LLMs via Gradient-Ratio Masking

GRM gradient-ratio masking attack on audio LLMs balancing jailbreak success with utility preservation.

Ax Esila Keskin 12d ago

The Fast Lane Hypothesis: Von Economo Neurons Implement a Biological Speed-Accuracy Tradeoff

Computational model of Von Economo neurons implementing biological speed-accuracy tradeoff in decision-making.

Ax Zizhao Li, Zhengkang Xiang, Jiayang Ao, Feng Liu, Joseph West, Kourosh Khoshelham 12d ago

Neural Distribution Prior for LiDAR Out-of-Distribution Detection

Neural distribution prior method for LiDAR out-of-distribution detection in autonomous driving.

Ax Augustin Chan 12d ago

Statistical Properties of the King Wen Sequence: An Anti-Habituation Structure That Does Not Improve Neural Network Training

Statistical analysis of I-Ching King Wen sequence showing no improvements to neural network training.

Ax Yuqin Lan, Gen Li, Yuanze Hu, Weihao Shen, Zhaoxin Fan, Faguo Wu, Xiao Zhang, Laurence T. Yang, Zhiming Zheng 12d ago

Mosaic: Multimodal Jailbreak against Closed-Source VLMs via Multi-View Ensemble Optimization

Mosaic multimodal jailbreak attack against closed-source VLMs via multi-view ensemble optimization.

Ax Jingzhi Gong, Ruizhen Gu, Zhiwei Fei, Yazhuo Cao, Lukas Twist, Alina Geiger, Shuo Han, Dominik Sobania, Federica Sarro, Jie M. Zhang 12d ago

SkillMOO: Multi-Objective Optimization of Agent Skills for Software Engineering

SkillMOO multi-objective optimization framework automatically evolving agent skill bundles for coding tasks.

Ax Zengbin Wang, Feng Xiong, Liang Lin, Xuecai Hu, Yong Wang, Yanlin Wang, Man Zhang, Xiangxiang Chu 12d ago

Visually-Guided Policy Optimization for Multimodal Reasoning

Visually-guided policy optimization improving visual faithfulness in vision-language models via reinforcement learning.

Ax Peng Ding 12d ago

LLM-Rosetta: A Hub-and-Spoke Intermediate Representation for Cross-Provider LLM API Translation

LLM-Rosetta hub-and-spoke intermediate representation for cross-provider LLM API translation and interoperability.

Ax Guiyao Tie, Jiawen Shi, Pan Zhou, Lichao Sun 12d ago

BadSkill: Backdoor Attacks on Agent Skills via Model-in-Skill Poisoning

BadSkill: backdoor attack formulation exploiting model artifacts bundled in agent skills.

Ax Andy Anderson 12d ago

The AI Codebase Maturity Model: From Assisted Coding to Self-Sustaining Systems

AI Codebase Maturity Model framework for systematic progression from assisted coding to self-sustaining systems.

Ax Wiebke Hutiri, Morgan Scheuerman, Shruti Nagpal, Austin Hoag, Alice Xiang 12d ago

Yes, But Not Always. Generative AI Needs Nuanced Opt-in

Policy proposal for nuanced consent frameworks in generative AI training data usage.

Ax Siyuan Zhou, Hejun Wang, Hu Cheng, Jinxi Li, Dongsheng Wang, Junwei Jiang, Yixiao Jin, Jiayue Huang, Shiwei Mao, Shangjia Liu, Yafei Yang, Hongkang Song, Shenxing Wei, Zihui Zhang, Peng Huang, Shijie Liu, Zhengli Hao, Hao Li, Yitian Li, Wenqi Zhou, Zhihan Zhao, Zongqi He, Hongtao Wen, Shouwang Huang, Peng Yun, Bowen Cheng, Pok Kazaf Fu, Wai Kit Lai, Jiahao Chen, Kaiyuan Wang, Zhixuan Sun, Ziqi Li, Haochen Hu, Di Zhang, Chun Ho Yuen, Bing Wang, Zhihua Wang, Chuhang Zou, Bo Yang 12d ago

PhysInOne: Visual Physics Learning and Reasoning in One Suite

PhysInOne dataset with 2M videos of physical phenomena for training physics-aware AI systems.

Ax Sanchita S. Kamath, Aziz N Zeidieh, Venkatesh Potluri, Sile O'Modhrain, Kenneth Perry, JooYoung Seo 12d ago

Three Modalities, Two Design Probes, One Prototype, and No Vision: Experience-Based Co-Design of a Multi-modal 3D Data Visualization Tool

Co-design of accessible 3D data visualization tool for blind and low-vision users.

Ax Wonbong Jang, Shikun Liu, Soubhik Sanyal, Juan Camilo Perez, Kam Woh Ng, Sanskar Agrawal, Juan-Manuel Perez-Rua, Yiannis Douratsos, Tao Xiang 12d ago