Ax Charles Ye, Jasmine Cui, Dylan Hadfield-Menell 9d ago

Prompt Injection as Role Confusion

Analysis of prompt injection attacks as role confusion where models infer text source by content style rather than origin.

Ax Yuanhe Zhang, Xinyue Wang, Zhican Chen, Weiliu Wang, Zilu Zhang, Zhengshuo Gong, Zhenhong Zhou, Kun Wang, Li Sun, Yang Liu, Sen Su 9d ago

Resource Consumption Threats in Large Language Models

Survey of resource consumption threats in LLMs including excessive generation attacks, resource efficiency requirements, and mitigation strategies.

Ax Ga\"etan Hadjeres, Marc Ferras, Khaled Koutini, Benno Weck, Alexandre Bittar, Thomas Hummel, Zineb Lahrici, Hakim Missoum, Joan Serr\`a, Yuki Mitsufuji 9d ago

Woosh: A Sound Effects Foundation Model

Open source sound effect foundation model from Sony AI with audio encoder/decoder and text-to-audio capabilities.

Ax Dun Yuan, Fuyuan Lyu, Ye Yuan, Weixu Zhang, Bowei He, Jiayi Geng, Linfeng Du, Zipeng Sun, Yankai Chen, Changjiang Han, Jikun Kang, Xi Chen, Haolun Wu, Xue Liu 9d ago

Beyond Message Passing: A Semantic View of Agent Communication Protocols

Framework analyzing agent communication protocols for LLM systems across three layers: communication, syntactic, and semantic. Systematically organizes 18 representative protocols.

Ax Daniele Solombrino, Antonio Andrea Gargiulo, Adrian Robert Minut, Luca Zhou, Alessandro Zirilli, Emanuele Rodol\`a 9d ago

Zero-Shot Quantization via Weight-Space Arithmetic

Zero-shot quantization method using weight-space arithmetic to improve post-training quantization robustness across models.

Ax Manish Bhatt, Sarthak Munshi, Vineeth Sai Narajala, Idan Habler, Ammar Al-Kahfah, Ken Huang, Joel Webb, Blake Gatto, Md Tamjidul Hoque 9d ago

The Defense Trilemma: Why Prompt Injection Defense Wrappers Fail?

Theoretical analysis proving limitations of continuous wrapper defenses against prompt injection attacks in LLMs.

Ax Xiangru Jian, Hao Xu, Wei Pang, Xinjian Zhao, Chengyu Tao, Qixin Zhang, Xikun Zhang, Chao Zhang, Guanzhi Deng, Alex Xue, Juan Du, Tianshu Yu, Garth Tarr, Linqi Song, Qiuzhuang Sun, Dacheng Tao 9d ago

FORGE: Fine-grained Multimodal Evaluation for Manufacturing Scenarios

Fine-grained benchmark evaluating multimodal LLMs on manufacturing scenarios.

Ax Peng Wang (The Chinese University of Hong Kong, Shenzhen), Yanqiao Zhu (X-LANCE Lab, Shanghai Jiao Tong University), Zixuan Jiang (Xi'an Jiaotong University), Qinyuan Chen (Fudan University), Xingjian Zhao (Fudan University), Xipeng Qiu (Fudan University), Wupeng Wang (Tongyi Fun Team, Alibaba Group), Zhifu Gao (Tongyi Fun Team, Alibaba Group), Xiangang Li (Tongyi Fun Team, Alibaba Group), Kai Yu (X-LANCE Lab, Shanghai Jiao Tong University), Xie Chen (X-LANCE Lab, Shanghai Jiao Tong University) 9d ago

Interactive ASR: Towards Human-Like Interaction and Semantic Coherence Evaluation for Agentic Speech Recognition

Interactive ASR system with semantic coherence evaluation and human-like correction mechanisms.

Ax Jingyu Zhang, Tianjian Li, William Jurayj, Hongyuan Zhan, Benjamin Van Durme, Daniel Khashabi 9d ago

Many-Tier Instruction Hierarchy in LLM Agents

Framework for managing hierarchical instruction conflicts in multi-source LLM agent environments.

Ax Julio Candanedo 9d ago

The Diffusion-Attention Connection

Theoretical connection between Transformers, diffusion maps, and magnetic Laplacians through Markov geometry.