Ax Bart{\l}omiej Starosta, S{\l}awomir T. Wierzcho\'n, Piotr Borkowski, Dariusz Czerski, Marcin Sydow, Eryk Laskowski, Mieczys{\l}aw A. K{\l}opotek 3/17/2026

Rough Sets for Explainability of Spectral Graph Clustering

Rough set theory applied to explain results of spectral graph clustering algorithms for text document analysis.

Ax Shaocheng Shen, Jianfeng Liang, Chunlei Cai, Cong Geng, Huiyu Duan, Xiaoyun Zhang, Qiang Hu, Guangtao Zhai 3/17/2026

Agentic Retoucher for Text-To-Image Generation

Agentic Retoucher: hierarchical agent for fixing distortions in text-to-image generation with spatial grounding.

Ax Toni J. B. Liu, Baran Zadeo\u{g}lu, Nicolas Boull\'e, Rapha\"el Sarfati, Christopher J. Earls 3/17/2026

Jacobian Scopes: token-level causal attributions in LLMs

Jacobian Scopes: gradient-based methods for token-level causal attribution in LLMs to identify which prior tokens influence predictions across layers and attention heads.

Ax Zhiliang Peng, Jianwei Yu, Yaoyao Chang, Zilong Wang, Li Dong, Yingbo Hao, Yujie Tu, Chenyu Yang, Wenhui Wang, Songchen Xu, Yutao Sun, Hangbo Bao, Weijiang Xu, Yi Zhu, Zehua Wang, Ting Song, Yan Xia, Zewen Chi, Shaohan Huang, Liang Wang, Chuang Ding, Shuai Wang, Xie Chen, Furu Wei 3/17/2026

VIBEVOICE-ASR Technical Report

VibeVoice-ASR framework for speech understanding in long-form audio using single-pass processing to handle context fragmentation and multi-speaker scenarios.

Ax Jiale Qian, Hao Meng, Tian Zheng, Pengcheng Zhu, Haopeng Lin, Yuhang Dai, Hanke Xie, Wenxiao Cao, Ruixuan Shang, Jun Wu, Hongmei Liu, Hanlin Wen, Jian Zhao, Zhonglin Jiang, Yong Chen, Shunshun Yin, Ming Tao, Jianguo Wei, Lei Xie, Xinsheng Wang 3/17/2026

SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis

Open-source singing voice synthesis system with zero-shot generalization and controllable generation capabilities.

Ax Maciej Besta, {\L}ukasz Jarmocik, Orest Hrycyna, Shachar Klaiman, Konrad M\k{a}czka, Robert Gerstenberger, J\"urgen M\"uller, Piotr Nyczyk, Hubert Niewiadomski, Torsten Hoefler 3/17/2026

GraphSeek: Next-Generation Graph Analytics with LLMs

System for natural language graph analytics over large property graphs using LLMs; enables querying complex heterogeneous datasets efficiently.

Ax Yunpeng Ba, Xi Lin, Changliang Zhou, Ruihao Zheng, Zhenkun Wang, Xinyan Liang, Zhichao Lu, Jianyong Sun, Yuhua Qian, Qingfu Zhang 3/17/2026

Survey on Neural Routing Solvers

Survey of neural routing solvers that use deep learning to tackle vehicle routing problems by learning implicit heuristic rules from data.

Ax Yen-Shan Chen, Shih-Yu Lai, Ying-Jung Tsou, Yi-Cheng Lin, Bing-Yu Chen, Yun-Nung Chen, Hung-yi Lee, Shang-Tse Chen 3/17/2026

Latent-Mark: An Audio Watermark Robust to Neural Resynthesis

arXiv paper proposing Latent-Mark watermarking framework for audio robust to neural resynthesis attacks using latent space embedding.