Ax Yichuan Deng, Zhao Song, Kaijun Yuan, Tianyi Zhou 3/16/2026

Why Softmax Attention Outperforms Linear Attention

Comparative analysis of softmax vs linear attention mechanisms in transformer architectures, examining computational efficiency tradeoffs.

Ax Jianwei Li, Jung-Eun Kim 3/16/2026

Superficial Safety Alignment Hypothesis

Analyzes brittleness of LLM safety alignment mechanisms, proposing superficial safety alignment hypothesis explaining why standard alignment approaches are vulnerable.

Ax Vinod Raman, Hilal Asi, Satyen Kale 3/16/2026

AdaBoN: Adaptive Best-of-N Alignment

Prompt-adaptive Best-of-N alignment strategy using reward models to reduce computational cost of test-time alignment for language models.

Ax Thai-Hoc Vu, Ngo Hoang Tu, Thien Huynh-The, Kyungchun Lee, Sunghwan Kim, Miroslav Voznak, Quoc-Viet Pham 3/16/2026

Integration of TinyML and LargeML: A Survey of 6G and Beyond

Survey on integrating TinyML and LargeML for 6G networks, covering deep learning applications in mobile systems, autonomous vehicles, and smart services.

Ax Yinqiu Huang, Hao Ma, Wenshuai Chen, Zongwei Wang, Shuli Wang, Yongqiang Zhang, Xue Wei, Yinhua Zhu, Haitao Wang, Xingxing Wang 3/16/2026

Generative Bid Shading in Real-Time Bidding Advertising

Generative approach to bid shading in real-time bidding advertising using non-convex surplus optimization instead of traditional two-stage methods.

Ax Gabriele Digregorio, Marco Di Gennaro, Stefano Zanero, Stefano Longari, Michele Carminati 3/16/2026

On the (In)Security of Loading Machine Learning Models

Security evaluation of ML model sharing frameworks and hubs, assessing vulnerabilities in loading shared models and security awareness gaps among practitioners.

Ax Shuofei Qiao, Yanqiu Zhao, Zhisong Qiu, Xiaobin Wang, Jintian Zhang, Zhao Bin, Ningyu Zhang, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen 3/16/2026

Scaling Generalist Data-Analytic Agents

DataMind: scalable data-analytic AI agents for automated discovery. Open-source agent framework handling diverse-format data files and multi-step reasoning.

Ax Hritik Bansal, Devendra Singh Sachan, Kai-Wei Chang, Aditya Grover, Gargi Ghosh, Wen-tau Yih, Ramakanth Pasunuru 3/16/2026

HoneyBee: Data Recipes for Vision-Language Reasoners

HoneyBee: data curation approaches for vision-language reasoning datasets. Analyzes impact of context, content, and format on VLM reasoning capabilities.

Ax Yash Jangir, Yidi Zhang, Pang-Chi Lo, Kashu Yamazaki, Chenyu Zhang, Kuan-Hsun Tu, Tsung-Wei Ke, Lei Ke, Yonatan Bisk, Katerina Fragkiadaki 3/16/2026

RobotArena $\infty$: Scalable Robot Benchmarking via Real-to-Sim Translation

RobotArena ∞: scalable robot benchmarking via real-to-sim translation. Enables rigorous evaluation of robot policies across diverse tasks and environments.

Ax Xinwu Ye, Yicheng Mao, Jia Zhang, Yimeng Liu, Li Hao, Fang Wu, Zhiwei Li, Yuxuan Liao, Zehong Wang, Yingcheng Wu, Zhiyuan Liu, Zhenfei Yin, Li Yuan, Philip Torr, Huan Sun, Xiangxiang Zeng, Mengdi Wang, Le Cong, Shenghua Gao, Xiangru Tang 3/16/2026

LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning

LatentChem: latent reasoning interface for chemical LLMs. Decouples chemical computation from discrete tokens to improve efficiency and performance in chemical reasoning.

Ax Markus Knauer, Samuel Bustamante, Thomas Eiband, Alin Albu-Sch\"affer, Freek Stulp, Jo\~ao Silv\'erio 3/16/2026

IROSA: Interactive Robot Skill Adaptation using Natural Language

IROSA: framework combining foundation models with imitation learning for robot skill adaptation via natural language. LLM application to robotics.

Ax Linus Folkerts, Will Payne, Simon Inman, Philippos Giavridis, Joe Skinner, Sam Deverett, James Aung, Ekin Zorer, Michael Schmatz, Mahmoud Ghanem, John Wilkinson, Alan Steer, Vy Hong, Jessica Wang 3/16/2026

Measuring AI Agents' Progress on Multi-Step Cyber Attack Scenarios

Benchmark evaluating frontier AI models on multi-step cyber attack scenarios. Agent capability measurement across extended action sequences.