Ax Hanxian Huang, Igor Fedorov, Andrey Gromov, Bernard Beckerman, Naveen Suda, David Eriksson, Maximilian Balandat, Rylan Conway, Patrick Huber, Chinnadhurai Sankar, Ayushi Dalmia, Zechun Liu, Lemeng Wu, Tarek Elgamal, Adithya Sagar, Vikas Chandra, Raghuraman Krishnamoorthi 3/18/2026

MobileLLM-Flash: Latency-Guided On-Device LLM Design for Industry Scale

MobileLLM-Flash methodology designs on-device LLMs optimized for latency constraints using hardware-in-the-loop architecture search.

Ax Callen MacPhee, Yiming Zhou, Koichiro Kishima, Bahram Jalali 3/18/2026

Standardizing Medical Images at Scale for AI

Physics-based preprocessing framework standardizes heterogeneous medical images at scale for improved model generalization.

Ax Atharva Sehgal, James Hou, Akanksha Sarkar, Ishaan Mantripragada, Swarat Chaudhuri, Jennifer J. Sun, Yisong Yue 3/18/2026

Evaluating Agentic Optimization on Large Codebases

FormulaCode benchmark evaluates LLM coding agents on repository-level codebase optimization with realistic multi-objective constraints.

Ax Yuanhe Zhang, Xinyue Wang, Zhican Chen, Weiliu Wang, Zilu Zhang, Zhengshuo Gong, Zhenhong Zhou, Li Sun, Yang Liu, Sen Su 3/18/2026

Resource Consumption Threats in Large Language Models

Survey of resource consumption threats in LLMs including excessive generation, covering efficiency challenges for providers and users.

Ax Jaechang Kim, Yotaro Shimose, Zhao Wang, Kuang-Da Wang, Jungseul Ok, Shingo Takamatsu 3/18/2026

Visual Prompt Discovery via Semantic Exploration

Visual prompt discovery method to diagnose and mitigate LVLM perception failures through semantic exploration.