Ax Hanglin Li, Shuchang Tian, Chen Lin, Zhiyong Zhao, Kun Zhan 3/25/2026

FAAR: Format-Aware Adaptive Rounding for NVFP4

FAAR quantization method for NVFP4 ultra-low-bit format that adapts rounding to non-uniform numerical grid for efficient LLM edge deployment.

Ax Yuren Cai, Guangyi Wang, Zongqing Li, Li Li, Zhihui Liu, Songzhi Su 3/25/2026

Three Creates All: You Only Sample 3 Steps

MTEO method for few-step diffusion sampling by distilling layer-wise, step-wise time embeddings to accelerate inference.

Ax Davide Bucciarelli, Evelyn Turri, Lorenzo Baraldi, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara 3/25/2026

Tiny Inference-Time Scaling with Latent Verifiers

Inference-time scaling method using small latent verifiers instead of multimodal LLMs to score and select outputs while reducing computational cost.

Ax Shoubin Yu, Lei Shu, Antoine Yang, Yao Fu, Srinivas Sunkara, Maria Wang, Jindong Chen, Mohit Bansal, Boqing Gong 3/25/2026

Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos

Ego2Web benchmark for multimodal web agents grounded in egocentric video, evaluating agents performing real-world workflows with physical context awareness.

Ax Javier Ferrando, Enrique Lopez-Cuena, Pablo Agustin Martin-Torres, Daniel Hinjos, Anna Arias-Duart, Dario Garcia-Gasulla 3/25/2026

Language Models Can Explain Visual Features via Steering

Method leveraging vision-language models to explain sparse autoencoder features in vision models through causal interventions instead of correlation-based approaches.

Ax Greg Nyilasy, Abraham Ryan Ade Putra Hito, Jennifer Overbeck, Brock Bastian, Darren W. Dahl 3/25/2026

Do Consumers Accept AIs as Moral Compliance Agents?

Survey studying consumer acceptance of AI in moral compliance roles versus moral decision-making across five studies.