Ax Yizhang Zhu, Liangwei Wang, Chenyu Yang, Xiaotian Lin, Boyan Li, Wei Zhou, Xinyu Liu, Zhangyang Peng, Tianqi Luo, Yu Li, Chengliang Chai, Chong Chen, Shimin Di, Ju Fan, Ji Sun, Nan Tang, Fugee Tsung, Jiannan Wang, Chenglin Wu, Yanwei Xu, Shaolei Zhang, Yong Zhang, Xuanhe Zhou, Guoliang Li, Yuyu Luo 2/25/2026

A Survey of Data Agents: Emerging Paradigm or Overstated Hype?

Survey examining terminology, definitions, and taxonomy of data agents—autonomous systems orchestrating data and AI for complex data tasks.

Ax Yida Zhao, Kuan Li, Xixi Wu, Liwen Zhang, Dingchu Zhang, Baixuan Li, Maojia Song, Zhuo Chen, Chenxi Wang, Xinyu Wang, Kewei Tu, Pengjun Xie, Jingren Zhou, Yong Jiang 2/25/2026

Repurposing Synthetic Data for Fine-grained Search Agent Supervision

LLM-based search agents trained on synthetic entity-centric data using improved reward mechanisms to capture informative near-miss samples.

Ax Zheng Du, Hao Kang, Song Han, Tushar Krishna, Ligeng Zhu 2/25/2026

OckBench: Measuring the Efficiency of LLM Reasoning

OckBench: Benchmark measuring LLM reasoning efficiency via token usage, revealing up to 5x differences in token length across models.

Ax Xing Li, Hui-Ling Zhen, Lihao Yin, Xianzhi Yu, Zhenhua Dong, Mingxuan Yuan 2/25/2026

What Matters For Safety Alignment?

Comprehensive empirical study evaluating factors affecting safety alignment in LLMs and LRMs across 32 recent models.

Ax Christopher Clark, Jieyu Zhang, Zixian Ma, Jae Sung Park, Mohammadreza Salehi, Rohun Tripathi, Sangho Lee, Zhongzheng Ren, Chris Dongjoo Kim, Yinuo Yang, Vincent Shao, Yue Yang, Weikai Huang, Ziqi Gao, Taira Anderson, Jianrui Zhang, Jitesh Jain, George Stoica, Winson Han, Ali Farhadi, Ranjay Krishna 2/25/2026

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Molmo2: Open-weight vision-language model with video understanding, grounding, and disclosed training data and recipe.

Ax Ritik Raina, Abe Leite, Alexandros Graikos, Seoyoung Ahn, Dimitris Samaras, Gregory J. Zelinsky 2/25/2026

Generating metamers of human scene understanding

MetamerGen: Latent diffusion model generating visual scenes aligned with human perception using periphery gist and foveal information.

Ax Venus Team, Changlong Gao, Zhangxuan Gu, Yulin Liu, Xinyu Qiu, Shuheng Shen, Yue Wen, Tianyu Xia, Zhenyu Xu, Zhengwen Zeng, Beitong Zhou, Xingran Zhou, Weizhi Chen, Sunhao Dai, Jingya Dou, Yichen Gong, Yuan Guo, Zhenlin Guo, Feng Li, Qian Li, Jinzhen Lin, Yuqi Zhou, Linchao Zhu, Liang Chen, Zhenyu Guo, Changhua Meng, Weiqiang Wang 2/25/2026

UI-Venus-1.5 Technical Report

UI-Venus-1.5: GUI agent with 2B, 8B, and 30B-A3B variants for automating digital environment interactions with broad generality and strong task performance.

Ax Bin Cao, Qian Zhang, Zhenjie Feng, Taolue Zhang, Jiaqiang Huang, Lu-Tao Weng, Tong-Yi Zhang 2/25/2026

AI-Driven Structure Refinement of X-ray Diffraction

Applies physics-constrained whole-pattern expectation-maximization algorithm for AI-driven refinement of X-ray diffraction crystal structures.

Ax Matthew Adiletta, Gu-Yeon Wei, David Brooks 2/25/2026

RPU -- A Reasoning Processing Unit

Proposes Reasoning Processing Unit (RPU) architecture to address memory bandwidth bottlenecks in LLM inference, particularly for reasoning applications with long outputs.

Ax Maijunxian Wang, Ruisi Wang, Juyi Lin, Ran Ji, Thadd\"aus Wiedemer, Qingying Gao, Dezhi Luo, Yaoyao Qian, Lianyu Huang, Zelong Hong, Jiahui Ge, Qianli Ma, Hang He, Yifan Zhou, Lingzi Guo, Lantao Mei, Jiachen Li, Hanwen Xing, Tianqi Zhao, Fengyuan Yu, Weihang Xiao, Yizheng Jiao, Jianheng Hou, Danyang Zhang, Pengcheng Xu, Boyang Zhong, Zehong Zhao, Gaoyun Fang, John Kitaoka, Yile Xu, Hua Xu, Kenton Blacutt, Tin Nguyen, Siyuan Song, Haoran Sun, Shaoyue Wen, Linyang He, Runming Wang, Yanzhi Wang, Mengyue Yang, Ziqiao Ma, Rapha\"el Milli\`ere, Freda Shi, Nuno Vasconcelos, Daniel Khashabi, Alan Yuille, Yilun Du, Ziming Liu, Bo Li, Dahua Lin, Ziwei Liu, Vikash Kumar, Yijiang Li, Lei Yang, Zhongang Cai, Hokin Deng 2/25/2026

A Very Big Video Reasoning Suite

Benchmark suite for evaluating video reasoning capabilities in modern video models including spatiotemporal reasoning and scaling behavior.

Ax Dongwei Wang, Jinhee Kim, Seokho Han, Denis Gudovskiy, Yohei Nakata, Tomoyuki Okuno, KhayTze Peong, Kang Eun Jeon, Jong Hwan Ko, Yiran Chen, Huanrui Yang 2/25/2026

MoBiQuant: Mixture-of-Bits Quantization for Token-Adaptive Elastic LLMs

MoBiQuant enables elastic LLM deployment with token-adaptive mixture-of-bits quantization supporting dynamic precision switching at runtime.