LB vgel.me via atharva 2/26/2026

Small Models Can Introspect, Too

Open-source 32B model demonstrates introspection capabilities through logit analysis. Improved prompting enhances performance on detecting injected concepts in activations.

Ax Thomas Kwa, Ben West, Joel Becker, Amy Deng, Katharyn Garcia, Max Hasin, Sami Jawhar, Megan Kinniment, Nate Rush, Sydney Von Arx, Ryan Bloom, Thomas Broadley, Haoxing Du, Brian Goodrich, Nikola Jurkovic, Luke Harold Miles, Seraphina Nix, Tao Lin, Neev Parikh, David Rein, Lucas Jun Koba Sato, Hjalmar Wijk, Daniel M. Ziegler, Elizabeth Barnes, Lawrence Chan 2/26/2026

Measuring AI Ability to Complete Long Software Tasks

Proposes metric measuring AI ability to complete long software tasks by comparing model performance to human domain expert completion time.

Ax Anton Selitskiy, Maitreya Kocharekar 2/26/2026

Discrete Optimal Transport and Voice Conversion

kDOT: discrete optimal transport framework for voice conversion using barycentric projection in pretrained speech embedding space instead of averaging strategies.

Ax Junxiao Yang, Jinzhe Tu, Haoran Liu, Xiaoce Wang, Chujie Zheng, Zhexin Zhang, Shiyao Cui, Caishun Chen, Tiantian He, Hongning Wang, Yew-Soon Ong, Minlie Huang 2/26/2026

BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs

BARREL identifies pathological reasoning patterns in Large Reasoning Models and improves factual reliability. Enables models to admit ignorance instead of confident false answers.

Ax Guodong Du, Zhuo Li, Xuanning Zhou, Junlin Li, Zesheng Shi, Wanyu Lin, Ho-Kin Tang, Xiucheng Li, Fangming Liu, Wenya Wang, Min Zhang, Jing Li 2/26/2026

Knowledge Fusion of Large Language Models Via Modular SkillPacks

Knowledge fusion method for LLMs via modular SkillPacks. Enables efficient cross-capability transfer for multi-task integration, compression, and continual learning.

Ax Rulin Shao, Shuyue Stella Li, Rui Xin, Scott Geng, Yiping Wang, Sewoong Oh, Simon Shaolei Du, Nathan Lambert, Sewon Min, Ranjay Krishna, Yulia Tsvetkov, Hannaneh Hajishirzi, Pang Wei Koh, Luke Zettlemoyer 2/26/2026

Spurious Rewards: Rethinking Training Signals in RLVR

Shows RLVR with GRPO can improve LLM mathematical reasoning using spurious rewards with little/no correlation to correct answers, challenging reward signal assumptions.

Ax Shan Jiang, Pranoy Kovuri, David Tao, Zhixun Tan 2/26/2026

CASCADE: LLM-Powered JavaScript Deobfuscator at Google

CASCADE: hybrid LLM-powered JavaScript deobfuscator at Google combining Gemini coding capabilities with compiler IR transformations for code comprehension.

Ax Lauri Suomela, Sasanka Kuruppu Arachchige, German F. Torres, Harry Edelman, Joni-Kristian K\"am\"ar\"ainen 2/26/2026

Synthetic vs. Real Training Data for Visual Navigation

Investigates sim-to-real gap in visual navigation by comparing simulator-trained and real-world-trained policies. Demonstrates simulator policies can match real-world performance.

Ax Kartik Hegde, Rehana Mahfuz, Yinyi Guo, Erik Visser 2/26/2026

Aligning Audio Captions with Human Preferences

Preference-aligned audio captioning framework using RLHF with CLAP-based reward model trained on human-labeled preferences. Addresses gap between supervised learning and real preferences.

Ax Advik Raj Basani, Pin-Yu Chen 2/26/2026

Diversity Boosts AI-Generated Text Detection

DivEye detector for AI-generated text using diversity metrics. Improves detection of synthetic text while providing interpretability over black-box classifiers.

Ax Raheem Karim Hashmani, Garrett W. Merz, Helen Qu, Mariel Pettee, Kyle Cranmer 2/26/2026

Multimodal Datasets with Controllable Mutual Information

Framework for generating multimodal datasets with controllable mutual information between modalities. Enables systematic study of MI estimators and multimodal self-supervised learning.

Ax NVIDIA, :, Arslan Ali, Junjie Bai, Maciej Bala, Yogesh Balaji, Aaron Blakeman, Tiffany Cai, Jiaxin Cao, Tianshi Cao, Elizabeth Cha, Yu-Wei Chao, Prithvijit Chattopadhyay, Mike Chen, Yongxin Chen, Yu Chen, Shuai Cheng, Yin Cui, Jenna Diamond, Yifan Ding, Jiaojiao Fan, Linxi Fan, Liang Feng, Francesco Ferroni, Sanja Fidler, Xiao Fu, Ruiyuan Gao, Yunhao Ge, Jinwei Gu, Aryaman Gupta, Siddharth Gururani, Imad El Hanafi, Ali Hassani, Zekun Hao, Jacob Huffman, Joel Jang, Pooya Jannaty, Jan Kautz, Grace Lam, Xuan Li, Zhaoshuo Li, Maosheng Liao, Chen-Hsuan Lin, Tsung-Yi Lin, Yen-Chen Lin, Huan Ling, Ming-Yu Liu, Xian Liu, Yifan Lu, Alice Luo, Qianli Ma, Hanzi Mao, Kaichun Mo, Seungjun Nah, Yashraj Narang, Abhijeet Panaskar, Lindsey Pavao, Trung Pham, Morteza Ramezanali, Fitsum Reda, Scott Reed, Xuanchi Ren, Haonan Shao, Yue Shen, Stella Shi, Shuran Song, Bartosz Stefaniak, Shangkun Sun, Shitao Tang, Sameena Tasmeen, Lyne Tchapmi, Wei-Cheng Tseng, Jibin Varghese, Andrew Z. Wang, Hao Wang, Haoxiang Wang, Heng Wang, Ting-Chun Wang, Fangyin Wei, Jiashu Xu, Dinghao Yang, Xiaodong Yang, Haotian Ye, Seonghyeon Ye, Xiaohui Zeng, Jing Zhang, Qinsheng Zhang, Kaiwen Zheng, Andrew Zhu, Yuke Zhu 2/26/2026

World Simulation with Video Foundation Models for Physical AI

Cosmos-Predict2.5 foundation model for world simulation unifying text/image/video generation. Leverages vision-language model for grounded physical AI predictions.

Ax Soufiane Hayou 2/26/2026

A Proof of Learning Rate Transfer under $\mu$P

Theoretical proof that optimal learning rates transfer across widths in MLPs with μP parameterization. Shows learning rate converges to nonzero constant at infinite width.

Ax Vincenzo Lipardi, Domenica Dibenedetto, Georgios Stamoulis, Evert van Nieuwenburg, Mark H. M. Winands 2/26/2026

Nonstabilizerness Estimation using Graph Neural Networks

Graph neural network approach for estimating nonstabilizerness in quantum circuits. Addresses quantum advantage through stabilizer Rényi entropy estimation.

Ax Aqsa Sultana, Rayan Afsar, Ahmed Rahu, Surendra P. Singh, Brian Shula, Brandon Combs, Derrick Forchetti, Vijayan K. Asari 2/26/2026

XtraLight-MedMamba for Classification of Neoplastic Tubular Adenomas

Deep learning model for classifying neoplastic tubular adenomas in colonoscopy. Uses Mamba architecture for digital pathology and colorectal cancer risk stratification.

Ax Christian Catalini, Xiang Hui, Jane Wu 2/26/2026

Some Simple Economics of AGI

Economic analysis of AGI's impact on labor and growth. Argues human verification becomes the bottleneck as AI decouples cognition from biology.