Ax Siddharth Boppana, Annabel Ma, Max Loeffler, Raphael Sarfati, Eric Bigelow, Atticus Geiger, Owen Lewis, Jack Merullo 3/11/2026

Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought

Analysis of performative chain-of-thought in reasoning models, showing models generate tokens without revealing internal beliefs via activation probing.

Ax Zijie Yan (NVIDIA), Hongxiao Bai (NVIDIA), Xin Yao (NVIDIA), Dennis Liu (NVIDIA), Tong Liu (NVIDIA), Hongbin Liu (NVIDIA), Pingtian Li (NVIDIA), Evan Wu (NVIDIA), Shiqing Fan (NVIDIA), Li Tao (NVIDIA), Robin Zhang (NVIDIA), Yuzhong Wang (NVIDIA), Shifang Xu (NVIDIA), Jack Chang (NVIDIA), Xuwen Chen (NVIDIA), Kunlun Li (NVIDIA), Yan Bai (NVIDIA), Gao Deng (NVIDIA), Nan Zheng (NVIDIA), Vijay Anand Korthikanti (NVIDIA), Abhinav Khattar (NVIDIA), Ethan He (NVIDIA), Soham Govande (NVIDIA), Sangkug Lym (NVIDIA), Zhongbo Zhu (NVIDIA), Qi Zhang (NVIDIA), Haochen Yuan (NVIDIA), Xiaowei Ren (NVIDIA), Deyu Fu (NVIDIA), Tailai Ma (NVIDIA), Shunkang Zhang (NVIDIA), Jiang Shao (NVIDIA), Ray Wang (NVIDIA), Vasudevan Rengasamy (NVIDIA), Rachit Garg (NVIDIA), Santosh Bhavani (NVIDIA), Xipeng Li (NVIDIA), Chandler Zhou (NVIDIA), David Wu (NVIDIA), Yingcan Wei (NVIDIA), Ashwath Aithal (NVIDIA), Michael Andersch (NVIDIA), Mohammad Shoeybi (NVIDIA), Jiajie Yao (NVIDIA), June Yang (NVIDIA) 3/11/2026

Scalable Training of Mixture-of-Experts Models with Megatron Core

Megatron Core system optimizations for scaling Mixture-of-Experts model training across memory, communication, and computation constraints.

Ax Peter Brodeur, Jacob M. Koshy, Anil Palepu, Khaled Saab, Ava Homiar, Roma Ruparel, Charles Wu, Ryutaro Tanno, Joseph Xu, Amy Wang, David Stutz, Hannah M. Ferrera, David Barrett, Lindsey Crowley, Jihyeon Lee, Spencer E. Rittner, Ellery Wulczyn, Selena K. Zhang, Elahe Vedadi, Christine G. Kohn, Kavita Kulkarni, Vinay Kadiyala, Sara Mahdavi, Wendy Du, Jessica Williams, David Feinbloom, Renee Wong, Tao Tu, Petar Sirkovic, Alessio Orlandi, Christopher Semturs, Yun Liu, Juraj Gottweis, Dale R. Webster, Jo\"elle Barral, Katherine Chou, Pushmeet Kohli, Avinatan Hassidim, Yossi Matias, James Manyika, Rob Fields, Jonathan X. Li, Marc L. Cohen, Vivek Natarajan, Mike Schaekermann, Alan Karthikesalingam, Adam Rodman 3/11/2026

A prospective clinical feasibility study of a conversational diagnostic AI in an ambulatory primary care clinic

Clinical feasibility study of AMIE, an LLM-based conversational AI for patient diagnostic history in real-world primary care workflows.

Ax Ben Rank, Hardik Bhatnagar, Ameya Prabhu, Shira Eisenberg, Karina Nguyen, Matthias Bethge, Maksym Andriushchenko 3/11/2026

PostTrainBench: Can LLM Agents Automate LLM Post-Training?

PostTrainBench benchmarks LLM agents' ability to automate post-training of language models, extending AI agents to AI research automation.

HN photoncat 3/11/2026

Rlclaw autonomous ML research companion

Autonomous AI agent that optimizes control systems by independently writing code, training models, and iterating on research problems with minimal human guidance via Discord.

HN sultanvaliyev 3/11/2026

Semantically search 45k+ AI skills

Open marketplace indexing 45,000+ AI agent skills with semantic search. Works with Claude Code, Cursor, Windsurf and other agents.

HN kevin1chun 3/11/2026

Robinhood Agent Integration

MCP server providing 18 structured tools for AI agents to interact with Robinhood trading platform. Compatible with Claude Code and OpenClaw.