Ax Zijie Yan (NVIDIA), Hongxiao Bai (NVIDIA), Xin Yao (NVIDIA), Dennis Liu (NVIDIA), Tong Liu (NVIDIA), Hongbin Liu (NVIDIA), Pingtian Li (NVIDIA), Evan Wu (NVIDIA), Shiqing Fan (NVIDIA), Li Tao (NVIDIA), Robin Zhang (NVIDIA), Yuzhong Wang (NVIDIA), Shifang Xu (NVIDIA), Jack Chang (NVIDIA), Xuwen Chen (NVIDIA), Kunlun Li (NVIDIA), Yan Bai (NVIDIA), Gao Deng (NVIDIA), Nan Zheng (NVIDIA), Vijay Anand Korthikanti (NVIDIA), Abhinav Khattar (NVIDIA), Ethan He (NVIDIA), Soham Govande (NVIDIA), Sangkug Lym (NVIDIA), Zhongbo Zhu (NVIDIA), Qi Zhang (NVIDIA), Haochen Yuan (NVIDIA), Xiaowei Ren (NVIDIA), Deyu Fu (NVIDIA), Tailai Ma (NVIDIA), Shunkang Zhang (NVIDIA), Jiang Shao (NVIDIA), Ray Wang (NVIDIA), Santosh Bhavani (NVIDIA), Xipeng Li (NVIDIA), Chandler Zhou (NVIDIA), David Wu (NVIDIA), Yingcan Wei (NVIDIA), Ashwath Aithal (NVIDIA), Michael Andersch (NVIDIA), Mohammad Shoeybi (NVIDIA), Jiajie Yao (NVIDIA), June Yang (NVIDIA) 3/10/2026

Scalable Training of Mixture-of-Experts Models with Megatron Core

arXiv paper on scaling Mixture-of-Experts model training using Megatron Core, addressing systems challenges in sparse model architectures across memory, communication, and computation.

LB hal.cs.princeton.edu via sanxiyn 3/10/2026

AI Agent Reliability Tracker

Study evaluating 14 AI agents across 2 benchmarks on 12 metrics across 4 reliability dimensions. Finds recent capability gains yield only small improvements in actual reliability compared to accuracy scores.

HN vismit2000 3/10/2026

A Survival Guide to a PhD (2016)

2016 retrospective guide on navigating PhD programs. Academic career advice unrelated to AI/tech development.

HN manavpanchal 3/10/2026

Phonyhuman

Open source agent framework forking OpenAI's Symphony, using Claude Code for autonomous implementation of Linear board issues. AI agents with LLM integration.

HN champagnepapi 3/10/2026

Karpathy: Autoresearch

Andrej Karpathy thought piece on autonomous AI agents conducting frontier research across compute clusters. Speculative/fictional framing of agentic research systems.

HN piotrbednarsalt 3/10/2026

LLM Integrity During Inference in Llama.cpp

Security research on model artifact integrity during local LLM inference in llama.cpp. Creates llm-inference-tampering project targeting inference-layer attacks.

HN samfierro 3/10/2026

The Requirements Layer Your AI System Is Missing

Framework for defining requirements and specifications for AI systems beyond testing/evals. Addresses gap between eval scores and actual user satisfaction in AI products.

HN alhazrod 3/10/2026

TLAi+ Benchmarks for Evaluating LLMs

TLAi+ Benchmarks: dataset and benchmark suite for evaluating LLMs on TLA+ formal specification tasks with diverse problem types.

HN Escafati 3/10/2026

I will pay you if you ship a bug

Autonoma: AI agents that automatically generate test suites and find bugs by navigating applications without manual test scripts.

HN oumua_don17 3/9/2026

Getting Started in Common Lisp

Guide on getting started with Common Lisp programming language, covering setup and IDE configuration.

HN addiefoote8 3/9/2026

Open Weights Isn't Open Training

Essay on limitations of open weights models without open training data, discussing post-training challenges for trillion parameter models.

HN cassmtnr 3/9/2026

Claude Code Starter CLI

CLI tool using Claude to analyze project codebases and generate customized Claude Code configurations. Integrates with Claude CLI for code-specific setup.

HN aboutphilippe 3/9/2026

Context plane for AI agents (Rust, S3)

Architecture and benchmarks for context plane infrastructure that provides AI agents with targeted context retrieval via S3, replacing prompt stuffing and multiple API calls.