HN o4c 2/21/2026

The Fundamental Limits of LLMs at Scale

ArXiv Labs framework description with limited technical content visible; appears to be platform announcement rather than research article.

HN gcollard- 2/21/2026

Hardware LLM at 16K Tokens/s

Hardware benchmark demonstrating 17k tokens/second throughput for Llama 3.1 8B on Taalas silicon, comparing against competing inference accelerators.

HN gmays 2/21/2026

They Do Mean the Effect on Jobs

News roundup discussing job impact projections and AI timeline predictions, including Claude Sonnet 4.6 release.

HN fagnerbrack 2/21/2026

Napkin Math

Techniques and numbers for estimating system performance from first principles, with examples like memory read speeds and storage cost calculations.

HN gfortaine 2/21/2026

Cord: Coordinating Trees of AI Agents

Cord framework for coordinating trees of dependent AI agent tasks with parallelism and context flow, addressing multi-agent orchestration.