Ax Samuel Girard, Aurelien Bibaut, Arthur Gretton, Nathan Kallus, Houssam Zenati 4/6/2026

Fast Best-in-Class Regret for Contextual Bandits

Fast regret bounds for contextual bandits without realizability assumptions using pessimistic policy updates.

Ax Toufique Ahmed, Jatin Ganhotra, Avraham Shinnar, Martin Hirzel 4/6/2026

Investigating Test Overfitting on SWE-bench

Investigation of test overfitting in SWE-bench for code resolution, where models pass tests but miss important cases.

HN XYen0n 4/6/2026

SSH to any machine without IP

SSH tool for connecting to machines behind NAT/firewalls without port forwarding. Infrastructure utility unrelated to AI.

HN gimlids 4/6/2026

Show HN: LLMs' Favorite Colors

Analysis of LLM color generation patterns. Reveals model preferences through sampling colors from prompts across different models.

HN prolly97 4/6/2026

Show HN: hot or not for .ai websites

Hot or Not for .ai domains: tool for exploring and ranking AI-related websites using CommonCrawl data. Helps identify landscape trends.

HN gmays 4/6/2026

Can we ever trust AI to watch over itself?

Opinion piece on AI safety research funding. Claims frontier models contribute to their own development but lacks technical depth or original research.