Isolater - Feed

Ax Jacky Kwok, Shulu Li, Pranav Atreya, Yuejiang Liu, Yixing Jiang, Chelsea Finn, Marco Pavone, Ion Stoica, Azalia Mirhoseini 23d ago

LLM-as-a-Verifier: A General-Purpose Verification Framework

arXiv research introducing LLM-as-a-Verifier framework for verification as scaling axis, enabling fine-grained feedback for agentic tasks.

HN tt_ay 23d ago

Traceburn, a local profiler that found 69% avoidable agent spend

Traceburn profiler analyzes LLM agent token usage and identified 69% avoidable spending by recommending prompt caching optimization.

HN salarkhannn 23d ago

Forget the GPU Shortage: The Real AI Bottleneck Was Diagnosed in 2007

Analysis of memory bandwidth as the true bottleneck limiting AI model inference at scale, examining recent HBM supply deals and packaging innovations.

HN medhovarsh 23d ago

ForkMind – Git for LLM context: branch, offload, and restore it

Developer tool treating LLM context windows as Git repositories with branching, DAG visualization, and multi-model support. Works locally with Ollama and OpenAI-compatible APIs.

HN handfuloflight 23d ago

Smolbren: Local search for your Markdown vaults

Smolbren converts markdown files into queryable knowledge graphs with full-text search, supporting AI agent queries via Cypher and BM25.

HN ericstrate 23d ago

The Answer Citation Protocol (ACP)

Proposes Answer Citation Protocol to improve web structure for LLM-based answer engines, addressing inefficient HTML crawling.

HN rishsriv 23d ago

Show HN: FactIQ – a realtime econ+finance database for AI agents

FactIQ database provides organized economic/finance data for AI agents to conduct investment research without wasting context window on data cleaning.

HN narinluangrath 23d ago

SmolMail: An AI agent for Gmail that answers in visuals, not walls of text

SmolMail AI agent for Gmail that summarizes emails with visual output instead of text walls.

BL 23d ago

Introducing GPT-Live

OpenAI releases GPT-Live, full-duplex voice model enabling simultaneous listening and speaking for natural conversations.

HN litppicho 23d ago

Free 100M AI tokens for Kimi and MiniMax models

Kimi and MiniMax offering free 100M AI tokens for inference on decentralized GPU network for developers and startups.

HN jack1689 23d ago

Why the rise of open source AI isn't hurting Anthropic yet

Analysis of contradiction between enterprise adoption of lighter open-source models and sustained spending on frontier AI models.

HN BUFU 23d ago

Qualcomm acquires Nexa AI, open-sources GenAI runtime for Hexagon NPUs

Qualcomm acquires Nexa AI and open-sources GenieX runtime for on-device LLM inference on Hexagon NPUs. Supports GGUF models from Hugging Face with Python, CLI, and OpenAI-compatible APIs.

HN modinfo 23d ago

Desk-Pet – a local-first desktop pet powered by MiniCPM5

Desk-Pet: local-first desktop companion powered by MiniCPM5 LLM; fully offline with guided setup for macOS and Windows.

HN nossa-y 23d ago

Activity-frames – give your AI agent eyes on your day

Activity-frames tool gives AI agents real-time visibility into user activity and calendar.

HN Fanfulla 23d ago

Show HN: OCR Buddy: local browser OCR for code, formulas (LaTeX) and tables

OCR Buddy: local browser OCR extension for code, formulas, tables with Markdown export for LLMs; fully offline, no hallucinations.

HN herbertl 23d ago

Chinese AI models are gaining ground with U.S. companies as costs surge

Analysis of Chinese AI models gaining market share in US as costs rise and performance improves.

HN mazen160 24d ago

Show HN: Backlog – tasks and contexts manager for AI coding agents

Backlog tool for managing tasks and context for AI coding agents.

HN CrankyBear 24d ago

Agent Name Service: The universal AI Agents identity system

Linux Foundation standardizing AI agent identification through Agent Name Service (ANS) and DNS-AID protocols.

HN davidecampora 24d ago

Omnibaas, a provider-agnostic Infrastructure-as-Code compiler for BaaS services

Omnibaas: provider-agnostic Infrastructure-as-Code compiler for BaaS services reducing vendor lock-in across multiple cloud providers.

HN boulos 24d ago

Mixed-Precision SVD on GPUs via Ogita–Aishima Iterative Refinement

Research on GPU-accelerated SVD computation using mixed-precision iterative refinement for numerical stability.

HN roriau 24d ago

Show HN: Agent Bus – IRC-style message bus for AI agents (MCP)

IRC-style message bus infrastructure for AI agents using Model Context Protocol (MCP).

HN gawkdev 24d ago

Show HN: Gawk CLI – a live AI update feed in your terminal

CLI tool for curated AI news feed with source citations, alternative to LinkedIn AI coverage.

HN gukov 24d ago

Code maintainability plummets in the AI coding era

Analysis of code maintainability challenges from agentic AI coding tools violating DRY principles.

HN guga42k 24d ago

Parallel development in tmux* with Git worktrees

Workmux: workflow tool for managing git worktrees and tmux windows as isolated development environments, optimized for parallel AI agent execution.

HN zie1ony 24d ago

We charge $10k a week to delete AI-generated code

Service offering code refactoring for AI-generated codebases to improve maintainability and reduce duplication.

HN t-van 24d ago

A launch playbook your coding agent can run – it launched itself

Launch playbook framework for coding agents and agentic builders to ship products through community engagement without existing audience.

HN syumei 24d ago

A Cursor Sandbox Escape Shows Why AI Agents Need Kernel Boundaries

Analysis of sandbox escape vulnerability in Cursor IDE, highlights security needs for AI agents with system access.

HN legojazz 24d ago

Open Source Game Dev Agent

OpenGenie: AI-powered game development agent converting plain language descriptions into Godot 4 projects with automated testing.

HN haritha1313 24d ago

Microsoft Replaces OpenAI, Anthropic with Own AI in Some Apps

Microsoft replacing OpenAI and Anthropic models with proprietary AI in Excel and Outlook to reduce costs.

HN trekhleb 24d ago

Claude's Learning Mode

Claude's learning mode now available to all Code users, guiding users to solutions rather than direct answers.

HN salnika 24d ago

Show HN: Dejavu, stop showing coding agents the same command output twice

Dejavu: PATH shim tool reducing redundant command output for coding agents like Claude Code and Cursor by returning compact deltas instead of repeated output.

HN bogdiyan 24d ago

MIRA: Multiplayer Interactive World Models with Representation Autoencoders

MIRA: 5B parameter latent diffusion world model for Rocket League supporting real-time 2v2 multiplayer gameplay at 20 FPS on single GPU with released code.

HN dilyevsky 24d ago

Show HN: CLRK, an open-source agent runtime with gVisor and MitM guardrails

CLRK: open-source agent runtime using gVisor for isolation and Kubernetes deployment. Provides fully intercepted I/O for tracing LLM and network calls with MitM guardrails.

HN rzk 24d ago

Neuronpedia, an open source platform for AI interpretability

Open source platform for AI interpretability research enabling analysis of neural network internal mechanisms.

HN thomasunise 24d ago

I built a tiny proxy that gives GLM 5.2 vision (or any text LLM) – MIT

Proxy tool that adds vision capabilities to text-only LLMs like GLM 5.2.

HN alaaalawi 24d ago

Show HN: Q a REPL for LLM inside the terminal

Q is a REPL interface for running LLMs directly within the terminal.

HN gurjeet 24d ago

Sets up your AI agent for Cloudflare

Tool to help AI agents on Cloudflare infrastructure.

HN wscholl 24d ago

Squish, a local LLM inference server for Apple Silicon

Squish is a local LLM inference server optimized for Apple Silicon hardware.

HN betzsoftware 24d ago

Show HN: Last EHR – AI agent over a FHIR back end with human approval on writes

AI agent system for electronic health records using FHIR standard with human approval workflow for data writes.

HN surprisetalk 24d ago

Muse Image and Muse Video

Meta's Muse Image and Muse Video generative models.

HN erikbethke 24d ago

Show HN: Bike4Mind – open-core AI workbench; any model, agents, RAG, self-host

Open-core AI workbench supporting multiple models, agents, RAG, self-hosting. Developer tool for AI workflows.

HN thekiraproject 24d ago

Local AI is re-reading its own prompt

Local AI model introspection behavior. Lacks technical depth and full context.

HN anonli 24d ago

AI Job Search – An AI-powered job application framework built on Claude Code

AI-powered job application framework built on Claude Code, demonstrating LLM agent application for automating job search workflows.

HN Kosturdistan 24d ago

AI Clambake Launches AI Bubble Tracker

LLM-powered tool generating personalized interactive courses from user input. Educational LLM application.

HN hbarka 24d ago

Ask HN: What is your AI harness that lets you switch LLM models easily?

Developer question on LLM abstraction layer tools for easy model switching across providers.

HN kirillklimuk 24d ago

Show HN: Docx-CLI: agents read/edit Word docs using 1/2 the time and tokens

CLI tool for AI agents to read and edit .docx files with comments and redlines while preserving formatting, reducing token usage.

HN realdjpaulyd 24d ago

Show HN: Fork – Let users build features on top of existing applications

Chrome extension enabling users to build AI-powered features on Gmail and Google Calendar via coding interface.

HN luispa 24d ago

We learned to trust our AI code reviewer at DoorDash

DoorDash shares experience deploying AI code reviewer in production, learning to trust automated code review.

HN lukeigel 24d ago

Show HN: Cfswitch – Switch between multiple Cloudflare accounts in Wrangler

CLI tool switching between multiple Cloudflare accounts via environment variables, built for AI agents and CI workflows.

HN _pdp_ 24d ago

Show HN: An agentic CRM, built for AI agents to drive over plain HTTP

Agent-first CRM built on HTTP APIs designed for AI agents to operate directly without UI dashboards. Internal system replacement.