Isolater - Feed

HN alexbit2019 19d ago

Npx codemod AI: make your coding agent great at large migrations

Npx codemod AI tool for enabling AI agents to handle large-scale code migrations.

HN locknitpicker 19d ago

DDD Bounded Contexts: Clear Domain Boundaries for LLM Code Generation

Using DDD bounded contexts to improve LLM code generation by providing architectural clarity and reducing cognitive overload.

HN Brajeshwar 19d ago

Benchmarking LLM Tool-Use in the Wild

Research on LLM tool-use benchmarking. Limited preview content.

HN dial481 19d ago

Show HN: Proposal for a real long-term AI memory benchmark

Audit of AI memory benchmarks reveals flawed evaluation: wrong answer keys, biased LLM judges, unreliable comparisons.

HN bastscho 19d ago

Show HN: Mdpdf a 2k line C CLI to convert Markdown to tiny PDFs

CLI tool converting Markdown to PDFs. Author mentions using agentic coding during development but tool itself unrelated to AI.

HN veganmosfet 19d ago

Show HN: BrokenClaw Part 5: GPT-5.4 Edition (Prompt Injection)

Security research demonstrating prompt injection vulnerabilities in GPT-5.4 model, showing untrusted code execution risks.

HN gk1 19d ago

Agents Need a Database to Break

Infrastructure requirements for AI agents: real-time data streaming and database branching for safe sandboxed execution.

HN ashed96 19d ago

LLM-Wiki but for Early Founders

LLM-Wiki adapted for early-stage startup founders as a knowledge management tool.

HN steveharing1 19d ago

Open-source AI assistant that watches your screen and points at things

Open-source AI assistant that monitors screen content and provides contextual assistance.

HN jakehulberg 19d ago

Coding Agents Are Reading Your .env

Security research showing how coding agents can inadvertently expose secrets from .env files.

HN rida123 19d ago

Nyth AI – Local LLM Inference on iOS Using MLC-LLM and TVM

Nyth AI enables local LLM inference on iOS using MLC-LLM and TVM compiler framework.

HN onthemarkdata 19d ago

Petri, a multi-agent orchestration framework for building AI context

Petri is a multi-agent orchestration framework for coordinating AI context across agents.

HN karimfan 19d ago

Foxhound – helping LLMs better navigate codebases

Foxhound helps LLMs better navigate and understand codebases.

HN liminnnnnng 19d ago

AET: A transpiler that compresses code for LLMs (saves 30-55% tokens)

AET transpiler compresses code for LLM input, reducing token usage by 30-55%.

HN theahura 19d ago

Show HN: Ship of Theseus License

Discussion of cleanroom implementation legal theory for circumventing software licenses using AI.

HN atzeus 19d ago

AI alignment: the signal is the goal

Title only, no content. Appears to address AI alignment concepts.

HN kuberwastaken 19d ago

I Read Anthropic's 244 Page Reason to Not Release Mythos So You Don't Have To

Commentary on Anthropic's decision not to release a model, referencing historical GPT-2 release. Incomplete article with unclear relevance.

HN ddp26 19d ago

I think Anthropic is worth $100B more than last week

Opinion piece forecasting Anthropic's market valuation based on announced revenue growth.

HN himanshudongre 19d ago

Show HN: PropOps – AI agent that reads Indian govt property data nobody checks

Title only. AI agent application reading Indian government property data for insights.

HN SpaceJudas 19d ago

Datadog: We built a real-world evaluation platform for SRE agents at scale

Title only. Datadog platform for evaluating SRE AI agents in production environments at scale.

HN rarce 19d ago

An Agent Skill that implements Karpathy's LLM-wiki on personal GitHub Repo

Agent skill implementing Karpathy's LLM-wiki pattern for persistent knowledge management on GitHub repos with hybrid search.

HN philiparxist 19d ago

LLM agents shouldn't execute blindly – this one plans first and stays editable

cuddlytoddly is an LLM agent framework that generates editable task graphs before execution rather than acting blindly.

HN sohaibtariq 19d ago

Show HN: AI agents are bad at API integrations – we fixed it

Developer tool helping AI agents integrate with APIs properly by using current documentation instead of stale training data. Addresses real agent limitation.

HN Brajeshwar 19d ago

Study: Google's AI Overviews show wrong answers every hour

Report on accuracy issues in Google's AI search summaries, citing hourly hallucinations. News coverage of LLM reliability problems.

HN timssopomo 19d ago

The Model Says Walk: How Surface Heuristics Override LLM Reasoning Constraints

Research on how surface heuristics can override reasoning constraints in LLMs, published on arXiv.

HN michaelquigley 19d ago

Show HN: MCP Gateway – Zero-Trust Access to MCP Tool Servers

MCP Gateway tool for secure remote access to MCP servers using zero-trust networking with zrok/OpenZiti.

HN Jordisilvestre 19d ago

Repurposed Nvidia RT Cores for LLM routing (218x speedup)

Technique repurposing Nvidia RT cores for LLM routing achieving 218x speedup. Limited technical details.

HN meryll_dindin 19d ago

We orchestrate our internal AI agent (PydanticAI, Gemini, Jinja2 prompts)

Title only. Documentation of internal AI agent architecture using PydanticAI, Gemini, and Jinja2 templates.

HN Ms-J 19d ago

Best Open Source Offline AI Agent

Evaluation of open-source AI agents supporting local/self-hosted models with offline capability and network isolation.

HN donge 19d ago

Show HN: Qiaohu – offline multimodal voice assistant on Snapdragon 8 Gen 2

Offline Chinese voice assistant running entirely on Snapdragon 8 Gen 2 with VAD, LLM, TTS, and barge-in interrupts. Open source with code.

HN elsewhen 19d ago

Study found that young adults have grown less hopeful and more angry about AI

Survey on young adults' attitudes toward AI, showing declining optimism. Social sentiment research, not technical.

HN adcent 19d ago

Show HN: Postagent – Postman CLI, but for AI Agents

Developer tool for agents to discover and call APIs like Postman. Manages API credentials securely, keeping secrets out of LLM context.

HN Arham-Begani 19d ago

Show HN: Forze – A platoform that turns a startup idea into a product in mins

AI agent platform orchestrating sequential agents for market research, branding, landing pages. Practical multi-agent application completing startup validation in 10 minutes.

HN tinyprojects 19d ago

Show HN: Zoneless – Open-source Stripe Connect clone with $0.002 fees using USDC

Open-source Stripe Connect alternative using USDC. Payment infrastructure, not AI-focused despite bootstrapping an AI marketplace.

HN Brajeshwar 19d ago

With Claude Managed Agents, Anthropic wants to run your AI agents for you

News about Anthropic's Claude Managed Agents service offering hosted AI agent execution.

HN wslh 19d ago

ScienceClaw: Framework for Autonomous Scientific Investigation

ScienceClaw: open-source framework for autonomous scientific investigation with independent agents, 300+ interoperable tools, peer review on shared platform.

HN alxstn 19d ago

Show HN: AgentDM – Agent to agent messaging over MCP and A2A

AgentDM: hosted messaging grid enabling direct agent-to-agent communication over MCP with 5-line JSON config, no SDK required.

HN hjm1980 19d ago

Show HN: I built a desktop app for building and debugging MCP tools

Desktop application for building and debugging MCP (Model Context Protocol) tools.

HN wslh 19d ago

DeepTutor: Agent-Native Personalized Tutoring

DeepTutor v1.0.0 agent-native tutoring system with ground-up architecture rewrite, TutorBot, and flexible mode switching under Apache-2.0 license.

HN girlwponytail 19d ago

Run it for yourself: compute time dilation

OS concept claiming polynomial-time computational hardness collapse with security implications for cryptographic systems.

HN allthingsapi 19d ago

ALTK‑Evolve: On‑the‑Job Learning for AI Agents

Research on AI agents that learn and improve performance through on-the-job task execution.

HN mooreds 19d ago

Migrations Considered Helpful

Essay on database migrations as evolutionary processes managing system changes while maintaining continuity and uptime.

HN pdrgds 19d ago

Show HN: Nheengatu – CLI tool to simplify books to your language level with LLMs

Nheengatu: Rust CLI tool using LLMs to simplify EPUB books to target language proficiency levels (A1-C2), supports Groq or local Ollama.

HN x591 19d ago

Programming language designed for LLMs to write, not humans

Vera: programming language designed for LLMs to write with verification as first-class citizen, adapted to model-as-author paradigm.

HN danielhanchen 19d ago

Gemma 4 Fine-Tuning Guide

Guide for fine-tuning Google's Gemma 4 LLM model.

HN upsys 19d ago

A productivity-first Dynamic Island for macOS, built with zero dependencies

macOS app providing Dynamic Island functionality for music control, calendar, and focus tracking without subscriptions.

HN tosh 19d ago

PtrHash: Minimal Perfect Hashing at RAM Throughput

PtrHash: Research paper on minimal perfect hashing achieving RAM throughput speeds for databases and search engines.

HN hungryclaw 19d ago

Exploring Mythos: The AI Model Anthropic Won't Release

Opinion article discussing Anthropic's unreleased Mythos model and implications for model access exclusivity.

HN sarimkx 19d ago

White-Collar Workers Are Rebelling Against AI – 80% Refuse Adoption Mandates

Opinion article on worker resistance to AI adoption mandates, citing MIT study on shadow AI usage.

HN mbmproductions 19d ago

AI agents aren't abstraction, they're delegation (and why that matters)

Conceptual article distinguishing AI agents as delegation systems rather than abstractions, exploring design implications.