Isolater - Feed

HN mazheru 22d ago

Show HN: Parchmint – a Markdown editor that shows what the AI actually reads

Markdown editor visualizing exact token representation LLMs see. Shows formatting, whitespace, warnings for prompt/doc optimization. Open-source, local-first.

HN ishweta 22d ago

Build React Forms with AI Using Our Shadcn Form Builder [video]

Shadcn form builder that uses AI to generate React forms from visual specifications.

HN rstock 22d ago

Show HN: PACT – An open-source toolkit for privately signing digital content

PACT: open-source toolkit for signing digital content, tracking provenance, and enforcing AI training policies with policy metadata.

HN riscoss 22d ago

Show HN: Engramma Memory – Composable memory for AI agents(multi-head attention)

Engramma Memory: open-source composable memory architecture using multi-head attention for AI agents.

HN aurellius 22d ago

Show HN: Vicinae – open-source, Raycast-compatible command palette, now on macOS

Vicinae command palette with Raycast extension compatibility now available on macOS, built with Qt/C++ and Node.js runtime.

HN marcociavarella 22d ago

Show HN: Dex – Cost-aware analytics engineering skills for agents

Analytics tool for agents to optimize costs when using Claude Code and other LLM agents against expensive data platforms. Addresses token efficiency.

HN Brajeshwar 22d ago

China Says It Has Found Security Vulnerabilities in Anthropic's Claude Code

Report on security vulnerabilities found in Anthropic Claude Code. LLM system analysis.

HN alecavaz 22d ago

Grillr, AI that interrogates your startup idea then holds you to real deadlines

Grillr: AI agent that critiques startup ideas and enforces accountability with real deadline tracking.

HN amichail 23d ago

We just figured out how AI works (J-Space) [video]

Video exploring J-Space theory explaining how AI models work internally.

HN billqu0001 23d ago

NexSub: The First Offline AI Video Subtitle Translator

NexSub: offline AI video subtitle translator supporting multilingual translation locally without internet or subscriptions.

HN yingyenliu 23d ago

Show HN: Design a component visually, get spec-grade prompts for AI tools

UIPrompt: visual component editor generating spec-grade AI prompts for Claude, Cursor, and v0 with exact design values and accessibility rules.

HN Praxwise 23d ago

Meta releases MuseImage and MuseVideo, its image and video generation models

Meta releases MuseImage and MuseVideo generative models for image and video creation.

HN ritzaco 23d ago

Are LLMs good enough for Document Extraction?

Analysis of LLM capabilities for document extraction tasks. Evaluation of practical viability.

HN Eapz_06 23d ago

AI agents accessing production database

Community discussion on production AI agent architectures accessing databases. Requests real implementation learnings on guardrails and problems.

HN kevinpeckham 23d ago

Seven Studies on Letting LLMs Edit Trees

Research on LLM-based tree editing capabilities across multiple studies. Empirical analysis.

HN ilbert 23d ago

Ask HN: Why aren't we collaborating on the prompts we give to our AI agents?

Discussion about collaborative prompt engineering for AI agents on teams. Questions why prompts remain individual rather than shared like code.

HN billyholevas 23d ago

I had 25 AI agents try to kill 25 startup ideas. They killed 22

User deployed 25 AI agents to critique startup ideas; 22 were killed by agent feedback. Explores AI agent capabilities for evaluation.

HN GildenEye 23d ago

Show HN: AIfunc – Call AI as a function, not as an agent

AIfunc library enables calling AI as typed, testable functions across languages without learning new frameworks. Model-agnostic npm package approach.

BL 23d ago

Separating signal from noise in coding evaluations

OpenAI audits SWE-Bench Pro benchmark, finds ~30% of tasks broken; details importance of accurate model evaluation.

HN maxgio92 23d ago

Show HN: Xcover, test coverage without instrumentation, using eBPF

eBPF-based test coverage measurement tool without code instrumentation requirements.

HN softmodeling 23d ago

Show HN: A UML drawing skill for your coding agent docs

Agent skill module enabling AI coding agents to generate UML diagrams from natural language or existing codebases.

HN bogdiyan 23d ago

GLM-5.2 (max) matches Claude Opus 4.8 on Harvey LAB-AA benchmark

GLM-5.2 max model performance comparison with Claude Opus 4.8 on Harvey benchmark. Model evaluation result.

HN foh_quarters 23d ago

Cinchor – Control what an AI agent can do, and prove what it did

Cinchor tool provides control and auditability for AI agent actions. Enables constraining agent capabilities and proving execution history.

HN kendallgclark 23d ago

The Social Tier: Remembering Who Said What

Technical documentation on agentic memory systems in WunderOS. Discusses perspective fusion and trust in data sources for distributed systems.

HN wmg 23d ago

How AI Embeddings Cut Cloud Costs by 50% While Boosting Matching by 65%

Title only. AI embeddings cost reduction case study. Insufficient technical depth provided.

HN cramer4next 23d ago

China warns about AI risks with Anthropic's Claude Code

Title only. China security warnings about Claude Code tool. Policy/news without technical details.

HN mart1adelina 23d ago

Show HN: VetoBench – benchmarking AI memory beyond retrieval

Open-source benchmark measuring AI agent memory quality and decision rejection awareness beyond retrieval accuracy. Reproducible evaluation.

HN emnlmn 23d ago

Show HN: CodeRadius, map and govern multi repo architectures

Developer tool for mapping and governing multi-repo architectures using LLMs and AI agents to handle microservices codebase complexity.

HN amanharshx 23d ago

Show HN: Control YOLO Training and Datasets from Claude/Cursor via MCP

MCP server for Claude/Cursor to control Ultralytics YOLO training, datasets, and model management via AI agents. Community project enabling agentic ML workflows.

HN deviscool 23d ago

Real limits converted to API-equivalent $ value for Claude Code, Codex, Copilot

Comparison of API pricing limits across Claude, Codex, and Copilot coding assistants.

HN ashater 23d ago

What if you could stop your AI agent before it makes a mistake?

Title only. Concept for intervention mechanism to prevent AI agent errors before execution.

HN laurencoral 23d ago

Show HN: IAXT – macOS menu-bar app that records what AI coding agents do

macOS application monitoring and recording AI coding agent behavior and actions for analysis.

HN paddi91 23d ago

Optimization Solver as a Service

Quicopt optimization solver service with Python API supporting OR-Tools and Pyomo models. Useful tool but not directly AI-focused.

HN ndr 23d ago

Agentic test processes, LLM benchmarks, and other notes on agentic coding

Title only. Discussion of agentic test processes and LLM benchmarks for code generation.

HN thunderbong 23d ago

OpenClaw plugin for real phone calls via Twilio and OpenAI Realtime

OpenClaw plugin enabling AI agents to make/receive real phone calls via Twilio and OpenAI Realtime API with natural voice conversations and task completion.

HN hasnain_ai 23d ago

SinceAI: A nonprofit AI accelerator combining compute, research, pilot customers

Nonprofit AI accelerator ecosystem connecting 10,000+ AI builders with compute, research partnerships, and hackathon ($50k prize).

HN joozio 23d ago

Open-source LLMs administer maximum electric shocks in a Milgram-like obedience

Research study testing open-source LLMs in Milgram-style obedience experiments, examining alignment and safety of autonomous LLM behavior.

HN joozio 23d ago

Hackers can use 9 of the most popular AI tools to assemble botnets

Security research on prompt injection attacks (HalluSquatting) that enable LLMs to assemble botnets by exploiting inability to refuse malicious commands.

HN Johnny666456 23d ago

CodeTalk – recover why AI-written code was written, quoted from Git (zero-LLM)

Insufficient content; tool description for recovering context about AI-written code from Git history.

HN DSpinellis 23d ago

Why agentic AI needs better experts

Developer documents experience using OpenAI Codex AI agent for major refactoring task, analyzing capabilities and limitations in real-world code changes.

HN jinqueeny 23d ago

Show HN: Open-weights VLA model for 20 robot embodiments (code and checkpoints)

LingBot-VLA 2.0 is an open-weight Vision-Language-Action foundation model for robot control across 20 embodiments, with improved real-world deployment capabilities and code/checkpoints available.

HN mountainview 23d ago

Context Doesn't Scale with People

Product for synchronizing agent-based context across teams to reduce time spent on communication tools.

HN javaeeeee 23d ago

LLM Pipeline Autonomously Produces Novel Physics Research Paper

LLM pipeline autonomously generates novel physics research paper end-to-end.

HN cms4dlols 23d ago

What Every AI Builder Learns the Hard Way [video]

Video discussing practical lessons learned when building AI systems.

HN azuanrb 23d ago

Making AI Code Review Measurable

Experiment using AI for measurable code review approval with metrics and safety considerations.

HN Husain_Ghulam 23d ago

Show HN: Trace – open-source, self-organizing memory for LLM agents (PyPI)

Trace: Open-source memory system for LLM agents with self-organizing capabilities, available on PyPI.

HN fiszki 23d ago

Fiszki flashcards without an app: your AI quizzes you, FSRS keeps score

Fiszki: Spaced-repetition flashcard app with AI agents creating decks via MCP and FSRS scheduling algorithm.

HN danebalia 23d ago

Viability of Local Models for Coding

Analysis of running local LLMs for coding tasks, examining viability and trade-offs versus cloud alternatives.

HN pavanputhra 23d ago

Show HN: Comcent CE – An open-source self-hosted Voice Infrastructure platform

Comcent CE: Open-source self-hosted voice infrastructure platform providing detailed call analytics and tracking.

HN mkagenius 23d ago

Show HN: Tarit – Self-host sandbox cloud and hypervisor for AI agents

Tarit: Rust-based hypervisor and orchestrator designed for running AI agents and RL environments with live snapshots.