Isolater - Feed

Ax Ravi Raju, Mengmeng Ji, Shubhangi Upasani, Bo Li, Urmish Thakker 3/9/2026

The Limits of Long-Context Reasoning in Automated Bug Fixing

Evaluates whether LLMs can reliably perform long-context code debugging and patch generation, testing limits of agentic workflows on software engineering tasks.

Ax Ihor Kendiukhov 3/9/2026

What Topological and Geometric Structure Do Biological Foundation Models Learn? Evidence from 141 Hypotheses

Investigates geometric and topological structures learned by biological foundation models like scGPT using autonomous hypothesis screening with AI-driven workflows.

Ax Jayadev Billa 3/9/2026

Modality Collapse as Mismatched Decoding: Information-Theoretic Limits of Multimodal LLMs

Information-theoretic analysis of multimodal LLM failure modes. Frames modality collapse as mismatched decoding problem, explains 98% information loss.

Ax Niloofar Jazaeri, Hilmi R. Dajani, Marco Janeczek, Martin Bouchard 3/9/2026

LMU-Based Sequential Learning and Posterior Ensemble Fusion for Cross-Domain Infant Cry Classification

Infant cry classification using Legendre Memory Units and multi-branch CNN. Healthcare monitoring application with limited domain relevance.

Ax Mandip Goswami 3/9/2026

Whisper-RIR-Mega: A Paired Clean-Reverberant Speech Benchmark for ASR Robustness to Room Acoustics

Benchmark dataset for evaluating speech recognition robustness to room acoustics. Paired clean/reverberant speech utterances with acoustic metrics.

Ax Sanyam Singh, Naga Ganesh, Vineet Singh, Lakshmi Pedapudi, Ritesh Kumar, SSP Jyothi, Archana Karanam, Waseem Pasha, Ekta Kumari, C. Yashoda, Mettu Vijaya Rekha Reddy, Shesha Phani Debbesa, Chandan Dash 3/9/2026

Fine-Tuning and Evaluating Conversational AI for Agricultural Advisory

Fine-tuning conversational LLMs for agricultural advisory with domain-specific improvements. Addresses recommendation accuracy and farmer communication alignment.

Ax Ashwath Vaithinathan Aravindan, Mayank Kejriwal 3/9/2026

Fragile Thoughts: How Large Language Models Handle Chain-of-Thought Perturbations

Empirical study evaluating LLM robustness to chain-of-thought reasoning perturbations across five error types. Assesses reasoning reliability under corruption.

Ax Saad Qadeer, Panos Stinis 3/9/2026

Improving the accuracy of physics-informed neural networks via last-layer retraining

Research on improving physics-informed neural networks accuracy through post-processing retraining. Domain-specific ML application for solving PDEs.

Ax Anatoly Belikov, Ilya Fedotov 3/9/2026

Good-Enough LLM Obfuscation (GELO)

Arxiv paper proposing obfuscation method to protect LLM prompt privacy on shared accelerators. Addresses KV cache security against adversarial memory access.

HN kentf 3/9/2026

OpenAI's Symphony: Agent Management Layer

OpenAI's Symphony orchestrates autonomous coding agents for project work, monitoring task boards and managing PR delivery with proof-of-work artifacts.

HN mcastilho 3/9/2026

Zero Lines Written by a Human but 750 Pull Requests Later

Engineer completed production app with 750+ PRs across 4 languages in 45 days using only AI code generation, no human-written code.

HN optinghost 3/9/2026

Show HN: Upvoicy – built feedback management SaaS for sale

Feedback management SaaS tool for collecting and organizing user feedback. Business software, not AI-related.

HN wisdomcrane 3/9/2026

Show HN: Tilnote – An AI note workspace from rough ideas to publishable content

Tilnote AI note workspace uses agent to structure ideas into publishable content from keywords, with web clipper and writing assistance.

HN Bspinky 3/9/2026

Show HN: Generate App Store screenshots by matching any top app's style

Tool that generates App Store screenshots matching reference app styles, with Claude/ChatGPT API integration. LLM-adjacent but design-focused.

HN mercat 3/9/2026

CLI tool for deterministic linting of LLM output

Vale is open-source CLI linting tool for editorial style guides, runs offline, integrates with VS Code and GitHub. Tangentially useful for LLM output processing.

HN piotrbednarsalt 3/9/2026

How to manipulate running LLM outputs via GGUF page cache poisoning

Proof-of-concept exploit demonstrating persistent manipulation of LLM outputs via GGUF page cache poisoning in running inference servers.

HN ronbenton 3/9/2026

Ask HN: Are we going to see more job postings asking for only agentic coding?

Discussion of job market shift toward agentic coding workflows. Zapier job posting requires experience directing AI agents, handling failures, and multi-agent patterns.

HN ozten 3/9/2026

Show HN: Cantrip – Agent-native GTM engine I built for solo technical founders

AI-powered GTM engine for solo founders. Describes product and generates customer acquisition strategy to reach first 100 users.

HN yash_chudasama 3/9/2026

Show HN: Ajen – Open-source platform where AI employees build your startup

Open-source platform where AI agents (CEO, CTO, CMO) collaborate to plan and build startups based on descriptions. Early-stage project seeking feedback.

HN mcastilho 3/9/2026

Show HN: Running multiple Claude Code agents in parallel Git worktrees (ChatML)

Tool for running multiple Claude Code agents in parallel using Git worktrees to avoid filesystem conflicts, enabling concurrent AI-assisted development workflows.

HN levelsofself 3/9/2026

Show HN: Nervous System v1.9 – Governance for Multi-Agent AI (MCP)

Nervous System governance framework enforces 7 rules preventing multi-agent AI failures, battle-tested on 13-agent system with zero bypasses of 58+ violations.

HN kaspern 3/9/2026

Show HN: Own your AI's context and memories across every model and device

Personal memory system using knowledge graph, pgvector, and MCP server to share context across multiple LLM providers and devices.

HN goranmoomin 3/9/2026

Your binary is no longer safe: LLM-assisted Decompilation

LLM-assisted decompilation technique for reverse-engineering binary programs, automating binary-to-source code conversion.

HN safteylayer 3/9/2026

I ran the same AI security test 4 times – 75% found critical bypasses

Mutation testing engine reveals GPT-4 prompt injection vulnerabilities, finding different critical bypasses in 75% of runs despite identical inputs.

HN ronbenton 3/9/2026

IRS tax withholding estimator has been open sourced

IRS open-sourced tax withholding estimator tool for Form W-4 calculations. Government software, not AI-related.

HN carnevalem 3/9/2026

What if you never had to get an API key ever again?

Val Town platform founder discusses eliminating API key friction in developer workflows, relevant for agent/LLM app development experience.

HN amunchkin 3/9/2026

Show HN: Spotr – Client-side fuzzy search for collections in TypeScript

TypeScript fuzzy search library for client-side collection searching with configurable scoring. Developer tool but not AI-specific.

HN mguardai 3/9/2026

Show HN: Mguard – First defense against MINJA memory poisoning attacks

Security library defending against memory poisoning attacks (MINJA, AgentPoison, MemoryGraft) on AI agents. Drop-in protection for Mem0, LangChain, custom systems.

HN jonas_kgomo 3/9/2026

Show HN: A community feed for founders to share, update and receive feedback

Founder networking platform for early-stage projects with generic AI tools mentioned.

HN JackArnot 3/9/2026

Show HN: Volt HQ – MCP server comparing AI inference pricing across providers

MCP server for comparing AI inference pricing across providers with budget alerts and optimization recommendations.

HN munnam77 3/9/2026

Show HN: Security toolkit for OpenClaw – scanner, hardened configs, guides

Security toolkit for OpenClaw personal AI assistant including scanner, hardened configs, and vulnerability guides. Addresses exposed instances.

HN umangsehgal93 3/9/2026

Agency: Specialized Expert Agents with Personality

Collection of specialized AI agent personalities with distinct expertise, processes, and deliverables for various tasks.

HN slopinthebag 3/8/2026

The Dangerous Illusion of AI Coding [video]

Video title about AI coding concerns. No content provided.

HN devonnull 3/8/2026

AI allows hackers to identify anonymous social media accounts, study finds

Research on using LLMs to de-anonymize social media accounts and link identities across platforms.

HN prophet94 3/8/2026

Show HN: Self-hosted financial analyst – Plaid and Claude and Next.js, –$5/month

Self-hosted personal finance app integrating Plaid, Claude API, and Next.js for AI-powered investment analysis.

HN walterbell 3/8/2026

State-of-the-Art Prompting for AI Agents (2025)

Summary of prompt engineering techniques from YC founders for building AI agents.

HN noobernetes 3/8/2026

IPFS OCI Registry – now with federation policy and private swarm support

Decentralized container registry powered by IPFS with federation and private swarm support. Kubernetes-compatible.

HN CGMthrowaway 3/8/2026

WiFi-DensePose – open-source software that sees you through walls using wifi

Open-source software using WiFi signals and sensing to perceive people and objects through walls without cameras.

HN todsacerdoti 3/8/2026

AI Assistants Are Moving the Security Goalposts

Analysis of security implications and risks introduced by autonomous AI agents with computer access.

HN ibrahimwithi 3/8/2026

Show HN: Wa-agent – Framework for building AI agents on WhatsApp

Node.js framework for autonomous AI agents on WhatsApp using YAML config, multi-step tool use, and multiple model providers.

HN mattiagaggi 3/8/2026

Claude Custom Chat – customize your Claude Code extension

VS Code/Cursor extension providing custom chat interface for Claude Code CLI. Self-modifying extension with rollback capability.

HN binwen 3/8/2026

Oly – Run AI agents, close your terminal, intervene when it needed from anywhere

Session-persistent PTY daemon for long-running CLI AI agents with intervention capabilities from anywhere.

HN paulpauper 3/8/2026

Chamath Palihapitiya Says AI Costs at Startup 8090 Could Hit $10M

Business news on startup AI infrastructure costs rising. No technical details.

HN kitasan 3/8/2026

Show HN: OxiMedia – Pure Rust Reconstruction of FFmpeg and OpenCV

Pure Rust reconstruction of FFmpeg and OpenCV. 92 crates, 1.36M LOC, forbids unsafe code, patent-free codecs, async architecture.

HN jeeybee 3/8/2026

Show HN: GYML – YAML syntax, JSON semantics, zero runtime dependencies

Strict YAML subset with JSON type semantics and zero runtime dependencies. Reduces YAML's complexity for config files.

HN zdw 3/8/2026

LLM-eliza – LLM plugin providing access to the ELIZA language model

Joke plugin for LLM tool providing access to ELIZA chatbot from 1966. Satire/novelty.

HN oldschoolai 3/8/2026

Show HN: Engram — a brain-inspired context database for AI agents

Engram: persistent context database for AI agents and LLMs that manages memory like human cognition to prevent context collapse and agent coordination issues.

HN SLHamlet 3/8/2026

Did AI Misidentify the Minab School?

Discussion about AI misidentifying a school in a photograph. Minimal technical content.

HN treetalker 3/8/2026

Agents of Chaos

Opinion article critiquing cynical AI applications in industry. Commentary without technical analysis.

HN hevalon 3/8/2026

Threat-Modeling the OWASP Top for LLM Applications

Security threat modeling and case studies of LLM application vulnerabilities including data exfiltration and prompt injection.