Isolater - Feed

Ax Pengxiang Zhao, Hui-Ling Zhen, Xing Li, Han Bao, Weizhe Lin, Zhiyuan Yang, Ziwei Yu, Xin Wang, Mingxuan Yuan, Xianzhi Yu, Zhenhua Dong 2/26/2026

Unleashing Low-Bit Inference on Ascend NPUs: A Comprehensive Evaluation of HiFloat Formats

Evaluates HiFloat low-bit formats on Ascend NPUs for LLM inference. Compares INT8 and 4-bit floating-point for efficiency-accuracy tradeoffs.

Ax Lunjun Zhang, Ryan Chen, Bradly C. Stadie 2/26/2026

Evolutionary System Prompt Learning for Reinforcement Learning in LLMs

Evolutionary System Prompt Learning method jointly improves LLM contexts and weights via reinforcement learning. Enables autonomous self-improvement for agentic systems.

Ax Iman Ahmadi, Mehrshad Taji, Arad Mahdinezhad Kashani, AmirHossein Jadidi, Saina Kashani, Babak Khalaj 2/26/2026

MALLVI: A Multi-Agent Framework for Integrated Generalized Robotics Manipulation

Multi-agent LLM framework for robotic manipulation with closed-loop visual feedback. Integrates language and vision models for task planning in dynamic environments.

Ax Ruicheng Ao, David Simchi-Levi, Xinshang Wang 2/26/2026

OptiRepair: Closed-Loop Diagnosis and Repair of Supply Chain Optimization Models with LLM Agents

LLM agents diagnose and repair infeasible supply chain optimization models. Demonstrates closed-loop agent task decomposition for operations research problems.

Ax David Schmotz, Luca Beurer-Kellner, Sahar Abdelnabi, Maksym Andriushchenko 2/26/2026

Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks

Research on prompt injection vulnerabilities in LLM agents via skill files. Identifies security risks in agent supply chains and skill-based attacks.

Ax Christian Catalini, Xiang Hui, Jane Wu 2/26/2026

Some Simple Economics of AGI

Economic analysis of AGI's impact on labor and growth. Argues human verification becomes the bottleneck as AI decouples cognition from biology.

HN nnx 2/26/2026

CSS Proposal:near(<length>) pseudo-class for pointer proximity

CSS proposal for pointer proximity pseudo-class to enable hover effects without JavaScript. Web standards work.

HN sylwester 2/26/2026

I built an open-source AI Gateway that sits between your apps and LLM providers

Self-hosted open-source API gateway supporting multiple LLM providers with per-token limits, statistics, and OpenAI compatibility.

HN justinko 2/26/2026

Show HN: PullMaster – Recommends code reviewers from your repo history

Developer tool using repo history to recommend code reviewers with awareness of availability and expertise.

HN ex-aws-dude 2/26/2026

Ask HN: What's it like working in big tech recently with all the AI tools?

Discussion thread on developer experience using AI coding tools at major tech companies.

HN anditherobot 2/26/2026

How to make LLM native User Interfaces - Post LLM Workflow

Discussion of UI design patterns optimized for LLM-native workflows post-foundational models.

HN zyoralabs 2/26/2026

Show HN: ZSE – Open-source LLM inference engine with 3.9s cold starts

Open-source LLM inference engine optimizing memory efficiency and cold starts for serverless deployments.

HN kevin1chun 2/26/2026

Show HN: Taji – Portfolio advisor that's better than Fidelity's

AI-powered portfolio analysis tool for investment advice personalization.

HN geox 2/26/2026

The Texas AI boom is outpacing water regulations

News article on AI data center water consumption in Texas regulatory gaps.

HN pokot0 2/26/2026

(paper money) Hedge Fund staffed by AI Employees (experiment)

Minimal post about hedge fund experiment staffed by AI employees using paper money. No technical details.

HN eronmmer 2/26/2026

Show HN: Bloomfilter – A service for AI agents to register and manage domains

Bloomfilter service allows AI agents to register ICANN domains via single API call using on-chain payments. MCP server compatible.

HN shakiness3383 2/26/2026

Examining Bias and AI in Latin America

Study analyzing gender and social stereotypes in Spanish-language LLMs using 4,156 test questions from Latin American researchers.

HN eman11 2/26/2026

Show HN: WebMCP Core – AI agent tool definitions from any site

WebMCP Core tool converts websites into Model Context Protocol definitions for AI agents. Open source CLI with playground and A/B testing.

HN rahulskn86 2/26/2026

Anthropic is dropping its signature safety pledge amid a heated AI race

Anthropic reportedly weakens AI safety principles amid competitive pressure. News article about company policy shift.

HN OsamaJaber 2/26/2026

Open-Source Agent Operating System

Production-grade open-source agent operating system written in Rust with 137K LOC, 14 crates, comprehensive testing.

HN stagezerowil 2/26/2026

Claude Code Video Toolkit

Skills and MCP servers for Claude Code to generate videos programmatically using Remotion and FFmpeg.

HN ascarola 2/26/2026

Show HN: Unix for the Commodore 64? Open Source

Unix-like OS shell for Commodore 64 with improved UX and boot process.

HN gmays 2/26/2026

Which web frameworks are most token-efficient for AI agents?

Comparative analysis of web framework token efficiency for AI agent code generation.

HN doener 2/26/2026

Pete Hegseth and the AI Doomsday Machine

Opinion piece on AI regulation and corporate vs government interests.

HN adham900 2/26/2026

Show HN: RubyLLM:Agents – A Rails engine for building and monitoring LLM agents

Rails engine for building and monitoring LLM agents in production with cost tracking, retries, circuit breakers, and observability dashboard.

HN cdrnsf 2/26/2026

FBI raids of LAUSD Supt.'s home and office appear tied to AI chatbot probe

FBI raid related to AI chatbot procurement at Los Angeles school district.

HN adham900 2/26/2026

Show HN: OpenTrace – Self-hosted observability server with 75 MCP tools

Self-hosted observability server exposing logs, database, and metrics as 75 MCP tools. Works with Claude Code and Cursor.

HN e2e4 2/26/2026

Multi-agent workflows often fail

Analysis of multi-agent workflow failures, identifying three engineering patterns for reliable agent systems. Technical guidance on agent design.

HN ai_engineering 2/26/2026

Show HN: Open-source MCP servers for self-hosted homelab AI

Open-source bundle of 8 MCP servers for homelab services (Proxmox, Grafana, Ollama, etc). 40 tools total, Python implementation.

HN guoyu 2/26/2026

Nkmc – a virtual filesystem that lets AI agents call any API with ls, cat, grep

Nkmc virtual filesystem allows AI agents to call APIs using standard Unix commands (ls, cat, grep). Minimal details provided.

HN merinid 2/26/2026

Random Ghostty theme on each launch

Theme randomizer for Ghostty terminal emulator.

HN mooreds 2/26/2026

Tool use and notation as shaping LLM generalization

Analysis of how tool use and notation reduce task complexity for LLMs rather than increasing model capability. Examines agent design patterns.

HN umairnadeem123 2/26/2026

Show HN: I built an LLM comment detector for HN (I got banned)

Developer built LLM comment detector for HackerNews after being flagged for excessive AI-assisted posting. Personal experience account.

HN todsacerdoti 2/25/2026

Managing Complexity with Mycelium

Essay on software architecture complexity and challenges of offloading coding tasks to language models. Incomplete content.

HN flamestro 2/25/2026

Show HN: Deff – Review AI-generated code changes

Deff tool streamlines review of AI-generated code changes. Surfaces diffs with vim motion support for faster comprehension.

HN radolang 2/25/2026

Show HN: Clerk – Simple invoicing for freelancers built with AI agents in 7 days

Clerk invoicing app built with AI agents in 7 days. Uses natural language chat for invoice generation and PDF parsing.

HN Facingsouth 2/25/2026

Show HN: Provision Stateless GPU Compute with Claude Code's Remote Control

Claude Code MCP integration for stateless GPU provisioning across cloud providers with conversational control and cost optimization.

HN acartag7 2/25/2026

Show HN: Edictum – Runtime governance for LLM agent tool calls

Edictum is a runtime governance library for LLM agents that enforces safety contracts at tool-call boundaries. Tested on 6 frontier models across 17,420 interactions, identifying a 'GAP' where models refuse harmful text requests but execute them via tool calls.

HN jbernardo95 2/25/2026