Show HN: Forgeterm – Runtime security monitor for AI coding agents
Forgeterm is a runtime security monitor designed specifically for AI coding agents.
Forgeterm is a runtime security monitor designed specifically for AI coding agents.
Scindo: AI agent converting team discussions into code PRs. Agentic workspace managing conversation-to-code workflow.
Chrome extension monitoring employee data sharing with AI tools. Security/compliance tool for agent data protection.
OpenFable: open-source RAG engine implementing FABLE algorithm with tree-structured indexes for retrieval across document hierarchies.
LLM connected to 8-bit game via structured text summaries; maintains notes, develops strategies, discovers AI exploits.
Title only: Yolt tool for recovering deleted/overwritten files in safer LLM YOLO mode.
Open-source MIT-licensed tool that audits AI agents through interview-style questioning with regulatory compliance flags (SOC2, GDPR, EU AI Act).
Developer tool for estimating AI workflow costs accounting for retrieval, retries, tool use, and model choices.
Deep dive into LLM inference engine implementation in C++ explaining why output token generation costs 5x more than input processing.
Open-source MIT-licensed tool that audits AI agents through interview-style questioning with regulatory compliance flags (SOC2, GDPR, EU AI Act).
Research on full-precision training technique for 100B+ parameter LLMs on single GPU.
Opinion piece on AI coding tools. Minimal technical detail provided.
Security vulnerability disclosure about Google Support chat widget leaking call logs and agent data.
News article reporting Anthropic's Mythos model withheld from release due to vulnerability detection capabilities.
Training 163M-parameter GPT-2-style model from scratch; analysis of loss differences vs original GPT-2.
Discussion post about colleagues using LLM-generated responses. Opinion piece without technical content.
Agentic PostgreSQL DBA tool running as Go binary alongside PostgreSQL, executing diagnostic rules and optional LLM-powered fixes with trust-ramped automation.
Essay contrasting LLM capabilities in technical domains versus creative writing, citing Altman and Cowen perspectives.
Developer tool combining local speech-to-text with ripgrep code search, enabling voice-based code navigation for AI coding assistants.
Peking University AI framework autonomously discovered and formally verified solution to open problem in commutative algebra using 19,000 lines of Lean 4 code.
Hacker News sentiment tracker for Claude Code and Codex coding tools, updated daily with community opinions on performance.
Multi-model workflow orchestrator with YAML-defined chains, parallel execution, visual canvas builder, and MCP/REST/CLI interfaces supporting tool-based agents.
Safetensors joins PyTorch Foundation as hosted project to secure model distribution and prevent arbitrary code execution in agentic solutions.
Local prompt injection detection tool for AI agents using MCP and function calling, runs ONNX model offline without API calls.
Announcement of Claude Mythos Preview availability on Google Cloud Vertex AI for select customers as part of Project Glasswing.
Technical post on optimizing norm gathering assembly code for SereneDB search benchmark across x86_64 and ARM architectures.
Browser-based private AI workspace running models via WebGPU with zero server communication, local storage, and no API key requirements.
PostgreSQL MCP server providing schema awareness to AI agents and coding assistants from offline snapshots without production database credentials.
Research note on Mamba-3, a state space model architecture improving on transformers' quadratic attention costs for efficient LLM inference in agentic applications.
A2CN: open protocol for safe agent-to-agent commercial negotiation enabling procurement agents to transact deals.
COBOL-based AI agent chatbot with tool use and agentic loop, integrating with modern LLMs via OpenRouter.
Reading notes on signature method for feature engineering from sequential event data in machine learning.
Self-hosted AI assistant with voice, vision, RAG, and web search running entirely on-device without cloud.
Open-source, self-hosted form backend alternative to SaaS solutions like Formspree with email/webhook delivery.
AI video generation tool converting text and images to videos for content creation platforms.
Automated decensoring tool reduced Google's Gemma 4 refusal rate from 98% to 47% in 24 minutes on laptop.
2019 op-ed critiquing NASA's proposed lunar Gateway space station design.
Open-source infrastructure for building AI agents and internal software with managed database, auth, and RBAC.
SharpSkill is an interview preparation platform for coding assessments covering React, Node.js, and other tech stacks.
Modular data center pods offer faster deployment as alternative to hyperscale projects, relevant for AI infrastructure.
Analysis of Entire.io startup and its Checkpoints open-source CLI tool providing observability layer for AI coding workflows.
Investigation into false benchmark claims in MemPalace, an open-source AI memory project. Exposes fabricated scores and questions attribution to actress Milla Jovovich.
Claude chatbot outage with elevated error rates affecting Sonnet 4.6 model and downstream services.
Python real-time engine with sub-1ms jitter for industrial control, auto-generates REST APIs and MCP for LLM agent integration.
Anthropic provides Mythos model to major tech companies for cybersecurity testing and vulnerability discovery.
Open-source framework for AI SRE agents that integrate 40+ infrastructure tools to autonomously investigate and resolve production incidents.
Video presentation on GitOps relevance and practices in systems managed by AI agents, from FluxCon conference.
Yu is a sandboxing tool that isolates Claude Code and Codex execution to prevent credential exposure from compromised code or dependencies.
GLM-5.1 is a 754B parameter open-source LLM that demonstrates improved reasoning and multi-modal capabilities like unprompted SVG+CSS generation.
Analysis of cognitive load and limitations when managing multiple parallel AI agents, focusing on human-in-the-loop costs beyond throughput metrics.