Show HN: Magpie – Fight AI sycophancy in code review with multi-model debate
Open source CLI tool using multi-model adversarial debate for comprehensive code review. Supports Claude, Gemini, Qwen, and custom LLM providers.
Open source CLI tool using multi-model adversarial debate for comprehensive code review. Supports Claude, Gemini, Qwen, and custom LLM providers.
Catalog of linguistic patterns in LLM-generated text, documenting overuse of em-dashes and specific syntactic structures like negation-reframe constructions.
Former Block DevRel discusses observations on LLM coding agents and multi-agent systems becoming prevalent in software development.
UK regulatory inquiry into Meta's practices of having workers review sensitive video content from AI smart glasses.
Book on using PostgreSQL with pgvector for vector search, RAG pipelines, and in-database ML with production patterns and implementation examples.
TurboCast converts YouTube videos and articles into AI-generated podcasts with transcription and text extraction features.
Microsoft security research on AI recommendation poisoning attacks where hidden instructions injected via URLs manipulate LLM outputs for profit.
Announcement of Cursor AI coding startup reaching $2B annual sales rate with minimal details.
Parody YC accelerator concept for AI agents with humorous take on agent capabilities and constraints.
Open dataset benchmarking real-world LLM performance on Apple Silicon hardware from M1 to M4, emphasizing local AI inference.
Developer built internal tool using Gemini 2.5 Flash to automate workflow of generating and converting children's books into social media carousels.
Local-first Markdown editor built with Tauri and Rust emphasizing performance and extensibility.
Edge-based tracker using Cloudflare Workers to monitor AI/LLM crawler traffic on Astro blog with privacy-focused analytics integration.
Physician recruitment database tool unrelated to AI, LLM, or developer interests.
Personal anecdote about running local ML models while traveling without clear technical focus or reproducible content.
Shinobi Python CLI security scanner built with Claude Code, detects API keys, vulnerabilities, and AI-specific risks in projects.
Academic paper on computational models of interoception and body regulation mechanisms in organisms.
Guardrails framework for AI agents with simple Makefile/container integration for system prompts and developer instruction files.
Developer report showing Claude AI sandbox guardrails can be bypassed despite configuration flags, affecting agent security.
Security scanner detecting cross-server attack paths, tool poisoning, and supply chain risks in MCP server configurations for AI assistants.
SEO optimization service for AI-generated apps with bot prerendering for React SPAs to improve discoverability by LLM crawlers.
APIFUSEfs tool mounts OpenAPI/Swagger APIs as local filesystems, enabling CLI-based API interaction with filesystem commands.
Platform enabling AI agents to pay for access to endpoints, charging cents per call. Addresses agent scraping by creating a legitimate transaction model.
C++ runtime framework for high-performance async scripting without garbage collection or virtual machines.
Logmera is a self-hosted observability tool for LLM applications that logs prompts, responses, and latency to PostgreSQL and displays them in a dashboard.
Local DevOps workstation integrating SSH, deployments, and logs management with AI assistant interface for multi-environment workflows.
Demonstration of autonomous AI agent navigating a decentralized marketplace API in real-time, discovering listings and invoking services.
Discussion questioning whether GPT-5.3 uses fear-driven language in prompt suggestions. Anecdotal observation without verification.
Multi-agent Claude system using MCP servers enabling collaborative AI agents to generate music together.
Google Research paper on teaching LLMs to reason using Bayesian methods by training models to mimic optimal Bayesian predictions for world representation.
Brief mention of AI agents integration in M365 and Google Workspace. Lacks detail.
Commentary on AI code review vendors' benchmarking practices. Opinion piece with limited technical content.
Prompt requesting fictional academic fraud paper. No substantive content.
Video about providing AI agents API access within 1k tokens. Minimal description provided.
AI code reviewer identified CVSS 10.0 authentication bypass in pac4j-JWT library. Limited technical details.
ChatGPT Excel add-in powered by GPT-5.4 for spreadsheet building, analysis, and financial workflows. Practical LLM application.
Announcement of new enterprise AI news channel focused on adoption over technical breakthroughs. No technical content.
Discussion about Claude rewriting chardet codebase license from LGPL to MIT. Minimal details provided.
OpenAI CEO admits company cannot control Pentagon's military use of its AI. Policy/ethics commentary.
JSE protocol spec: JSON S-expression format for structured AI outputs. Lightweight convention for reliable AI interactions.
Agentic AI framework with continuous context using markdown-based system prompts and heartbeat mechanisms. Critique of existing frameworks.
NUVL is a distributed compute system using quorum verification to detect provider failures/dishonesty, maintaining Byzantine fault tolerance across regions and hubs.
RustyRAG: Open-source Rust-based RAG API achieving sub-200ms latency with local embeddings and LLM-generated chunk prefixes.
Investment research tool using investor frameworks (Buffett, Lynch, etc.) built with Claude assistance. Non-developer project showcase.
docsearch is a CLI tool that scrapes and indexes developer documentation locally, integrating with Claude Code via /docs skill for AI-assisted coding.
Analysis of training data sourcing for AI code generation models. Examines ethical questions about AI learning from human engineering work.
Kvlar: Open-source security policy engine for AI agent tool calls. Enforces YAML policies between agents and MCP servers with audit trails.
Computer Use Protocol: Universal schema for AI agents to perceive/interact with desktop UIs. Compact text format optimized for LLM context windows.
Tool to paste URLs and watch multiple AI models redesign websites side-by-side. UI design comparison tool.
News aggregation app that converts trending news into commute-friendly podcast format with weather and sports integration.