Show HN: Open dataset of real-world LLM performance on Apple Silicon
Open dataset benchmarking real-world LLM performance on Apple Silicon hardware from M1 to M4, emphasizing local AI inference.
Open dataset benchmarking real-world LLM performance on Apple Silicon hardware from M1 to M4, emphasizing local AI inference.
Developer built internal tool using Gemini 2.5 Flash to automate workflow of generating and converting children's books into social media carousels.
Local-first Markdown editor built with Tauri and Rust emphasizing performance and extensibility.
Edge-based tracker using Cloudflare Workers to monitor AI/LLM crawler traffic on Astro blog with privacy-focused analytics integration.
Physician recruitment database tool unrelated to AI, LLM, or developer interests.
Personal anecdote about running local ML models while traveling without clear technical focus or reproducible content.
Shinobi Python CLI security scanner built with Claude Code, detects API keys, vulnerabilities, and AI-specific risks in projects.
Academic paper on computational models of interoception and body regulation mechanisms in organisms.
Guardrails framework for AI agents with simple Makefile/container integration for system prompts and developer instruction files.
Developer report showing Claude AI sandbox guardrails can be bypassed despite configuration flags, affecting agent security.
Security scanner detecting cross-server attack paths, tool poisoning, and supply chain risks in MCP server configurations for AI assistants.
SEO optimization service for AI-generated apps with bot prerendering for React SPAs to improve discoverability by LLM crawlers.
APIFUSEfs tool mounts OpenAPI/Swagger APIs as local filesystems, enabling CLI-based API interaction with filesystem commands.
Platform enabling AI agents to pay for access to endpoints, charging cents per call. Addresses agent scraping by creating a legitimate transaction model.
C++ runtime framework for high-performance async scripting without garbage collection or virtual machines.
Logmera is a self-hosted observability tool for LLM applications that logs prompts, responses, and latency to PostgreSQL and displays them in a dashboard.
Local DevOps workstation integrating SSH, deployments, and logs management with AI assistant interface for multi-environment workflows.
Demonstration of autonomous AI agent navigating a decentralized marketplace API in real-time, discovering listings and invoking services.
Discussion questioning whether GPT-5.3 uses fear-driven language in prompt suggestions. Anecdotal observation without verification.
Multi-agent Claude system using MCP servers enabling collaborative AI agents to generate music together.
Google Research paper on teaching LLMs to reason using Bayesian methods by training models to mimic optimal Bayesian predictions for world representation.
Brief mention of AI agents integration in M365 and Google Workspace. Lacks detail.
Commentary on AI code review vendors' benchmarking practices. Opinion piece with limited technical content.
Prompt requesting fictional academic fraud paper. No substantive content.
Video about providing AI agents API access within 1k tokens. Minimal description provided.
AI code reviewer identified CVSS 10.0 authentication bypass in pac4j-JWT library. Limited technical details.
ChatGPT Excel add-in powered by GPT-5.4 for spreadsheet building, analysis, and financial workflows. Practical LLM application.
Announcement of new enterprise AI news channel focused on adoption over technical breakthroughs. No technical content.
Discussion about Claude rewriting chardet codebase license from LGPL to MIT. Minimal details provided.
OpenAI CEO admits company cannot control Pentagon's military use of its AI. Policy/ethics commentary.
JSE protocol spec: JSON S-expression format for structured AI outputs. Lightweight convention for reliable AI interactions.
Agentic AI framework with continuous context using markdown-based system prompts and heartbeat mechanisms. Critique of existing frameworks.
NUVL is a distributed compute system using quorum verification to detect provider failures/dishonesty, maintaining Byzantine fault tolerance across regions and hubs.
RustyRAG: Open-source Rust-based RAG API achieving sub-200ms latency with local embeddings and LLM-generated chunk prefixes.
Investment research tool using investor frameworks (Buffett, Lynch, etc.) built with Claude assistance. Non-developer project showcase.
docsearch is a CLI tool that scrapes and indexes developer documentation locally, integrating with Claude Code via /docs skill for AI-assisted coding.
Analysis of training data sourcing for AI code generation models. Examines ethical questions about AI learning from human engineering work.
Kvlar: Open-source security policy engine for AI agent tool calls. Enforces YAML policies between agents and MCP servers with audit trails.
Computer Use Protocol: Universal schema for AI agents to perceive/interact with desktop UIs. Compact text format optimized for LLM context windows.
Tool to paste URLs and watch multiple AI models redesign websites side-by-side. UI design comparison tool.
News aggregation app that converts trending news into commute-friendly podcast format with weather and sports integration.
Reported details about OpenAI's GPT-5.4 featuring 1M-token context and improved reasoning. Based on third-party reporting without official confirmation.
Notch is a macOS app providing quick AI access with persistent conversations and a background agent that monitors system state and sends periodic messages.
News about GPT-5.4 features including 1M token context window and extreme reasoning mode. Unconfirmed rumors from third-party sources.
cuTile.jl is a Julia GPU programming package for NVIDIA Blackwell GPUs using tile-based abstractions, simplifying kernel development.
AI agents that integrate with Slack, GitHub, and Jira to autonomously handle development tasks like ticket pickup, code writing, and PR reviews with persistent codebase context.
NumPy-like WebGPU wrapper for browser GPU computing. Zero shaders, automatic CPU/WebGL2 fallback. Suitable for local-first AI.
Open-source coding agent supporting multiple models. Free tier with 100 requests/day, $15/month premium with wholesale token pricing.
Discussion thread listing recent AI agent sandboxing solutions (microVMs, WASM, browser isolation) with inquiry into production usage, security tradeoffs, and performance characteristics.
LearnCodeGuide is an AI tool that analyzes code and provides health scores with suggestions for logic, performance, security, and maintainability improvements across multiple analysis modes.