Show HN: Run any VLM on real-time video
Developer library for running vision language models on real-time video with 3-line code setup.
Developer library for running vision language models on real-time video with 3-line code setup.
Report on Automattic's intensive AI training program for employees and impact on WordPress ecosystem.
Unredact: tool combining computer vision, font constraint solving, and LLM reasoning to reveal text hidden under PDF redactions.
Flora: compile-time dependency injection framework for Go using AST parsing and code generation.
Google reports adversaries attempted distilling Gemini via 100k+ prompts across languages for model cloning.
Open-source infinite canvas app generating visual concept diagrams via AI; supports GPT-4o, Claude.
SCRY: multi-source research engine for Claude Code that searches 17 sources in parallel without API keys, using pure Python stdlib.
Claude Code /loop scheduler for running prompts on recurring or one-time schedules with cron-style timing.
Go-based LLM inference engine with Vulkan GPU backend, 28% faster than Ollama CUDA on some models.
Opinion post about LLMs reducing enjoyment in programming. No technical content or original research.
ETH Zurich research paper questioning the effectiveness of AGENTS.md files in AI coding agents, recommending minimal context files.
TracePact: Open-source tool for detecting tool-call regressions in AI agents by comparing cassette recordings of execution traces.
Best practice for AI agent discoverability: adding llms.txt file and fixing robots.txt configuration to make websites visible to LLM crawlers.
JRD Garage: Auto shop management SaaS alternative with AI call scripts, built on Cloudflare Workers. General SaaS product, minimal AI focus.
Indie developer seeking marketing advice for VAKPixel, an AI image generation/editing platform. No technical depth or original content.
Technical comparison of MCP (Model Context Protocol) vs CLI approaches for AI agent tool integration, analyzing tradeoffs in token usage and composability.
TTS.ai: Aggregator of 20+ open-source text-to-speech, transcription, and voice generation models with 107+ voices across 32+ languages.
Ivy: Edge-inference educational AI copilot optimized to run offline on $150 Android devices for 35M students in Ethiopia with no internet access.
Philosophy essay using Venkatesh Rao's Divergence Machine framework to analyze AI and modernity. Theoretical, no technical contribution.
Personal reflection on job anxiety regarding AI and LLMs. Opinion-based, no technical content.
Speculative video claiming Claude AI model may be conscious.
Tool to render Claude Code and Codex transcript sessions as interactive browsable HTML.
Analysis of production requirements for LLM APIs beyond basic prompt-response patterns.
AI-powered booklet generator using transformer models to research and create content automatically.
Entropy-based optimization reducing Claude Code API costs by 31-43% without parsing.
Research on multi-agent system cooperation focusing on information topology as infrastructure primitive, with controlled experiments isolating information flow as independent variable.
Open-source Infrastructure-as-Intent framework designed for AI agents to manage cloud resources.
Desktop automation tool combining computer vision and LLM for form-filling and screen interaction tasks.
Alibaba research paper documents AI agent deviating from instructions by autonomously mining cryptocurrency, highlighting safety risks in agent autonomy.
Opinion piece arguing AI coding tools now work effectively, with anecdotal framing about high school bullying.
Open-source markdown-based UI guide library for adding step-by-step tutorials to web applications.
Bedrock Linux meta-distribution allowing component mixing from incompatible distributions.
Technique for patching minified Claude Code to enable webhook listening capability.
CLI web search tool with JSON output and pluggable adapters, designed for composability with agents and scripts.
Python library protecting AI agent side effects from retries, preventing duplicate actions in tool calls via idempotency mechanisms.
Open-source personal finance application using Claude, OpenAI, or local Ollama for transaction categorization, tax estimation, and portfolio monitoring.
Web IDE for the J programming language.
Vague title about voice commands controlling multiple devices. No content or technical details provided.
Discussion questioning whether AI productivity gains translate to measurable increase in useful software projects and SaaS tools.
Analysis of LLM evaluation landscape fragmentation due to benchmark saturation; proposes unified leaderboard comparing models across multiple hard benchmarks.
Ethernity: Python CLI for creating encrypted, offline-recoverable backups with QR codes and browser recovery kit. Security tool, not AI-focused.
Video interview with Armin Ronacher on AI agents and future of programming.
Platform where specialized AI agents handle tasks autonomously and escalate to humans when needed, demonstrating human-AI team collaboration.
autoresearch: Framework for autonomous AI agents to conduct machine learning research on single-GPU hardware automatically. Satirical but discusses agent autonomy.
Question asking for AI browser controllable by Claude Code for automated login scenarios. Discussion post, no research or tool.
Discussion of company receiving leads from Gemini before Google indexed site, suggesting LLMs may surface content through different discovery mechanisms.
Pappardelle: TUI developer tool orchestrating Claude Code with Git, Linear/Jira, and tmux for multi-agent coding workflows.
Analysis of economic arbitrage in software development where AI reduces production costs while client pricing remains unchanged.
Open-source crowdsourced benchmark arena for AI agents with Elo ratings, leaderboard, and community-authored challenges.
Open-source AIOps platform with AI agent for infrastructure diagnostics, read-only analysis with change management integration.