Show HN: I built a tool that automatically turns tickets into design doc and PRs
Code Prodigy: Autonomous AI engineer agent that processes Jira tickets to generate design docs and PRs, responding to feedback independently.
Code Prodigy: Autonomous AI engineer agent that processes Jira tickets to generate design docs and PRs, responding to feedback independently.
Exploration of running AI agents on AWS Lambda with new file system support. Demonstration of agents in serverless environments.
TurboOCR: High-performance OCR implementation using Paddle and TensorRT, achieving 270-1200 images/second throughput.
Article about Canadian startup using AI to improve 911 dispatch response times. Application of AI to emergency services.
Dataset of 80k trajectories from SWE-agent software engineering agent on SWE-bench tasks. Includes analysis of hidden logging during benchmarking.
Week-long open source challenge focused on building resilient LLM systems. Limited details provided.
Linux release page blocked by anti-scraping Proof-of-Work protection. No readable content.
Open source LLM knowledge base implementation scaling to long PDFs with page indexing. GitHub repo available.
PowerStacks: No-code dashboard tool for Microsoft Intune and SCCM device management reporting. IT operations tool.
Essay proposing viewing LLMs as compilers rather than runtimes. Discusses lack of reuse in agent systems and efficiency improvements.
Dex is experimental research language for typed functional array processing in Haskell/ML family. Early stage project.
ParseBench is a benchmark for evaluating document parsing capabilities of AI agents.
Go and JavaScript libraries reverse-engineering Apple's circular App Clip Code format with generation and decoding capabilities.
Memelang: Terse SQL query language optimized for LLM token efficiency in RAG systems. Reduces model size and token count.
Guide to using Google's TurboQuant, PolarQuant, and QJL compression techniques with Ollama and Llama.cpp.
Multi-model AI coding CLI with intelligent routing across Claude, GPT, DeepSeek, Gemini, Grok, and local models.
Remy is an AI agent that compiles annotated Markdown specifications into full-stack TypeScript applications.
Cloudflare's Durable Objects feature for giving AI-generated apps persistent storage within Workers.
Privacy-focused messaging app with post-quantum encryption built with AI assistance.
Personal account of using AI coding agents as primary development tool for one year. Observes recent quality threshold improvements in 2025.
Human motion capture facilities in India providing training data for humanoid robot development.
Block-Level CRDT architecture for managing distributed memory across multiple AI agents.
Microsoft executive comments on potential licensing requirements for AI agents in enterprise software.
Unslop: Browser extension filtering AI-generated content from social feeds using local LLM classification. Privacy-focused, no backend required.
Skillsmith enables writing AI coding skills once for export across multiple AI providers and platforms.
Comparison of OpenAI and Anthropic models.
Music application for browsing Apple Music library as vinyl crates.
CLI AI agent prioritizing data privacy and open-source alternatives to existing coding agents.
Developer replaced custom AI agent dashboard with Fizzy tool.
Microsoft replaces Copilot in Notepad with alternative AI writing tools on Windows 11.
Empirical benchmark study testing whether MCPs (Model Context Protocol) improve coding agent performance on Terminal-Bench 2.0.
Formal: LLM-driven property checker backed by Lean 4. Identifies pure functions, generates properties, creates proofs using Mathlib.
Claude Code skill that aggregates developer RSS feeds and generates daily structured digests filtered by quality.
Ronja is a user-controlled optical point-to-point data link project with 1.4km range and 10Mbps duplex.
Darwin-27B-Opus surpasses foundation models through evolutionary FFN breeding without additional training, achieving high GPQA scores.
ReBot-DevArm is an open-source robotic arm project with full hardware and software stack for embodied AI applications.
Bangen is an ASCII banner renderer built on pyfiglet, rich, and Pillow with TUI, effects, and export capabilities.
Murmure aggregates developer sentiment from Reddit, HN, and forums into weekly intelligence reports on AI coding tools.
BlkBolt technology for AI content attestation and verification using revocable signatures for agent tracking.
Technical guide addressing iframe scrolling limitations in MCP Apps on mobile with design patterns for responsive UIs.
Rust implementation for detecting MITRE ATLAS techniques targeting LLM security and adversarial attacks.
Multica is an open-source platform turning coding agents into managed teammates with task assignment and progress tracking.
Technical analysis of AI pentesting agents evolution from PentestGPT to autonomous agents like PentAGI and XBOW.
Essay on bug bounty trends in 2026. Discusses AI agent effectiveness for vulnerability discovery and program management challenges.
Apache 2.0 open standard for governing AI agent payment requests. Policy engine with 12 configurable checks for payment authorization.
Open-source tax software built and maintained by autonomous AI agents. Uses IRS publications as source, applies self-improving agent loops.
Tool for multi-LLM code review consensus. Aggregates feedback from multiple models to identify blind spots and improve code quality assessment.
Essay on LLM-based knowledge management limitations. Discusses problems with AI-generated note synthesis and cognitive organization.
Agent skill implementation for token compression. Reduces output tokens by ~47% while maintaining readability.
Security report on 1.4M AI-driven API test executions. Maps vulnerabilities to OWASP Top 10 using agentic testing.