OAuth Is Broken for AI Agents
Analysis of OAuth protocol limitations for AI agent authentication and external service access.
Analysis of OAuth protocol limitations for AI agent authentication and external service access.
Team collaboration tool syncing AI coding agent conversations to shared storage, linking to git commits/PRs, with searchable history and usage analytics.
WebAssembly-based backtesting platform for trading strategies, built to work with AI coding agents generating strategy code.
8-phase pipeline framework for deploying AI agents from problem definition through production release.
Open-source orchestration layer running multiple AI coding agents in parallel git worktrees with SQLite tracking, web dashboard, and search across sessions.
Article on military applications of AI in warfare.
Deterministic browser control system for AI agents achieving ~90% accuracy on Mind2Web benchmark.
Opinion article on AI alignment implications in military contexts. Lacks technical depth.
Tool detecting and preventing behavioral anti-patterns in AI coding agents. Improves agent reliability.
AI skills and tools kit for animation expertise in Claude and Cursor. Includes CSS generation and performance auditing.
Opinion on AI alignment as skill vs. state. Author's background in interpretability. Lacks technical depth or novel research.
AI-native RPG with structured agent outputs controlling game state, music, NPCs. Uses Flux for image generation. Shows LLM agent integration.
Testing Strix autonomous AI tool for web penetration testing against GPT-5.4 on Hack The Box challenges.
Tool claiming to remove safety guardrails from open-weight LLMs. Minimal content provided.
MCP server enabling AI agents to conduct real user interviews and extract structured insights from conversations.
Research on using LLMs to bootstrap fuzzers for low-resource language compilers. arXiv metadata page with limited content details.
Article about AI agent failure in financial transaction handling. Minimal content provided.
Self-hosted AI agent framework that generates web applications automatically.
Video on maximizing utility from AI/ML models. Minimal content provided.
Security article on process-level secret exposure. Minimal content provided.
AI coding agent successfully built a Python 3.14 interpreter in Rust independently in 30 days, with native and WASM versions available.
Reinforcement learning agent using PPO algorithm to optimize elevator dispatching, achieving 84% wait time reduction vs. classical methods.
Research on identity and security challenges in AI agent automation, addressing credential management for agentic systems.
Production-ready LLM inference load balancer with OpenAI-compatible API, supporting Ollama and vLLM backends.
CodeRabbit claims top performance on independent AI code review benchmark. Offers 50% reduction in code review time and bugs.
Open-source Claude Code skill linter that auto-fixes 92 SEO/GEO content violations for AI search engine optimization.
Building a planning agent using MCP and Keycard for secure authorization.
iepub: open-source platform for interactive fiction that separates narrative prose from runtime logic and state management.
Article arguing MCP (Model Context Protocol) will remain relevant despite criticism, using self-driving cars as analogy.
User discussion of LLM performance on dating advice compared to forum participants.
Discussion thread about practical methods for sourcing and licensing training data for AI models.
Open Wearables: self-hosted, open-source backend normalizing health data from multiple providers into AI-ready REST API.
Opinion piece critiquing common metaphors for AI capabilities (intern, colleague, compiler) and their implications.
Gamified arena platform for comparing products based on user voting.
Metateam is a Rust terminal UI that manages multiple AI coding agent instances (Claude Code, Codex, Gemini) side-by-side with persistent memory across sessions.
Ledgi is a UK-focused net worth tracker with CLI and Claude Code integration, allowing AI agents to query encrypted financial data via scoped API keys.
mcp-recorder is a VCR.py-like tool for MCP servers that records and replays tool calls to catch silent failures when AI agents interact with modified tool interfaces.
Git-surgeon provides hunk-level Git primitives designed specifically for AI agents to manipulate code repositories at a granular level.
DataQueryAI converts plain text to SQL using local Ollama models, supporting multiple databases and languages without cloud data transmission.
Tool predicting cloud GPU runtime from 2-min local benchmarks to estimate AI job costs and duration on AWS.
Review evaluating five security skills/tools for Claude Code AI assistant, finding limited practical value.
Tool routing system handling 3,146 apps with zero LLM runtime inference, achieving 7ms latency with self-improving capabilities.
AgentShield provides real-time monitoring and observability for AI agents running in production environments.
Kubegraf applies AI for root cause analysis in Kubernetes troubleshooting and SRE work.
Guide on skills engineering—teaching AI agents task execution through detailed instructions to improve token efficiency, reliability, and consistency.
Moltty is a native macOS app providing tabbed, persistent terminals for AI coding tools like Claude Code and Aider with session resumption.
Opinion piece arguing that enterprise software vendors face disruption from agentic AI systems that automate business logic and UIs.
Agent-vfs proposes using virtual filesystems as abstraction for AI agent memory, demonstrating how agents converge on file-based patterns.
Discussion of Python chardet library license change from LGPL to MIT and implications for open source software licensing.
Real-world testing of Qwen 3.5 9B model running locally on M1 MacBook Pro and iPhone, evaluating practical usability in actual workflows.