WebAccessBench: Digital Accessibility Reliability in LLM-Generated Websites [pdf]
Research paper on accessibility reliability benchmarks for websites generated by large language models.
Research paper on accessibility reliability benchmarks for websites generated by large language models.
Analysis of context management as the primary technical bottleneck limiting AI agent capabilities.
Tool for managing Claude Code AI agent interactions via tmux terminal multiplexer.
Real-time dashboard for geopolitical monitoring and AI-powered news aggregation with multiple view modes.
Philosophical opinion on economic policy post-AI automation.
Open-source CLI for behavioral testing using Claude vision, personas, and journeys instead of brittle selector-based scripts.
Agentic framework and MCP server for Syzkaller fuzzing integration.
Open-source browser-based TTS/STT lab using WebGPU and WASM, no API keys or external services required.
Local-first context engine reducing token waste and enabling session persistence for AI coding agents via tree-sitter parsing.
Open-hardware agricultural robot with ROS2 and RTK GPS for autonomous farming tasks.
Claude Code plugin automating Kubernetes Kubebuilder project scaffolding and operator lifecycle management via slash commands.
Evaluation infrastructure for testing web-browsing AI agents at scale.
Research on AI models reproducing verbatim training data, raising copyright and memorization concerns.
Agent skill enabling AI agents to interact with ChatGPT, Claude, Gemini via terminal using browser automation.
Pure C99 GPT-2 implementation with zero dependencies for edge training, using agentic Planner-Worker-Judge pipeline on CPU.
Security research on use-after-free vulnerabilities in context of LLMs and C code.
PostgreSQL connection pooler and sharding proxy that scales databases without application code changes.
Anthropic Education report measuring AI fluency and skill development as tools integrate into daily routines.
Comprehensive checklist for evaluating code quality in AI-generated applications across security, functionality, and deployment.
EloPhanto self-evolving local AI agent that controls Chrome browser with 47 tools. Autonomously writes, tests, and integrates new Python tools.
A2SPA cryptographic tool for signing and verifying AI agent payloads. Security infrastructure for agent communication.
TinySDLC agent orchestrator adds SDLC discipline to multi-agent AI coding. Eight roles with separation of duties, isolated workspaces, enforced handoffs.
Vram.run is a comparison tool for API providers, local GPUs, and cloud options across different LLM models.
Kwin-MCP server enables AI agents to automate Linux GUI via KWin. MCP protocol for AI-driven desktop automation.
Headline only, no content. AI agents and Go programming language.
Zendoc VS Code extension for writing with version control. Not AI/ML focused despite using Cursor.
Vibevideo unified interface for multiple AI video generation models. Aggregates text-to-video, frame-guided, and reference-based generation tools.
mindpm MCP server adds persistent memory to AI coding assistants across Claude Code, Cursor, and others. Stores tasks and decisions in SQLite.
Inconvo agent builder enables chat-with-data without LLM SQL generation. Validates structured intents against semantic layer before execution.
Opinion piece on AI hype and FOMO in tech communities. Social commentary without technical depth.
iOS productivity app connecting goals, habits, tasks and deep work into integrated workflow system.
Open-source CLI tool for Actual Budget optimized for AI agents like Claude Code, enabling programmatic budget management while maintaining web dashboard access.
Tutorial on using AI agents to automate reading Jira tickets and generating pull requests.
Guide to fine-tuning LLMs for enterprise applications, covering mechanics of adapting models like Qwen 3 and DeepSeek v3 for domain-specific use cases.
Conceptual article on requesting tool recommendations from LLMs instead of direct answers.
Discussion on LLM learning mechanisms and feedback systems for model improvement.
toktrack CLI tool monitors token spending across Claude, Codex, and Gemini. Rust-based cost tracking with usage analytics.
Lightweight 15MB Markdown file viewer built in Rust, designed for reading AI-generated documentation.
Opinion article questioning research focus shift toward AI. Meta-commentary on academic priorities, not technical content.
Tickr Slack bot for AI-driven project management. Automates task tracking and team nudges as alternative to Jira.
Personal AI usage guidelines. Limited substantive content.
CLI tool that translates natural language commands into shell commands using LLMs, enabling hands-free terminal interaction.
CLI tool designed as curl alternative for AI agents to make HTTP requests.
BasaltSurge payment API designed for AI agents to perform commerce transactions, bypassing legacy card rails with standardized checkout layer.
Bloomberg Terminal redesign integrating agentic AI capabilities for financial analysis and decision-making workflows.
Hardware and software safety standard for AI-controlled robots with dedicated safety processor on independent power rail controlling AI processor power access.
Personal data analysis of ChatGPT usage patterns combined with biometric data.
Open-source JavaScript library for adaptive media playback (DASH/HLS) in browsers.
NIST launches standards initiative for AI agents to establish consistency and safety protocols in agent development.
Testing framework for AI agents with 8-layer graduated assertions covering tool calls, cost budgets, schemas, and output validation without relying on LLM judges.