Standardized and In-Depth Benchmarking of Post-Moore Dataflow AI Accelerators
Research paper providing standardized benchmarking methodology for post-Moore dataflow AI accelerators.
Research paper providing standardized benchmarking methodology for post-Moore dataflow AI accelerators.
microgpt project. Insufficient content to assess, likely GPT-based tool or library.
Testing framework for LangChain agents covering prompt injection, tool failures, and cascading failures.
Podcast discussing historical chatbots and their relevance to modern clinical AI applications.
Developer built secure AI agent using Blink and Mac Mini as alternative to OpenClaw, addressing security concerns in agent architecture.
Analyzes prompt injection attacks as security threat to AI applications. Relevant to understanding LLM app vulnerabilities.
Tool converts visual feature boards into AI-ready coding prompts for development workflows.
Open-source field service management software for trades businesses.
User employed Claude LLM to negotiate hospital bill reduction of $163,000 through AI reasoning.
Feature flag evaluation library for Scala using ZIO framework.
Pure-Ruby GIS rendering engine generates maps and geospatial visualizations from GeoJSON without external services.
Guide on detecting prompt injection attacks in AI agents with practical implementation techniques. Directly relevant to securing LLM applications.
Deep learning method for segmenting individual tree crowns in aerial imagery using pseudo-labels derived from LiDAR data for forestry and environmental monitoring.
Developer built AI agent that autonomously drives app functionality and self-checks in 50ms loops for solo sports league platform.
ASID-1M: open-source collection of one million structured audiovisual instructions for training universal video understanding models with fine-grained annotations.
Static site with 75 free developer and AI tools including token counter, model comparison, cost estimator. Browser-based, no tracking.
Examines vulnerabilities in applications built on GPT/Claude/Llama. Covers attack vectors against LLM-based systems in production.
Part 4 of CNN image classification series covering input variation handling. Educational but narrow ML topic.
Open source runtime safety firewall for AI coding agents using bash and jq with zero dependencies.
SciAgentGym: benchmark with 1,780 domain-specific tools across sciences to evaluate multi-step tool-use in LLM agents, includes SciAgentBench evaluation suite.
API proxy providing unified interface to OpenAI, Anthropic, and compatible LLM providers.
Blip: Open-source ephemeral chat application with no data persistence.
Opinion piece on economic potential of generative AI as productivity tool.
Developer tool for sharing tmux terminal sessions over network, supports Claude Code collaboration.
Distributed mutex (Redis/file-backed) for multi-container/process AI agent swarms preventing race conditions.
2018 philosophy essay on open source motivations and community, not technical content.
Minimal GPT-style language model implementation for character-level prediction, educational machine learning code.
Research finding that increased effort reduces accuracy for Gemini Flash 3 and GPT-5 deep research.
Browser-based image toolkit with server-side AI processing.
CNN-based deep learning for visual classification at enterprise scale, including bird identification example. Practical ML architecture.
MCP server enabling team collaboration and review of coding agent plans in shared workspace.
Open-source MCP proxy enforcing per-tool budget controls on AI agent tool calls using L402/macaroons.
Open-source macOS AI agent accessing Mail, Calendar, Reminders via AppleScript without OAuth.
Minimal title-only entry without substantive content.
Discussion of operational shutdown mechanisms for misbehaving AI/LLM systems in production, covering cost, latency, and safety issues.
Open-source contact center platform for deploying AI agents across stacks with built-in infrastructure.
CCClub: Open-source leaderboard tool for tracking Claude Code token usage and costs among users. Developer tool for LLM application monitoring.
Research comparing LLM performance to physicians on medical differential diagnosis tasks. Benchmark study of LLM capabilities on specialized text analysis.
Nginx/iptables/UFW configuration repository for cloud providers. Infrastructure tooling unrelated to AI/ML interests.
Technique for safely running LLM agents in isolated VMs using Libvirt and Virsh. Infrastructure approach for agent safety and containment.
Personal account of AI-generated criticism. Narrative-focused without technical analysis or reproducible insights.
MCP server enabling coding agents to query GitHub repos with actual source code instead of relying on training data, reducing hallucinations.
User reporting SSL connection error to ChatGPT website.
Release announcement for open-source Angular diagramming library with interactive features.
Fuzzy-matching tool for matching job board data against UK visa sponsor registry. Data processing application with practical developer utility.
Web app for AI-powered video generation with intuitive UI. LLM/generative AI application but limited developer tooling focus.
Collection of prompts and responses from DeepMind's Aletheia model on advanced mathematics problems.
Amtrak train modernization announcement. Off-topic.
iOS app converting text prompts to complete songs using AI models. LLM application but music-focused, limited developer relevance.
Guide on using Claude Code for fullstack development, covering practical applications of AI-assisted coding with focus on effective usage patterns.