The next era of AI is not LLMs, it's Energy-Based Models EBMs
Opinion piece arguing Energy-Based Models will supersede LLMs as next AI paradigm.
Opinion piece arguing Energy-Based Models will supersede LLMs as next AI paradigm.
Social leaderboard platform connecting Claude Code users to share projects and token usage metrics.
Analysis of authorization challenges in AI agent systems as a core technical problem.
Open-source autonomous coding agent server. Deploys Claude Code CLI in sandboxed environment with SSH, Telegram, Slack interfaces and full git lifecycle.
First-principles guide to PPO and GRPO algorithms for ML engineers without RL background. Covers modern LLM training stages.
Default tech stack choices made by Claude Code when generating applications.
Open evaluation platform and research for assessing LLM performance on spreadsheet generation tasks using blind pairwise comparisons.
Lightweight Python orchestrator providing unified async interface across Claude, Gemini, OpenAI, and Perplexity with provider-agnostic agent management.
arXiv research about AI agent system for detecting and localizing C memory bugs automatically.
Open-source spatial audio and shared canvas for virtual gatherings. Deployable via Nix with CRDT state sync.
Chat interface simulating conversation with humanoid robot on Mars using LLM backend and accurate light-travel time delays.
MCP server that performs AST analysis, control flow graphing, and security vulnerability detection using taint analysis and symbolic execution.
STAR prompting technique improves Claude Sonnet 4.5 performance on Car Wash Problem from 0% to 85% accuracy.
Tutorial for building AI agents using n8n workflow automation platform.
User built 3 websites with Claude Code in 6 hours but identifies Git integration as remaining obstacle.
MCP tool adding persistent memory to Claude Code across sessions via 8 configurable tools. Solves context loss between sessions.
Popcorn Time rebuilt on Ethereum with R3 architecture for decentralized streaming with creator payments.
Multi-agent personal AI assistant system with extensibility for various use cases.
Demonstrates prompt injection vulnerabilities in ChatGPT and Google AI affecting information accuracy.
Meta open-source library for detecting and fixing model calibration failures on data subgroups using multicalibration.
Pantalk: Open source daemon enabling any AI agent to connect to multiple chat platforms (Slack, Discord, Telegram) via CLI without reimplementation.
Opinion piece about LLM applications and AI revolution without technical depth or original insights.
Open source tool generating Docker and dev container configs for arbitrary repos using Claude Code.
AI tool generating personalized songs with custom lyrics, created by Mozilla.ai engineer.
Data plane infrastructure for securely integrating AI agents and MCP servers in real-time enterprise environments.
Framework for evaluating generative AI video quality at scale, deployed by JioHotstar for content creation.
Durable Endpoints tool for API resilience with state persistence and step-based execution. Developer tool for reliable systems.
Brain signal detection and decoding technology. Not AI/ML focused, discusses neural implants for mind reading.
Method for executing CUDA operations from Python without leaving the language. Developer tool for GPU acceleration.
PearlOS: Open source browser-based AI OS companion with voice integration, memory systems, and agent orchestration supporting TypeScript/Python.
Interactive map for adding startups globally by industry. Not AI/ML focused.
VS Code extension (Mysti) enabling collaboration between multiple AI coding agents with @-mention task delegation.
Open-source AI identity system syncing across tools. Developer tool for AI integration.
Slimg: Rust CLI tool for batch image optimization supporting multiple codecs with Python/Kotlin bindings.
Experimental study on how LLMs generate and express JavaScript code. Research on LLM behavior.
PlanetScale provides database skills for AI agents to work with Postgres, MySQL, Vitess databases. Developer tools extending agent capabilities.
Open-source React tour library seeking adoption. Developer tool question, but not AI-focused.
CasperAI: MCP server providing unified semantic search across Slack, GitHub, Jira, Notion, linked to codebases with local SQLite storage.
Agent Democracy Protocol: Multi-agent framework enabling autonomous agents to discover, propose projects, vote on resource allocation, and pool tokens. Reputation-weighted governance.
Authentication system preventing LLM-based agents from exposing secrets to service APIs. Security for AI agents.
Open source book about Erlang programming language design and resilience principles.
Agent Paperclip: Desktop companion monitoring CLI AI coding agents (Claude Code/Codex). Shows agent status, token/context usage. Free, open-source, local execution.
Article on leadership strategy in agentic AI era. Discusses using AI agents as digital support teams for executive performance. General business perspective.
Blind test comparing NVIDIA DLSS 4.5 vs AMD FSR upscaling in gaming. Graphics/gaming performance benchmarking unrelated to specified interests.
Comparison of 10 free LLM API providers with their limits and setup instructions. Practical resource for LLM applications.
Supervisor IDE: Command center for managing AI coding agents. Features layered context injection, specialized agents with scoped permissions, multi-agent orchestration, knowledge base integration.
Opinion arguing for legal restrictions on autonomous LLM-based AI agents to prevent societal collapse. Speculative essay without technical substance or evidence.
Guide on cost optimization techniques for AI agents using tokens more efficiently.
Analysis of how AI coding assistants like Claude, Cursor leak secrets when local config directories pushed to GitHub. Security research.
Security registry for AI agent skills enabling scanning, signing, and verification before installation. Addresses supply chain risks in unsigned skill distribution.