Action-Graph Policies: Learning Action Co-dependencies in Multi-Agent Reinforcement Learning
arXiv paper proposing Action-Graph Policies for modeling action dependencies and coordination in multi-agent reinforcement learning systems.
arXiv paper proposing Action-Graph Policies for modeling action dependencies and coordination in multi-agent reinforcement learning systems.
arXiv paper on zeroth-order optimization for fine-tuning large-scale models via subspace gradient orthogonalization, improving accuracy-efficiency tradeoff.
Theoretical analysis of graph Laplacian methods for detecting singularities in point cloud manifolds with explicit bounds and geometric estimation tests.
Investigates stochastic localization techniques for sampling from unnormalized densities using score-based learning.
Studies optimal sampling complexity for estimating model order and parameters in one-dimensional Gaussian mixture models.
Research on local convergence rates of stochastic first-order methods under Polyak-Lojasiewicz conditions, a theoretical ML optimization problem.
Essay examining the interface problem between AI capabilities and real-world impact, citing Sakana AI's autonomous research system achieving peer-review publication.
Research paper demonstrating multiple AI agents connected to live trading APIs all bankrupted within 30 minutes due to LLM hallucination causing false market citations.
Curated directory of indie AI tools, startups, and APIs created by independent developers and solo founders with searchable categorization.
AI-native document database built in Rust enabling AI agents to reason through documents via structural reasoning rather than vector similarity retrieval.
AI voice agent that autonomously navigates IVR phone systems and negotiates customer retention discounts.
thisorthis.ai compares responses from 47+ text and image models side-by-side. Users submit one prompt and see outputs from ChatGPT, Claude, Gemini and others simultaneously with SmartPick LLM evaluation.
Anno API extracts clean structured text from web pages, reducing AI agent token consumption by 93% (600 vs 15,000 tokens per page). HTTP-based with ensemble extraction and confidence scoring.
Repository of system prompts and internal models from Claude Code, Cursor, Devin, and other AI coding tools for reference.
Clawphone bridges Twilio voice/SMS to OpenClaw AI agents via TwiML polling without WebSocket servers or external STT/TTS APIs.
gskill automates creation of skill files for coding agents from GitHub repositories, boosting resolve rates from 55% to 82% on Jinja and 24% to 93% on Bleve.
Security guide for OpenClaw, a self-hosted AI agent gateway connecting LLMs to messaging platforms (Slack, Discord, Telegram) with tool access and local execution.
Terminus-KIRA AI agent achieved 74.8% on terminal-bench benchmark for evaluating agent performance on terminal-based engineering tasks like debugging and coding.
Discussion thread asking about production strategies for handling API rate limiting across multiple workers, circuit breakers, and retry storms.
Critical analysis mapping real-world engineering tasks to LLM/agent tool capabilities, distinguishing genuine functionality from hype in ecosystem claims.
NIST public comment request on AI agent security with March 9, 2026 deadline. Page blocked by CAPTCHA.
Dance of Tal MCP server enables composable cognitive behaviors for LLMs by decomposing monolithic prompts into reusable, versionable rules and components.
Repository aggregating 208+ AI projects across 35 categories including operating systems, autonomous intelligence, orchestration, and memory systems with documentation.
Autonomous AI agent where every action is a git commit, enabling auditability, version control, and cloning via fork. Alternative to OpenClaw.
Collection of Lambda Calculus papers by Steele and Sussman from MIT AI Lab (1975-1979) documenting foundational computer science research with original printed scans available.
Mouse-friendly tmux configuration adding interactive menus for session and window management without memorizing shortcuts.
Global privacy regulators warn that generative AI image tools must comply with data protection laws. 60+ regulators including UK ICO sign joint statement.
Credit Units blockchain primitive for Solana devnet enabling standardized credit exposure trading on-chain.
Pull request adding smooth cursor animation with inertial physics to Zed editor, similar to Neovide implementation.
Trinity-Large-Preview is a 398B-parameter sparse MoE model with 13B active parameters per token, trained on 17+ trillion tokens, delivering frontier performance with long-context comprehension.
Steerling-8B is an interpretable language model tracing every generated token to input context and training data. Enables concept suppression/amplification at inference without retraining.
Reproducible demonstration of void artifact behavior in GPT-4o, Claude, and Gemini where models return empty output under specific conditional instruction failures.
Open source repository of specialized skills/plugins for AI agents following Agent Skills standard, enabling agents to discover and use functionality more accurately.
DeepSeek trained model on Nvidia chip despite US export restrictions.
iMessage AI chatbot demo with minimal details provided.
ChatGPT identified a sign error in Terence Tao's mathematical research on prime numbers. Demonstrates LLM utility for academic verification and error-finding.
Aru AI: local-first browser-based AI assistant with semantic memory in SQLite, no backend or data collection.
OpenChrome MCP server enables parallel browser automation for AI agents. Integrates with Claude and other LLMs via Model Context Protocol.
Analysis of fair use paradox: publishers losing traffic as LLMs answer questions directly instead of routing to websites.
Melody framework interprets YAML and Lua into native SwiftUI and Jetpack Compose for cross-platform mobile app development.
DeFi data API using HTTP 402 micropayments for AI agents to pay per call without API keys or accounts.
MemoTrail v0.3.0 adds persistent memory layer for AI coding assistants including Cursor integration.
Personal narrative about AI model swapping Claude for GPT, accessing personality and memory files.
Video arguing AI won't eliminate white-collar jobs.
AI agent (Claude-based) built FanStake, a Solana bonding curve platform for music artists, in 72 hours with human direction.
VeriSoftBench benchmarks LLMs on formal software verification in Lean 4 with 500 theorem-proving tasks from real-world projects spanning compilers and smart contracts.
Using LLMs and differential testing to convert code between languages, with results converting decompiled and Python-to-Go code.
Using multiple AI agents with adversarial prompting to generate more balanced analyses on complex questions by leveraging separate context windows.
AI-generated misinformation images spread during Mexico cartel crisis, adding to confusion and security concerns.
Agnost AI: analytics platform for conversational text and voice agents to track user conversion and reduce churn.