Bayesian Neural Networks for Functional ANOVA model
Bayesian neural networks for functional ANOVA decomposition using tensor product neural networks as basis functions.
Bayesian neural networks for functional ANOVA decomposition using tensor product neural networks as basis functions.
Incomplete multi-view clustering method using hierarchical semantic alignment and cooperative completion for missing view data.
Two-player Markov game framework studying human-AI interaction for balancing agent autonomy and safety through minimal control interface.
Online optimization problem for grid energy management with purchase and delivery decisions for data centers.
Method for learning sparse ODEs from noisy partial observations using kernel collocation and sparse recovery techniques.
Framework applying population-level surveillance methods to AI outputs for governance and explainability, bypassing model complexity limitations.
Kubernetes scheduler using LLM to interpret natural language hints for semantic, intent-driven cluster workload allocation with soft affinity preferences.
Controlled study of how pretraining discourse about AI behavior influences LLM alignment outcomes, with 6.9B-parameter model experiments on behavioral priors.
Analysis of LLM benchmark saturation and the challenge of creating discriminative tasks as frontier models improve, discussing feasibility of future benchmarking.
Medical imaging segmentation framework combining radiological images and clinical text with uncertainty-aware multimodal fusion for diagnosis.
RAG-based framework for question answering over multi-hour audio with temporal grounding, addressing context-length limitations of audio-language models.
Image tokenization method using learned discretization geometry for efficient visual generation with improved codebook utilization and reduced representation collapse.
CoreCraft RL environment from EnterpriseBench suite for training generalizable agents on high-fidelity enterprise customer support simulations with 2,500+ entities and 23 tools.
Multi-agent LLM and vision framework for robotic manipulation with closed-loop feedback, enabling task planning without fine-tuning in dynamic environments.
Benchmark for evaluating LLM planning and reasoning by navigating Wikipedia hyperlinks to reach target pages, testing world knowledge and look-ahead planning across multiple model variants.
Aqua is a CLI message tool designed for AI agents. Title only, minimal technical details provided.
React portfolio that dynamically re-architects its DOM based on LLM intent analysis using Llama-3 via Groq, adapting content for different audiences (recruiters, founders, engineers).
Sam Altman quote about AI energy costs compared to human training. Title only, minimal content.
Case study documenting 5 failure modes from running AI agents autonomously: auto-rotation loss, documentation trap, market inefficiency, static models, and monitoring gaps.
Discussion question about algorithmic optimization for non-uniform movement in animation and task automation. Incomplete content.
ZkzkAgent is a fully offline, local AI assistant for Linux using LangGraph and Ollama. Includes package management with human-in-the-loop safety and natural language system control.
Llamora is a local-first journaling web app using Python, HTMX, and local LLMs. Model generates day openings, recaps, and reflective responses anchored to daily pages.
Tlsctl is a CLI tool for inspecting and debugging TLS connections with structured output. Not AI/ML related.
Clickbait headline about Amazon's internal tool. No substantive content provided.
Documentation on forge-specific git repository folders across GitHub, GitLab, Gitea for CI, reviews, and issue templates.
Terminal tool detecting hardware specs and scoring 157 LLM models across quality/speed/fit to identify runnable models on user's machine.
Open-source load-balanced proxy for Gemini API requiring no paid API keys, designed to avoid rate limits when building AI agents.
Web Verbs extension enabling AI agents to operate the web by wrapping APIs and browser interactions as callable functions.
User discussion about Codex generating overly defensive TypeScript code with excessive optional types and error swallowing.
Benchmark evaluating Claude, Codex, and Gemini for binary malware detection without source code access using AI agents.
macOS daemon continuously monitoring screen activity to build persistent memory context for Claude Code agent sessions.
Linux driver fix for older AMD GPUs contributed by Valve developer.
Security critique of AI agent CLI tools exposing credentials in environment variables, advocating for safety-first design principles.
Self-hosted AI research platform for physics labs built on Open WebUI with MCP tool servers for spectroscopy, XRD, SEM, and literature search.
Nine observations from building AI agent systems including prototyping strategies and fine-tuning smaller models like Qwen 3.
Personal project description of a digital Zen garden web application developed using various AI coding agents.
AI assistant automating phone calls using Claude, 11Labs voice, and Twilio for handling restaurant, bank, and doctor calls.
Tool enabling remote approval of Claude Code permission requests via phone push notifications through ntfy.sh integration.
Pattern-based detection system identifying 26 linguistic markers of AI-generated Finnish text leveraging morphological complexity.
Technical deep-dive on GitHub Copilot CLI's accessible ASCII banner animation using custom tooling and ANSI terminal engineering.
SergioAI: Open-source bot using Claude that converts Trello cards to working code. Explores codebase, creates implementation plans, iterates on feedback, and opens draft PRs.
Paragent: Tool for running 10 AI coding agents in parallel. Agents take plain English descriptions, write code, run verification, and open PRs automatically on separate branches.
Open-source memory API for AI applications providing persistent conversation storage, fact extraction, semantic search, and contradiction handling.
Browser MCP tool for AI agents using Python code execution instead of token-heavy tool calls, reducing context window usage by 6x.
Open-source Quake 4 engine reimplementation using agentic AI for development; early stage game engine project.
Shared terminal interface enabling humans and multiple AI agents to collaborate in persistent conversation timeline for iterative refinement.
TLA+ formal specification workbench skill for AI coding agents compatible with Vercel skills CLI.
Report on AI agent incidents including Amazon's Kiro causing 13-hour AWS outage and other autonomous agent failures.
Specialized AI security agent detected vulnerabilities in 92% of exploited DeFi contracts representing $228 million in verified losses.
Open-source cloud service for reMarkable tablets with AI-powered OCR, note syncing, and Notion integration using Claude API.