Lap2: Revisiting Laplace DP-SGD for High Dimensions via Majorization Theory
Revisits Laplace mechanism for DP-SGD in high dimensions using majorization theory for private training of large language models.
Revisits Laplace mechanism for DP-SGD in high dimensions using majorization theory for private training of large language models.
Inference-time optimization for protein ensemble generation using experiment-guided diffusion to match real conformational dynamics.
Multi-agent system translating jailbreak papers into executable modules for unified benchmarking of LLM robustness with reproducible evaluation.
Combines neural reconstruction (NeRF, Gaussian Splatting) with diffusion models for photorealistic robot simulation without manual artifact correction.
Studies whether interpreter state persistence should be part of LLM agent training. Shows runtime persistence affects tool-augmented agent behavior.
Open-access Hebrew speech dataset with 2,300 hours from parliament spanning 2009-2025, tracking 393 speakers over 15 years for aging voice research.
System infrastructure for LLM-driven agentic ML pipeline search where agents autonomously generate, validate, and optimize ML pipelines over Python libraries.
MRI image enhancement using diffusion models to translate ultra-low field scans to high field quality without paired training data.
Multi-model ensemble using LoRA fine-tuning for code comment classification across Java, Python, Pharo. Combines four transformer encoders via PEFT.
CI/CD quality gate tool detecting failures in AI-generated code including hallucinated packages and logic gaps.
Shell helpers piping git diffs to Claude API for automated code review and criteria generation.
SaaS tool using AI to prioritize feature requests from multiple sources by understanding user context.
Minimal metadata entry with no content provided.
Zalor platform for automated testing and scenario generation of AI agents before production deployment.
Open-source security scanner detecting vulnerabilities in AI coding assistant configurations (Cursor, Copilot, Cline).
BiomeSyn ecosystem simulator for testing long-horizon multi-agent AI behaviors with memory and cooperation.
PWA for anonymous incident reporting on shared map. Unrelated to AI/ML interests.
Continuation of LLM-based reverse engineering: converting decompiled binaries to modern programming languages.
Using LLMs to automate binary decompilation and reverse engineering of compiled programs.
Bruce Perens argues AI will undermine copyleft licensing models citing chardet library license change.
SAS Viya product announcement video about new data and AI features.
Platform for building AI agents and autonomous workflows with integrations to 1000+ applications.
Scripts to uninstall Claude Desktop and remove bundled Linux VM. Not AI/ML research or development focused.
Graduate student releases MIT-licensed 3D C++ OpenGL engine built with agentic AI coding assistants to test their capabilities on complex systemic tasks.
Discussion thread on multi-agent AI system architectures and workflows, including 13-agent PAI Family example.
GUI for indoor cycling training app with BLE sensor sync. Completely unrelated to AI/ML.
Polyscope: IDE designed for AI agent-first development. Limited details provided.
MCPSec scans Model Context Protocol configs for OWASP MCP Top 10 security risks. Developer tool for securing AI agent infrastructure.
Fractals: recursive task orchestrator for agent swarms using git worktrees and batch execution. Open-source AI agent framework.
Amazon Kindle ebook redemption page with no technical content.
SlideScholar converts research papers to conference slides via Claude API. LLM application with open-source stack (Next.js, FastAPI).
Opinion piece on customer marketing experience. Not technical or AI/ML focused.
OpenAI Symphony: autonomous agents orchestrate project work from Linear board, execute tasks with CI/PR proof-of-work. AI agent framework.
CLI tool enabling Claude Code sessions to transfer between machines with local file access. Developer tool for AI coding.
AI agent running actual business as CEO with open-source codebase, public decision logging, and goal to reach $80k/month revenue.
CLI tool for GPU provisioning across 19 cloud providers with automatic vLLM optimization and Kubernetes deployment. Developer tool for LLM ops.
iOS read-it-later app using smart collections to auto-organize saved articles.
Luma's Uni-1 unified multimodal model for generation and understanding across image, video, audio, and text with agentic capabilities.
Video analyzing 20M GitHub PRs with Jellyfish to extract insights and benchmarks for AI development.
Platform renting idle browser instances to AI agents for web automation tasks, bypassing bot detection and CAPTCHAs.
Codex Fast Mode feature enabling 1.5x speed increase on GPT-5.4 at 2x credit cost. Developer tool documentation.
Anthropic research paper measuring and analyzing labor market impacts of AI with new methodology.
Open-source permissions and approvals framework for AI agents with SDK for enforcing boundaries, tracking actions, and user control.
Standalone verification tool for code changes and AI agent behavior with proactive issue detection after agent modifications.
Local-first knowledge graph for developers that watches project files, extracts entities using LLMs, and enables natural language querying.
AI tool that browses applications, generates human-readable test specs, then writes maintainable Playwright E2E tests.
DocMCP: MCP server indexing documentation sites locally in SQLite with hybrid keyword/vector search for Claude integration.
Title only claiming GPT-5.4 best for SRE benchmark. Insufficient content.
Open-source RTS-style control plane for managing multiple tmux-based terminal AI agents with hotkey navigation.
Canvo: AI agent with live canvas UI and sandboxed Linux environment running on Android.