SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows
Dexterous robotic manipulation policy fine-tuning using diffusion models and normalizing flows for real-world scenarios.
Dexterous robotic manipulation policy fine-tuning using diffusion models and normalizing flows for real-world scenarios.
Research on predicting LLM success from internal pre-generation activations to optimize inference efficiency in reasoning tasks.
SSLogic agentic meta-synthesis framework where LLM agents iteratively generate and refine task specifications for logic reasoning.
Training-free few-shot anomaly detection using subspace modeling of vision foundation model features.
Analysis of noise models and mitigation strategies in photonic quantum machine learning systems.
Training framework for geometric and neuromorphic AI using alternative arithmetic substrates.
SwiftGS system for rapid 3D satellite surface reconstruction via meta-learned Gaussian primitives.
Canonical Security Telemetry Substrate for standardizing cybersecurity data formats for AI-driven detection.
Firefly algorithm adaptation for mixed-variable optimization problems.
Weakly convex ridge regularizer for 3D non-Cartesian MRI reconstruction.
Early warning system for GPU hardware failures using structural observability beyond numeric telemetry.
OptiMer framework for optimizing data mixture ratios during continual LLM pre-training without manual tuning.
Brain tissue segmentation from MRI using deep learning and foundation models.
Multimodal dataset of 601k text annotations and 385k audio recordings across 10 African languages.
Realistic backdoor attack methods for federated learning using semantically meaningful triggers.
Novel neural architecture primitive based on field theory and metriplectic dynamics.
S0 tuning method for efficient LLM adaptation via state matrix optimization, outperforming LoRA on code generation tasks.
Neural architecture for generating online handwriting with stroke continuity and stylistic consistency.
Stub article about mempalace AI memory system benchmark.
OpenClaw provider plugin routes LLM requests through Claude Code CLI with persistent worker pool and OAuth, enabling Claude Pro/Max access without API credentials.
LLM-based workout plan generator for personal trainers in India with WhatsApp integration and exercise library.
Ship Safe v7.0.0: AI-powered security platform running 19 specialized agents to scan code for vulnerabilities including LLM/agentic AI security risks.
Open-source AI research agent that reads papers, searches web, writes drafts, runs experiments locally with cited claims.
Fujitsu One Compression is an open-source Python library for LLM post-training quantization implementing GPTQ, DBF, RTN and novel QEP methods.
Claude Profile wrapper enables multiple concurrent Claude Code sessions with isolated configs per subscription (work/personal separation).
Knowledge base and retrieval tooling for AI-human interaction and shared memory systems.
Preprint research examining judgment consistency vs reasoning quality in ChatGPT, Claude, and Gemini across 1,800 judgments.
OpenAI policy proposals for AI economy including public wealth funds, robot taxes, and wealth redistribution mechanisms.
User experience comparing Pollinations as a lightweight LLM backend alternative to Hugging Face.
Node.js tool that patches Claude Code system prompts to reduce corner-cutting by rebalancing laziness/thoroughness instructions 5:1 ratio.
User complaint about Claude Code subscription limits and lack of transparency in usage metrics.
SmolVM: Open-source lightweight sandboxes for running AI agent code and browser automation safely with instant boot/teardown.
Prompt technique using job titles to orchestrate multi-specialist AI team workflows. MIT licensed, two-command setup.
Tool enabling team-based AI agents with specialized job titles instead of single prompts, supporting 125+ skills across disciplines.
Neuro-symbolic AI proof-of-concept reduces energy consumption by 100x while improving accuracy for sustainable AI systems.
Cryptographic delegation protocol for AI agents that establishes user-to-operator trust through delegation receipts, closing gaps in IETF frameworks.
Research on test-time scaling showing overtraining is compute-optimal for LLM inference, presented as arXiv research framework description.
Luigis-meter is a terminal statusline indicator for Claude Code Max users that tracks session and weekly quota usage without API calls.
A2A is a hub that connects Claude Code agents, enabling real-time collaboration where one agent can watch and debug another's work using MCP.
Benchmark comparison of open-source and commercial LLMs on code generation tasks; Claude Opus, Sonnet, and GLM models showed highest success rates.
Zopaf is a negotiation math engine exposed as an MCP server requiring zero LLM tokens.
RSC Boundary is a Next.js devtool that visualizes server/client component boundaries in the browser for React Server Components.
Kyoo is a self-hosted media server alternative to Jellyfin/Plex with automatic metadata handling and built-in features.
MCP and API tool for feeding SEC filing data into AI agent workflows with source attribution for equity research.
Graphify is a Claude Code skill that converts folders into queryable knowledge graphs using multimodal AI to understand codebases.
arXiv research on reverse address translation overheads in multi-GPU systems. Machine learning infrastructure paper.
ACP is a governance layer for AI coding agents that handles authentication, permissions, limits, and audit logs for LLM-based tools.
Analysis of 30M sources shows Reddit is the most-cited source across major AI search platforms (24% of Perplexity citations), followed by YouTube.
Next Moca is an enterprise control plane for AI agents that provides intuitive specification and orchestration of agent behavior with governance, security, and integration capabilities.
GitHub Copilot CLI adds Rubber Duck feature using a second model to review and provide independent feedback on agent-generated plans.