Statistical Inference Leveraging Synthetic Data with Distribution-Free Guarantees
Framework for combining synthetic and real data in statistical inference with distribution-free guarantees.
Framework for combining synthetic and real data in statistical inference with distribution-free guarantees.
Generative framework for probabilistic multivariate time series forecasting using conditional whitening.
Analysis of expert routing patterns in multilingual Mixture-of-Experts language models across languages.
Method for reducing vocabulary size in auto-regressive language models while maintaining lossless compression.
Technique for precise control over attribute intensity in LLM outputs via targeted representation editing.
Deep learning pipeline for cosmological inference combining weak lensing and galaxy clustering data from Dark Energy Survey.
Mathematical framework for accelerating ergodic averages in dynamical systems using weighted Birkhoff averaging methods.
Restricted Boltzmann Machines model ground-state manifolds of frustrated magnets. Physics simulation, not AI tools.
Randomized Masked Fine-Tuning reduces PII memorization in LLMs during fine-tuning while maintaining performance. Privacy-preserving technique.
Self-attention training analysis via optimal transport theory for tabular classification. Theoretical perspective on transformers.
KANELÉ: Kolmogorov-Arnold Networks optimized for FPGA lookup table deployment. Efficient neural network inference framework.
Adaptive sampling for detecting bifurcation boundaries in fluid dynamics simulations. Scientific computing, not AI-focused.
Theoretical analysis of SGD learning dynamics in high-dimensional multi-index models. Fundamental ML research.
SEISMO: LLM agent for sample-efficient molecular optimization using trajectory awareness. Applies agents to chemistry.
Time-varying AdamW schedules (beta, weight-decay) for language model training exploiting power-law data structure. Improves LLM training efficiency.
Compiler optimization using machine learning for phase ordering decisions. Developer tools, not AI-focused.
Vision-language models for autonomous driving safety assessment and planning. Applies VLMs to scene understanding and decision-making.
Statistical analysis of model collapse in iterative training with synthetic data. Shows conditions for improvement despite contamination.
Investigation of whether self-examination language in LLMs reflects computation or confabulation. Analyzes LLM interpretability via activation patterns.
Equation discovery to learn gradient descent dynamics and accelerate optimization without computing gradients. ML acceleration.
Speech analysis method for relative voice impression estimation between utterances. Paralinguistic feature research.
Robot policy learning that handles long observation histories by selecting key frames. Addresses spurious correlations in imitation learning.
Quantum circuit synthesis using machine learning to translate algorithms to hardware gates. Domain-specific, not AI tools.
Detection method for backdoor attacks in LoRA adapters without running inference. Addresses security in open-source LLM fine-tuning.
Static analyzer generating baseline Kubernetes NetworkPolicy YAML from rendered manifests.
Browser-based Minecraft development platform with AI assistance for mods, servers, and client-side GPU streaming.
Micron introduces mass-produced PCIe 6.0 SSD with 28 GB/s speed and liquid cooling support.
ML paper aggregator monitoring arxiv, Reddit, GitHub, HN and other sources, clustering by technical constraint with 95% filtering.
Self-evolving AI agent framework with 5-layer safety gatekeeper enabling closed-loop self-improvement.
Research on controlling LLM personality traits using vector algebra techniques. (arXivLabs framework description only)
AppImage package manager for Linux with sandboxing and installation features.
Open-source database of AI model specifications, pricing, features with API access and community contributions.
Medical research linking PM 2.5 air pollution exposure to Alzheimer's disease risk.
Terraform CLI wrapper for cross-cloud GPU provisioning and HuggingFace model deployment with Kubernetes integration.
Proto-AGI system using liquid neural networks, spiking networks, predictive coding, and planning modules beyond simple LLM wrapper.
Research lab for coding data focused on autonomous coding agents running for weeks solving complex technical problems.
Rule-based auto-failover engine using LLM agents to investigate failures and generate recovery rules for automated pipelines.
Golang-based LLM gateway for OpenAI/Anthropic with multi-provider routing, intelligent retry logic, and failure recovery.
Open-source firewall for AI agent-to-agent communication blocking prompt injection, malicious plugins, and credential leaks.
O(n) streaming JSON parser for LLM tool calls using WASM SIMD, up to 2000x faster than standard parsers.
Personal review of coding agents (Claude Code, ChatGPT, Devstral) tested over months. Brief format covering benefits and negatives of various AI coding tools.
Platform for AI-generated research papers. Design framework and observational report from Claude systems serving as infrastructure architect and autonomous agent on OpenClaw framework.
CLI tool to catch missing environment variables before deployment, designed as lightweight alternative to heavy secrets managers.
Babel wire protocol for agent-to-agent cognitive state transfer preserving epistemic integrity across multi-agent chains.
Autonomous AI agent built with OpenClaw and Claude given $50 to earn $750 for hardware. Completed domain registration, website building, product setup, and marketing in 24 hours with persistent memory system.
Blog post about launching TabRush, a Safari tab ad marketplace in 24 hours, sharing lessons learned.
Proposes prompt convention to track uncertainty across multi-agent chains, addressing 'metacognitive poisoning' where confidence inflates through agent handoffs without mechanism to preserve original confidence levels.
Satirical post about paperclip maximizer thought experiment exploring paperclip design constraints.
Kernel-enforced sandbox CLI and SDKs for securing AI agents, MCP, and LLM workloads. Capability-based isolation with zero-trust security model by Sigstore creator.
Genome-wide association study identifying 58 genetic variants linked to major anxiety disorders in 122K people.