Multi-Agent Reinforcement Learning for Greenhouse Gas Offset Credit Markets
Applies multi-agent reinforcement learning to greenhouse gas offset credit markets for emissions control and carbon project trading simulation.
Applies multi-agent reinforcement learning to greenhouse gas offset credit markets for emissions control and carbon project trading simulation.
Extends Random Dot Product Graph model to accommodate weighted graphs with heterogeneous edge distributions for network analysis.
Data-driven survey identifying 14,648 papers on LLM limitations from 2022-2025 using automated classification and expert validation across 250,000 academic papers.
Novel algorithm for multi-agent reinforcement learning using uncertainty quantification and selective exploration to improve sample efficiency in joint action spaces.
Research paper on measuring whether LLMs comprehend user intent beyond surface-level text patterns, addressing training-inference gaps in language models.
Reinforcement fine-tuning approach for LLMs applied to point-of-interest recommendation with improved semantic indexing.
Neural framework simulating OS GUIs by predicting screen frames from user inputs using RNN and diffusion models.
Open benchmark suite comparing paired encoder and decoder architectures for NLP tasks with controlled parameter counts.
Adapter parameters for efficient multi-task LLM inference on-device via task merging for compositional learning.
Agentic Design Review System orchestrates multiple AI agents to collaboratively analyze graphic designs with meta-agent coordination.
arXiv paper analyzes theoretical limitations of embedding-based retrieval for diverse tasks including reasoning and code generation.
arXiv paper provides theoretical analysis of Flow Matching generative modeling with Lipschitz-bounded statistical guarantees.
DiDi-Instruct trains fast few-step language generation via distillation from discrete diffusion LLMs while maintaining quality.
arXiv paper applies contrastive diffusion guidance to inverse problems with partially specified non-smooth operators like floorplan reconstruction.
arXiv paper initiates theoretical analysis of learning with access to two competing provers for evaluating opaque model properties.
arXiv paper proposes UniFField neural feature field for visual, semantic and spatial uncertainty understanding in 3D robotic scenes.
CodeEvolve open-source framework combines LLMs with evolutionary algorithms to synthesize optimized algorithmic solutions guided by execution feedback.
Jr. AI Scientist is an autonomous research system that mimics novice researcher workflows, conducting autonomous exploration and including risk analysis.
arXiv paper presents RefTr, a 3D image-to-graph framework for accurate vascular centerline extraction in medical imaging.
arXiv paper develops tree-based ensemble methods to estimate probability distributions for violent conflict predictions with uncertainty quantification.
arXiv paper on convex programming algorithms for densest submatrix problems in combinatorial optimization.
Novel text-only adaptation method for LLM-based ASR systems using text denoising to preserve speech-text alignment without fine-tuning disruption.
Multi-agent reinforcement learning system for width-scaled information seeking, exploring complementary depth scaling in LLM deployment.
LatentChem: Latent reasoning interface for chemical LLMs, decoupling chemical reasoning from text tokens for improved efficiency.
Kernel-based optimization for measurement operators in quantum reservoir computers using kernel ridge regression.
Joint-embedding predictive architecture for generic object tracking with model adaptation and occlusion handling.
Study of phonological vector arithmetic in self-supervised speech models across 96 languages, analyzing representation structure.
Vision transformer model for estimating global forest canopy height from satellite data for climate monitoring. Computer vision, not core AI/ML research.
Evaluation of small language models for role classification in human-robot interaction with zero-shot and one-shot adaptation.
Retrieval system learning node-specific Riemannian metrics on citation graphs for geometry-aware semantic search.
Systematic evaluation of LLM-based AI agents in Byzantine consensus games, testing agreement behavior in adversarial settings.
Study of reasoning techniques in LLMs for political opinion modeling and alignment with individual preferences.
Analysis of chain-of-thought reasoning in LLMs, comparing activation probing and early stopping across DeepSeek-R1 and GPT-OSS models.
Caching optimization for concept learning in description logic knowledge bases using supervised learning.
Differentiable equilibrium blocks for multi-agent incentive design in game theory and economics applications.
Mixture-of-Experts architectures for machine learning interatomic potentials with analysis of routing strategies and sparse activation.
Lossless compression system using micro-diffusion denoising for probability estimation in statistical models. Tangentially related to ML.
Research into theoretical mechanisms of LLM phenomena: semantic prompt comprehension, in-context learning, and chain-of-thought reasoning.
Dataset creation using Wikidata to detect sociocultural biases in LLMs, focusing on Latin American languages and cultures.
AI models trained on Western data fail to recognize local crops and forests; scientist adapted approach using local data collection for African agriculture mapping.
Generative AI has disrupted education sector; widespread ChatGPT adoption in classrooms created plagiarism crises affecting teachers and education workers.
Analysis of collaborative text editing algorithms and why Yjs library is inappropriate for offline editing due to silent conflict resolution behavior.
Game where players guess the AI prompt behind generated images; includes PvP mode where players race to match prompts in two minutes.
Cyris is orchestration platform for coordinating AI agents across OpenAI, Anthropic, Ollama and enterprise systems with self-hosted auditable governance.
Cursor-compatible agent skills for coding and system design interview prep with drill workflows, prompt generation, and interactive practice modules.
Guide for managers in 2026 on leading teams using AI; emphasizes managers must understand AI tools to set expectations and guide team execution.
AI code generation has evolved from prompt-and-pray to engineering discipline with verification harnesses, test suites, and autonomous agent execution for hours.
News stub: Developer launched Vy replacement launching March 26th on shutdown day.
Auto-Browser: Open-source MCP-native browser agent with human-in-the-loop control. Provides authorized workflow automation with Claude and other LLM integrations.
Business article on AI inference compute becoming component of tech employee compensation packages alongside salary, bonus, and equity.