Generative Drifting is Secretly Score Matching: a Spectral and Variational Perspective
Theoretical analysis showing generative drifting for one-step image generation is equivalent to score matching under Gaussian kernels.
Theoretical analysis showing generative drifting for one-step image generation is equivalent to score matching under Gaussian kernels.
SignalMC-MED benchmark for evaluating biosignal foundation models on synchronized ECG and PPG data from clinical datasets.
Study of learning rate sensitivity in PPO actor-critic methods, analyzing early structural signals to predict training stability.
Neural debugger for Python that trains LLMs on execution traces to predict line-by-line program execution for debugging workflows.
Analysis of neural network optimizers (AdamW, Muon) as steepest descent under matrix norms, addressing width scaling stability.
Mechanistic interpretability study on how feature correlations affect superposition in neural networks beyond idealized uncorrelated settings.
TAM-RL framework uses representation learning to improve upscaling of terrestrial carbon flux measurements for climate modeling.
Layer-wise representational analysis comparing diffusion language models and autoregressive LLMs, examining layer-skipping capabilities.
Novel techniques (OAS, MBS) improve MXFP4 quantization accuracy for efficient LLM inference, addressing gaps versus NVIDIA's NVFP4.
KernelCraft benchmarks agentic LLM systems for generating low-level kernels for novel AI accelerator instruction set architectures.
ALADIN framework for design-space analysis of mixed-precision quantized neural networks on resource-constrained embedded AI accelerators.
Review of ultra-low-power edge AI processors including SoCs, neural accelerators, and in-sensor architectures for embedded inference.
Research on dataflow-based CNN accelerators on FPGAs addressing data-rate inefficiencies in layers with reduced output dimensions.
Ablation study integrating Leaky Integrate-and-Fire neurons, contrastive learning, and Hopfield networks in spiking neural networks for neuromorphic vision.
Hebbian-Oscillatory Co-Learning (HOC-L) combines structural plasticity and phase synchronization in sparse neural architectures using hyperbolic geometry and Hebbian learning.
Auralink SDC deploys autonomous edge AI agents for electric vehicle charging infrastructure management, achieving autonomous operation with edge computing latency requirements.
Sensitivity-guided compression framework for reservoir computing enabling design-space exploration of quantization, pruning, and hardware efficiency trade-offs.
AetherFloat family proposes block-scale-free quad-radix floating-point architectures reducing silicon area and power overhead in AI accelerators.
Permutation-equivariant 2D state space models for multivariate time series, formalizing permutation symmetry principle for exchangeable variables.
Formal analysis proving that no verification procedure can simultaneously satisfy soundness, completeness, and decidability for AI alignment certification.
Midicoth introduces micro-diffusion denoising for lossless compression to improve probability estimates in adaptive statistical models like PPM.
MASEval extends multi-agent evaluation beyond model-centric benchmarks to evaluate LLM-based agentic system components including topology, orchestration, and error handling.
APPLV automates parameter tuning for autonomous navigation by learning from vision-language-action models, balancing safety assurances with learning flexibility.
Measurement campaign evaluating throughput predictions in private 5G networks, challenging channel-centric models for communication-aware robot planning.
FedLECC proposes cluster and loss-guided client selection for federated learning under non-IID data, improving convergence in distributed AI systems.
Vision-language models encode clinical guidelines for interpretable medical reasoning in Concept Bottleneck Models, enabling transparent AI in medical imaging.
Deep learning framework using digital network twin for optimizing reinforcement learning training in multi-fidelity 5G networks with antenna tilt adjustment.
Guardian system combines reinforcement learning with LLM-based quality assurance to create spatiotemporal risk surfaces for missing-child search planning from unstructured case documents.
BiCLIP adapts vision-language models to specialized domains via structured geometric transformation, extending canonical transformation theory to domain adaptation.
Kernel debiased plug-in estimator (ULFS-KDPE) for nonparametric statistical models achieving semiparametric efficiency without explicit influence function derivation.
Research on using machine learning for statistical inference with scientific simulators, focusing on hypothesis testing and model refinement.
Guardian system uses multi-LLM pipeline for intelligent information extraction in missing-person investigations, coordinating end-to-end execution across tasks.
Survey introducing reinforcement learning methods to economists, addressing curse of dimensionality in complex economic models.
Reinterprets generative AI through statistical lens using flow matching, connecting generative models to causal inference and interpretability.
LLM serving system for mobile devices with hardware-based isolation using ARM TrustZone to protect model weights and user data from kernel attacks.
Autonomous AI agent for clinical triage in remote patient monitoring, using 21 medical tools to process vitals data 24/7 without physician bottleneck.
Data curation method for robot learning using influence functions to select high-quality demonstrations from noisy human teleoperation data.
Online reliability prediction for satellite electronics using degradation models and active learning under limited data.
Theoretical work on observers in hypergraph physics frameworks applying Conant-Ashby Good Regulator Theorem.
Taxonomy and evaluation framework for latent world models and vision-language-action systems in autonomous driving.
Reinforcement learning approach for dense image captioning using rubric-guided optimization to improve diversity and generalization.
Replication study of Band-Split RNN for music source separation, documenting reproducibility challenges and training complexity.
Examination of logical reasoning as mechanistic pathway to situational awareness in advanced AI systems, exploring emergent capabilities risks.
Study of emotion as latent representational factor in LLM reasoning and text processing, beyond sentiment classification tasks.
Vision-language models that self-evolve from zero data without seed images, extending self-improvement paradigms from LLMs to multimodal systems.
Generative sampling framework for complex distributions with discrete parameters, avoiding gradient computation issues.
Theoretical analysis of Thompson sampling for Bayesian optimization, comparing regret bounds with GP-UCB methods.
Medical image segmentation framework handling missing modalities through consistency learning between expert models.
Physics-informed neural networks for flow field reconstruction with optimized sensor placement, addressing sparse measurement challenges in fluid dynamics.
Research on convolutional neural network backbones for diffusion models, exploring parameter efficiency and hardware friendliness alternatives to Transformers.