Sven: Singular Value Descent as a Computationally Efficient Natural Gradient Method
Sven optimization algorithm exploiting natural loss decomposition using Moore-Penrose pseudoinverse for efficient neural network training.
Sven optimization algorithm exploiting natural loss decomposition using Moore-Penrose pseudoinverse for efficient neural network training.
Framework training LLMs to forecast supply chain disruptions using calibrated probabilistic forecasts from disruption outcomes.
UQ-SHRED adds uncertainty quantification to shallow recurrent decoder networks for sparse spatiotemporal reconstruction.
Online machine learning framework for multi-resolution energy system design optimization and performance analysis.
JetPrism diagnoses convergence issues in Conditional Flow Matching for physics simulations and inverse problems.
Distributed graph modeling approach for detecting money laundering transaction patterns at scale.
Tutorial on Bayesian Optimization as a principled framework for automating scientific discovery using surrogate models.
Principled layer-wise optimization approach for model merging via data-free covariance estimation without task-specific training.
SECURE framework addressing robustness issues in deep learning models for autonomous driving collision prediction.
GPU-accelerated inference algorithm for multivariate Hawkes processes achieving O(N) complexity with parallelization.
Novel Langevin-based algorithm for adaptive inverse reinforcement learning using Malliavin calculus for gradient estimation.
PI-JEPA: Physics-informed surrogate model for multiphysics simulation exploiting unlabeled parameter fields via latent prediction.
Residuals-based offline reinforcement learning approach for high-stakes applications with restrictive data coverage assumptions.
Benchmark datasets and evaluation protocols for machine learning methods on photoplethysmography medical signals.
Train-to-Test scaling laws optimizing model size, training tokens, and inference samples jointly for compute-optimal LLM deployment.
Study of reward hacking in LLM RL showing reproducible failure patterns and mitigation strategies using representation-level signals.
Hierarchical RL framework for privacy-preserving synthetic clinical data generation combining LLMs with structured learning.
CuTeGen: LLM-based agentic framework for automated generation and optimization of high-performance GPU kernels using CuTe abstraction.
Comparative study of Evolution Strategies vs GRPO for LLM post-training showing ES achieves comparable accuracy with different parameter geometry.
Residual decomposition framework for improving classifier performance on long-tailed datasets beyond standard logit adjustment.
Self-supervised framework for learning clinical ECG image representations without access to raw signal recordings.
ZEUS: Training-free acceleration method for diffusion models using second-order predictors to reduce sampling steps.
Care-Conditioned Neuromodulation framework for LLM-based dialogue agents that balances helpfulness with user autonomy preservation.
EEG seizure detection method using graph neural networks with self-supervised learning and information bottleneck principles.
Influence-Guided PPO framework for LLM post-training that filters noisy rollouts using data attribution to improve training efficiency.
Research on training LLMs to develop both in-context and in-weights learning capabilities simultaneously via contrastive context sampling.
Novel reinforcement learning algorithm addressing noisy temporal difference errors in deep RL through pseudo-quantization methods.
arXiv paper on expert-choice routing for diffusion language models. Deterministic load balancing improves throughput and convergence vs token-choice.
arXiv paper on CRIT, graph-based automatic data synthesis for cross-modal multi-hop reasoning. Generates complementary image-text data.
arXiv paper on label shift estimation with incremental prior updates. Addresses distribution mismatch between training and deployment.
arXiv paper on coupled query-key dynamics for scaled dot-product attention. Improves language modeling perplexity by 6-7% on WikiText-103.
arXiv paper introducing MiCA, parameter-efficient LLM fine-tuning method adapting minor singular vector subspaces. Outperforms LoRA on knowledge retention.
arXiv paper on transformer encoder-decoder with multimodal learning for wind structural health monitoring and digital twins.
arXiv paper on MATA-Former for ICU risk prediction using semantic-aware temporal alignment. Clinical-logic-aligned transformer architecture.
arXiv paper applying Koopman operator methods for multivariable control of turbofan engines. Meta-heuristic extended dynamic mode decomposition.
arXiv paper on DDCL, differentiable end-to-end framework for unsupervised prototype-based representation learning. Integrates feature learning with clustering.
arXiv paper proposing FourierMoE for parameter-efficient LLM fine-tuning in multi-task settings. Mixture-of-experts approach addressing task interference.
arXiv paper integrating deep learning forecasting with integer linear programming for supply chain analytics. Three-stage framework on 180K transactions.
arXiv paper on graph neural operators for real-time virtual sensing on irregular grids with edge deployment. Addresses sparse-to-dense field inference.
arXiv paper combining reinforcement learning with physics-informed priors (Gibbs) for power grid topology control. Semi-Markov framework.
arXiv paper on test-time adaptation for multivariate time-series anomaly detection under distribution shift. Curated adaptation framework (CANDI).
arXiv paper introducing diffusion-based posterior sampling for uncertainty quantification in industrial data-driven models. Addresses safety-critical applications.
arXiv paper on spectral graph contrastive learning addressing high-frequency signal variance in heterophilic graphs. Theoretical analysis with regret bounds.
Self-organising transformer with hierarchical prototype structure that automatically determines optimal architecture dimensions (heads, depth, width) during training.
Layer-wise Interactive Dual-Stream Network architecture for EEG decoding in brain-computer interfaces with improved temporal-spatial feature fusion.
Expert-guided uncertainty modeling for medical AI systems to improve reliability and enable human experts to prioritize high-risk diagnostic cases.
Theoretical analysis of plasticity loss in deep reinforcement learning due to non-stationarity, proposing sample weight decay mitigation technique.
PAC-Bayesian framework for outcome weighted learning that incorporates reward uncertainty into policy selection with finite-sample guarantees.
annbatch: Mini-batch loader for terabyte-scale biological data training in anndata format, addressing memory bottlenecks in ML pipelines for bioinformatics.
LSCP: Self-gated post-training framework for autonomous knowledge acquisition using self-generated Q&A chains and adaptive learning rates based on model conviction.