Multi-dimensional Persistent Sheaf Laplacians for Image Analysis
Multi-dimensional persistent sheaf Laplacian framework for image analysis addressing dimensionality reduction sensitivity.
Multi-dimensional persistent sheaf Laplacian framework for image analysis addressing dimensionality reduction sensitivity.
Theoretical analysis of temperature scaling properties for classifier calibration and LLM stochasticity control.
Drift-diffusion matching framework embedding asymmetric RNN dynamics in latent manifolds for biological neural computation.
Adjoint-based ship hull optimization using ML surrogate models and CVAEs for complex propulsion systems.
GAPA: post-hoc Gaussian Process method for uncertainty quantification in pretrained networks via activation-space modeling.
Autoencoder-based fault detection in electrical power distribution systems using probabilistic decision methods.
Physics-informed neural networks with gradient networks for dynamic modeling of synchronous electrical machines.
ThermEval benchmark for evaluating vision-language model generalization on thermal imagery across surveillance and medical applications.
Quantum-enhanced Gaussian processes with distributed training for multi-agent systems using quantum kernel embeddings.
BPP method for robot imitation learning attending to relevant historical frames via key frame selection in long-context tasks.
Cold-start personalization using structured world models and RL for efficient user preference inference with limited interaction budget.
Neural network transfer learning from low-resolution LAMOST to medium-resolution DESI stellar spectra for parameter estimation.
Regression algorithm using subset stacking for heterogeneous data with local predictors trained on random input-space subsets.
Analysis of sample efficiency in offline policy selection for reinforcement learning via off-policy evaluation and Bellman error.
PIVID method for inferring Bayesian network structure from observational data using permutation-based variational inference.
Memory-efficient zeroth-order optimizer (MeZO) for LLM fine-tuning using only forward passes, with sparse parameter improvements.
Optimal design methods for efficient human preference elicitation to reduce annotation costs in preference learning models.
Gradient-based methods for solving data-driven inverse optimization problems with mixed integer linear programs.
Survey of federated learning combined with foundation models for privacy-preserving distributed training.
Multi-graph pretraining approach enabling graph transformers to learn transferable representations across diverse domains.
Benchmark for evaluating AI-based data assimilation methods using real-world observations for weather forecasting.
Lightweight multivariate time series forecasting model with 0.1K parameters for resource-constrained settings.
Posterior sampling algorithms for contextual bandits using diffusion model priors.
Privacy-preserving LLM customization service using differentially private synthesis for domain-specific fine-tuning.
Theoretical analysis showing denoising diffusion models adapt optimally to unknown low-dimensional data.
Consensus-driven framework improving robustness of time series causal discovery methods.
Dynamic batch-aware expert selection for efficient Mixture-of-Expert model inference during serving.
One-line modification to momentum-based optimizers improving stability and speed for transformer pretraining.
Bayesian flow networks for out-of-distribution molecular generation in drug design.
Bayesian framework for gradient sparsification in distributed training using error accumulation.
Technique to learn unbounded neural network layer width during training without manual hyperparameter selection.
Lightweight time series forecasting model using wavelet decomposition for resource-constrained environments.
Studies rotational equivariance in adaptive optimizers and proposes reparameterization for improved convergence.
Autoregressive framework for fast graph generation using topological data analysis concepts.
Block clustered quantization technique for 4-bit weight and activation LLM inference without quantization-aware training.
Introduces Fenchel-Young losses as generalizations of KL divergence for variational learning methods.
Novel data selection approach for Direct Preference Optimization to improve LLM alignment by addressing parameter shrinkage from noisy preference data.
Contextual quantum neural networks for stock price prediction using quantum superposition and new training techniques.
RMOD inference algorithm aligns LLMs to multiple objectives via robust maximin game formulation achieving Nash equilibrium.
Learning rate annealing schedules improve robustness to hyperparameter tuning in stochastic gradient optimization.
Knowledge distillation of GNN teachers into MLP students for graph link prediction using heuristic methods.
Riemannian Denoising Diffusion Probabilistic Models for learning distributions on submanifolds without requiring explicit geodesic information.
Sparse Latent Factor Forecaster with iterative inference for commodity futures prediction addressing amortized inference deployment gap.
Multi-layer hierarchical federated learning framework generalizing to arbitrary network architectures with nested aggregation.
Deep learning model for 8-hour probabilistic precipitation forecasting integrating radar, satellite, and NWP data.
Heterogeneity-aware client sampling for federated learning with diverse communication and computational capabilities.
Residual feature integration prevents negative transfer in transfer learning through simple architectural modifications.
MoESD applies speculative decoding to accelerate Mixture of Experts LLM inference without accuracy loss.
Theoretical analysis of connections between rectified flows, flow matching, and optimal transport with invariance properties.
Difficulty-targeted online data selection and rollout replay techniques improve data efficiency in LLM reinforcement learning fine-tuning.