Out of Distribution Detection for Efficient Continual Learning in Quality Prediction for Arc Welding
Out-of-distribution detection for continual learning in arc welding quality prediction using VQ-VAE Transformer architecture.
Out-of-distribution detection for continual learning in arc welding quality prediction using VQ-VAE Transformer architecture.
Morephy-Net uses multi-objective evolutionary optimization for physics-informed neural operators on parametric PDEs in noisy regimes.
Analyzes signal propagation and weight randomness in physics-informed neural networks using spectral/statistical properties.
GenFacts generates valid counterfactual explanations for multivariate time series using class-discriminative VAE.
Learns admissible heuristics for A* search algorithms using constrained optimization to guarantee solution optimality.
Flock: knowledge graph foundation model using random walk learning for zero-shot link prediction on novel entities and relations.
TabImpute: zero-shot universal imputation for tabular data with missing values using language model approach.
General exploratory bonus method for optimistic exploration in RLHF that avoids bias toward reference model high-probability regions.
Transformer models learn permuted congruential generator sequences via in-context prediction with curriculum learning and interpretability analysis.
Terminal Velocity Matching generalizes flow matching for one/few-step generative modeling with Wasserstein distance bounds.
Applies consistency models to error correction codes for one-step neural decoding in low-latency communication settings.
BEP algorithm trains binary neural networks with constrained weights/activations via error propagation for resource-constrained deployment.
Improves VAE and autoencoder training using random Fourier transformation with frequency principle analysis for aviation safety anomaly detection.
ARGUS detects distributional drift in high-dimensional data streams using local statistics over fixed spatial partitions of data manifold.
Stratified hazard sampling reduces variance in discrete diffusion/flow models by optimizing event scheduling in CTMC/DTMC processes.
PROMA: reference-free proximal policy method for LLM training that controls KL divergence via gradient projection without reference model.
GenDA reconstructs high-resolution urban wind fields from sparse sensor data using graph-based diffusion and classifier-free guidance.
OPO: theoretical framework for LLM alignment using constrained proximal policy optimization with work-dissipation principle and chi-square geometry.
Instant Retrospect Action algorithm improves policy exploitation in online RL through Q-network representation learning.
Green-NAS multi-objective neural architecture search optimizes weather forecasting models for efficiency and carbon footprint.
MinPV Principle minimizes path variance in score-based models to improve accuracy and stability.
Analyzes role of iterative computation in RL, showing policies benefit from additional compute beyond fixed parameters.
Algorithm achieving simultaneous optimal static and dynamic regret in adversarial multi-armed bandits.
Horizon Imagination improves efficiency of diffusion-based world models for RL by denoising multiple future observations.
Systematic evaluation of chemical language model scaling on molecular property prediction downstream tasks.
Recovery-based shielding framework integrates Gaussian process models with RL for provably safe control in continuous systems.
Online GPU energy optimization using bandit algorithms to reduce power consumption in HPC systems.
Extends linear bandits theory beyond inner product spaces using optimal transport for recommendation and clinical systems.
Evaluates multimodal LLMs and vision-language models for time series anomaly detection in systems monitoring.
Mathematical analysis of curved Bregman divergences and their applications in statistical learning.
MINT framework aligns LLMs with biomedical knowledge using preference optimization on multimodal data.
MMS-VPR introduces a large-scale multimodal dataset for street-level visual place recognition in pedestrian environments.
Cadrille uses reinforcement learning for multi-modal CAD reconstruction from point clouds, images, and text inputs.
TQml Simulator benchmarks numerical simulation techniques for quantum machine learning circuits to accelerate research.
AI agent framework integrating LLMs with Lean formal proof assistant for automated theorem proving.
Gaussian splatting technique with external memory for large-scale 3D reconstruction rendering.
Discrete diffusion method for audio inpainting using pre-trained tokenized representations.
Reinforcement learning approach for quantum circuit synthesis optimization using Q-learning.
Reinforcement learning framework for dynamic treatment regime estimation from censored survival data.
Explainable AI methodology for cough sound spectral analysis to characterize respiratory diseases.
Scaling long chain-of-thought reasoning in LLMs using NP-hard graph problems for cost-effective training.
Autonomous AI agents for physics data analysis using machine learning in particle physics research.
Quantum machine learning method for learning phase states via agnostic boosting.
RLHF approaches for improving LLM-based UI generation using designer feedback and rationale.
Vision-language model enhancement framework for improved disaster assessment image descriptions.
Flow matching trajectory planning for autonomous driving with data balancing strategies.
Safety vulnerability analysis of diffusion language models and mitigation strategies for jailbreak attacks.
Method for scaling parallel LLM inference by enabling interdependent token generation across multiple responses.
Efficient linear transformation method for aligning text embedding spaces without parallel data.
Convergence analysis of regret matching in zero-sum games bridging theoretical and practical game-solving methods.