Explainability for Fault Detection System in Chemical Processes
Comparison of SHAP and Integrated Gradients for explaining LSTM fault detection in chemical processes.
Comparison of SHAP and Integrated Gradients for explaining LSTM fault detection in chemical processes.
Physics-informed neural networks for optical inversion and spectral unmixing in photoacoustic imaging.
Theoretical analysis of reward-free and reward-agnostic exploration in MDPs with improved regret bounds.
Unified benchmarking suite for evaluating machine unlearning algorithms with KLoM metric and precomputed model ensembles.
Novel bias-correction method using Weierstrass transform for data released under local differential privacy in binary classification.
Studies bias spillover in LLM fairness alignment, showing how single-attribute debiasing can worsen disparities in other dimensions.
FPGA implementation of event-graph neural networks for neuromorphic audio processing with low latency and energy efficiency.
GICDM corrects hubness bias in high-dimensional embedding spaces for more reliable generative model evaluation metrics.
LoRSum improves LoRA fine-tuning efficiency using proximal subspace iteration with diagonal K-FAC, reducing memory overhead.
Proposes HPMixer for multivariate time series forecasting using hierarchical patching to model periodic patterns and residuals.
Develops verification techniques for C-RASP language capturing transformer-expressible concepts using model checkers and SMT-solvers.
Addresses scalability of analytical diffusion models by reducing dataset scanning overhead at each timestep in generative modeling.
Proposes conditionally additive local models balancing interpretability of GAMs with accuracy by adding conditional feature interactions.
Examines deep learning approaches for identifying small molecules from tandem mass spectrometry using molecular fingerprint prediction.
Compares reinforcement learning training regimes for parameterized quantum state preparation with continuous rotations.
Applies deep reinforcement learning to capacity-constrained demand response for smart grid management and congestion prevention.
Introduces FEKAN, feature-enriched Kolmogorov-Arnold networks with improved scalability and convergence over existing KAN variants.
Studies transfer learning for linear regression using multiple overparameterized pretrained models with debiasing.
Analyzes vulnerabilities of safe reinforcement learning via inverse constrained RL without requiring policy gradient access.
Proposes RIDER, reinforcement learning-guided diffusion approach for inverse design of 3D RNA structures.
Investigates barren plateau phenomenon in variational quantum circuits with parameter sharing for quantum machine learning.
Introduces MetaDOAR meta-controller augmenting double oracle paradigm with filtering and caching for multi-agent RL on cyber networks.
Provides fine-grained analysis of steering diffusion models with quadratic rewards at inference time for downstream tasks.
Develops gradient boosting models for urban demand forecasting in intelligent transportation systems.
Proposes LSTM model pretrained on ERA5-Land reanalysis and finetuned on IFS for global streamflow forecasting.
Develops sequential membership inference attacks exploiting model dynamics across multiple updates for privacy auditing.
Applies machine learning to predict cop number in pursuit evasion graph game, addressing computational complexity.
Systematically evaluates tokenization strategies for foundation models applied to MEG neuroimaging time series data.
Proves almost sure convergence of differential temporal difference learning for average reward reinforcement learning.
Analyzes role of optimizer choice in emergence of neural collapse patterns during deep neural network training.
Applies factorization machines with optimization annealing to RNA inverse folding problem for identifying nucleotide sequences.
Applies retrieval-augmented foundation models to matched molecular pair transformations for drug discovery and medicinal chemistry analog generation.
Demonstrates privacy risks in machine unlearning: reconstruction attacks show remaining data can be exposed when perfect retraining is pursued.
Argues causal inference is necessary for valid interpretability claims in LLMs, critiquing non-generalizable findings and unsupported causal interpretations.
Proposes latent space methods for analyzing high-dimensional discrete data in EHR applications with imbalanced matrix dimensions.
Studies geometric constraints on personality trait steering in LLMs (LLaMA, Mistral), examining whether Big Five traits can be independently controlled via steering vectors.
Clinical NLP safety analysis addressing temporal leakage in discharge planning models and deployment risk mitigation.
MARVL uses vision-language models for multi-stage reward design in robotic manipulation RL, addressing spatial grounding and task semantics.
P-RAG combines parametric and retrieval-augmented generation with LoRA and selective chain-of-thought for biomedical QA.
Event-driven neuromorphic system for energy-efficient EEG-based sleep staging on edge devices.
Neural operator surrogates (DeepONet, FNO) for accelerating neutron transport computations in physics simulations.
Self-play multi-agent reinforcement learning for autonomous driving policy adaptation to new cities without human demonstrations.
Quality-constrained entropy maximization framework for LLM fine-tuning balancing output diversity and alignment quality.
Non-reversible diffusion process acceleration by breaking detailed balance in generative models while preserving stationary distribution.
Leverage score methodology for assessing privacy vulnerability and membership inference attack risk without model retraining.
Graph learning approach for Gaussian graphical models incorporating node textual metadata via Laplacian constraints.
DreamZero world action model using video diffusion for zero-shot robotic control by learning physical dynamics and generalizing to unseen environments.
Differentiable framework integrating ML models into density functional theory for exchange-correlation functionals in periodic systems.
Survey of real-time object detection using deep learning across applications like surveillance, AR/VR, and traffic monitoring.
Security analysis of vision-language models in multi-turn conversations, exploring injection attacks via manipulated images.