Principal Prototype Analysis on Manifold for Interpretable Reinforcement Learning
Principal Prototype Analysis on Manifold: interpretability method for reinforcement learning agents using prototype-based explanations.
Principal Prototype Analysis on Manifold: interpretability method for reinforcement learning agents using prototype-based explanations.
Generalization of diffusion models to correlated stochastic sampling using probabilistic computers beyond standard neural network implementations.
FedDES: graph-based dynamic ensemble selection for personalized federated learning that addresses negative transfer through selective peer integration.
Analysis of diffusion maps showing they provide spectral representation of geometry rather than dimensionality reduction, compared with Isomap and UMAP.
Framework for deterministic deep learning training in medical applications via structured orthogonal initialization to eliminate randomness sources.
ROVED: hybrid reinforcement learning framework combining vision-language embeddings with oracle feedback to reduce annotation costs for reward learning.
Physics-informed deep learning framework (PhysNet) for medical imaging that embeds tumor growth dynamics into feature learning for improved interpretability.
Generative AI framework for creating synthetic safety-critical scenarios in autonomous ship navigation testing using vessel trajectory data.
Koopman-based surrogate models for RL control of fluid dynamics with mitigation of distribution shifts.
InkDrop demonstrates backdoor attacks against dataset condensation methods through invisible trigger implantation.
Heddle is a distributed system for orchestrating agentic RL rollouts with LLMs to address trajectory generation bottlenecks.
Training methodology to verify and bound Lipschitz constants of neural networks for adversarial robustness and generalization.
GVF framework models health risk as vector fields on simplicial complexes from multimodal wearable and environmental data.
Federated learning approach for livestock growth prediction addressing privacy concerns and limited data availability.
ORACAL is a multimodal GNN framework for smart contract vulnerability detection with causal graphs and explainability.
Automated methodology for early disease prediction from structured and unstructured clinical data using NLP.
Global-regional coupling framework using Transformers for kilometer-scale regional weather forecasting.
PCGS framework for strictly online prediction under non-stationarity with Transformer instantiation for expert switching.
Perturbation-based approach for unconstrained bandit linear optimization with improved regret guarantees.
ERPO uses token-level entropy-regulated policy optimization to improve credit assignment in reinforcement learning for language models.
Variational neurons in Transformer feed-forward layers to incorporate uncertainty into internal computation for language modeling.
AI-driven anomaly detection system for smart bridge monitoring using infrastructure data.
MR-CDM framework for multi-resolution time series generation using hierarchical decomposition and diffusion models.
Study on robustness against data corruption in offline multi-agent reinforcement learning from human feedback.
Framework for estimating learning complexity and communication costs in federated learning systems before deployment.
FI-KAN introduces fractal interpolation function bases into Kolmogorov-Arnold Networks for improved multi-scale function approximation.
arXiv paper proposing optical in-network computing to reduce communication overhead in distributed machine learning systems.
arXiv paper introducing LIBERO-Para benchmark to evaluate robustness of Vision-Language-Action models to paraphrased instructions in robotic tasks.
arXiv paper proposing FedRCO, a second-order optimization framework for federated learning with improved stability under non-IID data.
arXiv paper addressing fairness issues in graph condensation, preventing amplification of demographic biases during dataset compression.
arXiv paper using physics-informed neural networks to predict hydrogen sorption in geological formations with thermodynamic constraints.
arXiv paper integrating learning-based optimization with classical statistical methods for efficient high-dimensional matrix estimation.
arXiv paper applying deep reinforcement learning to maritime coverage path planning on irregular hexagonal grids.
arXiv paper addressing label-efficient retraining of malware detection models under distribution drift in real-world settings.
arXiv paper on Bayesian framework for preference learning in many-objective optimization using mixture models of latent preference archetypes.
arXiv paper presenting evolutionary framework using LLMs to discover novel reinforcement learning algorithms by searching over executable update rules.
arXiv paper introducing KGroups, a feature selection algorithm for high-dimensional biological data using max-relevance min-redundancy criteria.
IsoQuant uses quaternion algebra and isoclinic rotations for efficient LLM KV cache compression with hardware-aligned blockwise operations.
FeDMRA addresses federated class-incremental learning with dynamic memory replay for non-IID distributed healthcare data.
HISA improves efficiency of token-level sparse attention mechanisms through hierarchical indexing, reducing O(L²) bottleneck.
Analysis of scaling laws in AI across model families, explaining their predictive power and universal effectiveness in training loss reduction.
Interpretable machine learning framework for detecting low left ventricular ejection fraction from ECG data using predictor-driven approach.
CirrusBench evaluates LLM-based agents in real-world cloud service environments beyond correctness, measuring robustness and efficiency.
Simplex denoising framework for discrete generative modeling using non-Markovian noising scheme, applied to graph generation.
Offline multi-agent reinforcement learning approach using Partial Action Replacement to handle exponential joint action space growth.
ChemCLIP uses contrastive learning to bridge organic and inorganic anticancer compound discovery by enabling knowledge transfer across chemical domains.
Physics-informed impact identification framework combining observational and inductive biases for aerospace composite damage detection.
Position paper arguing explainable AI fundamentally relies on causal inference rather than diverse disconnected XAI methods.
LACE mechanism for continual learning that adaptively expands model capacity during training based on loss signal monitoring.
Information-theoretic analysis of safety verification impossibility for self-improving systems balancing bounded risk with unbounded utility.