A Deep Reinforcement Learning Framework for Closed-loop Guidance of Fish Schools via Virtual Agents
Deep reinforcement learning framework using PPO to train virtual agents for guiding fish school collective motion.
Deep reinforcement learning framework using PPO to train virtual agents for guiding fish school collective motion.
Differentiable power-flow optimization using neural networks to replace Newton-Raphson methods for scalable grid simulation.
Domain adaptation method for quantum machine learning using classical shadows to handle distribution shift in quantum data.
Multi-view learning approach with prototypes for thyroid ultrasound classification with improved robustness across devices.
Multi-agent pipeline for literature analysis using Deleuzian ontology to identify non-linear patterns in research landscapes.
Framework for evolutionary GPU kernel optimization using evaluation-driven agent and evolutionary techniques for operator generation.
Ensemble learning system combining deep learning and traditional classifiers for brain tumor MRI classification.
Analysis of prompt framing artifacts in vision-language model evaluation on clinical neuroimaging tasks.
Survey of network performance modeling approaches comparing traditional simulation with deep learning methods.
Novel geometric optimization framework using affine normal descent with invariance under volume-preserving transformations.
Diffusion distillation approach using reinforcement learning to improve student model performance beyond teacher anchoring.
Study of coordinated user manipulation tactics in recommendation systems through collective rating and review mechanisms.
Analysis of instructional alignment in cybersecurity exercises using multimodal data and Bloom's taxonomy coding.
Medical AI Scientist system that autonomously generates hypotheses, conducts experiments, and writes manuscripts in clinical medicine.
Theoretical analysis of approximation constraints in narrow ResNets, proving inability to represent critical points.
Automated pipeline using diffusion models to detect and anonymize sensitive regions in images while preserving utility.
Genetic programming pipeline for automatically evolving interpretable composite features for music tagging tasks.
Study of demographic bias and fairness issues in facial recognition systems used by law enforcement.
Method for improving diversity in text-to-image diffusion transformers through contextual space repulsion to address typicality bias.
Paper examining financial insurance mechanisms for AI system failures and algorithmic liability in high-stakes domains.
Study on few-shot learning and RNNs applying asymptotic equipartition property from information theory to machine learning.
Theoretical analysis of inexact Langevin algorithm convergence for score-based generative models with KL divergence guarantees.
Survey on continual graph learning covering incremental learning from streaming graph data with experience and generative replay approaches.
Mathematical analysis of auto-differentiation reliability in neural-ODE training with high-order numerical methods.
Novel prior learning method for neural networks using structured posteriors to improve generalization and uncertainty estimation.
Proves asymptotic optimality of new restless bandit policies with O(1/√N) gap under unichain and aperiodicity conditions.
Theoretical analysis of sample complexity for model-based Q-learning, establishing finite-time convergence bounds for model-learning algorithms.
Paper proposing Explaining-Away Variational Autoencoders to improve uncertainty representations in deep generative models for visual inference tasks.
Survey of multimodal continual learning methods that enable models to learn from new data across multiple modalities while retaining previous knowledge without catastrophic forgetting.
Transformers learn variable-order Markov chains in-context with finite-sample accuracy analysis using context-tree weighting.
Analysis of Sharpness-Aware Minimization robustness to label noise through gradient down-weighting at element-wise level.
Scalable neural network verification using branch-and-bound with inferred cutting planes instead of external MIP solvers.
Wavelet subspace compression for optimizer states reduces memory during LLM training, improving upon low-rank approaches.
Dataset distillation for credit scoring models addressing class imbalance in pretrained models on tabular financial data.
Binned spectral power loss function for improved deep learning predictions of chaotic multiscale dynamical systems.
Multimodal drug-aware diffusion model for ECG generation in virtual clinical trials with demographic disentanglement.
Steering vectors applied to LLM activations for bias mitigation across social dimensions like age, gender, and race.
Large graph dataset and measurement framework for evaluating long-range interactions in graph representation learning.
Methods to measure faithfulness of concept-based explanations in deep vision models using surrogate models.
Training-free audio-visual segmentation using foundational models for open-vocabulary pixel-level mask prediction.
Survey of LLM integration with Computer-Aided Design tools, covering applications in 3D modeling and design workflows.
Birch SGD framework represents distributed SGD methods as computation trees to unify analysis and design of optimization algorithms.
Deep latent variable models for vertical federated learning with flexible alignment and labeling across feature-partitioned data.
Foundation models for time-series prediction often use simple parroting strategies rather than learning physics, revealing shared failure modes.
FlowPure uses continuous normalizing flows for adversarial purification to remove perturbations from ML model inputs at inference time.
Structured Agent Distillation compresses large LLM-based ReAct agents into smaller models while preserving reasoning and action consistency.
CoDec kernel optimizes LLM decoding by sharing prefix computation across multiple prompts to reduce memory-intensive KV cache access.
Decentralized multi-player multi-armed bandits problem with unknown arm capacities and no collision sensing.
Physics-informed neural networks compute 3D magnetohydrodynamic equilibria by parametrizing Fourier modes and minimizing force residuals.
User-centric evaluation metrics for counterfactual explanations in ML models, focusing on actionability and end-user preferences.