Learning-based agricultural management in partially observable environments subject to climate variability
Learning-based agricultural management in partially observable environments subject to climate variability
Learning-based agricultural management in partially observable environments subject to climate variability
Provable Emergence of Deep Neural Collapse and Low-Rank Bias in $L^2$-Regularized Nonlinear Networks
Goal-Conditioned Reinforcement Learning from Sub-Optimal Data on Metric Spaces
Kernel-based Optimally Weighted Conformal Time-Series Prediction
IGC-Net for conditional average potential outcome estimation over time
Proficient Graph Neural Network Design by Accumulating Knowledge on Large Language Models
Measuring Orthogonality as the Blind-Spot of Uncertainty Disentanglement
Enhancing Inverse Reinforcement Learning through Encoding Dynamic Information in Reward Shaping
Towards Privacy-Guaranteed Label Unlearning in Vertical Federated Learning: Few-Shot Forgetting without Disclosure
Hypercube Policy Regularization Framework for Offline Reinforcement Learning
HypeRL: Hypernetwork-Based Reinforcement Learning for Control of Parametrized Dynamical Systems
Rethinking Approximate Gaussian Inference in Classification
Iterative Importance Fine-tuning of Diffusion Models
From Belief Entrenchment to Robust Reasoning in LLM Agents
ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model
Spatiotemporal Field Generation Based on Hybrid Mamba-Transformer with Physics-informed Fine-tuning
Directional Convergence, Benign Overfitting of Gradient Descent in leaky ReLU two-layer Neural Networks
On Transferring Transferability: Towards a Theory for Size Generalization
Can LLMs Reason Structurally? Benchmarking via the Lens of Data Structures
Belief-Based Offline Reinforcement Learning for Delay-Robust Policy Optimization
Adapt before Continual Learning
Tailored Behavior-Change Messaging for Physical Activity: Integrating Contextual Bandits and Large Language Models
Uni-DPO: A Unified Paradigm for Dynamic Preference Optimization of LLMs
Little By Little: Continual Learning via Incremental Mixture of Rank-1 Associative Memory Experts
Uncertainty-driven Embedding Convolution
Deep Network Trainability via Persistent Subspace Orthogonality
Conformal Unlearning: A New Paradigm for Unlearning in Conformal Predictors
Shuffle-R1: Efficient RL framework for Multimodal Large Language Models via Data-centric Dynamic Shuffle
MOTGNN: Interpretable Graph Neural Networks for Multi-Omics Disease Classification
Semantic-Enhanced Time-Series Forecasting via Large Language Models
Energy Injection Identification enabled Disaggregation with Deep Multi-Task Learning
Dimensional Collapse in Transformer Attention Outputs: A Challenge for Sparse Dictionary Learning
The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
ButterflyQuant: Ultra-low-bit LLM Quantization through Learnable Orthogonal Butterfly Transforms
A Law of Data Reconstruction for Random Features (and Beyond)
Beyond Aggregation: Guiding Clients in Heterogeneous Federated Learning
Understanding Language Prior of LVLMs by Contrasting Chain-of-Embedding
Discrete Variational Autoencoding via Policy Search
GLASS Flows: Transition Sampling for Alignment of Flow and Diffusion Models
ACT: Agentic Classification Tree
Lipschitz Bandits with Stochastic Delayed Feedback
Improving Reasoning for Diffusion Language Models via Group Diffusion Policy Optimization
LLM Priors for ERM over Programs
Provably Optimal Reinforcement Learning under Safety Filtering
Position: Many generalization measures for deep learning are fragile
Overlap-weighted orthogonal meta-learner for treatment effect estimation over time
SeeDNorm: Self-Rescaled Dynamic Normalization
Improving Long-Range Interactions in Graph Neural Simulators via Hamiltonian Dynamics
Finding Kissing Numbers with Game-theoretic Reinforcement Learning
KernelBand: Steering LLM-based Kernel Optimization via Hardware-Aware Multi-Armed Bandits