Mode Seeking meets Mean Seeking for Fast Long Video Generation
Proposes training paradigm decoupling local fidelity from long-term coherence for scaling video generation from seconds to minutes.
Proposes training paradigm decoupling local fidelity from long-term coherence for scaling video generation from seconds to minutes.
TimeMAE: self-supervised framework for learning transferable time series representations using decoupled masked autoencoders.
Interprets consensus-based optimization as stochastic relaxation of gradient descent, providing theoretical understanding of derivative-free methods.
Analyzes geometric structure of shallow ReLU networks and constructs upper bounds for L2 cost minimization without gradient descent.
Framework for distributional reinforcement learning with online risk adaptation for safer decision-making in uncertain environments.
Proposes dispatcher/executor principle for multi-task reinforcement learning that abstracts unnecessary details for better generalization.
Addresses long-tail recognition with unknown test distributions using hierarchical label variation decomposition.
Introduces COLA framework for generating sparse counterfactual explanations using optimal transport and Shapley-based attribution methods.
Analyzes matrix sketching for linear bandits to reduce computational complexity in high-dimensional online learning problems.
Proposes label unlearning method for vertical federated learning using representation-level manifold mixup. Addresses privacy in distributed ML.
Super-resolution recurrent diffusion model for renewable energy generation under climate change impacts.
Single-sequence uncertainty estimation method for LLMs addressing computational expense of multi-sequence approaches.
Sample complexity analysis for online reinforcement learning in nonlinear continuous state/action spaces.
Mutual information estimation method using diffusion bridge models for improved domain transfer problems.
Novel semantic parallelism approach for efficient MoE model inference via co-scheduling of model and data placement.
Probabilistic neural networks using t-distribution outputs for improved prediction intervals beyond Gaussian assumptions.
Optimization perspective on reward model quality for RLHF, analyzing factors beyond accuracy that make effective teachers.
Domain decomposition approach for neural operators solving PDEs with improved geometry generalization capabilities.
Research on enforcing token sparsity in multimodal LLMs to reduce computational overhead while maintaining accuracy.
Novel attention mechanism and pointer network for parcel pickup route prediction in logistics optimization.
Theoretical research on manifold learning with normalizing flows for Riemannian geometry in high-dimensional data.
Research on feature selection using permutation-invariant embeddings and policy-guided search with generative models.
Lightweight prediction model for LLM-based agentic workflow performance across agent configurations and prompting strategies.
Framework connecting inverse optimization and inverse reinforcement learning for apprenticeship learning with prior cost function beliefs.
Analysis of Lipschitz continuity properties for neural networks operating on set-valued data.
Method bridging target-free and target-based reinforcement learning to reduce memory overhead while maintaining learning stability.
Framework converting multimodal LLM generative capabilities into zero-shot discriminative embedding models without extensive pre-training.
Model merging technique using task vector distillation to improve robustness of multi-task learning across diverse settings.
Physics-informed DeepONet operator learning combining full rollout and autoregressive approaches for PDE inference.
Theoretically motivated improvement to supervised fine-tuning for LLMs by rectifying reward structure to match RL generalization.
Offline multi-agent reinforcement learning using efficient flow-based policies for time-sensitive deployment.
Framework of strategies for improving LLM-based forecasting by integrating historical data and textual context with reduced computational cost.
Federated learning approach for nonlinear system identification with theoretical convergence guarantees.
Ensemble framework for conditional independence testing to improve computational efficiency in constraint-based causal discovery.
Investigation of in-context learning in world models for embodied AI to adapt to novel environmental configurations.
Study of how activation function design affects plasticity in continual learning to prevent loss of model adaptation.
Interpretable time series forecasting method using hierarchical prototypes to explain model decision-making.
Deep learning approach for reduced order modeling via parameter-dependent linear subspace regression.
Foundation inference model using in-context learning to predict marked temporal point process event sequences across different systems.
Process Reward Models that capture step-by-step reasoning dependencies in LLMs to improve reasoning alignment with final outcomes.
Benchmark dataset of 50 condensed matter theory problems for evaluating LLMs on advanced research-level physics problem-solving.
Permutation-invariant representation learning for privacy-preserving feature selection using generative intelligence.
Carré du champ flow matching: geometry-aware regularization technique for generative models improving quality-generalization tradeoff.
Hybrid tensor-EM method for learning mixtures of linear dynamical systems with improved performance on noisy time-series data.
Evaluation of zero-shot super-resolution capabilities in machine-learned operators for modeling continuous physical phenomena.
ToSFiT: Thompson sampling via LLM fine-tuning for Bayesian optimization in large discrete spaces without acquisition function maximization.
Quaternion-valued Hopfield neural network with supervised learning rules for representing rotations in neural computation.
Variational inference approach for data assimilation combining dynamical models with noisy observations using Gaussian distributions.
Differential privacy framework for decentralized learning using matrix factorization to enable collaborative training while preserving privacy.
FAPO method for LLM reasoning via reinforcement learning that filters flawed positive rollouts to improve policy optimization with verifiable rewards.