Directional Textual Inversion for Personalized Text-to-Image Generation
Improves textual inversion for personalized text-to-image generation by addressing embedding norm inflation issues in CLIP.
Improves textual inversion for personalized text-to-image generation by addressing embedding norm inflation issues in CLIP.
Conditional diffusion framework for forecasting electromagnetic field levels in wireless networks.
Proposes affine divergence method to align activation updates during gradient descent optimization in neural networks.
Automates generation and resolution of diverse forecasting questions for evaluating AI systems' general intelligence capabilities.
Identifies gradient issues in RLVR methods like GRPO used for LLM reasoning tasks, proposing improvements to reward-based supervision.
Framework using influence functions to edit training data and induce targeted model behavior changes for data poisoning robustness evaluation.
Diffusion model distillation technique using branching ensemble networks with dense supervision for accelerated sampling.
Flow-matching generative model for predicting periodic molecular crystal structures handling large molecules and complex interactions.
Deep reinforcement learning approach for robust control under multiple uncertainties combining domain randomization and uncertainty learning.
Addresses factorization barrier in diffusion language models through architectural changes enabling efficient parallel token generation.
Transformer-based model (SpeedTransformer) for transportation mode detection from smartphone GPS trajectories outperforming LSTM.
Framework (DUEL) for computing exact likelihood in masked diffusion models by addressing test-time distribution mismatch in ELBO.
LLM-based agent (GOME) for machine learning engineering using gradient-based reasoning instead of tree search for efficient optimization.
Mechanistic interpretability study using architectural topology modifications to investigate grokking and delayed generalization in Transformers.
Study on eliciting truthful responses from censored LLMs and detecting deception using naturally-occurring dishonesty patterns.
Memory-efficient optimization method for training large language models with mask traversal and convergence guarantees for nonconvex settings.
Clustering algorithm for data summarization using Khatri-Rao product to reduce redundancies in centroid-based cluster prototypes.
Meta-optimizer that dynamically selects update rules during training with PyTorch compatibility, achieving 5.3x faster convergence.
Multi-objective protein sequence design balancing designability with competing properties like solubility and thermostability using preference alignment.
Reinforcement learning approach for robust policies under latent distribution shift in partially observable domains using adversarial training.
Generative models with tunable complexity for inverse problems, extending diffusion models and normalizing flows with adaptive dimensionality.
Decision-theory framework for designing probabilistic weather forecasts tailored to heterogeneous farmers' agricultural decision-making.
Federated learning framework addressing non-IID data distribution through personalization strategies for heterogeneous client data.
Benchmark for evaluating time-series forecasting foundation models with temporal generalization, addressing train-test contamination issues in static splits.
Deep learning method for learning coordinates and flow maps to enhance computational efficiency in multiscale dynamical systems.
DRUPI: dataset reduction method using privileged information beyond input-label pairs for dataset condensation tasks.
Differentiable optimization approach using control barrier functions to learn safe responsibility allocations in multi-agent autonomous systems.
Machine learning techniques for computing Calabi-Yau metrics and Ricci-flat approximations using gradient descent and algebraic ansatz methods.
Adaptive importance sampling and stratified subsampling estimators for robust high-dimensional sparse regression with heavy-tailed noise.
Source-free unsupervised domain adaptation framework for EEG-based applications addressing distribution shifts across subjects and sessions.
Prognostics system for autonomous deep-space habitat health monitoring and remaining useful life prediction under multiple failure modes.
MS-HGNN: morphological-symmetry-equivariant heterogeneous graph neural network for robotic dynamics learning with structural priors.
CuriousBot: mobile robotic exploration system using actionable 3D relational object graphs for interactive environment exploration.
Molecular fingerprints show competitive performance with complex neural networks for peptide function prediction.
Real2Sim2Real framework using likelihood-free inference for visual robotic manipulation of deformable linear objects.
LayerNorm tuning method using concept drift for efficient multimodal metaphor identification in internet memes.
UltraEdit: training-free model editing approach for lifelong knowledge updates in LLMs without retraining or subject-specific data.
CORA: cooperative game-theoretic credit assignment method for multi-agent reinforcement learning using core concepts for advantage allocation.
Regret-optimal Q-learning algorithms minimizing sample collection and policy switching costs in single-agent and federated reinforcement learning.
Analysis of user and topic interaction dynamics in online social networks and information cascade spreading.
Supervised contrastive learning approach for low-resource language identification to improve multilingual LLM pretraining corpus curation.
Theoretical analysis of Iteratively Reweighted Least Squares algorithm for robust subspace recovery with linear convergence guarantees.
Research on convergence rates for stochastic gradient descent and heavy ball methods under convex and non-convex objectives using discrete Gronwall's inequality.
Latent policy steering approach using embodiment-agnostic pretrained world models to leverage cross-embodiment datasets for robot learning.
AI-generated singing avatars presenting course syllabi to improve student engagement and information comprehension in education.
Monte Carlo sampling method for computing integrals on unit spheres with applications to sliced Wasserstein distance computation.
Robot Control Stack ecosystem for scalable vision-language-action model training and deployment, replacing traditional robotics frameworks.
Test-time composition method for enhancing diffusion-based robot control policies without additional training data or model fine-tuning.
Latent Speech-Text Transformer improving compute efficiency of speech-text models by reducing token sequence length through latent representations.
AlphaApollo agentic reasoning system addressing reasoning capacity and verification bottlenecks through multi-turn reasoning and trustworthy tool integration.