VecMol: Vector-Field Representations for 3D Molecule Generation
Novel 3D molecule generation framework using vector-field representations to address modality entanglement and geometry-chemistry constraints.
Novel 3D molecule generation framework using vector-field representations to address modality entanglement and geometry-chemistry constraints.
Watermarking technique for diffusion models using semantic-aware embedding injection to improve robustness against semantic editing attacks.
Self-supervised system for robots to detect and recognize novel objects from human video demonstrations without prompt engineering.
Efficient optimization technique addressing long-tail distribution problem in LLM-based sequential recommender systems.
Multimodal foundation model for Earth observation using temporal training objectives robust to variable-length satellite and sensor data.
Method for injecting auxiliary visual features into vision-language-action models to improve geometric understanding and temporal reasoning for robotic manipulation.
Model distillation approach compressing 2B vision-language retriever into 70M text-only encoder for efficient document retrieval.
AI-based framework transforming global weather forecasts into fine-grained wind field predictions and infrastructure failure probabilities for tropical cyclones.
Hierarchical deep learning framework for automated wear classification of abrasive flap wheels in manufacturing.
Review of surrogate models for parametric system modeling in optimization, control, and digital twin applications.
Machine learning approach for epileptic seizure forecasting from video using cross-species transfer learning instead of EEG signals.
RL method for post-training diffusion-based text-to-image models using paired trajectory sampling to improve image quality and prompt alignment.
Theoretical analysis of how diffusion models learn data statistics, showing they learn simple pairwise statistics before higher-order correlations.
Research on using vision-language models for detecting and localizing forged images, studying how VLM priors affect forgery detection performance.
Data-efficient MRI reconstruction strategy using diffusion probabilistic models with pre-training and fine-tuning.
Adaptable fraud detection system handling adversarial attacks in resource-constrained environments with multiple risk modules.
Framework (ARL-Tangram) optimizing resource efficiency in agentic RL by dynamically allocating external compute resources.
Surgical world model using controllable video generation for simulating surgical actions with precise tool-tissue control.
Dataset and baseline for real-time screw classification in industrial automation and robotic systems.
Diagnostic benchmark (ESPIRE) for evaluating vision-language models on embodied spatial reasoning tasks.
GNN approach for precoder learning in cell-free wireless systems accounting for dynamic user-access point associations.
Empirical study of federated few-shot learning on neuromorphic hardware using spike-timing-dependent plasticity.
Convergence analysis of functional learning methods for contextual stochastic optimization problems.
Research on interpretable multimodal concept bottleneck models ensuring faithful explanations through proper concept detection.
Provable multi-agent reinforcement learning in partially observable stochastic games leveraging information sharing among agents.
Introduces diffusion models as expressive variational posteriors for black-box inference in latent variable models.
Graph signal processing research extending sampling theory to graphon signals using limits of large graphs.
Research on offline reinforcement learning combining return-conditioned supervised learning with Q-functions to improve stitching capability and stability.
Introduces Walk Profile method and explores positional encodings for directed graphs in graph neural networks and graph transformers.
Develops DNN model using topological and electrostatic features to predict protein biophysics properties like Coulomb and solvation energies.
Explores polynomial attention alternatives to softmax in transformers, arguing regularization rather than probability distribution drives performance.
Open-source Geant4-based tool and data format for generating 3D radiation field datasets for dosimetry deep learning research.
Proposes first computationally efficient algorithm with optimal regret for infinite-horizon discounted reinforcement and imitation learning.
Advocates integrating causal methods into ML to balance trustworthiness objectives like fairness, privacy, robustness, and explainability.
Proposes Dual Filter framework connecting Hidden Markov Models to transformer decoder architecture for causal nonlinear prediction.
Applies unsupervised anomaly detection to ultrafast electron diffraction data to identify beam instabilities in materials science experiments.
Introduces Guided Policy Optimization framework for RL in partially observable environments using privileged information from simulators.
Analyzes oversmoothing problem in deep Graph Neural Networks and explores why networks fail to learn non-oversmoothed representations.
Proposes uncertainty estimation improvements to Residual Reinforcement Learning for faster adaptation of pretrained policies with sparse rewards.
Data condensation approach for training diffusion models with minimal computational budget by constructing smaller synthetic training datasets.
Graph transformer architecture designed for invariant learning to improve out-of-distribution generalization on graph-structured data.
Theoretical study of implicit bias in deep neural network training showing gradient flow induces learning of lower-dimensional parameter structures.
Continual learning framework with unified prompt pools for medical imaging tasks, addressing domain-specific challenges in adaptive AI.
Analysis of compositional generalization mechanisms in conditional diffusion models, studying length generalization on controlled image generation tasks.
Low-rank approximation technique for accelerating machine learning models predicting mechanical properties of heterogeneous materials.
Lightweight meta-learning method using three parameters to dynamically adjust sample loss weights for noisy training, fairness, and synthetic data utilization.
Hybrid pre-training approach using low-rank adapters alongside full training to reduce computational cost for vision transformer training.
Graph-based method for forecasting irregular multivariate time series in healthcare and finance with adaptive spatio-temporal interactions.
Method for robust fine-tuning non-robust pretrained models using epsilon-scheduling to achieve adversarial robustness and task adaptation simultaneously.
Neural operator architecture combining spectral and coupling methods for efficiently learning partial differential equation dynamics.