AdaGrad-Diff: A New Version of the Adaptive Gradient Algorithm
AdaGrad variant using cumulative squared gradient differences instead of norms for adaptive stepsize learning.
AdaGrad variant using cumulative squared gradient differences instead of norms for adaptive stepsize learning.
Reconstructing high-resolution face images from facial embeddings using diffusion models to test privacy risks in face recognition systems.
Climate model selection for precipitation projection in river basins using CMIP6 and machine learning techniques.
Learning robot manipulation skills from human videos using simulation filtering and modular policies for grasping and post-grasp motions.
DART algorithm for non-linear top-K arm selection in combinatorial bandits with adaptive accept-reject sampling.
Method leveraging low-rank structures and compressible parameter dynamics to reduce computational costs in overparameterized neural network learning.
LTSM-Bundle: toolbox and benchmark for using large language models as universal time series forecasting models on heterogeneous datasets.
Framework for incorporating physical priors into diffusion-based generative models to generate physically feasible dynamics.
Analysis of the modality gap in contrastive multimodal learning (CLIP-like models) and methods to explain and reduce representation misalignment between modalities.
B3C: minimalist offline multi-agent reinforcement learning approach using behavior cloning regularization to address overestimation in action selection.
MINJA attack: adversarial method for injecting malicious data into LLM agent memory banks via query-only interactions without direct memory access.
System for recognizing worker activities and estimating picker efficiency in commercial strawberry harvesting using instrumented equipment.
Method using noisy manual labels as information sources to enhance variable selection in penalized logistic regression.
Theoretical analysis of identifiability and singularity in polynomial neural networks and their neuromanifolds.
Framework for systematic design and analysis of weight quantization formats for efficient deep learning model training and deployment.
Transformer model with dynamic structure learning for multivariate time series forecasting using causal relationships instead of all-to-all connections.
Research on learning verifiers for chain-of-thought reasoning in LLMs using formal verification to improve mathematical problem-solving.
N², a unified Python package for nearest neighbor-based matrix completion with theoretical guarantees and empirical benchmarks.
Variational autoencoder method for extracting quasiparticle interference patterns from quantum material imaging data.
PeakWeather dataset of Swiss weather station measurements for training spatiotemporal deep learning models for weather forecasting.
Framework for continual learning that handles concept drift in real-world data streams while preventing catastrophic forgetting.
Research on improving AI vision robustness by training on human developmental visual patterns to reduce reliance on texture and increase shape recognition.
Efficient 3D molecular generation method using Gaussian probability paths with reduced sampling steps for drug discovery applications.
Foundation model for analyzing industrial signals from SCADA systems across multiple modalities for anomaly detection.
Instruction-based diffusion model for editing time series properties while preserving specified conditions, replacing rigid attribute vectors.
Federated learning approach using generative models to address data heterogeneity across distributed clients and improve model generalization.
Self-evolving LLM that autonomously generates training data and improves reasoning without relying on human-curated tasks or labels.
Semantic caching system for reducing LLM inference costs by retrieving cached responses based on query similarity rather than exact matches.
Generative adversarial transformer model for upsampling energy time series data with improved fidelity over conventional methods.
Hyperdimensional refinement method for LLM-generated reasoning graphs in video anomaly detection handling distribution-deficient structures.
Neural network architecture learning nonharmonic Fourier series with cosine activations for scientific machine learning tasks.
Fast approximate softmax attention mechanism clustering queries and keys for 36% faster transformer pretraining on long sequences.
Online reinforcement learning framework using sparse Gaussian mixture model Q-functions with interpretable policy iteration.
Diffusion-based scenario tree generation for stochastic forecasting and multistage optimization in energy and finance domains.
Adaptive time series foundation model with parameter-efficient design handling temporal heterogeneity and varying sampling rates.
Multi-agent linear bandit algorithm with stage-wise safety constraints for collaborative learning in networked settings.
Vector diffusion wavelets integrated into geometric graph neural networks for point cloud and manifold data representation.
Investigation into what temporal graph learning models actually learn despite benchmark claims, revealing evaluation protocol issues.
Statistical set-level inference framework to identify training data in large language models with controlled error rates for legal evidence.
Reinforcement learning method to fine-tune generative models avoiding decision boundary mode-seeking when paired with safety classifiers.
Deep learning method analyzing extended ECG windows to improve arrhythmia classification specificity and cross-dataset generalization.
Weight decay regularization has greater impact than maximal update parameterization for transferring learning rates across model scales.
Active learning approach using task-driven representations to select relevant samples from uncurated, messy data pools.
Large language models used as in-context meta-learners to recommend model families and hyperparameters from dataset metadata.
Federated learning framework over wireless networks handling heterogeneous device conditions through bias-variance optimization.
Causal graph discovery method for spatially gridded time series data with non-stationary regimes and varying patterns.
Quantum temporal convolutional neural network applied to stock market prediction with classical temporal encoder component.
Methods to detect if specific audio data was used in training generative audio models through membership and dataset inference attacks.
Graph neural network framework for handling heterophilic graphs without iterative message passing using multi-resolution community features.
Imitation learning framework for combinatorial optimization under uncertainty. Explores role of expert demonstrations in training policies for sequential decision problems.