Atomic Trajectory Modeling with State Space Models for Biomolecular Dynamics
State space models for biomolecular dynamics modeling, accelerating MD simulations while preserving temporal relationships for drug discovery applications.
State space models for biomolecular dynamics modeling, accelerating MD simulations while preserving temporal relationships for drug discovery applications.
Inhibitory normalization in neural circuits improves learning when handling complex input distributions, bridging biological and artificial neural networks.
Adaptive guidance mechanism for RAG-enhanced masked diffusion models to handle retrieval-prior conflicts when context is noisy or inconsistent.
Sensi: LLM agent architecture for ARC-AGI-3 using curriculum-based test-time learning with perception-action separation and active hypothesis testing.
Stochastic set-valued optimization framework for robust machine learning using multi-objective optimization with hyperbox set representations.
Uses shuffle products and finite-state automata to model overlapped speech for alignment and speaker-attributed transcription via marginalizing serializations.
CoVerRL framework escapes consensus trap in label-free LLM reasoning by using generator-verifier co-evolution to maintain output diversity and avoid reinforced systematic errors.
ResNet-50 pipeline for gastrointestinal video analysis using class reweighting and temporal decoding to handle severe class imbalance in pathology detection.
Framework integrating HPC, ML, and quantum computing for drug discovery, replacing trial-and-error with quantitative precision in molecular dynamics.
ChopGrad reduces memory costs in video diffusion model training by using truncated backpropagation with pixel-wise losses instead of accumulating activations.
Language model infers stage-play layouts (scenes, positions, movements) from narrative text without explicit spatial cues, testing spatial reasoning capabilities.
Text-to-Stage: LLM-based spatial reasoning from narratives to generate stage-play layouts. Probes compositional reasoning in language models.
Physics-informed ML surrogates for power grid simulation validation. Application of ML to scientific computing.
Research on how LLMs generate verbal confidence scores. Investigates timing and computation of uncertainty estimates in black-box models.
scicode-lint: LLM-generated patterns for detecting methodology bugs in scientific Python code. Addresses sustainability of ML-specific linters.
LoST: level-of-semantics tokenization method for 3D shape generation improving autoregressive 3D generative models.
Token pruning framework for efficient video vision-language models reducing computational cost via temporal token scoring.
Graph neural network method for air quality forecasting modeling pollution diffusion between cities and monitoring stations.
Aergia: federated learning system leveraging client heterogeneity in computing power to reduce training time.
Feature space renormalization mechanism for semi-supervised learning improving consistency regularization on unlabeled data.
Soft Dice Confidence: confidence estimator for selective prediction in semantic segmentation enabling model abstention.
Hi-GMAE: hierarchical graph masked autoencoders for multi-scale self-supervised learning on graph-structured data.
Analyzes transformer-based amortized causal discovery on observational data, bridging supervised learning with identifiability theory.
Den-TP: data curation framework addressing long-tail distribution in trajectory prediction datasets for autonomous driving.
ACT-JEPA: joint-embedding predictive architecture for efficient policy representation learning via self-supervised learning from unlabeled data.
SALSA-RL: stability analysis method for deep reinforcement learning agents enabling interpretability and safety assessments in continuous control.
Studies regret minimization in repeated first-price auctions with causal inference for online advertising scenarios.
Offline reinforcement learning algorithm leveraging inverse optimization and sub-optimality loss for continuous state/action spaces.
Hierarchical federated learning framework using UAVs as mobile aggregators for distributed IoT systems with limited connectivity.
Proposes minimal repair concept showing imputing all missing values unnecessary; identifies critical missing data subsets for accurate ML models.
SocialJax: evaluation suite for multi-agent reinforcement learning in sequential social dilemmas, measuring agent generalization.
Proposes Arch-VQ for learning discrete neural architecture representations using autoregressive priors instead of continuous VAE mapping.
Studies impact of duplicated training data on deep neural network image classifiers, comparing robust vs. standard models against adversarial attacks.
arXiv paper: ILLUME method for post-hoc explainability of tabular ML models with interpretable meta-encoding.
arXiv paper: Clust-Splitter algorithm for efficient clustering on large datasets using nonsmooth optimization.
arXiv paper: Bi-level policy optimization with Nyström hypergradients for actor-critic reinforcement learning algorithms.
arXiv paper: Restoration Score Distillation framework for learning generative models from corrupted data. Novel ML research.
Foundation model for time series forecasting using mixture-of-experts architecture with decoupled training to handle diverse temporal patterns and multi-variable correlations.
Tutorial on diffusion and flow-based generative models covering mathematical foundations, ODEs, SDEs, and core algorithms for image, video, and multi-modal generation.
Offline reinforcement learning method for mismatched dynamics leveraging model-based approaches to explore high-reward states.
Self-organizing map extension addressing catastrophic forgetting in continual learning scenarios.
Privacy-preserving neural network framework integrating epidemiological modeling with differential privacy guarantees.
Learning to Reject framework extending ML models to abstain from predictions and explanations with low quality.
Method predicting better pre-trained weights by leveraging structural properties and retrodiction of forgetting.
Fast weight programmers with 2D matrix hidden states connecting RNNs, language modeling, and neurobiology.
Inverse game theory algorithm learning constraints from Nash equilibrium demonstrations using MILP formulations.
Transfer learning framework for EEG-based emotion recognition using domain-class prototypes.
Theoretical analysis of implicit regularization in diagonal linear networks via Lasso regularization path.
Tree-based group relative policy optimization for LLM agents addressing sparse supervision in multi-turn tasks.
Method aligning supervised fine-tuning with in-context learning activations to improve LLM generalization and calibration.