Transforming Behavioral Neuroscience Discovery with In-Context Learning and AI-Enhanced Tensor Methods
Using in-context learning and AI-enhanced tensor methods to automate behavioral neuroscience discovery pipelines.
Using in-context learning and AI-enhanced tensor methods to automate behavioral neuroscience discovery pipelines.
Uncertainty-aware time-series ensemble method for proactive anomaly prediction providing early warning signals before anomalies occur.
Hash-based indexing mechanism for embedding tables in recommendation systems reducing collisions and improving model freshness.
Multi-agent reinforcement learning framework using spatio-temporal hypergraphs for human-centric multimodal traffic signal control.
Graph neural network architecture with adversarial synthesis and contrastive learning for resilient node classification under structural noise.
NAMO optimizer combining Adam's adaptive moments with Muon's orthogonalized momentum for efficient large language model training.
Machine unlearning approach using target feature disentanglement to balance privacy and model utility for right to be forgotten compliance.
Formal framework defining locality radius to determine when multi-hop reasoning is necessary for foreign key discovery in relational databases.
Federated fine-tuning approach for LLMs using low-rank Gram matrices and Procrustes alignment to enable collaborative adaptation without data sharing.
Serverless MLOps framework orchestrating complete ML lifecycle with event-driven pipelines for model-agnostic inference and rapid deployment.
Online learning framework for multiclass settings where agents can improve features to achieve better labels with budgeted costs.
3D Gaussian splatting with implicit physical simulation for accurate prediction of object deformation and movement.
Time-invariant frequency operator for handling distribution shift in nonstationary time series forecasting.
Vector Perturbation VAE decouples representation learning from discretization by eliminating explicit codebook dependency in generative modeling.
Analysis of underfitting challenges in multi-expert learning to defer systems where classifiers abstain and defer to experts.
Unified multimodal model for time series understanding and generation combining numerical forecasting with semantic interpretation.
Gradient-free zeroth-order optimization method for memory-efficient fine-tuning of large-scale models via subspace gradient orthogonalization.
Empirical study comparing how transformers and linear attention models perform in-context learning on regression tasks, analyzing MSE, convergence, and generalization.
Deep reinforcement learning with domain randomization for robust control of mechanical systems with multiple uncertainties and nonlinear dynamics.
Open-source PyTorch library implementing GPU-accelerated Soft Dynamic Time Warping with improved memory efficiency and numerical stability.
ArXiv paper on CounterFlowNet for generating multiple minimal counterfactual explanations for tabular data with heterogeneous features.
ArXiv research on adapting EEG foundation models with limited supervision using prototype-guided fine-tuning for clinical settings.
ArXiv paper applying Wasserstein Autoencoders to control laser pulse shapes in Free-Electron Lasers via differentiable latent interface.
ArXiv research on Unified Latents framework combining diffusion priors with diffusion decoders for efficient latent representation learning.
ArXiv paper proposing RL framework for extremal graph theory problems, extending Deep Cross-Entropy methods to combinatorial optimization.
LexiSafe: Offline safe reinforcement learning method using lexicographic hierarchy to prevent safety violations in cyber-physical systems.
Research paper introducing Flickering Multi-Armed Bandits framework where available actions change dynamically, modeled via random graph processes.
Machine learning framework using deep learning on carotid ultrasound videos to detect vascular damage for cardiovascular disease risk assessment.
Subquadratic structure inference pipeline for immune repertoire analysis combining retrieval, affinity kernels, and multimodal fusion.
Test-time training method using prompts for Graph Neural Network out-of-distribution detection without accessing training data.
Study of shortcut learning in neural networks applied to knot topology classification with applications to protein folding and polymer physics.
ML framework for biomedical data emphasizing feature stability and interpretability under incomplete/missing data for trustworthy clinical decision-making.
Formulates MDP planning as Bayesian policy inference with return-based optimality prior for discrete action domains.
Analyzes convergence of weighted optimistic gradient descent-ascent in bilinear games with feedback delays via prediction.
Develops training method for neural networks with strictly ±1 node values using Boolean threshold functions and constraint formulation.
Uses LLMs for temporal credit assignment in self-evolving agents via retrospective in-context learning from sparse feedback.
Introduces CRAFT, parameter-efficient fine-tuning method using Tucker decomposition on frozen pre-trained transformer attention weights.
Proposes radVI algorithm improving variational inference by optimizing radial profiles beyond standard Gaussian approximations.
Identifies transformer attention heads functioning as membership testers/bloom filters across GPT-2 and Pythia models.
Position paper arguing for standardized evaluation benchmarks in 12-lead ECG representation learning beyond current arrhythmia-focused datasets.
Proposes MASPO algorithm improving gradient utilization and sample efficiency in RL for LLM reasoning tasks beyond GRPO limitations.
Theoretical framework for training modular LLMs by combining domain-specific expert models robustly without heuristic dataset weighting.
Investigates weight regularization techniques in parameter-efficient continual learning with low-rank adapters for pre-trained models.
Theoretical analysis of normalization strategies and their impact on expressivity of Transformer-based time series models.
Studies canonicalization of representation spaces across independently trained multimodal contrastive models for consistency.
Theoretical analysis of loss landscape topology in overparameterized one-hidden-layer ReLU networks with Lipschitz losses.
Develops anytime-valid statistical watermarking method to distinguish machine-generated text from human content in LLMs.
Proposes privacy-preserving federated split learning with intermediate representation protection for distributed ML training.
Addresses variance issues in asynchronous RL training for LLMs using policy-gradient methods like REINFORCE and GRPO on reasoning tasks.
arXiv paper on federated learning with incremental data under limited communication. Addresses catastrophic forgetting in privacy-preserving distributed learning scenarios.