Exact Discrete Stochastic Simulation with Deep-Learning-Scale Gradient Optimization
Enables gradient-based learning for continuous-time Markov chains by decoupling simulation from differentiation for optimization.
Enables gradient-based learning for continuous-time Markov chains by decoupling simulation from differentiation for optimization.
Automatic chord recognition via pseudo-labeling and knowledge distillation using pre-trained models with unlabeled audio.
Path-conditioned training approach for exploiting rescaling symmetries in ReLU neural network parameters.
Depth-Structured Music Recurrence model for long-context symbolic music generation on resource-limited devices.
Survey on meta-learning and meta-reinforcement learning techniques for rapid adaptation to novel tasks with minimal data.
Uplift learning method for estimating causal effects of combinatorial treatments using permutation-invariant representations.
Causal discovery method using signal processing for time-series observations with misaligned sampling rates in biological/physical systems.
arXiv paper proposing gradient-based severity labeling for contrastive learning in medical image biomarker classification.
arXiv paper on private inference protocols resilient to malicious client behavior in ML deployment.
arXiv paper on event-driven trading using LLMs and hierarchical-gated reward modeling with textual market signals.
arXiv paper proposing lifelong adaptability in imitation learning to improve compositional generalization beyond memorization.
arXiv paper on normal behavior modeling for time-series forecasting of telescope monitoring data.
arXiv paper on feature importance estimation and selection biases in large-scale recommender systems.
arXiv paper addressing modality gap in CLIP-based multimodal learning for medical image representation alignment.
arXiv paper on out-of-distribution detection in deep neural networks and how regional focus affects OOD detection performance.
Research on descent-guided policy gradients for multi-agent reinforcement learning addressing cross-agent noise scaling issues.
Human-AI collaboration framework using adaptive ensembles that align with human judgment when beneficial and complement when needed.
Autonomous scaling method for synthetic training environments using verifier-based reinforcement learning for reasoning language models.
Investigation of LLM knowledge encoding using nanochat family with fully open pre-training data to understand where parametric knowledge originates.
Post-calibration uncertainty quantification method balancing aleatoric and epistemic uncertainty in classifier ensembles.
Security vulnerability analysis of LLM agents exploiting skill files for prompt injection attacks through the agent skills supply chain.
Benchmark suite for evaluating video reasoning capabilities of models, focusing on spatiotemporal consistency and reasoning beyond visual quality.
Investigation of loss surface sharpness relationship to neural representation compression through local volumetric ratio and activation concentration measures.
Study demonstrating unstructured neural network pruning can induce structural effects and layer collapse, shortening computational paths.
Mathematical framework for resampling from target probability measures using learned maps from prior distributions without explicit density knowledge.
Improved straight-through estimator for discrete optimization decoupling forward-pass stochasticity from backward-pass gradient dispersion in neural networks.
Robust causal discovery method for agent-based model validation using cross-validation approach to improve accuracy on complex time series data.
Threshold-independent fairness assessment for record matching models, introducing score bias measures beyond binary decision evaluation.
Spectral mixture representation framework for isotropic kernels extending Random Fourier Features beyond Gaussian kernels for machine learning applications.
Comprehensive taxonomy and empirical study of Graph Neural Networks for graph-level prediction tasks in molecular and biological domains.
Unsupervised clustering algorithm for high-dimensional biomedical datasets using two-phase K-Means and graph-based approach without requiring prior cluster count.
Mamba-based graph convolutional network addressing over-smoothing in GNNs using selective state space mechanisms for improved depth handling.
Semantic prompt caching system using variable similarity thresholds in vector databases to reduce LLM inference latency and cost beyond static threshold approaches.
Critical examination of oversmoothing metrics in Graph Neural Networks, arguing that Dirichlet energy and similarity measures fail to reliably capture the phenomenon.
Research on 'Curse of Depth' phenomenon where ~50% of LLM layers underperform, analyzing root causes across Llama, Mistral, DeepSeek, Qwen families with theoretical and empirical analysis.
Multi-step TD-learning analysis under linear function approximation, off-policy learning, and bootstrapping.
Contextual combinatorial semi-bandits with sparse rewards for recommendation systems.
AdaGC: gradient clipping method stabilizing LLM pretraining by addressing multi-factor loss spikes.
Test-time training provably enhances transformer in-context learning through gradient-based weight updates.
High-dimensional vector computing architecture for modeling human and animal learning.
Noise-Aware Generalization: training methods handling label noise and domain shifts simultaneously.
Universal graph encoder for learning transferable structural representations across diverse graph domains.
GraphOmni: comprehensive benchmark for evaluating LLM reasoning on graph-theoretic tasks in natural language.
Think2SQL: reinforcement learning approach improving LLM reasoning for complex text-to-SQL generation.
HetGL2R: heterogeneous graph framework for ranking critical road segments using origin-destination flows.
Federated domain generalization leveraging source domain features for improved generalization.
GRILL: adversarial attack method for autoencoders addressing ill-conditioned gradient propagation.
Performance estimation for binary classifiers without ground truth labels using calibrated confidence.
CodePDE: LLM-based inference framework generating code for solving partial differential equations.
FairSHAP: preprocessing framework using Shapley values for identifying and mitigating bias in ML models.