High-Dimensional Robust Mean Estimation with Untrusted Batches
Addresses high-dimensional mean estimation in collaborative learning with statistically heterogeneous and adversarial data sources.
Addresses high-dimensional mean estimation in collaborative learning with statistically heterogeneous and adversarial data sources.
Large-scale 3D CFD benchmark dataset (WeirNet) for surrogate modeling of piano key weir hydraulic performance.
Introduces Fuz-RL, a fuzzy-guided framework for safe reinforcement learning under multiple uncertainty sources with interpretable risk assessment.
Proposes ECO, an offline self-play learning paradigm using Direct Preference Optimization and Mamba architecture for neural combinatorial optimization.
Applies federated learning to electric vehicle energy demand forecasting for smart grid optimization.
Studies how rehearsal scale affects learning dynamics in continual learning algorithms across varying model capacities.
Characterizes catastrophic forgetting in continual learning through parameter update magnitude, formalizing knowledge degradation from task-specific drift.
Analyzes reasoning capabilities in cooperative multi-agent reinforcement learning under Dec-POMDP framework with partial observability and decentralized coordination.
Proposes regret-guided search control for AlphaZero to improve learning efficiency through targeted state revisitation in reinforcement learning.
Introduces transcoder adapters to interpret MLP computation differences in reasoning models, comparing Qwen2.5-Math-7B and DeepSeek-R1-Distilled variants.
Semantic-guided expert forest method for class-incremental learning that organizes adapter knowledge through task relationships.
EEG-to-text decoding system assessing hierarchical abstraction levels for fine-grained brain signal interpretation across multiple classes.
Extension of maximal update parameterization for hyperparameter transfer across different optimizers in large language model training.
Method for estimating confidence bounds in binary classifiers using Wilson Score kernel density estimation for critical applications.
Theoretical work connecting law of robustness to robust generalization in neural networks through Lipschitz constraints.
Multi-fidelity surrogate modeling technique combining low and high-fidelity simulation data with spatial trust-weighting.
Theoretical analysis of multi-agent imitation learning showing impossibility and hardness results for learning low-exploitable policies offline.
Neural network architecture for multivariate time series imputation using channel-head binding to transfer information across variables.
Interpretable framework for fMRI-based brain disorder diagnosis combining prototype learning with Monte Carlo tree search.
Domain adaptation method for offline reinforcement learning addressing dynamics mismatch through localized similarity matching.
Federated semi-supervised learning framework using proxy guidance to handle data heterogeneity across and within distributed clients.
Evaluation of DeepSpeed distributed training framework for scaling Vision Transformer models to address computational and memory demands.
Analysis of Graph Neural Network activation patterns through graph topology and curvature to understand oversmoothing and oversquashing artifacts.
Tokenization method combining vector quantization with Self-Organizing Maps to create structured discrete codebooks for interactive generative models.
Self-evolving LLM agent framework using uncertainty-aware rewards to guide multi-step decision-making and improve learning signal for agent training.
Framework for counterfactual inference on temporal clinical data distinguishing immutable from controllable features in electronic health records.
Study on Predictor-Corrector samplers for discrete diffusion models to improve multi-step generation quality beyond ancestral sampling methods.
Research on Pass@k metric optimization for LLMs showing trade-offs between multi-sample and single-sample inference performance in code generation and reasoning tasks.
Statistical query lower bounds for learning halfspaces under Gaussian perturbations in the smoothed agnostic learning model.
Untied Ulysses: memory-efficient context parallelism technique for long sequences via headwise chunking in Transformers.
Reflective Test-Time Planning for embodied LLMs combining in-action and post-action reflection with test-time scaling for robot task learning.
Analysis revealing that test-time training with KV binding can be expressed as learned linear attention mechanism.
Online resource allocation algorithm for assigning mediators to judicial cases with quality learning.
Benchmark comparing knowledge-distilled small language models against vanilla and proprietary models for resource-constrained environments.
AI framework using neural networks on intracardiac echocardiography video to localize cardiac arrhythmia origins.
Leakage-aware benchmarking framework for early patient deterioration prediction under realistic emergency triage sensing constraints.
Machine learning approach using axial vectors in equivariant networks for designing D-peptide binders targeting L-proteins.
Physics-informed neural networks for estimating coolant velocity in MOSFET heat sinks given thermal conditions.
Deep learning approach for MIMO symbol detection using soft interference cancellation to reduce computational complexity.
Flow-matching generative model for predicting organic crystal structures from molecular graphs, addressing gaps in materials science.
KEMP-PIP: hybrid machine learning framework combining protein language models with handcrafted features for pro-inflammatory peptide prediction.
Deep learning model for de novo peptide sequencing with explicit mass consistency constraints using regressor-guided diffusion.
Deep learning validation for quantifying GABA metabolites from magnetic resonance spectroscopy signals.
Theoretical analysis of gap-dependent regret bounds for reinforcement learning algorithms with linear function approximation.
Comparison of statistical and machine learning approaches for predicting childhood obesity using survey data.
LLM-based framework for rare disease phenotyping from clinical notes, extracting and standardizing features to Human Phenotype Ontology terms.
Circuit tracing framework for vision-language models using transcoders and attribution graphs to analyze multimodal reasoning mechanisms.
QueryBandits: model-agnostic contextual bandit framework to mitigate hallucinations in closed-source LLMs through post-hoc detection and mitigation.
Research: Zero-overhead online continual learning framework for deep neural network OFDM receivers adapting to time-varying communication channels.
Research on detecting and mitigating group bias in heterogeneous treatment effect predictions when aggregating ML model outputs to subgroups.