Noise-Aware Generalization: Robustness to In-Domain Noise and Out-of-Domain Generalization
Noise-Aware Generalization: training methods handling label noise and domain shifts simultaneously.
Noise-Aware Generalization: training methods handling label noise and domain shifts simultaneously.
Universal graph encoder for learning transferable structural representations across diverse graph domains.
GraphOmni: comprehensive benchmark for evaluating LLM reasoning on graph-theoretic tasks in natural language.
Think2SQL: reinforcement learning approach improving LLM reasoning for complex text-to-SQL generation.
HetGL2R: heterogeneous graph framework for ranking critical road segments using origin-destination flows.
Federated domain generalization leveraging source domain features for improved generalization.
GRILL: adversarial attack method for autoencoders addressing ill-conditioned gradient propagation.
Performance estimation for binary classifiers without ground truth labels using calibrated confidence.
CodePDE: LLM-based inference framework generating code for solving partial differential equations.
FairSHAP: preprocessing framework using Shapley values for identifying and mitigating bias in ML models.
Federated learning client sampling strategy addressing heterogeneous communication and computational capabilities.
Information geometry analysis of diffusion model latent spaces, examining geodesic decoding and stochastic decoders.
QiMeng-CodeV-R1 applies RLVR to Verilog generation from natural language, introducing verifiable reward signal for hardware design automation.
Shows LLM weights follow generalized Gaussian distributions and proposes end-to-end optimization framework for improved training efficiency.
E-BATS enables efficient test-time adaptation for speech models under domain shift without backpropagation, reducing memory overhead.
Physics vs Distributions framework for flow matching that balances physical constraints with distributional accuracy in generative modeling.
Uses LLMs and concept graphs to extract scientific concepts from materials science abstracts and discover novel research connections.
MoNE prunes MoE models by replacing redundant experts with lightweight novices, reducing memory overhead while maintaining performance.
Symbolic Branch Networks inherit architecture from decision tree ensembles to create interpretable neural models for multiclass classification.
Demonstrates sampling-aware adversarial attacks against LLMs that leverage stochastic sampling to more accurately assess robustness.
Uses BioEmu with time-lagged generation to learn collective variables for enhanced molecular dynamics simulations of rare events.
Improves GNN node learning performance using statistically grounded graph encoder embedding initialization for faster convergence.
MDM-OC enables scalable, reversible model composition with orthogonal constraints to prevent task interference and catastrophic forgetting.
MolReasoner enhances LLM reasoning for molecular tasks with domain-specific semantics, reducing hallucinations and improving interpretability.
GEDAN learns optimal edit costs for graph edit distance computation, improving upon NP-hard approximation methods with learnable cost functions.
Shuffle-R1 framework improves RL efficiency for multimodal LLMs through data-centric dynamic shuffling to address advantage collapsing and rollout silencing.
Comprehensive evaluation of 25 pretrained molecular embedding models across 25 datasets for molecular property prediction and drug design.
Proposes biased local SGD for efficient parallel neural network training on heterogeneous computing systems with varying resource availability.
AFABench framework for benchmarking active feature acquisition methods that dynamically select informative features under acquisition cost constraints.
Presents Exploratory Iteration (ExIt), RL methods enabling agents to self-improve through iterative refinement without fixed iteration limits.
Explores exogenous variable modeling in spatio-temporal forecasting systems to improve prediction accuracy.
Data augmentation strategies for generative recommendation systems improving generalization in sequential user behavior prediction.
Privacy-aware Bayesian network approach using credal sets for secure public release of probabilistic graphical models.
Multi-agent RL with curiosity-driven exploration using contextual calibration to distinguish novelty from environmental stochasticity.
DriftLite: Training-free particle-based approach for inference-time diffusion model adaptation to new distributions.
Error mitigation methods for post-training N:M activation sparsity in LLMs enabling dynamic input-adaptive compression.
Aurora: Multimodal foundation model for cross-domain time series forecasting integrating text and temporal data.
SpinGPT applies LLM approach to poker strategy, addressing CFR computational limits in multi-player game settings.
8-bit blockwise quantization of Muon optimizer states reducing memory overhead for large-scale LLM pretraining.
Framework for standardizing evaluation of positive-unlabeled learning algorithms under consistent experimental settings.
Weather forecasting method using adaptive boundary alignment for regional and global predictions with spatial-temporal modeling.
Polychromic objectives for RL fine-tuning preventing policy collapse and preserving diversity in pretrained model behaviors.
Diffusion Alignment as Variational EM framework addressing reward over-optimization and mode collapse in diffusion model alignment.
Analysis of RL-induced parameter dynamics in LLMs revealing rank-1 dominance in reasoning improvements and predictability of training trajectories.
Surrogate-free ADMM method for LLM pruning achieving >50% sparsity without accuracy degradation, breaking through conventional compression limits.
Scaling law formalization incorporating data quality parameter for language model pretraining, extending traditional model/dataset size relationships.
KVComm: Communication framework for multi-agent LLM systems using selective key-value sharing instead of natural language or hidden states.
Fairness auditing framework for classifiers with partial feedback using cost-aware data acquisition strategies.
TROLL: Trust region-based RL method improving upon PPO clipping for LLM fine-tuning, achieving more stable and optimal reward-based training.
Novel RL algorithm for diffusion LLMs using distribution matching policy optimization to improve reasoning capabilities and match autoregressive LLM performance.