Enhancing the Reliability of Medical AI through Expert-guided Uncertainty Modeling
Expert-guided uncertainty modeling for medical AI systems to improve reliability and enable human experts to prioritize high-risk diagnostic cases.
Expert-guided uncertainty modeling for medical AI systems to improve reliability and enable human experts to prioritize high-risk diagnostic cases.
Theoretical analysis of plasticity loss in deep reinforcement learning due to non-stationarity, proposing sample weight decay mitigation technique.
PAC-Bayesian framework for outcome weighted learning that incorporates reward uncertainty into policy selection with finite-sample guarantees.
annbatch: Mini-batch loader for terabyte-scale biological data training in anndata format, addressing memory bottlenecks in ML pipelines for bioinformatics.
LSCP: Self-gated post-training framework for autonomous knowledge acquisition using self-generated Q&A chains and adaptive learning rates based on model conviction.
Statistical analysis of multi-task and multiple operator learning architectures with generalization bounds and theoretical guarantees.
Self-improving world models using forward-inverse asymmetry to improve robustness across suboptimal actions for policy evaluation and planning.
RL post-training framework for building general-purpose reasoning models across diverse domains with verifiable rewards, addressing multi-domain optimization challenges.
Active learning method using feature weighting for regression tasks to optimize sample selection from unlabeled pools under budget constraints.
ArXiv paper on dynamic weight generation for recursive transformers using input-conditioned LoRA modulation controller.
ArXiv paper on fast SVD-based compression for large language models without retraining, addressing distribution shifts.
ArXiv paper applying shallow recurrent decoder networks to magnetohydrodynamic flow modeling in fusion reactors.
ArXiv paper using neural networks to solve inverse problem of designing reflectors for light distribution.
ArXiv paper on graph attention network for multi-sensor object fusion and tracking in autonomous driving.
ArXiv paper introducing Universal Hypernetworks that generate weights for arbitrary model architectures using descriptors.
ArXiv paper applying diffusion denoising objectives to causal structure learning from observational data.
ArXiv paper on model-based reinforcement learning for control systems with time-varying dynamics.
ArXiv paper on in-context agentic reinforcement learning enabling LLM agents to internalize skills at inference time.
ArXiv paper on lightweight diffusion transformer for crystal structure generation using subatomic tokenization.
ArXiv paper unifying group-relative and self-distillation policy optimization for LLM post-training with improved credit assignment.
ArXiv paper proposing Head-Calibrated Clipped-Linear Softmax as efficient surrogate for attention softmax in edge inference.
ArXiv paper on exact parameterization of doubly stochastic matrices for learned mixing in neural networks.
Single-stage training paradigm for efficient LLM reasoning that reduces token consumption in chain-of-thought without degrading quality.
Neural-assisted physics-based model for interpretable battery aging prediction via 2D aging fingerprints without additional diagnostics.
Learning-based cooperative coevolution framework addressing heterogeneous large-scale global optimization via adaptive low-dimensional optimizers.
Lightweight deep learning architecture for brain tumor classification from MRI images with comparative analysis of different approaches.
Machine learning optimization applied to solve modular bootstrap equations for exploring 2D conformal field theories.
Physics-informed neural network and finite volume hybrid approach for modeling UAV traffic patterns in 3D anisotropic wind fields.
Evolutionary multi-objective optimization framework for fusing deepfake speech detectors to balance accuracy and system complexity using NSGA-II.
Neural-symbolic framework for discovering constitutive closures in nonlinear PDEs from spatiotemporal data while avoiding spurious physical recovery.
Research on regularizing attention scores in vision transformers using bootstrapping to improve interpretability and reduce noisy attention maps.
Analysis of safety, security, and cognitive risks in world models used for autonomous decision-making in robotics, autonomous vehicles, and agentic AI systems.
Neural architecture for predicting odorant intensity perception by combining graph convolutional networks with domain-informed design for molecular structure analysis.
Mathematical study of optimal coupling in causal dynamical systems using Schrödinger Bridge framework for input-output distributional data.
Framework for conceptualizing and generating intentional event streams to evaluate stream processing and mining algorithms.
Generative approach for characterizing task timing in real-time systems across varying hardware resource contexts.
Study of reliability gaps in AI-assisted medication systems, highlighting risks in healthcare decision support.
Framework combining LLMs with infeasibility detection for NP-hard combinatorial optimization problems.
Equivariant transformer architecture for modeling agent behaviors in autonomous driving with SE(2) symmetry.
New optimizer deriving design principles from Muon, improving LLM training efficiency through surrogate model analysis.
Method for efficiently adapting closed-box LLM APIs to target tasks by priming followed by local optimization.
Theoretical physics study of topological gaps in spin models and critical phenomena using persistent homology.
Research on conformal risk control under non-monotonic loss functions for distribution-free prediction guarantees.
Benchmark dataset for evaluating AI coding agents based on production workloads, addressing language distribution and codebase structure gaps.
EXHIB benchmark for binary function similarity detection supporting vulnerability analysis and malware classification.
Study of interactions between normalization methods and optimizers in LLM training at 1B parameters.
LiteInception: lightweight interpretable deep learning framework for fault diagnosis on edge devices.
LiveMathematicianBench: benchmark for evaluating LLM mathematical reasoning capabilities with proof sketches.
Prophet inequality problem with noisy observations and unknown reward distributions using linear models.
Study on language pre-training bias improving performance on general vision tasks through cross-modality transfer.