Empirical Gaussian Processes
Empirical GPs: principled framework for learning kernel functions automatically rather than handcrafting from standard functions.
Empirical GPs: principled framework for learning kernel functions automatically rather than handcrafting from standard functions.
Novel method learning structured latent representations using metric spaces for multimodal state estimation in RL without explicit noise assumptions.
Theoretical analysis of offline RL under Q*-approximation and partial coverage, answering whether Q*-realizability enables sample efficiency.
Few-shot Bayesian optimization framework exploiting auxiliary information from experiments for expensive black-box design problems.
KAN-FIF applies spline-parameterized neural networks for tropical cyclone estimation on resource-constrained meteorological satellite devices.
Meta-Sel: lightweight supervised meta-learning approach for efficient demonstration selection in in-context learning with tight prompt budgets.
Investigates whether LLMs trained with RL spontaneously exploit reward function loopholes without malicious intent, examining alignment risks.
Theoretical analysis of on-policy distillation showing it as special case of KL-constrained RL with reward extrapolation improvements.
Introduces damped harmonic oscillators method for irregular time series modeling as alternative to Transformers and Neural ODEs.
Proposes TIME benchmark addressing limitations in time series foundation models including data composition, integrity, and task formulation issues.
SafeNeuron provides neuron-level safety alignment mechanism for LLMs by targeting safety-critical parameters to prevent alignment bypass attacks.
Group Relative Policy Optimization enables amortized molecular design learning transferable to unseen molecules rather than instance-specific optimization.
Theoretical analysis of sampling and reference policy effects in preference alignment for LLMs through Identity Preference Optimization framework.
WaveFormer integrates wavelet decomposition into transformer architecture for improved biomedical signal classification with long sequences and multi-scale patterns.
Learning to Forget Attention proposes adaptive attention reduction as familiarity increases, reducing compute in hybrid state-space-attention models.
Study shows that downstream adaptation for measuring world models can corrupt latent physics representations, affecting OOD generalization assessment.
Distribution Discriminant Theory enables on-policy supervised fine-tuning for LLMs by bridging computational efficiency gap with RL-based approaches.
Variance Minimisation Policy Optimisation reformulates diffusion model alignment as an SMC process for reward-guided sampling.
Categorical Flow Maps applies flow matching for accelerated few-step generation of categorical data using self-distillation.
Olmix framework addresses practical challenges in data mixing ratios during language model development with principled design choices.
ExtractBench provides a benchmark and methodology for evaluating LLM-based PDF-to-JSON extraction at enterprise scale with diverse schemas.
Research on privacy concerns when GNN-based clustering reveals sensitive community structures in social and infrastructure networks.
Flow-Guided Neural Operator introduces adaptive corruption levels for self-supervised learning on unlabeled time-series data beyond fixed masking ratios.
Fun-DDPS combines function-space diffusion models with neural operator surrogates for forward and inverse modeling in carbon capture and storage applications.
User study on explainable AI without code examines how non-technical users can understand ML model predictions in no-code platforms.
Study of transformer representations shows direction and magnitude of hidden state vectors serve distinct functional roles in language modeling and syntax tasks.
Research evaluates few-shot temporal reasoning capabilities of LLMs for predicting human activities in smart environments with limited data.
RadarPos is a position-aware self-supervised learning framework for radar signal recognition using pulse-level temporal dynamics.
AskBench benchmark and rubric-guided RLVR method evaluates and improves LLMs' ability to request clarification when prompts lack critical details, reducing hallucinations.
SWE-MiniSandbox: container-free reinforcement learning method for scalable training of software engineering agents without isolation overhead.
Generative AI model optimizing reconfigurable intelligent surface phase shifts in massive MIMO systems using diffusion models.
Latent Generative Solvers framework using VAE and Transformer with flow matching for long-horizon physics simulation across PDE systems.
Deep learning model for tea leaf disease detection using explainable AI and adversarial training for agricultural automation.
Self-play framework for vision-language models that actively explores environments to generate tailored visual data for autonomous improvement.
Identifies computational hardness in learning non-trivial mixed-state quantum phases using autoregressive networks and conditional mutual information.
Deep learning pipeline for photometric redshift estimation in astrophysics that generalizes across galaxy morphologies and observational conditions.
Evaluation of hybrid ML-physics atmosphere models across timescales and out-of-distribution forcings for weather prediction.
Simulation-based inference method using generalized Bayesian inference and neural score functions robust to outliers and measurement errors.
Taylor method-based verification approach for assessing safety of neural network controllers in spaceflight guidance without sampling.
Agentic reinforcement learning approach to optimize proactive LLM agents for multi-turn task completion with learned interaction strategies.
Studies whether self-referential language in LLMs reflects actual internal computation or confabulation through activation analysis.
Method to improve LLM reasoning by identifying critical tokens and verifying consistency through paraphrastic probing.
Formalizes religious epistemology using VAE mathematics to model religious traditions as generative mappings.
Modification to latent diffusion models that reorders diffusion trajectory to achieve efficiency while operating on raw pixel-space images.
Online learning theory for environments with multiple change points, analyzing regret bounds and failure modes.
System-level defenses against indirect prompt injection attacks in AI agents, reducing unsafe actions while maintaining task completion rates.
Training method for LLMs using generalized entropic objectives instead of uniform token-level weighting to improve robustness and sharpening.
Physics-informed neural field for reconstructing sound fields from sparse acoustic data to infer surface impedance.
Denoising diffusion-based generative approach to learning-to-rank that models joint distribution over query-document features.
Differentially private algorithm for computing top singular vectors using adaptive power iteration with privacy guarantees.