Learning Contextual Runtime Monitors for Safe AI-Based Autonomy
Context-aware runtime monitors for safe AI-based autonomous systems using ensemble ML controllers.
Context-aware runtime monitors for safe AI-based autonomous systems using ensemble ML controllers.
Theoretical framework for online learning with partial feedback and collection version spaces.
Rate-distortion framework for lossy compression of transformer intermediate representations to reduce inference compute and memory.
Analysis of safety properties in diffusion-based LLMs versus autoregressive LLMs, showing robustness against jailbreak attacks.
Study of capability acquisition in transformers tracking geometric changes and linear probes across model scales and algorithmic task difficulty levels.
Quantitative selection theorems proving that strong task performance under uncertainty necessitates world models and belief-like memory structures in agents.
Efficient evaluation algorithm for excess risk of large-scale empirical risk minimization using black-box access with single dataset.
Machine unlearning approach using key deletion in model architecture for privacy-compliant data removal without requiring full training data access.
Adaptive stock price prediction framework using autoencoder-gated dual node transformers with RL control for regime-dependent market behavior.
Theoretical framework for RLHF with multi-source imperfect preferences, deriving regret bounds when feedback comes from multiple annotators with systematic mismatches.
Trillion-parameter scientific multimodal foundation model with advanced agent capabilities spanning 100+ scientific domains and general reasoning tasks.
Method for learning low-dimensional policy manifolds in reinforcement learning through state-occupancy matching to improve sample efficiency.
Novel GNN architecture using cross-attentive cohesive subgraph embedding to address oversquashing and improve information flow in graph neural networks.
Theoretical analysis of safety verification limits for self-improving AI systems, formalizing compatibility between bounded risk and unbounded utility.
Uses LLM to dynamically generate curriculum over actions for RL agents, progressively introducing complex actions during training.
Studies how deep networks assign higher density to simpler out-of-distribution data than in-distribution test data.
Multimodal representation learning framework for e-commerce product understanding combining reasoning with product attributes.
Prompt-based continual learning method for next activity prediction that handles concept drift without catastrophic forgetting.
Language agents that learn adaptive policies at test-time through environment interactions, improving performance via iterative refinement.
Data-driven framework using skeleton-based biomechanical analysis and motion modeling for personalized dart training.
Pipeline using street-view image analysis and ML imputation to extract building elevation data for flood risk assessment in Texas.
Federated learning approach using HAPS networks with weighted client selection to handle non-IID data distributions.
Causal k-means clustering algorithm to identify heterogeneous treatment effects across unknown subgroups.
Parameter-efficient adaptation method for foundation models via black-box visual prompting without full parameter access.
Framework for approximating probability distributions using weighted particles via maximum mean discrepancy and gradient flows.
Empirical study measuring how prompt and response characteristics impact energy consumption and inference costs of LLM operations.
Develops polynomial-time algorithm for solving Stampacchia variational inequalities under the Minty condition.
Proposes multi-timescale variants of primal-dual hybrid gradient algorithm for distributed optimization problems.
Deep learning framework for image zero-watermarking using noise-adversarial training to learn distortion-invariant features.
Introduces AICO, a tool for testing feature significance in supervised learning models to improve interpretability and fairness.
Presents method to train generative models that learn causally disentangled latent representations using context modules.
Proposes boosted quantile regression neural networks with entropy analysis for predicting patterns in complex dynamical systems.
Evaluates 5 LLMs on fairness and inclusion bias in summarizing parliamentary proceedings, measuring representation gaps across demographic groups.
Theoretical study of zeroth-order query complexity for sampling from logconcave distributions using only function evaluations, no gradient information required.
Gauss-Newton reinforcement learning method for model predictive control offering second-order convergence with faster training than first-order RL methods.
HEAS framework for agent-based simulation combining hierarchical evolution with metric standardization to improve multi-objective policy search in complex systems.
Uses LLMs to analyze 150+ years of German parliamentary debates on migration, demonstrating constraint-free large-scale political text analysis without manual annotation.
LLM-based framework for loop invariant synthesis to accelerate program verification with sound evaluation methodology.
Study of adversarial robustness in conformal novelty detection using learning-based frameworks with FDR guarantees.
Method for adapting coverage levels in conformal prediction based on individual sample characteristics.
Framework for linear contextual bandits leveraging pretrained models for feature imputation in partially observed contexts.
Safe reinforcement learning approach using adaptive action scaling to reduce constraint violations during training.
Sentiment-guided augmentation technique for multimodal sentiment analysis addressing data scarcity in video, audio, and text.
System for reducing remote video inference latency through on-device correction with lightweight models for robotics and edge devices.
Multi-agent framework using LLMs for interpreting gene clusters from RNA-seq data in antimicrobial resistance research.
Quantum classifier using Hamming distance measurements with classical post-processing for improved noise robustness.
Geometric approach to post-hoc debiasing in vision-language models by treating bias as subspace rather than individual coordinates.
Self-supervised learning framework using masked autoencoders to learn view-invariant representations from multi-view radiology data.
Deterministic world models for verification of vision-based control systems avoiding stochastic latent variable overapproximation.
Reinforcement learning approach to control stylized motion of animated character robot using animation references.