Inverse Neural Operator for ODE Parameter Optimization
Two-stage framework using Conditional Fourier Neural Operators to recover hidden ODE parameters from sparse observations.
Two-stage framework using Conditional Fourier Neural Operators to recover hidden ODE parameters from sparse observations.
Framework for WiFi CSI sensing across multiple stations handling missing features and limited labeled data using self-supervised learning.
Analyzes the role of reversible instance normalization in time series forecasting, addressing distribution shifts in temporal and spatial data.
Uses reinforcement learning to adapt LLM-based recommender systems for dynamic, need-specific objectives and complex recommendation goals.
Proposes optimal balancing weights for estimating individual treatment effects in multi-treatment scenarios using causal representation learning.
EnTransformer generative architecture for multivariate probabilistic forecasting with reliable uncertainty quantification.
Chem4DLLM multimodal LLM interprets 4D molecular trajectories to explain chemical dynamics and reactions.
MobileKernelBench evaluates LLM capabilities for generating efficient compute kernels optimized for mobile devices.
Circuit mapping and mechanistic interpretability of Geneformer foundation model reveals redundancy and layer-dependent control.
Addresses over-squashing in graph neural networks via effective resistance rewiring considering global connectivity constraints.
Formalizes statistical and structural identifiability as distinct properties explaining representation learning model stability.
Extends DeepONets operator approximation framework to topological settings generalizing Chen-Chen theorem.
Theoretical analysis of generalization in multipass preconditioned SGD relating population risk curvature to effective dimension.
Deep learning metamodeling for nonlinear stochastic dynamic systems accounting for parametric and predictive uncertainty.
Flowcean framework automates data-driven model generation for cyber-physical systems with modular architecture.
Studies unitary matrix product states from tensor networks for generative modeling with strong physical interpretability.
Analyzes frequentist consistency of prior-data fitted networks for causal inference compared to classical estimators.
Training-free decoding acceleration for LLMs exploiting stable attention patterns within semantic spans during generation.
Mathematically proves chemical reaction networks can solve tasks requiring hidden layers in spiking neural networks.
Develops multi-label temporal convolutional framework for predicting transcription factor binding sites in DNA.
Addresses cross-domain reinforcement learning challenges when source and target domains have different state or action spaces.
Uses LLMs with feedback memory to automate neural architecture search for CNNs on consumer GPUs without fine-tuning.
Proposes MMDDPG algorithm for stable reinforcement learning under environmental disturbances and model uncertainties.
Cornserve distributed serving system for any-to-any multimodal models with different input/output modalities and scaling characteristics.
Automated generation of high-performance RL environments using prompt templates, verification, and agent-assisted repair for <$10 compute cost.
IsoCompute scaling laws for optimal allocation of sampling compute across rollouts, problems, and update steps in LLM RL post-training.
Theoretical analysis of catastrophic forgetting in continual post-training of generative models under two-mode mixture abstraction.
Neural Thickets shows task-specific expert solutions exist in pretrained weight distributions, enabling discovery through structured optimization.
Perplexity's analysis of security considerations for frontier AI agents based on operating agentic systems at scale.
STAMP framework for text privatization using task-aware token-level privacy budget allocation balancing privacy sensitivity and task utility.
Feature-matching objective for LLM fine-tuning targeting sequence-level statistics without task-specific verifiers.
Large-scale entity matching benchmark with 755K labeled pairs for multilingual compliance workflows benchmarked with LLMs.
Analytical theory connecting LLM hyperparameters to speculative decoding throughput efficiency without training.
End-to-end TinyML system for autonomous navigation on ESP32 microcontroller with quantized CNN.
Latent diffusion framework for drug-target affinity prediction with improved cold-start generalization.
Deep neural network regression approach using minimum error entropy principle for dependent observations.
Automated metadata curation for museum video archives using multimodal ML with database grounding.
Self-supervised ML approach for symbolic simplification of mathematical expressions using oracle trajectories.
PACED framework for efficient LLM distillation by focusing training on problems at frontier of student competence.
Graph-based transformer approach for learning domain name embeddings from DNS queries for intrusion detection.
Security-focused steering mechanisms for LLM-based code generation using internal representations to prevent vulnerable code.
Evaluation of frontier AI models' autonomous capabilities on multi-step cyber attack scenarios across 18-month period.
Analysis of how LLM outputs change through iterative reprocessing, examining convergence behavior in generation chains.
Methods for calibrating predictive distributions from forecasting systems, especially for rare event prediction.
Audio ML system for detecting cough segments in tuberculosis screening using pre-trained architectures.
Framework for evaluating and disentangling latent representations in VAEs, with focus on tabular data interpretation.
Open-source Python framework for standardized evaluation of generative models for single-cell gene expression data with consistent metrics.
Machine learning-enhanced method for modeling nonlinear gas flow in porous media with pressure-dependent permeability.
Exploration strategy for contextual bandits with black-box reward models using regularization-induced exploration techniques.
Deep learning approach for MRI parameter mapping using denoising priors to reduce aliasing artifacts in accelerated imaging.