Finance-Informed Neural Network: Learning the Geometry of Option Pricing
Finance-informed neural network for option pricing and hedging using self-supervised replication objective based on dynamic hedging theory.
Finance-informed neural network for option pricing and hedging using self-supervised replication objective based on dynamic hedging theory.
General Time-series Model with frequency-domain attention for enhanced representation learning on diverse time-series downstream tasks.
Higher-order guided diffusion model for graph generation that captures non-Euclidean topology using higher-order graph structures.
Riemannian Gaussian Variational Flow Matching for generative modeling on manifolds applied to material and protein design.
Aggregation-free federated learning method for medical image classification using multi-dimensional similarity knowledge distillation across heterogeneous client models.
Framework for compressing large LLM-based ReAct agents into smaller student models while preserving reasoning and action consistency.
Framework for generative modeling with enforced physical constraints using split augmented Langevin sampling for scientific applications.
Hierarchical differential model for inferring system degradation from sensor data by disentangling slow and fast temporal dynamics.
Text-trained LLMs perform zero-shot extrapolation of PDE dynamics, revealing three-stage in-context learning mechanism for spatiotemporal forecasting.
Mathematical study of Busemann functions in Wasserstein space with applications to geometric machine learning and data slicing.
Development of conformal prediction method that ensures counterfactual fairness in prediction sets for fair decision-making under uncertainty.
Zeroth-order optimization approach for continual learning that improves memory efficiency and addresses plasticity-stability tradeoffs without gradient computation.
Research on unifying in-context learning and activation steering as instances of a broader framework using belief dynamics to control LLM behavior at inference time.
Provides theoretical explanation for OOD detection failure on single-domain data through domain feature collapse phenomenon.
Proposes WaveLSFormer, wavelet-based Transformer for intraday trading policy learning with multi-scale decomposition.
Introduces RAT+, structured dilated attention architecture that enables sparse inference while maintaining long-range connectivity and accuracy.
Develops MSFlow, a flow matching approach for de novo molecular structure elucidation from mass spectrometry data.
Proposes controllable exploration strategy for RLVR training of multi-modal LLMs to address entropy collapse and policy degradation.
Investigates gene expression prediction from DNA sequences, showing long sequence modeling can degrade performance despite prior focus on extending sequence length.
Introduces FlashOptim, memory-efficient optimizers for mixed-precision neural network training reducing per-parameter memory requirements.
Shows preference labels in LLM-as-judge training can function as covert communication channels, challenging assumptions about semantic supervision.
Studies parallel Bayesian optimization for expensive black-box function evaluation with theoretical regret bounds.
Investigates tokenizer pretraining impact on physics foundation models for emulating complex multiphysics phenomena in data-limited settings.
Structure-aware set transformers with temporal and variable-type attention for asynchronous clinical time series in EHR data.
Conditional optimal transport maps using unbalanced framework for outlier-robust conditional generative modeling.
Analyzes how MDP design choices (state composition, rewards, dynamics) affect sim-to-real transfer in reinforcement learning for industrial control.
Studies harmonic loss with non-Euclidean distance layers as alternative to cross-entropy for neural network training.
Instance unlearning method for diffusion models removing specific outputs without text prompts, addressing unpromptable undesired generations.
Contract And Conquer algorithm for provably computing adversarial examples against black-box deep neural networks.
Proposes iterative selection of Gaussian mixture priors to prevent posterior collapse in variational autoencoders.
Theoretical bounds on bias introduced by low-dimensional representation learning in conditional average treatment effect estimation.
AI system analyzing police bodycam footage at scale to assess officer-public interactions and improve government accountability.
Statistical analysis of instrumental variable testability in nonlinear treatment effect models with non-constant effects.
RouteNet-Gauss integrates testbed hardware with machine learning model for efficient network simulation and performance estimation.
Generative Predictive Control method augments frozen diffusion policies with action-conditioned world models for improved robot control without retraining.
Applies multi-agent reinforcement learning to greenhouse gas offset credit markets for emissions control and carbon project trading simulation.
Extends Random Dot Product Graph model to accommodate weighted graphs with heterogeneous edge distributions for network analysis.
Data-driven survey identifying 14,648 papers on LLM limitations from 2022-2025 using automated classification and expert validation across 250,000 academic papers.
Novel algorithm for multi-agent reinforcement learning using uncertainty quantification and selective exploration to improve sample efficiency in joint action spaces.
Research paper on measuring whether LLMs comprehend user intent beyond surface-level text patterns, addressing training-inference gaps in language models.
Reinforcement fine-tuning approach for LLMs applied to point-of-interest recommendation with improved semantic indexing.
Neural framework simulating OS GUIs by predicting screen frames from user inputs using RNN and diffusion models.
Open benchmark suite comparing paired encoder and decoder architectures for NLP tasks with controlled parameter counts.
Adapter parameters for efficient multi-task LLM inference on-device via task merging for compositional learning.
Agentic Design Review System orchestrates multiple AI agents to collaboratively analyze graphic designs with meta-agent coordination.
arXiv paper analyzes theoretical limitations of embedding-based retrieval for diverse tasks including reasoning and code generation.
arXiv paper provides theoretical analysis of Flow Matching generative modeling with Lipschitz-bounded statistical guarantees.
DiDi-Instruct trains fast few-step language generation via distillation from discrete diffusion LLMs while maintaining quality.
arXiv paper applies contrastive diffusion guidance to inverse problems with partially specified non-smooth operators like floorplan reconstruction.
arXiv paper initiates theoretical analysis of learning with access to two competing provers for evaluating opaque model properties.