Orion-Bix: Bi-Axial Attention for Tabular In-Context Learning
Orion-Bix: Bi-Axial Attention for Tabular In-Context Learning
Orion-Bix: Bi-Axial Attention for Tabular In-Context Learning
A unified framework for geometry-independent operator learning in cardiac electrophysiology simulations
Generalization of Diffusion Models Arises with a Balanced Representation Space
EvoXplain: When Machine Learning Models Agree on Predictions but Disagree on Why -- Measuring Mechanistic Multiplicity Across Training Runs
Discount Model Search for Quality Diversity Optimization in High-Dimensional Measure Spaces
Building Production-Ready Probes For Gemini
Shortest-Path Flow Matching with Mixture-Conditioned Bases for OOD Generalization to Unseen Conditions
Provably Robust Bayesian Counterfactual Explanations under Model Changes
Tractable Gaussian Phase Retrieval with Heavy Tails and Adversarial Corruption with Near-Linear Sample Complexity
Rank-1 Approximation of Inverse Fisher for Natural Policy Gradients in Deep Reinforcement Learning
Analysis of Control Bellman Residual Minimization for Markov Decision Problem
Implicit Hypothesis Testing and Divergence Preservation in Neural Network Representations
Transform-Augmented GRPO Improves Pass@k
TABES: Trajectory-Aware Backward-on-Entropy Steering for Masked Diffusion Models
Scalable Spatio-Temporal SE(3) Diffusion for Long-Horizon Protein Dynamics
Expanding the Capabilities of Reinforcement Learning via Text Feedback
Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards
Entropy-Guided Dynamic Tokens for Graph-LLM Alignment in Molecular Understanding
The Label Horizon Paradox: Rethinking Supervision Targets in Financial Forecasting
A Function-Space Stability Boundary for Generalization in Interpolating Learning Systems
Live or Lie: Action-Aware Capsule Multiple Instance Learning for Risk Assessment in Live Streaming Platforms
Learning, Solving and Optimizing PDEs with TensorGalerkin: an efficient high-performance Galerkin assembly algorithm
Variational Speculative Decoding: Rethinking Draft Training from Token Likelihood to Sequence Acceptance
Escaping Local Minima Provably in Non-convex Matrix Sensing: A Deterministic Framework via Simulated Lifting
ContextBench: A Benchmark for Context Retrieval in Coding Agents
Dense Neural Networks are not Universal Approximators
A Thermodynamic Theory of Learning Part II: Critical Period Closure and Continual Learning Failure
Distribution-Free Robust Predict-Then-Optimize in Function Spaces
Breaking the Simplification Bottleneck in Amortized Neural Symbolic Regression
Learning to Remember, Learn, and Forget in Attention-Based Models
Importance inversion transfer identifies shared principles for cross-domain learning
UniComp: A Unified Evaluation of Large Language Model Compression via Pruning, Quantization and Distillation
Beyond Student: An Asymmetric Network for Neural Network Inheritance
Fully-automated sleep staging: multicenter validation of a generalizable deep neural network for Parkinson's disease and isolated REM sleep behavior disorder
A Controlled Study of Double DQN and Dueling DQN Under Cross-Environment Transfer
Infusion: Shaping Model Behavior by Editing Training Data via Influence Functions
Neural Score Matching for High-Dimensional Causal Inference
Airway Tree Modeling Using Dual-channel 3D UNet 3+ with Vesselness Prior
Fine-grained Analysis of Non-parametric Estimation for Pairwise Learning
Diffusion posterior sampling for simulation-based inference in tall data settings
Tensor learning with orthogonal, Lorentz, and symplectic symmetries
Exponential time differencing for matrix-valued dynamical systems
Bridging Explainability and Embeddings: BEE Aware of Spuriousness
When Speculation Spills Secrets: Side Channels via Speculative Decoding In LLMs
End to End Collaborative Synthetic Data Generation
Auditing a Dutch Public Sector Risk Profiling Algorithm Using an Unsupervised Bias Detection Tool
EmbBERT: Attention Under 2 MB Memory
Translate Policy to Language: Flow Matching Generated Rewards for LLM Explanations
Multi-Objective Bayesian Optimization for Networked Black-Box Systems: A Path to Greener Profits and Smarter Designs
Decentralized Reinforcement Learning for Multi-Agent Multi-Resource Allocation via Dynamic Cluster Agreements