Hybrid Energy-Based Models for Physical AI: Provably Stable Identification of Port-Hamiltonian Dynamics
Energy-based models framework for port-Hamiltonian system identification with provable stability guarantees. Physical AI application.
Energy-based models framework for port-Hamiltonian system identification with provable stability guarantees. Physical AI application.
Analysis of OOD anomaly where deep networks assign higher density to simple out-of-distribution data than in-distribution test data.
MOON3.0 multimodal representation learning for e-commerce product understanding using reasoning-aware MLLMs to capture fine-grained attributes.
Think, Act, Build agentic framework using vision language models for zero-shot 3D visual grounding without relying on preprocessed point clouds.
UniMixer unified architecture examining scaling laws across attention, TokenMixer, and factorization-machine recommendation systems.
Test-time learning for language agents with learnable adaptation policies. Improves agent behavior through iterative refinement at inference.
Dignified Peer framework countering sycophancy and evasiveness in aligned LLMs through anti-sycophancy and empathy.
MyPhoneBench evaluation framework measuring privacy compliance in phone-use agents during mobile task completion.
ORBIT generates 20K training queries for search agents integrating LMs with web search using scalable and verifiable methods.
DR-LoRA assigns dynamic ranks to expert modules in MoE models for efficient parameter-specific fine-tuning of LLMs.
Triadic Cognitive Architecture for tool-using agents with principled bounds on information-acquisition costs and deliberation.
BIOGEN multi-agent reasoning framework using evidence-grounding for transcriptomic interpretation in antimicrobial resistance.
TaCarla comprehensive benchmarking dataset for end-to-end autonomous driving with perception and planning tasks.
MemFactory unified framework for training and inference in memory-augmented LLMs using RL to optimize memory operations.
Sven optimization algorithm exploiting natural loss decomposition using Moore-Penrose pseudoinverse for efficient neural network training.
Framework training LLMs to forecast supply chain disruptions using calibrated probabilistic forecasts from disruption outcomes.
UQ-SHRED adds uncertainty quantification to shallow recurrent decoder networks for sparse spatiotemporal reconstruction.
Online machine learning framework for multi-resolution energy system design optimization and performance analysis.
JetPrism diagnoses convergence issues in Conditional Flow Matching for physics simulations and inverse problems.
Distributed graph modeling approach for detecting money laundering transaction patterns at scale.
Tutorial on Bayesian Optimization as a principled framework for automating scientific discovery using surrogate models.
Principled layer-wise optimization approach for model merging via data-free covariance estimation without task-specific training.
SECURE framework addressing robustness issues in deep learning models for autonomous driving collision prediction.
GPU-accelerated inference algorithm for multivariate Hawkes processes achieving O(N) complexity with parallelization.
Novel Langevin-based algorithm for adaptive inverse reinforcement learning using Malliavin calculus for gradient estimation.
PI-JEPA: Physics-informed surrogate model for multiphysics simulation exploiting unlabeled parameter fields via latent prediction.
Residuals-based offline reinforcement learning approach for high-stakes applications with restrictive data coverage assumptions.
Benchmark datasets and evaluation protocols for machine learning methods on photoplethysmography medical signals.
Train-to-Test scaling laws optimizing model size, training tokens, and inference samples jointly for compute-optimal LLM deployment.
Study of reward hacking in LLM RL showing reproducible failure patterns and mitigation strategies using representation-level signals.
Hierarchical RL framework for privacy-preserving synthetic clinical data generation combining LLMs with structured learning.
CuTeGen: LLM-based agentic framework for automated generation and optimization of high-performance GPU kernels using CuTe abstraction.
Comparative study of Evolution Strategies vs GRPO for LLM post-training showing ES achieves comparable accuracy with different parameter geometry.
Residual decomposition framework for improving classifier performance on long-tailed datasets beyond standard logit adjustment.
Self-supervised framework for learning clinical ECG image representations without access to raw signal recordings.
ZEUS: Training-free acceleration method for diffusion models using second-order predictors to reduce sampling steps.
Care-Conditioned Neuromodulation framework for LLM-based dialogue agents that balances helpfulness with user autonomy preservation.
EEG seizure detection method using graph neural networks with self-supervised learning and information bottleneck principles.
Influence-Guided PPO framework for LLM post-training that filters noisy rollouts using data attribution to improve training efficiency.
Research on training LLMs to develop both in-context and in-weights learning capabilities simultaneously via contrastive context sampling.
Novel reinforcement learning algorithm addressing noisy temporal difference errors in deep RL through pseudo-quantization methods.
arXiv paper on expert-choice routing for diffusion language models. Deterministic load balancing improves throughput and convergence vs token-choice.
arXiv paper on CRIT, graph-based automatic data synthesis for cross-modal multi-hop reasoning. Generates complementary image-text data.
arXiv paper on label shift estimation with incremental prior updates. Addresses distribution mismatch between training and deployment.
arXiv paper on coupled query-key dynamics for scaled dot-product attention. Improves language modeling perplexity by 6-7% on WikiText-103.
arXiv paper introducing MiCA, parameter-efficient LLM fine-tuning method adapting minor singular vector subspaces. Outperforms LoRA on knowledge retention.
arXiv paper on transformer encoder-decoder with multimodal learning for wind structural health monitoring and digital twins.
arXiv paper on MATA-Former for ICU risk prediction using semantic-aware temporal alignment. Clinical-logic-aligned transformer architecture.
arXiv paper applying Koopman operator methods for multivariable control of turbofan engines. Meta-heuristic extended dynamic mode decomposition.
arXiv paper on DDCL, differentiable end-to-end framework for unsupervised prototype-based representation learning. Integrates feature learning with clustering.