SmartBench: Evaluating LLMs in Smart Homes with Anomalous Device States and Behavioral Contexts
SmartBench: evaluation benchmark for LLMs in smart home environments with anomalous device detection.
SmartBench: evaluation benchmark for LLMs in smart home environments with anomalous device detection.
HEARTS: benchmark for evaluating LLM reasoning on diverse health time series tasks and modalities.
SR-TTT: test-time training for LLMs with infinite context via fast weights, improving long-context reasoning.
Trust-aware federated learning framework for bone healing classification in distributed medical environments.
GAN-based hurricane bias correction for storm surge and wind forecasting beyond gauge stations.
Geodesic gradient descent: learning-rate-free Riemannian optimization algorithm for complex manifolds.
Ensemble learning framework for financial risk detection in ERP systems with leakage-safe evaluation.
Monte-Carlo sampling method for improving guidance in SDE-based diffusion generative models.
ATLAS: reinforcement finetuning framework for scaling agent capabilities with large toolspaces using small language models.
Synthetic EHR generation pipeline with clinical consistency validation for privacy-preserving health data sharing.
ProtAlign: contrastive learning framework for protein sequence-structure alignment using language models.
Hybrid approach combining time series foundation models with regression models for electricity price forecasting capturing temporal and cross-variate patterns.
Safe Transformer: Modular approach adding explicit safety bit as interpretable information bottleneck for controllable alignment in language models.
Orion: First open system enabling direct LLM training and inference on Apple Neural Engine (ANE) with compiler pipeline and on-device training support.
Reinforcement learning approach for safe neural navigation in variable-density crowds that generalizes beyond training conditions.
Super-resolution transformer optimization using FlashAttention with rank-factorized implicit neural bias to reduce computational burden.
Efficient decentralized framework for training diffusion models with heterogeneous objectives across isolated experts with reduced computational requirements.
Constrained generation framework for diffusion models handling complex feasible regions in robotics and autonomous driving applications.
Method to stabilize Group Relative Policy Optimization (GRPO) for diffusion language models by addressing reward collapse issues in post-training.
DIRECTER: Dynamic rejection steering technique for LLMs to improve instruction following while avoiding oversteering that degrades output quality.
Multi-objective protein sequence design balancing designability with properties like solubility and thermostability.
Quantum implicit neural representations in autoencoders and VAEs for image reconstruction and generation tasks.
Drift detection method for imbalanced streaming data using unbiased cluster descriptors in dynamic systems.
SHAP explainability methods for improving interpretability of ML models in Alzheimer's disease diagnosis and prognosis.
Physics-informed neural networks optimization via sparse QUBO and coreset construction for efficient collocation point selection.
Meta-learning approach using graph convolutional networks to handle traffic flow prediction under network disruptions.
Symbolic machine learning for failure detection in chemical processes, addressing interpretability vs neural network approaches.
Theoretical analysis of coordinate freedom and metric dependence in neural network representation spaces.
Deep reinforcement learning approach using heterogeneous graph transformers for job shop scheduling problems.
Graph diffusion and spatial attention framework for imputing sparse spatial transcriptomics data in gene expression analysis.
xaitimesynth: Reusable Python package for evaluating time series attribution methods using synthetic ground truth data with class-discriminating features.
Physics-informed diffusion model for generating synthetic rare weather events preserving physical consistency. Addresses data scarcity for tropical cyclone detection.
OPR: Lightweight mechanism preventing premature policy convergence in deep RL by maintaining buffer of high-performing episodes during optimization.
NEST: Device placement optimization for distributed deep learning that jointly considers parallelism, memory, and network topology without post-hoc feasibility fixes.
Framework for cooperative multi-agent RL with submodular rewards modeling overlapping agent contributions. First formal analysis of diminishing returns in team coordination.
Rectified flow and Ginzburg-Landau guidance methods for 3D gravity and magnetic inversion in subsurface ore detection. Physics-based inverse problem solving.
C3 algorithm for multi-agent reinforcement learning with LLMs. Addresses credit assignment problem in sparse reward scenarios through contextual counterfactual analysis.
Gaussian Linear Unit activation function analyzing mathematical relationships among modern transformer alternatives to ReLU.
Stochastic attention mechanism via Langevin sampling on Hopfield energy enabling temperature-controlled retrieval and generation.
Physics-informed surrogate model for ferroelectric NAND retention analysis replacing expensive TCAD simulations.
LLM-based CAD program generation using design procedures and geometric constraints for parametric model synthesis.
Generative models for tabular data using XGBoost as score estimator with denoising diffusion for small and large datasets.
Eigenspectral framework analyzing information flow in LLM feed-forward networks through lightweight spectral metrics.
Mixture-of-experts approach for state space models with expert specialization while maintaining computational efficiency.
Physics-consistent neural networks for Cosserat elasticity modeling deformation and director fields in microstructured materials.
Reinforcement learning formalism for coupled-dynamics environments specifying joint distributions across counterfactual actions.
Study of graph sparsification impact on GNN pipeline performance and scalability for billion-node graphs.
Reinforcement learning approach for chart comprehension in vision-language models using verifiable rewards for symbolic reasoning.
Imitation learning analysis for quadruped locomotion showing effectiveness in small data regimes via limit cycle structure.
Optimal transport framework for conditional generative modeling robust to outliers using unbalanced transport.