M$^2$RNN: Non-Linear RNNs with Matrix-Valued States for Scalable Language Modeling
M²RNN introduces matrix-valued hidden states in RNNs to achieve greater expressive power than transformers for language modeling tasks.
M²RNN introduces matrix-valued hidden states in RNNs to achieve greater expressive power than transformers for language modeling tasks.
Theory compiler framework automates translation of formal domain knowledge into neural network architectural constraints with correctness guarantees.
SPARQ framework combines spiking neural networks with quantization-aware training and RL-guided early exits for energy-efficient edge AI deployment.
Bilateral Decoupled Decay improves soft clipping in LLM reasoning with RLVR, addressing gradient divergence and enabling better exploration during policy optimization.
ES-Merging merges biological multimodal LLMs using embedding space signals to enable cross-modal scientific discovery beyond single-modality specialization.
Spatiotemporal graph-based deep learning for detecting energy losses, theft, and inefficiencies in oil and gas production networks under distribution shifts.
OFA-TAD proposes generalist one-for-all anomaly detection for tabular data with cross-domain generalization, replacing dataset-specific training approaches.
Model-based debiasing framework for recommendation systems addressing heterogeneous biases across user, content, and model dimensions in ranking aggregation.
Trainless GUI grounding for MLLM-based agents using element-level inference to map natural language to UI components without fine-tuning or large datasets.
Graph convolutional network for disease prediction in beehives using spatio-temporal modeling of inter-hive relationships from sensor networks.
Addresses linear sketching problem for data streaming, achieving near-perfect recovery from compact sketch summaries with lightweight computational procedures.
Geometric deep learning framework using GNNs to predict cold spray particle impact responses from simulation data for deposition process modeling.
Combines causal representation learning with local sparse attention for system identification, enabling interpretable deep learning of dynamical systems without predefined function libraries.
Unlearning-based sliding window approach for continual learning under concept drift, enabling models to adapt to non-stationary data streams without explicit task boundaries.
Physics-informed ML framework predicts stress-strain behavior of additively manufactured materials by combining data-driven models with physical constraints.
Proposes trust-region search algorithm for aligning diffusion and flow models to target rewards at inference time without requiring differentiable reward models.
Visualizes critic loss landscapes in online actor-critic reinforcement learning to improve interpretability of algorithm performance under changing system dynamics.
Demonstrates deep neural networks can meta-learn task sequencing from few demonstrations, enabling generalization to new sequencing problems without task-specific training.
CausalEvolve improves LLM-based AI agents for open-ended scientific discovery by adding causal guidance and knowledge organization mechanisms to program evolution.
Extends critic match loss landscape visualization from online to off-policy reinforcement learning to reveal optimization geometry in critic learning.
FlashHead provides efficient drop-in replacement for classification head in language model inference, reducing parameters and compute by ~50%.
Multi-scale graph learning framework for fraud detection in transaction networks handling sparse anomalies and temporal drift.
Loss landscape visualization framework for interpreting reinforcement learning algorithms, demonstrated on critic-based control methods.
Delightful policy gradient method that addresses variance issues in policy gradient updates by accounting for action likelihood under current policy.
Proactive routing system that selects between black-box models and interpretable surrogates with distribution-free safety guarantees.
Fairness evaluation framework for automated prior authorization systems addressing demographic differences in clinical decision-making.
Deep learning methodology to predict and correct thermal limit bias in boiling water reactors for nuclear power plant operations.
EARCP ensemble architecture dynamically weights heterogeneous expert models based on performance and inter-model coherence for sequential decision making.
AgentTrace provides causal graph tracing for post-hoc failure diagnosis in deployed multi-agent systems through execution log analysis.
Chain-of-Trajectories framework enables content-aware sampling schedules in diffusion models via graph-theoretic planning without additional training.
Cross-RAG applies retrieval-augmented generation with cross-attention to improve zero-shot time series forecasting using foundation models.
Training-free protein sequence generation using stochastic attention over Hopfield energy without requiring training or pretraining data.
Multimodal deep learning approach combining time-series EHR data and clinical notes for predicting patient deterioration in ICU settings.
DeFRiS applies decentralized federated reinforcement learning for IoT application scheduling across heterogeneous devices while preserving privacy.
GNNVerifier uses graph neural networks to verify and correct task plans generated by LLMs in autonomous agent systems, reducing hallucinations.
CAMD proposes coverage-aware decoding for multimodal LLMs to allocate compute efficiently by identifying easy vs hard reasoning cases.
Method to measure decision boundary geometry of neural networks using local surface volumes to analyze model accuracy and robustness properties.
POLCA framework uses LLMs as optimizers to automatically improve complex systems like prompts and multi-turn agents through numerical rewards and text feedback.
HO-SFL proposes hybrid-order split federated learning to reduce memory costs of backpropagation on edge devices while maintaining convergence speed.
Orthogonal Subspace Clustering method for high-dimensional data using matrix decomposition and factor analysis with theoretical guarantees.
LaPro-DTA framework for generalizable drug-target affinity prediction using latent dual-view drug representations and salient protein features.
GARCH-FIS hybrid model combining fuzzy inference with GARCH for financial time series forecasting with dynamic parameter adaptation.
Multi-task genetic algorithm with multi-granularity encoding for protein-nucleotide binding site prediction.
Universe Routing framework addressing epistemic control in self-evolving agents by managing epistemologically incompatible reasoning frameworks.
OpenReservoirComputing: Python library for GPU-accelerated reservoir computing in JAX with automatic differentiation and JIT compilation.
Theoretical analysis of dataset distillation showing how gradient-based learning extracts and encodes task-relevant information into synthetic data.
Mechanistic analysis of multi-stream transformer architectures with manifold-constrained hyper-connections using ablation and causal methods.
SafeDriver-IQ framework for real-time driver safety scoring using inverse crash probability modeling with continuous risk quantification.
Two-stage multimodal framework combining weather foundation models and satellite data for fine-grained solar irradiance forecasting.
Sample-efficient hypergradient estimation method for decentralized bi-level reinforcement learning with leader-follower agents.