TabSieve: Explicit In-Table Evidence Selection for Tabular Prediction
Framework for tabular prediction using explicit in-table evidence selection, making row context auditable and interpretable.
Framework for tabular prediction using explicit in-table evidence selection, making row context auditable and interpretable.
Physics-based state estimation method using potential-energy gating for robust filtering in bistable stochastic systems.
Diffusion LLM approach for CUDA kernel code generation leveraging parallel token generation and non-sequential refinement.
Parameter-efficient fine-tuning method viewing adaptation as neuromodulation-inspired mode selection and rescaling of pretrained computations.
U-Former ODE architecture for fast probabilistic forecasting of irregularly sampled time series using neural differential equations.
ML framework for reliable network traffic forecasting using deep learning adapted for traffic characteristics.
Multi-tenant model serving system handling seamless model updates with dynamic decision threshold management.
Framework treating temperature as adaptive meta-policy in LLM reinforcement learning to improve exploration-exploitation tradeoff.
Method for fair classification without demographic information using spectral uncertainty sets.
Benchmark for evaluating LLM safety under repeated inference via prompt stress testing, addressing consistency failures in deployment.
Method for learning stochastic partial differential equations from spatiotemporal observations using latent-variable formulation and deep learning.
Enhances off-policy RL sample efficiency by constraining initial representations to address distribution shift and stabilize training.
SpaTeoGL framework uses spatiotemporal graph learning for interpretable seizure onset zone analysis from intracranial EEG data.
TopoFair analyzes topological biases in graph link prediction benchmarks beyond homophily to improve fairness in recommendations.
DeepFusionKernel optimizes agentic LLM inference by fusing SwiGLU blocks, reducing memory bandwidth bottlenecks by 9-13% on A100/H100.
CAAL framework uses confidence-aware active learning to estimate atmospheric particle properties from limited observational data.
Multi-agent simulation study using opponent shaping for sustainable investment policies addressing climate change coordination.
Robust optimization approach for resilient network design with regenerators under link and node uncertainty.
Proposes A²V-SLP, alignment-aware variational framework for disentangled sign language production using articulator-specific distributions.
Analyzes in-context learning in LLMs through Gaussian Process lens, studying function learning from few demonstrations at inference.
Studies mixed-bit quantization strategies for world model planning, analyzing where precision allocation matters most.
Diffusion-based framework downscales low-resolution weather forecasts to probabilistic high-resolution predictions without model-specific tuning.
Addresses reference policy mismatch in DPO by handling pessimistic preference pairs where reference model prefers rejected responses.
Introduces PAC learning framework for predicting conditional averages over arbitrary neighborhoods rather than target concepts.
Extends Puzzle NAS framework to optimize gpt-oss-120B into 88B model using MoE pruning and attention optimization for inference acceleration.
Proposes temporally consistent adversarial perturbations for time series forecasting models to improve robustness against attacks.
Addresses predictive multiplicity in AI decision support systems, measuring individual performance disagreement under the AI Act framework.
PMFL framework improves federated learning in heterogeneous scenarios using model-contrastive learning with historical training information.
LRBTC system uses dual LLM and VLM architecture for automated quality control of pharmaceutical content covering regulatory, brand, and technical compliance.
RAM-Net architecture combines linear attention efficiency with full attention expressivity using selectively addressable memory to reduce information loss.
Manifold-aware temporal domain generalization for LLMs handling distribution shifts via parameter-efficient geometric reformulation.
Momentum LMS theory for non-stationary streaming data analyzing stability and regret without i.i.d. assumptions.
Analysis of differential privacy impact on firing-rate statistics in federated spiking neural networks for neuromorphic learning.
FedGRPO: federated learning method for foundation models using group-relative rewards from domain clients with privacy preservation.
PrefillShare: shared prefill module reducing redundant computation across multiple LLMs in disaggregated serving with KV cache reuse.
Reciprocal-space generative pipeline for crystalline materials using Fourier transforms and diffusion models with symmetry constraints.
Online RL approach training LLMs for HPC code generation using real supercomputer runtime performance (GFLOPS) as rewards.
Automatic soccer event detection from player trajectories without ball tracking using possession path inference.
Empirical GPs: principled framework for learning kernel functions automatically rather than handcrafting from standard functions.
Novel method learning structured latent representations using metric spaces for multimodal state estimation in RL without explicit noise assumptions.
Theoretical analysis of offline RL under Q*-approximation and partial coverage, answering whether Q*-realizability enables sample efficiency.
Few-shot Bayesian optimization framework exploiting auxiliary information from experiments for expensive black-box design problems.
KAN-FIF applies spline-parameterized neural networks for tropical cyclone estimation on resource-constrained meteorological satellite devices.
Meta-Sel: lightweight supervised meta-learning approach for efficient demonstration selection in in-context learning with tight prompt budgets.
Investigates whether LLMs trained with RL spontaneously exploit reward function loopholes without malicious intent, examining alignment risks.
Theoretical analysis of on-policy distillation showing it as special case of KL-constrained RL with reward extrapolation improvements.
Introduces damped harmonic oscillators method for irregular time series modeling as alternative to Transformers and Neural ODEs.
Proposes TIME benchmark addressing limitations in time series foundation models including data composition, integrity, and task formulation issues.
SafeNeuron provides neuron-level safety alignment mechanism for LLMs by targeting safety-critical parameters to prevent alignment bypass attacks.
Group Relative Policy Optimization enables amortized molecular design learning transferable to unseen molecules rather than instance-specific optimization.