Soft Equivariance Regularization for Invariant Self-Supervised Learning
Soft equivariance regularization augments invariance-based self-supervised learning to preserve transformation-dependent structure for robustness.
Soft equivariance regularization augments invariance-based self-supervised learning to preserve transformation-dependent structure for robustness.
Analysis of multimodal LLM generalization showing RGB-only approaches fail to generalize across cameras due to camera parameter entanglement.
Conformal prediction framework for solar flare regression with uncertainty quantification and false alarm reduction.
PolyBlocks MLIR-based compiler infrastructure for AI chips and frameworks using analytical cost models for automated high-performance code generation.
Calibrated Credit Intelligence combines Bayesian uncertainty with gradient boosting for fair credit risk scoring under distribution shift.
ML surrogate models for predicting steady-state flow through porous media in cold plate topology optimization.
Best-of-Tails method for inference-time alignment balancing optimism and pessimism in LLM candidate selection with imperfect reward models.
CREDO combines conformal prediction with credal methods for regression with distribution-free coverage and epistemic uncertainty quantification.
Theoretical analysis of contextual bilateral trade with heavy-tailed valuations and infinite variance using epoch-based pricing.
SymLang framework uses language-guided program synthesis with symmetry constraints to discover governing equations from noisy experimental data.
Analysis of fairness constraints in ML systems showing when enforcing fairness leads to worse outcomes for affected groups.
Self-evolving NLP extraction system that adapts to domain-specific taxonomies and emerging terminology in specialized fields like medical and legal.
Privacy analysis of DNA foundation model embeddings showing inversion attacks can reconstruct genomic sequences from shared embeddings.
Theoretical analysis of post-training linear autoregressive models with policy gradients, proving convergence properties under margin conditions.
SISA-based machine unlearning framework for removing poisoned training data influence in power transformer fault diagnosis models without full retraining.
Graph-based reinforcement learning framework for power distribution network resilience incorporating topological features for outage management.
Analyzes causal masking problem in fairness frameworks where zero average treatment effect hides underlying causal relationships.
Method for discovering interpretable audio attributes using multimodal LLMs for low-resource audio classification with high reliability.
Analysis of safety alignment in small language models using weak supervision and Self-MOA approach for reducing human annotation costs while maintaining usefulness.
Framework for temporal transportation of treatment effects from randomized trials to different time periods.
Token caching optimization for vision-language navigation models accounting for visual and semantic dynamics to reduce inference cost.
Distribution-free trajectory optimization for non-Gaussian stochastic systems using conformal inference with closed-loop guarantees.
Deep generative spatiotemporal model for probabilistic epidemic forecasting with uncertainty quantification.
Automated segmentation and annotation of bowel sounds for objective clinical auscultation using digital acoustic sensors.
Deep learning method for conditional rank-rank regression to measure intergenerational socio-economic mobility with covariate adjustment.
Framework for conditional image generation with flow maps using variational approach for solving inverse problems and adding constraints.
Token merging optimization for Segment Anything Model to improve inference speed while preserving segmentation quality and handling SAM's attention architecture.
Proposes giving advanced AI systems a primary goal of accepting shutdown to address control concerns.
Lightweight and explainable jamming detection for 5G networks using Tsetlin machines for RF interference identification.
Agricultural vision competition focused on data-centric AI and model generalization under real-world distribution shifts rather than model design alone.
Quality estimation for machine translation in low-resource scenarios across multiple domains and language pairs using prompting techniques.
Research on generalization of RL-trained vision-language mobile agents for GUI automation. Addresses lack of standardized benchmarks and open-source RL systems for interactive task learning.
Method for preserving LLM safety alignment during fine-tuning by constraining safety-critical tokens, addressing alignment drift.
Knowledge-grounded NL2SQL system handling heterogeneous SQL dialects with semantic correctness and dialect-specific syntax compliance.
Lightweight backbone for 3D point cloud recognition using nonparametric adaptive embeddings and geometric modulation.
Transformer variant decomposing residual stream into token and context components for interpretable language modeling.
Framework for auditable fine-tuning and inference of proprietary LLMs on cloud platforms with cryptographic verification.
Monograph on probabilistic inference and learning theory using Stein's method with applications to variational gradient descent.
Lightweight on-device adaptation framework for speech enhancement models addressing dynamic acoustic scene changes with frozen backbone.
Wireless sensing system addressing bistatic phase offset calibration for sub-wavelength scale contactless sensing.
World model learning approach using symmetry exploration to capture physical invariances and conservation laws for extrapolative generalization.
Tool for verifying and explaining RL policies for multi-bridge network maintenance with formal safety guarantees and interpretability.
Generative-reconstructive-discriminative network with ROI attention for industrial surface defect detection and localization.
Re-evaluation of LiRA membership inference attacks under realistic assumptions, questioning prior effectiveness claims with realistic threat models.
Systematic comparison of four training objectives (cross-entropy, prototype, triplet, AP loss) for out-of-distribution detection in image classification.
Procedural dataset generation framework for engine sounds with embedded control annotations for automotive audio synthesis.
Security analysis of large vision-language models vulnerable to semantic slot filling attacks that elicit unsafe outputs.
Algorithm for neural spike waveform compression and classification using adaptive level crossing and latent feature representation.
Hierarchical multi-agent system for Kubernetes autoscaling addressing resource waste through coordinated pod and node scaling policies.
arXiv paper on Staged Multi-Agent Training (SMAT) for co-adaptive exoskeleton control, using curriculum learning to mirror human motor adaptation.