FORGE: Fine-grained Multimodal Evaluation for Manufacturing Scenarios
Fine-grained benchmark evaluating multimodal LLMs on manufacturing scenarios.
Fine-grained benchmark evaluating multimodal LLMs on manufacturing scenarios.
Privacy-preserving synthetic data generation using LLMs with differential privacy mechanisms.
LLM-based tool for identifying HIV-related stigma in clinical narratives.
Data selection framework for multi-turn dialogue instruction tuning addressing dataset noise and inconsistency.
Genetic programming combined with latent space optimization for symbolic regression via neural encoders.
Attack exploiting denoising irreversibility in diffusion language models to bypass safety alignment.
Optimization for geometric foundation models in monocular SLAM via efficient keyframe selection.
Empirical study measuring bias amplification in multi-agent systems topologies and feedback loops.
Text-to-speech synchronization method for automated dubbing with phonetic alignment.
Interactive ASR system with semantic coherence evaluation and human-like correction mechanisms.
Physics-guided surrogate learning for turbulent flow control without heavy RL computational costs.
Framework for managing hierarchical instruction conflicts in multi-source LLM agent environments.
Theoretical connection between Transformers, diffusion maps, and magnetic Laplacians through Markov geometry.
Framework for evaluating fairness and equity across patient subgroups in brain tumor segmentation models.
Study of deliberative alignment for deeper safety in reasoning LLMs via attribution analysis.
Analysis of working memory limitations in LLMs and comparison with biological systems.
RWKV-based RL approach with explicit belief state representation for partial observability problems.
Computational model using Transformer self-prior to simulate mirror self-recognition behavior without external rewards.
Theoretical analysis comparing entropy regularization and covariance-based mechanisms for controlling policy collapse in RL-enhanced LLMs.
Framework for robust structured prediction using Tsallis reweighting and task-agnostic prompting with XML structure for group-robust fine-tuning.
Guide-Core Policies framework for black-box LLM agents where guide models generate structured strategies executed by core models reducing inference costs.
Review of explainable AI mechanisms for human activity recognition in healthcare, assistive living, and smart environments applications.
Unified visual encoding and decoding framework from neural activity modeling consistency between brain stimulus prediction and reconstruction.
Self-supervised learning on satellite imagery predicting mycorrhizal fungal biodiversity at landscape scales for ecosystem monitoring.
Analysis of preference encoding in looped transformer internal states using lightweight evaluator heads on RLHF dataset.
Methods for personalizing generative user interfaces addressing subjective preferences through preference divergence analysis and sparse feedback.
Self-supervised semantic enrichment method for medical vision-language datasets addressing reporting bias in radiology reports.
Multimodal EHR model for pediatric emergency triage using modality dropout to improve generalizability across demographics.
Convergence analysis of randomized Kaczmarz and SGD with greedy step size proving O(1/t^3/4) last-iterate convergence rate.
Temporal-aware network improving simultaneous speech translation policy to balance quality and latency with temporal context awareness.
Sampling methods for diffusion language models balancing speed, quality, and diversity through tempered confidence-based remasking.
Hybrid condition monitoring framework combining data-driven learning with physics-based insights for industrial systems reliability.
Physical reservoir computing inspired by biological vestibular system addressing hardware complexity with designed uncoupled topology.
Fine-tuning small language models for domain-specific code generation in production environments with strict latency requirements.
Kaczmarz-based preference learning algorithms for real-time matchmaking with stable convergence replacing recency-biased normalization.
Extension of Muon optimizer reducing computational overhead in foundation model pre-training through adaptive second-moment preconditioning.
Decentralized learning framework combining adaptive gradients and compressed communication for federated settings with multiple local training steps.
First benchmark for multi-source domain generalization in automatic sleep staging with noisy labels across institutions and devices.
Closed-form method for concept erasure in diffusion models using double projections without iterative optimization.
Cross-validated self-attention with denoising for automatic modulation classification under low signal-to-noise conditions.
Characterizes necessity and sufficiency conditions for reward poisoning attacks in reinforcement learning with linear MDPs.
Heterogeneous graph network with critical-path awareness for long-horizon flexible job-shop scheduling using rolling horizon optimization.
Theoretical analysis of why transformers learn optimal DDPM denoiser for multi-token Gaussian mixture models.
Survey on attention sink phenomenon in transformers, covering utilization, interpretation, and mitigation strategies.
Automated DNN optimization for PPG-based blood pressure estimation on resource-constrained wearable devices.
Distributed consensus-based framework for recursive multi-output Gaussian processes in large-scale streaming settings.
Temporally augmented graph attention network for affordance classification from EEG sequential data.
Interprets internal computation of Leela Chess Zero transformer using sparse decomposition to explain grandmaster-level reasoning.
Spatial-temporal graph neural networks for virtual metering in sparsely instrumented district heating networks.
Theoretical bounds on Hessian eigenspectrum for cross-entropy loss in nonlinear neural networks.