Anonymization-Enhanced Privacy Protection for Mobile GUI Agents: Available but Invisible
Privacy protection method for mobile GUI agents using anonymization to mitigate exposure of sensitive data during screen processing.
Privacy protection method for mobile GUI agents using anonymization to mitigate exposure of sensitive data during screen processing.
Causal continuous rotary positional encoding improving 3D feature alignment in multimodal LLMs for spatial reasoning tasks.
Language-Action pre-training framework enabling zero-shot transfer of robot policies across different embodiments without fine-tuning.
Data attribution method to trace and mitigate undesirable emergent behaviors in LLM post-training by identifying responsible datapoints.
Applies variational autoencoder mathematics to formalize religious epistemology and perennialism.
3D Gaussian Splatting simulation framework for training visual navigation policies in dynamic environments with moving obstacles.
Method to improve fine-grained visual perception in multimodal LLMs by distilling zoomed regions without repeated inference calls.
Benchmark and methodology for evaluating LLM-based PDF-to-JSON extraction at enterprise scale with complex schema requirements.
Theoretical analysis of Jacobian spectra in deep neural networks to explain implicit bias in gradient-based training.
Formalizes agent skills as composable packages enabling LLMs to extend capabilities dynamically without retraining, covering architecture, acquisition, and security.
MedXIAOHE medical vision-language foundation model with entity-aware continual pretraining achieving state-of-the-art on medical benchmarks.
Prior-guided symbolic regression framework incorporating scientific constraints to discover interpretable equations consistent with physical principles.
Directional Concentration Uncertainty framework for flexible uncertainty quantification in generative models across tasks and modalities.
Blueprint system for multimodal retrieval in engineering archives using layout-aware VLM OCR and identifier normalization.
Comparative study of ML/DL architectures on MNIST-1D dataset for distinguishing neural network performance on controlled benchmarks.
Review and proposed metrics for evaluating multi-iteration active learning query methods and sample selection strategies.
Multi-objective Bayesian optimization framework for designing cryoprotectant cocktails balancing ice suppression and cell toxicity.
Diffusion-based downscaling of climate emulator for high-resolution weather projections addressing training instability and computational costs.
Tutorial on PyCM library for comprehensive multi-class classifier evaluation and performance metrics comparison.
Research showing language model circuits are prompt-specific within tasks, not uniform across prompts, using causal communication analysis.
Federated learning framework for nonlinear temporal dynamics using graph attention for interpretable cross-client relationships without sharing raw data.
Research identifies rank collapse phenomenon in federated low-rank adaptation with heterogeneous clients, proposing mitigation for privacy-preserving fine-tuning.
TrasMuon optimizer enhances Muon-style methods with trust-region adaptive scaling to address training sensitivity and high-energy bursts.
Novel optimization framework for monotone non-convex functions unifying DR-submodular and OSS functions with theoretical convergence analysis.
Study demonstrating singular vectors of attention heads align with learned features in language models, providing mechanistic interpretability foundations.
QuaRK framework combines quantum reservoir computing with classical readout for time series learning with theoretical guarantees.
Fast algorithm for multiplication-free dimension reduction via element selection as alternative to PCA for reducing model parameters and accelerating inference.
Research on out-of-support generalization reformulates extrapolation beyond training data distribution as sequence modeling in weight space to prevent catastrophic failure.
Neural network approach for semantic communication in MU-MIMO OFDM systems handling multi-user interference and frequency-selective fading.
Optimal multiway-split decision trees for interpretable clustering that balance accuracy and explainability while reducing computational costs.
Investigates benchmark data leakage in LLM-based recommender systems where models memorize training data, inflating evaluation metrics.
Graph embedding method using distributional kernels for community detection without optimization, addressing over-smoothing in GNNs.
Detects unusual evolving trends in time series using concept of time series chains capturing ordered temporal pattern changes.
Fairness approach for federated learning addressing cumulative utility parity under intermittent and heterogeneous client participation.
Zero-order optimization method for LLM fine-tuning via learnable direction sampling, reducing memory requirements without backpropagation.
Optimized certainty equivalent risk-controlling prediction sets for safety-critical applications with tail behavior and worst-case guarantees.
Interactive multi-objective system for treatment planning in cervical cancer brachytherapy using ML-assisted clinical decision-making.
Vision-language model calibration approach for class-incremental learning that addresses representation rigidity and accumulated errors in analytic CIL.
Studies sparsification-approximation tradeoffs for correlation clustering, analyzing how much edge information is needed for LP-based guarantees.
Physics-informed neural networks for denoising magnetic field data in airborne navigation systems when GPS is unavailable.
Uses attention head entropy to predict answer correctness in LLMs, enabling detection of hallucinations without external evaluators.
First optimal regret bound for policy optimization in contextual multi-armed bandits with general offline function approximation.
OPO-CMDP presents first policy optimization algorithm for contextual MDPs with general offline function approximation achieving near-optimal regret bounds.
HBVLA applies 1-bit post-training quantization to vision-language-action models for efficient deployment on resource-constrained robots and edge devices.
Bi-level optimization framework using neural networks for operational optimization of thermal power systems with hierarchical variables.
Novel matrix-free eigendecomposition method using discrete double-bracket flows that is invariant to isotropic noise shifts.
Study of instruction-tuning data selection for LLMs using semantic representation similarity to identify redundancy in large-scale datasets.
MEMTS introduces parameterized memory for domain adaptation of time series foundation models to handle temporal distribution shifts and domain-specific patterns.
MechPert predicts transcriptional responses to unseen genetic perturbations using mechanistic consensus as inductive bias, combining knowledge graphs with LLM reasoning.
Cast-R1 applies tool-augmented sequential decision policies and iterative reasoning to time series forecasting, enabling autonomous evidence acquisition and prediction revision.