Using Synthetic Data for Machine Learning-based Childhood Vaccination Prediction in Narok, Kenya
Using synthetic data for ML-based childhood vaccination prediction in nomadic populations in Kenya.
Using synthetic data for ML-based childhood vaccination prediction in nomadic populations in Kenya.
Dynamic policy optimization bridging SFT and RL for LLMs, addressing bias-variance tradeoff in post-training through adaptive loss weighting.
Empirical study on effectiveness of advanced optimizers for multi-task learning, identifying overlooked factors in optimization approaches.
Analyzes calibration and paraphrase sensitivity in medical vision-language models using predictive entropy and uncertainty quantification.
SeqComm-DFL: Multi-agent coordination via sequential communication and decision-focused learning for value-aware message generation.
WOMBET: World model-based framework for experience transfer in robotics RL, generating and utilizing prior data for sample efficiency.
Hierarchical implicit flow Q-learning for offline goal-conditioned reinforcement learning with improved policy expressiveness.
SentryFuse: Framework for efficient multimodal model compression on edge devices with sensor dropout robustness via zero-shot pruning.
Graph neural network architecture addressing heterophilic graphs using switchable attention mechanism for monophily-aware learning.
Named entity identification and anonymization system for cybercrime datasets on Telegram with speech-to-text transcription.
Solution to NeurIPS 2023 LLM Efficiency Challenge: Fine-tuning LLaMA 70B on single A100 GPU within 24-hour constraint.
U-Cast: Efficient probabilistic weather forecasting model using standard U-Net architecture, simplifying state-of-art approaches.
PDYffusion: Diffusion model for long-horizon spatiotemporal prediction incorporating physics-based constraints and uncertainty quantification.
Proposes PML-MA method for partial multi-label learning using feature-label modal alignment to handle noisy labels.
arXiv paper proposing Temporal Patch Shuffle data augmentation for time series forecasting preserving temporal coherence and improving generalization.
arXiv paper integrating graph-based embeddings into event sequence models for user-item interactions in fraud and recommendation systems.
arXiv paper on GeoPAS geometric probing approach for automated algorithm selection in continuous black-box optimization.
arXiv paper on EquiformerV3, advancing SE(3)-equivariant graph attention Transformers for efficiency, expressivity, and 3D atomistic modeling.
arXiv paper proposing score-driven rating system extending classical Elo rating to accommodate diverse game outcomes and rankings.
arXiv paper on CORA, conformal risk-controlled GUI agents using vision language models with formal safety guarantees for mobile automation.
arXiv paper proposing truncated rectified flow policy for maximum entropy RL enabling one-step multimodal action distribution sampling.
arXiv paper augmenting distillation process dataset with simulations for deep learning-based anomaly detection in chemical batch processes.
arXiv paper developing generalization and scaling theory for Mixture-of-Experts Transformers with covering-number bounds and routing overhead analysis.
arXiv paper on GNN-based deep reinforcement learning scheduler for cloud workflow DAGs optimizing completion time and energy consumption.
arXiv paper analyzing statistical properties of ancient I-Ching King Wen sequence, finding no improvement to neural network training.
arXiv paper proposing DiffHLS framework using GNNs and LLM code embeddings for high-level synthesis quality prediction via differential learning.
arXiv paper investigating LLM pretraining geometry and common minima to improve downstream generalization without changing loss function.
arXiv paper applying causal inference to study relationship between parking infrastructure and EV adoption in Scotland.
arXiv paper on distributed online convex optimization with compressed communication, establishing optimal regret bounds for large-scale applications.
Physics-informed meta-learning framework (KAPI) combining meta-learned predictor with least-squares corrector for solving parametric linear PDEs.
Novel stability-enhanced Gaussian process VAE for training low-dimensional LTI systems from high-dimensional video data using probabilistic and physical models.
Hierarchical deep learning framework for predicting vehicle turning movements at signalized intersections.
Online learning method for nonstationary multivariate time series with concept drift applied to sintering quality prediction.
Weakly-supervised clustering approach for handling label noise in partial multi-label learning scenarios.
Neural network method for solving high-dimensional Gross-Pitaevskii equations with dimension-independent computational cost.
Controlled study of dataset scaling laws in attention-only decoder architecture across power-of-two subset sizes.
Machine unlearning approach using relearning convergence delay metric to remove contaminated data from pretrained models.
Online activation subspace learning (OASIS) to reduce memory requirements during LLM training through low-rank projections.
Scalable method for generating node embeddings on massive distributed graphs with millions to billions of nodes.
AdaCubic optimizer using adaptive cubic regularization with Hutchinson's method for approximating Hessian in deep learning.
One-step diffusion model for efficient chest X-ray report generation reducing inference latency compared to autoregressive models.
Method for safely updating deep reinforcement learning policies while preserving safety guarantees on previously encountered tasks.
Hardware optimization using electro-optic nonlinearities to replace softmax bottleneck in transformer attention mechanisms.
Application of world models paradigm to computational epidemiology for reasoning about latent disease burden and intervention effects.
High-fidelity cyber operations simulator (NetForge_RL) using temporal graph networks for multi-agent reinforcement learning in cybersecurity.
OmniBehavior benchmark for evaluating LLMs as user simulators on long-horizon, cross-scenario behavior traces from real-world data.
Investigation of self-sovereign AI agents that can economically sustain themselves without human involvement using LLMs and agent frameworks.
Analysis of how bias mitigation reshapes embedding spaces in BERT and Llama2 through representational analysis of gender-occupation associations.
Systematic evaluation of chain-of-thought vs zero-shot prompting across temperature settings using Grok-4.1 for extended reasoning LLMs.
Research on attention-based sampling for diffusion language models enabling parallel decoding instead of sequential auto-regressive approach.