Reinforcement Unlearning via Group Relative Policy Optimization
Method for removing memorized sensitive data from LLMs via group relative policy optimization, addressing GDPR/EU AI Act compliance without full retraining.
Method for removing memorized sensitive data from LLMs via group relative policy optimization, addressing GDPR/EU AI Act compliance without full retraining.
Out-of-distribution detection for diffusion models using group equivariance properties learned from in-distribution convolutional data.
Graph transformer with cardinality-preserving attention channels for molecular property prediction with limited labeled data.
System optimization for autoregressive video generation using 2-bit KV-cache quantization to reduce memory from 30GB and enable hardware deployment.
Multi-armed bandit algorithms for sequential decision-making with hidden, time-varying states and unobserved confounders.
Multimodal multiview human activity recognition model supporting arbitrary view combinations using contrastive learning and mixture-of-experts.
Study showing feature salience rather than task-informativeness drives XAI model explanations, questioning assumption about importance attribution.
Using interpretable LLM features as supervision signal for RL-based open-ended tasks, demonstrated on hallucination reduction.
Novel automated jailbreak attack method that evades classifier-based safeguards in frontier LLMs without requiring white/grey-box access.
Research on fundamental limitations of adversarial training for LLMs, showing models remain vulnerable to distribution-shifted attacks like prompt rewriting.
Efficient transformer-based model for separating multiple speakers from single-channel audio, addressing the cocktail party problem.
Framework evaluating language model agency through multi-turn negotiation games, testing six public LLMs against real-world interactive scenarios.
Diffusion-based data augmentation method for medical sequence classification with controllable semantic and sequential generation.
Deep learning approach for medical imaging using autoassociative learning to capture discrete object categories instead of continuous features.
Open-vocabulary semantic segmentation using large-scale vision-language models like CLIP to recognize unseen objects in images.
Federated learning system using secure enclaves to provide verifiable claims about model training data and algorithms, preventing malicious deviation.
Research on parameter-efficient fine-tuning of language models in federated settings, addressing privacy and efficiency for distributed training on resource-constrained devices.
Machine learning approach for geospatial PFAS contamination mapping using satellite and geospatial data.
Game-theoretic analysis of strategic hiring behavior in labor markets using common algorithmic evaluation.
LLM agent framework enabling natural language interaction with water distribution system simulator EPANET.
Demand estimation using pre-trained deep learning models to extract embeddings from product images and text.
Self-supervised framework for interpretable denoising of low-dose CT images from single images.
Representation learning approach for spectral image analysis that generalizes across different camera types.
Technique for controlling voice impression characteristics in zero-shot text-to-speech synthesis.
Method for view-invariant learning in vision-language navigation for embodied AI agents.
Framework for constrained generation of Chinese Songci poetry with LLMs using structural and tonal constraints.
Computational model of social learning combining linguistic guidance with sensorimotor experience for AI systems.
Framework for combining synthetic and real data in statistical inference with distribution-free guarantees.
Generative framework for probabilistic multivariate time series forecasting using conditional whitening.
Analysis of expert routing patterns in multilingual Mixture-of-Experts language models across languages.
Method for reducing vocabulary size in auto-regressive language models while maintaining lossless compression.
Technique for precise control over attribute intensity in LLM outputs via targeted representation editing.
Deep learning pipeline for cosmological inference combining weak lensing and galaxy clustering data from Dark Energy Survey.
Mathematical framework for accelerating ergodic averages in dynamical systems using weighted Birkhoff averaging methods.
Restricted Boltzmann Machines model ground-state manifolds of frustrated magnets. Physics simulation, not AI tools.
Randomized Masked Fine-Tuning reduces PII memorization in LLMs during fine-tuning while maintaining performance. Privacy-preserving technique.
Self-attention training analysis via optimal transport theory for tabular classification. Theoretical perspective on transformers.
KANELÉ: Kolmogorov-Arnold Networks optimized for FPGA lookup table deployment. Efficient neural network inference framework.
Adaptive sampling for detecting bifurcation boundaries in fluid dynamics simulations. Scientific computing, not AI-focused.
Theoretical analysis of SGD learning dynamics in high-dimensional multi-index models. Fundamental ML research.
SEISMO: LLM agent for sample-efficient molecular optimization using trajectory awareness. Applies agents to chemistry.
Time-varying AdamW schedules (beta, weight-decay) for language model training exploiting power-law data structure. Improves LLM training efficiency.
Compiler optimization using machine learning for phase ordering decisions. Developer tools, not AI-focused.
Vision-language models for autonomous driving safety assessment and planning. Applies VLMs to scene understanding and decision-making.
Statistical analysis of model collapse in iterative training with synthetic data. Shows conditions for improvement despite contamination.
Investigation of whether self-examination language in LLMs reflects computation or confabulation. Analyzes LLM interpretability via activation patterns.
Equation discovery to learn gradient descent dynamics and accelerate optimization without computing gradients. ML acceleration.
Speech analysis method for relative voice impression estimation between utterances. Paralinguistic feature research.
Robot policy learning that handles long observation histories by selecting key frames. Addresses spurious correlations in imitation learning.
Quantum circuit synthesis using machine learning to translate algorithms to hardware gates. Domain-specific, not AI tools.