genriesz: A Python Package for Automatic Debiased Machine Learning with Generalized Riesz Regression
Open-source Python package implementing debiased machine learning via Riesz regression for causal parameter estimation.
Open-source Python package implementing debiased machine learning via Riesz regression for causal parameter estimation.
Decentralized optimization method with adaptive stepsizes for multi-agent networks using three-operator splitting.
Adaptive regularization framework for maintaining LLM safety during fine-tuning while preserving utility.
Analysis of self-distillation in ridge regression showing generalization improvements with precise asymptotics.
Omniprediction algorithm extending binary case to multiclass settings with bounds across multiple losses.
Unsupervised anomaly detection framework using conditional flow matching for autonomous vehicle safety validation.
Sequential hypothesis testing algorithm for Markovian data with asymptotically optimal expected stopping time.
Online meta-learning approach for geospatial discovery using latent concepts with strategic sampling under constraints.
Multi-agent autonomous framework for designing, implementing, and verifying numerical PDE solvers without manual tuning.
Contrastive learning framework for semantic embeddings in e-commerce search with graded relevance.
Pruning technique for diffusion language models reducing inference cost by reconsidering attention sink preservation.
Data augmentation scheme for Raman spectroscopy with correlated annotations in biotechnology applications.
Graph machine learning method for estimating causal effects in social networks with interference and confounding.
Algorithm for gradient testing and estimation using only comparison oracle for smooth functions.
Risk-aware decision making algorithms for restless bandits incorporating downside risk mitigation.
Benchmark for evaluating physical safety risks of LLMs controlling robotic systems like drones with threat classification.
Surrogate model combining PointNet and DeepONet for predicting nonlinear fields on complex 3D geometries.
Online learning algorithm for Stackelberg games with contextual information achieving improved regret bounds.
Hierarchical meta-reinforcement learning approach using self-improving skills to handle noisy offline demonstrations.
Reversible Runge-Kutta solvers for neural differential equations in generative models with improved numerical stability.
Continual learning approach for foundation models addressing stability-plasticity trade-off during post-training on new classes/domains.
Parameter-efficient fine-tuning method using orthogonal adaptation on principal subspaces for adapting large models efficiently.
Survey demystifying common beliefs about oversmoothing, oversquashing, heterophily and long-range tasks in graph neural networks.
Analyzes KL-regularization design choices in policy gradient algorithms for LLM reasoning, comparing forward/reverse KL variants.
Supervised graph contrastive learning framework for gene regulatory networks addressing biological validity of perturbations.
Game theory algorithms for two-player zero-sum games with unknown payoffs estimated through bandit feedback.
Generative model for directed graphs using dual attention and asymmetric encoding to capture ordered relationships.
Strict Subgoal Execution method for hierarchical RL that improves long-horizon planning by validating subgoal feasibility.
AXLearn production system for scalable hardware-agnostic training of large models with modular software architecture.
Instance-wise adaptive sampling framework for constructing efficient training datasets for supervised inverse problem learning.
Extends reinforcement learning to continuous-time systems using Hamilton-Jacobi-Bellman equations for irregular interaction frequencies.
Deep learning methods for predicting electronic-structure Hamiltonians in materials, advancing generalization across diverse atomic systems.
First watermarking method for diffusion language models that generate tokens non-sequentially, addressing unique DLM challenges.
Analyzes overthinking in reasoning LLMs and proposes early exit using entropy after thinking tags to improve efficiency.
Extends prompt optimization to multimodal LLMs, proposing methods to optimize across text, images, video and other modalities.
Hardware design tool using ML to generate synthetic netlists for chip optimization training without long design turnaround times.
Introduces ConDA, a contrastive learning layer for organizing diffusion model latent spaces to enable controllable generation.
Proposes pi-Flow, a policy-based approach to improve few-step diffusion model distillation by predicting network-free policies.
ML research on mitigating algorithmic bias in clustering through fairness constraints and group balance representation.
System for accelerating neural network inference on mobile devices through fine-grained CPU-GPU co-execution and synchronization optimization.
LRT-Diffusion applies risk-aware sequential hypothesis testing to improve diffusion policy guidance for offline reinforcement learning.
Semi-supervised preference optimization framework for LLM alignment using limited labeled paired feedback data.
PREPO method improving data efficiency for LLM reinforcement learning with verifiable rewards using intrinsic exploration signals.
Model-agnostic local explanation method using MARS and N-ball sampling for high-fidelity black-box model interpretability.
Distributionally robust reinforcement learning approach using general function approximation for policy robustness under environment shift.
Theoretical analysis of sample complexity for data-driven approaches to blind inverse problems with interpretability concerns.
Reinforcement learning approach to discover local climate indices for improving rainfall prediction in Thailand.
Unified framework connecting physics-informed neural networks and neural operators for learning PDE solvers.
Theoretical analysis of active learning label complexity for decision trees with provable polylogarithmic guarantees.
Temporal graph pattern machine for learning transferable representations in dynamic networks without restrictive assumptions.