Isolater - Feed

Ax Ziming Wang, Changwu Huang, Ke Tang, Xin Yao 2/27/2026

Procedural Fairness in Machine Learning

Defines and formalizes procedural fairness in machine learning models drawing from philosophy and psychology, addressing fairness beyond distributive metrics.

Ax Jingren Liu, Zhong Ji, YunLong Yu, Jiale Cao, Yanwei Pang, Jungong Han, Xuelong Li 2/27/2026

Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective

Analyzes parameter-efficient fine-tuning for continual learning using Neural Tangent Kernel theory to understand model adaptation and catastrophic forgetting mitigation.

Ax Lorenzo Colantonio (Department of Physics, Sapienza University of Rome), Andrea Cacioppo (Department of Physics, Sapienza University of Rome), Federico Scarpati (Department of Physics, Sapienza University of Rome), Maria Chiara Angelini (Department of Physics, Sapienza University of Rome), Federico Ricci-Tersenghi (Department of Physics, Sapienza University of Rome), Stefano Giagu (Department of Physics, Sapienza University of Rome) 2/27/2026

Efficient Graph Coloring with Neural Networks: A Physics-Inspired Approach for Large Graphs

Physics-inspired neural framework using graph neural networks to solve large-scale graph coloring combinatorial optimization problems near algorithmic phase transitions.

Ax Hanlin Gu, Hong Xi Tae, Chee Seng Chan, Lixin Fan 2/27/2026

Towards Privacy-Guaranteed Label Unlearning in Vertical Federated Learning: Few-Shot Forgetting without Disclosure

Proposes label unlearning method for vertical federated learning using representation-level manifold mixup to enable privacy-preserving model unlearning.

Ax Junhao Liu, Haonan Yu, Xin Zhang 2/27/2026

Beyond Attribution: Unified Concept-Level Explanations

Model-agnostic explanation technique integrating concept-based approaches with diverse explanation forms beyond attribution methods.

Ax Ziyi Zhang, Li Shen, Sen Zhang, Deheng Ye, Yong Luo, Miaojing Shi, Dongjing Shan, Bo Du, Dacheng Tao 2/27/2026

Aligning Few-Step Diffusion Models with Dense Reward Difference Learning

Reinforcement learning framework for aligning few-step diffusion models with downstream objectives using stepwise policy optimization.

Ax Orestis Oikonomou, Levi Lingsch, Dana Grund, Siddhartha Mishra, Georgios Kissas 2/27/2026

Neuro-Symbolic AI for Analytical Solutions of Differential Equations

Neuro-symbolic framework automating discovery of analytical solutions to differential equations using formal grammars and continuous search.

Ax Qingyue Zhao, Kaixuan Ji, Heyang Zhao, Tong Zhang, Quanquan Gu 2/27/2026

Towards a Sharp Analysis of Offline Policy Learning for $f$-Divergence-Regularized Contextual Bandits

Theoretical analysis of sample complexity for offline reinforcement learning with f-divergence regularization in contextual bandits.

Ax Christian Kl\"otergens, Tim Dernedde, Lars Schmidt-Thieme, Vijaya Krishna Yalavarthi 2/27/2026

Mixing It Up: Exploring Mixer Networks for Irregular Multivariate Time Series Forecasting

Evaluation of MLP-Mixer architectures for forecasting irregular multivariate time series with missing values.

Ax Sina Salek, Joseph Enguehard 2/27/2026

Using the Path of Least Resistance to Explain Deep Networks

Alternative to Integrated Gradients for neural network interpretability using model-induced metrics instead of straight attribution paths.

Ax Mirja Granfors, Jes\'us Pineda, Blanca Zufiria Gerbol\'es, Joana B. Pereira, Carlo Manzo, Giovanni Volpe 2/27/2026

Global graph features unveiled by unsupervised geometric deep learning

Unsupervised geometric deep learning framework for capturing local and global graph structure using hourglass autoencoder.

Ax Jacob Comeau, Mathieu Bazinet, Pascal Germain, Cem Subakan 2/27/2026

Sample Compression for Self Certified Continual Learning

Continual learning method using sample compression theory to provide computable guarantees while avoiding catastrophic forgetting.

Ax Youguang Chen, George Biros 2/27/2026

Extensions of the regret-minimization algorithm for optimal design

Extension of regret minimization algorithms for optimal experimental design and active learning sample selection.

Ax Tongrui Su, Qingbin Li, Shengyu Zhu, Wei Chen, Xueqi Cheng 2/27/2026

RaPA: Enhancing Transferable Targeted Attacks via Random Parameter Pruning

Method for improving transferable adversarial attacks via random parameter pruning to reduce reliance on small parameter subsets.

Ax Changhai Zhou, Qian Qiao, Yuhua Zhou, Yuxin Wu, Shichao Weng, Weizhong Zhang, Cheng Jin 2/27/2026

Large Language Model Compression with Global Rank and Sparsity Optimization

Compression technique for large language models using global rank and sparsity optimization with layer-wise weight allocation.

Ax Shai Feldman, Stephen Bates, Yaniv Romano 2/27/2026

Conformal Prediction with Corrupted Labels: Uncertain Imputation and Robust Re-weighting

Framework for uncertainty quantification with corrupted or missing labels using conformal prediction with robust re-weighting.

Ax Shengyu Feng, Weiwei Sun, Shanda Li, Ameet Talwalkar, Yiming Yang 2/27/2026

FrontierCO: Real-World and Large-Scale Evaluation of Machine Learning Solvers for Combinatorial Optimization

Large-scale benchmark for evaluating machine learning solvers on combinatorial optimization using real-world industrial datasets.

Ax Rafa{\l} Karczewski, Markus Heinonen, Alison Pouplin, S{\o}ren Hauberg, Vikas Garg 2/27/2026

The Spacetime of Diffusion Models: An Information Geometry Perspective

Information geometry perspective on diffusion model latent spaces, analyzing deterministic and stochastic decoders.

Ax Giannis Nikolentzos, Konstantinos Skianis 2/27/2026

On the Lipschitz Continuity of Set Aggregation Functions and Neural Networks for Sets

Theoretical analysis of Lipschitz continuity properties for neural networks operating on set-structured data.

Ax Rohan Gupta, Erik Jenner 2/27/2026

RL-Obfuscation: Can Language Models Learn to Evade Latent-Space Monitors?

Research on whether language models can learn to evade latent-space safety monitors, with implications for LLM alignment and security.

Ax Jiyi Wang, Jingyang Ke, Bo Dai, Anqi Wu 2/27/2026

Learning Task-Agnostic Motifs to Capture the Continuous Nature of Animal Behavior

Framework for discovering continuous motor motifs in animal behavior using latent basis functions instead of discrete syllables.

Ax Magda Dubois, Harry Coppock, Mario Giulianelli, Timo Flesch, Lennart Luettgau, Cozmin Ududec 2/27/2026

Skewed Score: A statistical framework to assess autograders

Statistical framework for evaluating LLM-as-a-judge systems, addressing reliability and bias issues in automated LLM output evaluation.

Ax Siddharth Rout, Eldad Haber, Stephane Gaudreault 2/27/2026

Fast and Flexible Probabilistic Forecasting of Dynamical Systems using Flow Matching and Physical Perturbation

Flow matching approach for probabilistic forecasting of dynamical systems using physical perturbations instead of standard Gaussian perturbations.

Ax Zilei Shao, Anji Liu, Guy Van den Broeck 2/27/2026

Zero-Variance Gradients for Variational Autoencoders

Silent Gradients approach for training VAEs by restricting decoder architecture to reduce gradient estimation variance.

Ax Xiannan Huang, Shuhan Qiu, Jiayuan Du, Chao Yang 2/27/2026

Online time series prediction using feature adjustment

Online time series forecasting method using feature adjustment to handle distribution shift in sequential data deployment.

Ax Victor Chard\`es 2/27/2026

Random Matrix Theory-guided sparse PCA for single-cell RNA-seq data

Sparse PCA method using random matrix theory for dimensionality reduction in noisy single-cell RNA-seq data.

Ax Woosung Koh, Juyoung Suk, Sungjun Han, Se-Young Yun, Jamin Shin 2/27/2026

Predicting LLM Reasoning Performance with Small Proxy Model

rBridge framework enabling small proxy models (≤1B params) to predict reasoning performance of larger LLMs, optimizing dataset scaling.

Ax Takuya Kanayama, Yuki Ito, Tomoyuki Tamura, Masayuki Karasuyama 2/27/2026

Information-Theoretic Bayesian Optimization for Bilevel Optimization Problems

Bayesian optimization method for bilevel optimization problems with expensive black-box functions at both levels.

Ax O. Duranthon, P. Marion, C. Boyer, B. Loureiro, L. Zdeborov\'a 2/27/2026

Statistical Advantage of Softmax Attention: Insights from Single-Location Regression

Theoretical analysis of softmax attention mechanisms in LLMs through single-location regression task, addressing why softmax dominates alternative activation functions.

Ax Aleksandr Dremov, David Grangier, Angelos Katharopoulos, Awni Hannun 2/27/2026

Compute-Optimal Quantization-Aware Training

Study of compute-optimal allocation between full-precision and quantization-aware training phases for neural networks.

Ax James Oldfield, Philip Torr, Ioannis Patras, Adel Bibi, Fazl Barez 2/27/2026

Beyond Linear Probes: Dynamic Safety Monitoring for Language Models

Dynamic safety monitoring system for LLMs that adaptively adjusts computation based on input difficulty.

Ax Chunsan Hong, Seonho An, Min-Soo Kim, Jong Chul Ye 2/27/2026

Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies

Method for improving masked diffusion models by learning better unmasking policies beyond rule-based position scheduling.

Ax Nirmit Joshi, Gene Li, Siddharth Bhandari, Shiva Prasad Kasiviswanathan, Cong Ma, Nathan Srebro 2/27/2026

Learning to Answer from Correct Demonstrations

Imitation learning approach for training models with multiple acceptable answers from limited correct demonstrations.

Ax Bernardo Williams, Victor M. Yeom-Song, Marcelo Hartmann, Arto Klami 2/27/2026

Simplex-to-Euclidean Bijections for Categorical Flow Matching

Method for learning probability distributions on simplexes via smooth bijections and Aitchison geometry.

Ax Hung-Yueh Chiang, Chi-Chih Chang, Yu-Chen Lu, Chien-Yu Lin, Kai-Chiang Wu, Mohamed S. Abdelfattah, Diana Marculescu 2/27/2026

UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs

UniQL framework combining quantization and low-rank compression with adaptive on-device pruning for edge LLM deployment.

Ax Florent Draye, Anson Lei, Hsiao-Ru Pan, Ingmar Posner, Bernhard Sch\"olkopf 2/27/2026

Sparse Attention Post-Training for Mechanistic Interpretability

Post-training method achieving 99.6% attention sparsity without performance loss for mechanistic interpretability research.

Ax Hao Bai, Alexey Taymanov, Tong Zhang, Aviral Kumar, Spencer Whitehead 2/27/2026

WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks

WebGym: largest open-source environment with 300k tasks for training visual web agents on realistic websites.

Ax Jinshi Liu, Pan Liu, Lei He 2/27/2026

A Confidence-Variance Theory for Pseudo-Label Selection in Semi-Supervised Learning

Confidence-Variance theory framework for improved pseudo-label selection in semi-supervised learning beyond fixed thresholds.

Ax Trong Khiem Tran, Manh Cuong Dao, Phi Le Nguyen, Thao Nguyen Truong, Trong Nghia Hoang 2/27/2026

Rethinking Cross-Modal Fine-Tuning: Optimizing the Interaction between Feature Alignment and Target Fitting

Method for optimizing interaction between feature alignment and target fitting in cross-modal model fine-tuning.

Ax Winfried Ripken, Michael Plainer, Gregor Lied, Thorben Frank, Oliver T. Unke, Stefan Chmiela, Frank No\'e, Klaus-Robert M\"uller 2/27/2026

Learning Hamiltonian Flow Maps: Mean Flow Consistency for Large-Timestep Molecular Dynamics

Framework for learning Hamiltonian flow maps to enable stable large-timestep molecular dynamics simulations.

Ax Youngjoon Lee, Hyukjoon Lee, Seungrok Jung, Andy Luo, Jinu Gong, Yang Cao, Joonhyuk Kang 2/27/2026

Beyond Fixed Rounds: Data-Free Early Stopping for Practical Federated Learning

Data-free early stopping framework for federated learning using task vector growth rate monitoring.

Ax Rituparna Datta, Zihan Guan, Baltazar Espinoza, Yiqi Su, Priya Pitre, Srini Venkatramanan, Naren Ramakrishnan, Anil Vullikanti 2/27/2026

Agentic Framework for Epidemiological Modeling

EPIAGENT agentic framework that automatically synthesizes and calibrates epidemiological simulators via iterative program synthesis.

Ax Wei Chen, Jiacheng Li, Shigui Li, Zhiqi Lin, Junmei Yang, John Paisley, Delu Zeng 2/27/2026

A Minimum Variance Path Principle for Accurate and Stable Score-Based Density Ratio Estimation

Framework for score-based density ratio estimation addressing path-variance issues in practical training objectives.

Ax Andrea Montanari, Zihao Wang 2/27/2026

Phase Transitions for Feature Learning in Neural Networks

Theoretical analysis of phase transitions in neural network feature learning on multi-index models.

Ax Tao Huang, Rui Wang, Xiaofei Liu, Yi Qin, Li Duan, Liping Jing 2/27/2026

Detecting Misbehaviors of Large Vision-Language Models by Evidential Uncertainty Quantification

Technique to detect misbehaviors and hallucinations in large vision-language models using evidential uncertainty quantification.

Ax Kaizheng Wang, Ghifari Adam Faza, Fabio Cuzzolin, Siu Lun Chau, David Moens, Hans Hallez 2/27/2026

Learning Credal Ensembles via Distributionally Robust Optimization

Method for training ensemble models that quantify epistemic uncertainty via distributionally robust optimization.

Ax Ruishan Guo, Yibing Liu, Guoxin Ma, Yan Wang, Yueyang Zhang, Long Xia, Kecheng Chen, Zhiyuan Sun, Daiting Shi 2/27/2026

When Less is More: The LLM Scaling Paradox in Context Compression

Study of LLM scaling paradox showing larger compressor models can reduce context reconstruction faithfulness despite lower training loss.

Ax Truong Minh Huy, Edward Hirst 2/27/2026

Versor: A Geometric Sequence Architecture

Novel sequence architecture using Conformal Geometric Algebra instead of linear operations for improved generalization and interpretability.

Ax Wenkai Yang, Weijie Liu, Ruobing Xie, Kai Yang, Saiyong Yang, Yankai Lin 2/27/2026

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

On-policy distillation method that aligns student models with teacher logit distributions, theoretically framed as KL-constrained RL with reward extrapolation.

Ax DatologyAI, :, Aldo Gael Carranza, Kaleigh Mentzer, Ricardo Pio Monti, Alex Fang, Alvin Deng, Amro Abbas, Anshuman Suri, Brett Larsen, Cody Blakeney, Darren Teh, David Schwab, Diego Kiner, Fan Pan, Haakon Mongstad, Haoli Yin, Jack Urbanek, Jason Lee, Jason Telanoff, Josh Wills, Luke Merrick, Maximilian B\"other, Parth Doshi, Paul Burstein, Pratyush Maini, Rishabh Adiga, Siddharth Joshi, Spandan Das, Tony Jiang, Vineeth Dorna, Zhengping Wang, Bogdan Gaza, Ari Morcos, Matthew Leavitt 2/27/2026

\"UberWeb: Insights from Multilingual Curation for a 20-Trillion-Token Dataset

Analysis of multilingual data curation across 13 languages for 20-trillion-token dataset, addressing multilinguality challenges in foundation models.