Isolater - Feed

Ax Siqi Wang, Aoming Liu, Bryan A. Plummer 2/24/2026

Noise-Aware Generalization: Robustness to In-Domain Noise and Out-of-Domain Generalization

Noise-Aware Generalization: training methods handling label noise and domain shifts simultaneously.

Ax Jialin Chen, Haolan Zuo, Haoyu Peter Wang, Siqi Miao, Pan Li, Rex Ying 2/24/2026

Towards A Universal Graph Structural Encoder

Universal graph encoder for learning transferable structural representations across diverse graph domains.

Ax Hao Xu, Xiangru Jian, Xinjian Zhao, Wei Pang, Chao Zhang, Suyuchen Wang, Qixin Zhang, Zhengyuan Dong, Joao Monteiro, Bang Liu, Qiuzhuang Sun, Tianshu Yu 2/24/2026

GraphOmni: A Comprehensive and Extensible Benchmark Framework for Large Language Models on Graph-theoretic Tasks

GraphOmni: comprehensive benchmark for evaluating LLM reasoning on graph-theoretic tasks in natural language.

Ax Simone Papicchio, Simone Rossi, Luca Cagliero, Paolo Papotti 2/24/2026

Think2SQL: Reinforce LLM Reasoning Capabilities for Text2SQL

Think2SQL: reinforcement learning approach improving LLM reasoning for complex text-to-SQL generation.

Ax Ming Xu, Jinrong Xiang, Zilong Xie, Xiangfu Meng 2/24/2026

Learning to Rank Critical Road Segments via Heterogeneous Graphs with Origin-Destination Flow Integration

HetGL2R: heterogeneous graph framework for ranking critical road segments using origin-destination flows.

Ax Hongze Li, Zesheng Zhou, Zhenbiao Cao, Xinhui Li, Wei Chen, Xiaojin Zhang 2/24/2026

FedSDAF: Leveraging Source Domain Awareness for Enhanced Federated Domain Generalization

Federated domain generalization leveraging source domain features for improved generalization.

Ax Chethan Krishnamurthy Ramanaik, Arjun Roy, Tobias Callies, Eirini Ntoutsi 2/24/2026

GRILL: Restoring Gradient Signal in Ill-Conditioned Layers for More Effective Adversarial Attacks on Autoencoders

GRILL: adversarial attack method for autoencoders addressing ill-conditioned gradient propagation.

Ax Juhani Kivim\"aki, Jakub Bia{\l}ek, Wojtek Kuberski, Jukka K. Nurminen 2/24/2026

Performance Estimation in Binary Classification Using Calibrated Confidence

Performance estimation for binary classifiers without ground truth labels using calibrated confidence.

Ax Shanda Li, Tanya Marwah, Junhong Shen, Weiwei Sun, Andrej Risteski, Yiming Yang, Ameet Talwalkar 2/24/2026

CodePDE: An Inference Framework for LLM-driven PDE Solver Generation

CodePDE: LLM-based inference framework generating code for solving partial differential equations.

Ax Lin Zhu, Yijun Bian, Lei You 2/24/2026

FairSHAP: Preprocessing for Fairness Through Attribution-Based Data Augmentation

FairSHAP: preprocessing framework using Shapley values for identifying and mitigating bias in ML models.

Ax Shudi Weng, Chao Ren, Ming Xiao, Mikael Skoglund 2/24/2026

Heterogeneity-Aware Client Sampling for Optimal and Efficient Federated Learning

Federated learning client sampling strategy addressing heterogeneous communication and computational capabilities.

Ax Rafa{\l} Karczewski, Markus Heinonen, Alison Pouplin, S{\o}ren Hauberg, Vikas Garg 2/24/2026

The Spacetime of Diffusion Models: An Information Geometry Perspective

Information geometry analysis of diffusion model latent spaces, examining geodesic decoding and stochastic decoders.

Ax Yaoyu Zhu, Di Huang, Hanqi Lyu, Xiaoyun Zhang, Chongxiao Li, Wenxuan Shi, Yutong Wu, Jianan Mu, Jinghua Wang, Yang Zhao, Pengwei Jin, Shuyao Cheng, Shengwen Liang, Xishan Zhang, Rui Zhang, Zidong Du, Qi Guo, Xing Hu, Yunji Chen 2/24/2026

QiMeng-CodeV-R1: Reasoning-Enhanced Verilog Generation

QiMeng-CodeV-R1 applies RLVR to Verilog generation from natural language, introducing verifiable reward signal for hardware design automation.

Ax Jun Wu, Patrick Huang, Jiangtao Wen, Yuxing Han 2/24/2026

It Takes a Good Model to Train a Good Model: Generalized Gaussian Priors for Optimized LLMs

Shows LLM weights follow generalized Gaussian distributions and proposes end-to-end optimization framework for improved training efficiency.

Ax Jiaheng Dong, Hong Jia, Soumyajit Chatterjee, Abhirup Ghosh, James Bailey, Ting Dang 2/24/2026

E-BATS: Efficient Backpropagation-Free Test-Time Adaptation for Speech Foundation Models

E-BATS enables efficient test-time adaptation for speech models under domain shift without backpropagation, reducing memory overhead.

Ax Giacomo Baldan, Qiang Liu, Alberto Guardone, Nils Thuerey 2/24/2026

Physics vs Distributions: Pareto Optimal Flow Matching with Physics Constraints

Physics vs Distributions framework for flow matching that balances physical constraints with distributional accuracy in generative modeling.

Ax Thomas Marwitz, Alexander Colsmann, Ben Breitung, Christoph Brabec, Christoph Kirchlechner, Eva Blasco, Gabriel Cadilha Marques, Horst Hahn, Michael Hirtz, Pavel A. Levkin, Yolita M. Eggeler, Tobias Schl\"oder, Pascal Friederich 2/24/2026

Predicting New Research Directions in Materials Science using Large Language Models and Concept Graphs

Uses LLMs and concept graphs to extract scientific concepts from materials science abstracts and discover novel research connections.

Ax Geng Zhang, Yuxuan Han, Yuxuan Lou, Yiqi Zhang, Wangbo Zhao, Yang You 2/24/2026

MoNE: Replacing Redundant Experts with Lightweight Novices for Structured Pruning of MoE

MoNE prunes MoE models by replacing redundant experts with lightweight novices, reducing memory overhead while maintaining performance.

Ax Dalia Rodr\'iguez-Salas 2/24/2026

Symbolic Branch Networks: Tree-Inherited Neural Models for Interpretable Multiclass Classification

Symbolic Branch Networks inherit architecture from decision tree ensembles to create interpretable neural models for multiclass classification.

Ax Tim Beyer, Yan Scholten, Leo Schwinn, Stephan G\"unnemann 2/24/2026

Sampling-aware Adversarial Attacks Against Large Language Models

Demonstrates sampling-aware adversarial attacks against LLMs that leverage stochastic sampling to more accurately assess robustness.

Ax Seonghyun Park, Kiyoung Seong, Soojung Yang, Rafael G\'omez-Bombarelli, Sungsoo Ahn 2/24/2026

Learning Collective Variables from BioEmu with Time-Lagged Generation

Uses BioEmu with time-lagged generation to learn collective variables for enhanced molecular dynamics simulations of rare events.

Ax Shiyu Chen, Cencheng Shen, Youngser Park, Carey E. Priebe 2/24/2026

Graph Neural Networks Powered by Encoder Embedding for Improved Node Learning

Improves GNN node learning performance using statistically grounded graph encoder embedding initialization for faster convergence.

Ax Haris Khan, Sadia Asif, Shumaila Asif 2/24/2026

Modular Delta Merging with Orthogonal Constraints: A Scalable Framework for Continual and Reversible Model Composition

MDM-OC enables scalable, reversible model composition with orthogonal constraints to prevent task interference and catastrophic forgetting.

Ax Guojiang Zhao, Zixiang Lu, Yutang Ge, Sihang Li, Zheng Cheng, Haitao Lin, Lirong Wu, Hanchen Xia, Hengxing Cai, Wentao Guo, Hongshuai Wang, Mingjun Xu, Siyu Zhu, Guolin Ke, Linfeng Zhang, Zhifeng Gao 2/24/2026

MolReasoner: Toward Effective and Interpretable Reasoning for Molecular LLMs

MolReasoner enhances LLM reasoning for molecular tasks with domain-specific semantics, reducing hallucinations and improving interpretability.

Ax Francesco Leonardi, Markus Orsi, Jean-Louis Reymond, Kaspar Riesen 2/24/2026

GEDAN: Learning the Edit Costs for Graph Edit Distance

GEDAN learns optimal edit costs for graph edit distance computation, improving upon NP-hard approximation methods with learnable cost functions.

Ax Linghao Zhu, Yiran Guan, Dingkang Liang, Jianzhong Ju, Zhenbo Luo, Bin Qin, Jian Luan, Yuliang Liu, Xiang Bai 2/24/2026

Shuffle-R1: Efficient RL framework for Multimodal Large Language Models via Data-centric Dynamic Shuffle

Shuffle-R1 framework improves RL efficiency for multimodal LLMs through data-centric dynamic shuffling to address advantage collapsing and rollout silencing.

Ax Mateusz Praski, Jakub Adamczyk, Wojciech Czech 2/24/2026

Benchmarking Pretrained Molecular Embedding Models For Molecular Representation Learning

Comprehensive evaluation of 25 pretrained molecular embedding models across 25 datasets for molecular property prediction and drug design.

Ax Jihyun Lim, Junhyuk Jo, Chanhyeok Ko, Young Min Go, Jimin Hwa, Sunwoo Lee 2/24/2026

Biased Local SGD for Efficient Deep Learning on Heterogeneous Systems

Proposes biased local SGD for efficient parallel neural network training on heterogeneous computing systems with varying resource availability.

Ax Valter Sch\"utz, Han Wu, Reza Rezvan, Linus Aronsson, Morteza Haghir Chehreghani 2/24/2026

AFABench: A Generic Framework for Benchmarking Active Feature Acquisition

AFABench framework for benchmarking active feature acquisition methods that dynamically select informative features under acquisition cost constraints.

Ax Minqi Jiang, Andrei Lupu, Yoram Bachrach 2/24/2026

Bootstrapping Task Spaces for Self-Improvement

Presents Exploratory Iteration (ExIt), RL methods enabling agents to self-improve through iterative refinement without fixed iteration limits.

Ax Wei Chen, Yuqian Wu, Yuanshao Zhu, Xixuan Hao, Shiyu Wang, Xiaofang Zhou, Yuxuan Liang 2/24/2026

Select, then Balance: Exploring Exogenous Variable Modeling of Spatio-Temporal Forecasting

Explores exogenous variable modeling in spatio-temporal forecasting systems to improve prediction accuracy.

Ax Geon Lee, Bhuvesh Kumar, Clark Mingxuan Ju, Tong Zhao, Kijung Shin, Neil Shah, Liam Collins 2/24/2026

Sequential Data Augmentation for Generative Recommendation

Data augmentation strategies for generative recommendation systems improving generalization in sequential user behavior prediction.

Ax Niccol\`o Rocchi, Fabio Stella, Cassio de Campos 2/24/2026

Towards Privacy-Aware Bayesian Networks: A Credal Approach

Privacy-aware Bayesian network approach using credal sets for secure public release of probabilistic graphical models.

Ax Yiyuan Pan, Zhe Liu, Hesheng Wang 2/24/2026

Wonder Wins Ways: Curiosity-Driven Exploration through Multi-Agent Contextual Calibration

Multi-agent RL with curiosity-driven exploration using contextual calibration to distinguish novelty from environmental stochasticity.

Ax Yinuo Ren, Wenhao Gao, Lexing Ying, Grant M. Rotskoff, Jiequn Han 2/24/2026

DriftLite: Lightweight Drift Control for Inference-Time Scaling of Diffusion Models

DriftLite: Training-free particle-based approach for inference-time diffusion model adaptation to new distributions.

Ax Shirin Alanova, Kristina Kazistova, Ekaterina Galaeva, Alina Kostromina, Vladimir Smirnov, Redko Dmitry, Alexey Dontsov, Maxim Zhelnin, Evgeny Burnaev, Egor Shvetsov 2/24/2026

Lightweight error mitigation strategies for post-training N:M activation sparsity in LLMs

Error mitigation methods for post-training N:M activation sparsity in LLMs enabling dynamic input-adaptive compression.

Ax Xingjian Wu, Jianxin Jin, Wanghui Qiu, Peng Chen, Yang Shu, Bin Yang, Chenjuan Guo 2/24/2026

Aurora: Towards Universal Generative Multimodal Time Series Forecasting

Aurora: Multimodal foundation model for cross-domain time series forecasting integrating text and temporal data.

Ax Narada Maugin, Tristan Cazenave 2/24/2026

SpinGPT: A Large-Language-Model Approach to Playing Poker Correctly

SpinGPT applies LLM approach to poker strategy, addressing CFR computational limits in multi-player game settings.

Ax Aman Gupta, Rafael Celente, Abhishek Shivanna, D. T. Braithwaite, Gregory Dexter, Shao Tang, Hiroto Udagawa, Daniel Silva, Rohan Ramanath, S. Sathiya Keerthi 2/24/2026

Effective Quantization of Muon Optimizer States

8-bit blockwise quantization of Muon optimizer states reducing memory overhead for large-scale LLM pretraining.

Ax Wei Wang, Dong-Dong Wu, Ming Li, Jingxiong Zhang, Gang Niu, Masashi Sugiyama 2/24/2026

Accessible, Realistic, and Fair Evaluation of Positive-Unlabeled Learning Algorithms

Framework for standardizing evaluation of positive-unlabeled learning algorithms under consistent experimental settings.

Ax Hao Chen, Tao Han, Jie Zhang, Song Guo, Lei Bai 2/24/2026

STCast: Adaptive Boundary Alignment for Global and Regional Weather Forecasting

Weather forecasting method using adaptive boundary alignment for regional and global predictions with spatial-temporal modeling.

Ax Jubayer Ibn Hamid, Ifdita Hasan Orney, Ellen Xu, Chelsea Finn, Dorsa Sadigh 2/24/2026

Polychromic Objectives for Reinforcement Learning

Polychromic objectives for RL fine-tuning preventing policy collapse and preserving diversity in pretrained model behaviors.

Ax Jaewoo Lee, Minsu Kim, Sanghyeok Choi, Inhyuck Song, Sujin Yun, Hyeongyu Kang, Woocheol Shin, Taeyoung Yun, Kiyoung Om, Jinkyoo Park 2/24/2026

Diffusion Alignment as Variational Expectation-Maximization

Diffusion Alignment as Variational EM framework addressing reward over-optimization and mode collapse in diffusion model alignment.

Ax Yuchen Cai, Ding Cao, Xin Xu, Zijun Yao, Yuqing Huang, Zhenyu Tan, Benyi Zhang, Guangzhong Sun, Guiquan Liu, Junfeng Fang 2/24/2026

On Predictability of Reinforcement Learning Dynamics for Large Language Models

Analysis of RL-induced parameter dynamics in LLMs revealing rank-1 dominance in reasoning improvements and predictability of training trajectories.

Ax Kwanhee Lee, Hyeondo Jang, Dongyeop Lee, Dan Alistarh, Namhoon Lee 2/24/2026

The Unseen Frontier: Pushing the Limits of LLM Sparsity with Surrogate-Free ADMM

Surrogate-free ADMM method for LLM pruning achieving >50% sparsity without accuracy degradation, breaking through conventional compression limits.

Ax Anirudh Subramanyam, Yuxin Chen, Robert L. Grossman 2/24/2026

Scaling Laws Revisited: Modeling the Role of Data Quality in Language Model Pretraining

Scaling law formalization incorporating data quality parameter for language model pretraining, extending traditional model/dataset size relationships.

Ax Xiangyu Shi, Marco Chiesa, Gerald Q. Maguire Jr., Dejan Kostic 2/24/2026

KVComm: Enabling Efficient LLM Communication through Selective KV Sharing

KVComm: Communication framework for multi-agent LLM systems using selective key-value sharing instead of natural language or hidden states.

Ax Nirjhar Das, Mohit Sharma, Praharsh Nanavati, Kirankumar Shiragur, Amit Deshpande 2/24/2026

Cost Efficient Fairness Audit Under Partial Feedback

Fairness auditing framework for classifiers with partial feedback using cost-aware data acquisition strategies.

Ax Philipp Becker, Niklas Freymuth, Serge Thilges, Fabian Otto, Gerhard Neumann 2/24/2026

TROLL: Trust Regions improve Reinforcement Learning for Large Language Models

TROLL: Trust region-based RL method improving upon PPO clipping for LLM fine-tuning, achieving more stable and optimal reward-based training.

Ax Yuchen Zhu, Wei Guo, Jaemoo Choi, Petr Molodyk, Bo Yuan, Molei Tao, Yongxin Chen 2/24/2026

Enhancing Reasoning for Diffusion LLMs via Distribution Matching Policy Optimization

Novel RL algorithm for diffusion LLMs using distribution matching policy optimization to improve reasoning capabilities and match autoregressive LLM performance.