Isolater - Feed

Ax Yannik Hahn, Jan Voets, Antonin Koenigsfeld, Hasan Tercan, Tobias Meisen 2/18/2026

Out of Distribution Detection for Efficient Continual Learning in Quality Prediction for Arc Welding

Out-of-distribution detection for continual learning in arc welding quality prediction using VQ-VAE Transformer architecture.

Ax Binghang Lu, Changhong Mou, Guang Lin 2/18/2026

Morephy-Net: An Evolutionary Multi-objective Optimization for Replica-Exchange-based Physics-informed Neural Operator Learning Networks

Morephy-Net uses multi-objective evolutionary optimization for physics-informed neural operators on parametric PDEs in noisy regimes.

Ax Jean-Michel Tucny, Abhisek Ganguly, Santosh Ansumali, Sauro Succi 2/18/2026

Randomness and signal propagation in physics-informed neural networks (PINNs): A neural PDE perspective

Analyzes signal propagation and weight randomness in physics-informed neural networks using spectral/statistical properties.

Ax Sarah Seifi, Anass Ibrahimi, Tobias Sukianto, Cecilia Carbonelli, Lorenzo Servadei, Robert Wille 2/18/2026

GenFacts-Generative Counterfactual Explanations for Multi-Variate Time Series

GenFacts generates valid counterfactual explanations for multivariate time series using class-discriminative VAE.

Ax Ehsan Futuhi, Nathan R. Sturtevant 2/18/2026

Learning Admissible Heuristics for A*: Theory and Practice

Learns admissible heuristics for A* search algorithms using constrained optimization to guarantee solution optimality.

Ax Jinwoo Kim, Xingyue Huang, Krzysztof Olejniczak, Kyungbin Min, Michael Bronstein, Seunghoon Hong, \.Ismail \.Ilkan Ceylan 2/18/2026

Flock: A Knowledge Graph Foundation Model via Learning on Random Walks

Flock: knowledge graph foundation model using random walk learning for zero-shot link prediction on novel entities and relations.

Ax Jacob Feitelberg, Dwaipayan Saha, Kyuseong Choi, Zaid Ahmad, Anish Agarwal, Raaz Dwivedi 2/18/2026

TabImpute: Universal Zero-Shot Imputation for Tabular Data

TabImpute: zero-shot universal imputation for tabular data with missing values using language model approach.

Ax Wendi Li, Changdae Oh, Sharon Li 2/18/2026

General Exploratory Bonus for Optimistic Exploration in RLHF

General exploratory bonus method for optimistic exploration in RLHF that avoids bias toward reference model high-probability regions.

Ax Tao Tao, Maissam Barkeshli 2/18/2026

Learning Pseudorandom Numbers with Transformers: Permuted Congruential Generators, Curricula, and Interpretability

Transformer models learn permuted congruential generator sequences via in-context prediction with curriculum learning and interpretability analysis.

Ax Linqi Zhou, Mathias Parger, Ayaan Haque, Jiaming Song 2/18/2026

Terminal Velocity Matching

Terminal Velocity Matching generalizes flow matching for one/few-step generative modeling with Wasserstein distance bounds.

Ax Haoyu Lei, Chin Wa Lau, Kaiwen Zhou, Nian Guo, Farzan Farnia 2/18/2026

Syndrome-Flow Consistency Model Achieves One-step Denoising Error Correction Codes

Applies consistency models to error correction codes for one-step neural decoding in low-latency communication settings.

Ax Luca Colombo, Fabrizio Pittorino, Daniele Zambon, Carlo Baldassi, Manuel Roveri, Cesare Alippi 2/18/2026

BEP: A Binary Error Propagation Algorithm for Binary Neural Networks Training

BEP algorithm trains binary neural networks with constrained weights/activations via error propagation for resource-constrained deployment.

Ax Ata Akbari Asanjan, Milad Memarzadeh, Bryan Matthews, Nikunj Oza 2/18/2026

Improving Variational Autoencoder using Random Fourier Transformation: An Aviation Safety Anomaly Detection Case-Study

Improves VAE and autoencoder training using random Fourier transformation with frequency principle analysis for aviation safety anomaly detection.

Ax Anantha Sharma 2/18/2026

ARGUS: Adaptive Rotation-Invariant Geometric Unsupervised System

ARGUS detects distributional drift in high-dimensional data streams using local statistics over fixed spatial partitions of data manifold.

Ax Seunghwan Jang, SooJean Han 2/18/2026

Stratified Hazard Sampling: Minimal-Variance Event Scheduling for CTMC/DTMC Discrete Diffusion and Flow Models

Stratified hazard sampling reduces variance in discrete diffusion/flow models by optimizing event scheduling in CTMC/DTMC processes.

Ax Nilin Abrahamsen 2/18/2026

PROMA: Projected Microbatch Accumulation for Reference-Free Proximal Policy Updates

PROMA: reference-free proximal policy method for LLM training that controls KL divergence via gradient projection without reference model.

Ax Francisco Giral, \'Alvaro Manzano, Ignacio G\'omez, Ricardo Vinuesa, Soledad Le Clainche 2/18/2026

GenDA: Generative Data Assimilation on Complex Urban Areas via Classifier-Free Diffusion Guidance

GenDA reconstructs high-resolution urban wind fields from sparse sensor data using graph-based diffusion and classifier-free guidance.

Ax Wang Zixian 2/18/2026

Orthogonalized Policy Optimization:Decoupling Sampling Geometry from Optimization Geometry in RLHF

OPO: theoretical framework for LLM alignment using constrained proximal policy optimization with work-dissipation principle and chi-square geometry.

Ax Gong Gao, Weidong Zhao, Xianhui Liu, Ning Jia 2/18/2026

Improving Policy Exploitation in Online Reinforcement Learning with Instant Retrospect Action

Instant Retrospect Action algorithm improves policy exploitation in online RL through Q-network representation learning.

Ax Md Muhtasim Munif Fahim, Soyda Humyra Yesmin, Saiful Islam, Md. Palash Bin Faruque, Md. A. Salam, Md. Mahfuz Uddin, Samiul Islam, Tofayel Ahmed, Md. Binyamin, Md. Rezaul Karim 2/18/2026

Green-NAS: A Global-Scale Multi-Objective Neural Architecture Search for Robust and Efficient Edge-Native Weather Forecasting

Green-NAS multi-objective neural architecture search optimizes weather forecasting models for efficiency and carbon footprint.

Ax Wei Chen, Jiacheng Li, Shigui Li, Zhiqi Lin, Junmei Yang, John Paisley, Delu Zeng 2/18/2026

Don't Forget Its Variance! The Minimum Path Variance Principle for Accurate and Stable Score-Based Models

MinPV Principle minimizes path variance in score-based models to improve accuracy and stability.

Ax Raj Ghugare, Micha{\l} Bortkiewicz, Alicja Ziarko, Benjamin Eysenbach 2/18/2026

On the Role of Iterative Computation in Reinforcement Learning

Analyzes role of iterative computation in RL, showing policies benefit from additional compute beyond fixed parameters.

Ax Jian Qian, Chen-Yu Wei 2/18/2026

Achieving Optimal Static and Dynamic Regret Simultaneously in Bandits with Deterministic Losses

Algorithm achieving simultaneous optimal static and dynamic regret in adversarial multi-armed bandits.

Ax Lior Cohen, Ofir Nabati, Kaixin Wang, Navdeep Kumar, Shie Mannor 2/18/2026

Horizon Imagination: Efficient On-Policy Rollout in Diffusion World Models

Horizon Imagination improves efficiency of diffusion-based world models for RL by denoising multiple future observations.

Ax Tatsuya Sagawa, Ryosuke Kojima 2/18/2026

How Well Do Large-Scale Chemical Language Models Transfer to Downstream Tasks?

Systematic evaluation of chemical language model scaling on molecular property prediction downstream tasks.

Ax Alexander W. Goodall, Francesco Belardinelli 2/18/2026

Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models

Recovery-based shielding framework integrates Gaussian process models with RL for provably safe control in continuous systems.

Ax Xiongxiao Xu, Solomon Abera Bekele, Brice Videau, Kai Shu 2/18/2026

Online GPU Energy Optimization with Switching-Aware Bandits

Online GPU energy optimization using bandit algorithms to reduce power consumption in HPC systems.

Ax Lorenzo Croissant (CREST, FAIRPLAY, ENSAE Paris) 2/18/2026

Linear Bandits beyond Inner Product Spaces, the case of Bandit Optimal Transport

Extends linear bandits theory beyond inner product spaces using optimal transport for recommendation and clinical systems.

Ax Xiongxiao Xu, Haoran Wang, Yueqing Liang, Philip S. Yu, Yue Zhao, Kai Shu 2/18/2026

Can Multimodal LLMs Perform Time Series Anomaly Detection?

Evaluates multimodal LLMs and vision-language models for time series anomaly detection in systems monitoring.

Ax Frank Nielsen 2/18/2026

Curved representational Bregman divergences and their applications

Mathematical analysis of curved Bregman divergences and their applications in statistical learning.

Ax Zhanliang Wang, Da Wu, Quan Nguyen, Zhuoran Xu, Kai Wang 2/18/2026

Multimodal Integrated Knowledge Transfer to Large Language Models through Preference Optimization with Biomedical Applications

MINT framework aligns LLMs with biomedical knowledge using preference optimization on multimodal data.

Ax Yiwei Ou, Xiaobin Ren, Ronggui Sun, Guansong Gao, Kaiqi Zhao, Manfredo Manfredini 2/18/2026

Scale-Invariant Regret Matching and Online Learning with Optimal Convergence: Bridging Theory and Practice in Zero-Sum Games

Convergence analysis of regret matching in zero-sum games bridging theoretical and practical game-solving methods.