Isolater - Feed

Ax Ken Ding 3/26/2026

HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation

Training method for LLMs on mathematical reasoning combining RL with privileged self-distillation to improve learning on hard problems.

Ax Henry LeCates, Haoze Wu 3/26/2026

The Luna Bound Propagator for Formal Analysis of Neural Networks

C++ implementation of neural network verification tool supporting bound propagation methods for DNN formal analysis.

Ax Guopeng Li, Matthijs T. J. Spaan, Julian F. P. Kooij 3/26/2026

Off-Policy Safe Reinforcement Learning with Constrained Optimistic Exploration

Safe reinforcement learning method addressing constraint violations in off-policy exploration through constrained optimistic exploration Q-learning.

Ax Chenxu Zhou, Zelin Liu, Rui Cai, Houlin Gong, Yikang Yu, Jia Zeng, Yanru Pei, Liang Zhang, Weishu Zhao, Xiaofeng Gao 3/26/2026

GRMLR: Knowledge-Enhanced Small-Data Learning for Deep-Sea Cold Seep Stage Inference

ML method for deep-sea microbial analysis with small datasets using knowledge enhancement techniques.

Ax Yaxin Liao, Qimei Cui, Kwang-Cheng Chen, Xiong Li, Jinlian Chen, Xiyu Zhao, Xiaofeng Tao, Ping Zhang 3/26/2026

Wireless communication empowers online scheduling of partially-observable transportation multi-robot systems in a smart factory

Online multi-robot task assignment and route scheduling in smart factories using wireless communication under partial observability.

Ax Jimyung Hong, Jaehyung Kim 3/26/2026

Diet Your LLM: Dimension-wise Global Pruning of LLMs via Merging Task-specific Importance Score

DIET: structured pruning method for LLMs using dimension-wise global importance scores that adapt to task-specific requirements.

Ax Zongliang Ji, Yifei Sun, Andre Amaral, Anna Goldenberg, Rahul G. Krishnan 3/26/2026

Can we generate portable representations for clinical time series data using LLMs?

Uses LLMs to generate portable patient embeddings from clinical time series that transfer across hospitals with minimal retraining.

Ax Allen Nie, Xavier Daull, Zhiyi Kuang, Abhinav Akkiraju, Anish Chaudhuri, Max Piasevoli, Ryan Rong, YuCheng Yuan, Prerit Choudhary, Shannon Xiao, Rasool Fakoor, Adith Swaminathan, Ching-An Cheng 3/26/2026

Understanding the Challenges in Iterative Generative Optimization with LLMs

Analysis of design challenges in iterative generative optimization using LLMs for self-improving agents; identifies hidden choices engineers must make.

Ax Zhangyong Liang, Ji Zhang 3/26/2026

Stochastic Dimension-Free Zeroth-Order Estimator for High-Dimensional and High-Order PINNs

Dimension-free zeroth-order estimator for PINNs addressing spatial derivative complexity and memory overhead in high-dimensional PDEs.

Ax Chen Ma, Wanjie Wang, Shuhao Fan 3/26/2026

i-IF-Learn: Iterative Feature Selection and Unsupervised Learning for High-Dimensional Complex Data

Iterative unsupervised framework for feature selection and clustering in high-dimensional data by recovering influential features.

Ax Ruobing Wang, Xin Li, Yujie Fang, Mingzhong Wang 3/26/2026

Lagrangian Relaxation Score-based Generation for Mixed Integer linear Programming

Generative framework using Lagrangian relaxation-guided score-based generation to solve mixed-integer linear programming with diverse solutions.

Ax Andrea Manzoni 3/26/2026

MoE-Sieve: Routing-Guided LoRA for Efficient MoE Fine-Tuning

MoE-Sieve: routing-guided LoRA fine-tuning framework for MoE models that adapts to skewed expert routing patterns for efficiency.

Ax J. J. H. van Gemert, V. Breschi, D. R. Yntema, K. J. Keesman, M. Lazar 3/26/2026

The impact of sensor placement on graph-neural-network-based leakage detection

Investigates optimal sensor placement for GNN-based leakage detection in water distribution networks.

Ax Fei Bai, Zhipeng Chen, Chuan Hao, Ming Yang, Ran Tao, Bryan Dai, Wayne Xin Zhao, Jian Yang, Hongteng Xu 3/26/2026

Towards Effective Experiential Learning: Dual Guidance for Utilization and Internalization

Dual guidance approach for RL-based LLM training combining external verification and internal experience to improve reasoning task performance.

Ax Peng Xu, Yapeng Li, Tinghuan Chen, Tsung-Yi Ho, Bei Yu 3/26/2026

KCLNet: Electrically Equivalence-Oriented Graph Representation Learning for Analog Circuits

Graph representation learning for analog circuit electrical equivalence to support electronic design automation tasks.

Ax Saba Nasiri, Selin Aviyente, Dorina Thanou 3/26/2026

Causality-Driven Disentangled Representation Learning in Multiplex Graphs

Causal inference framework for learning disentangled representations from multiplex graphs by separating shared and layer-specific information.

Ax Mingyi Liu 3/26/2026

The Alignment Tax: Response Homogenization in Aligned LLMs and Its Implications for Uncertainty Estimation

RLHF-aligned LLMs exhibit response homogenization limiting uncertainty estimation; analyzes alignment tax impact across different tasks and sampling methods.

Ax Igor Colin (LTCI, S2A, IP Paris), Aur\'elien Bellet (PREMEDICAL), Stephan Cl\'emen\c{c}on (LTCI, IDS, S2A, IP Paris), Joseph Salmon (IROKO, UM) 3/26/2026

On Gossip Algorithms for Machine Learning with Pairwise Objectives

Gossip-based distributed machine learning algorithms for IoT networks with privacy constraints and limited computation/communication resources.

Ax Mayssa Soussia, Gita Ayu Salsabila, Mohamed Ali Mahjoub, Islem Rekik 3/26/2026

Reservoir-Based Graph Convolutional Networks

Graph convolutional networks using reservoir computing to address challenges with complex and dynamic graph data and long-range dependencies.

Ax Lukas Theiner, Maik Pfefferkorn, Yongpeng Zhao, Sebastian Hirt, Rolf Findeisen 3/26/2026

Efficient Controller Learning from Human Preferences and Numerical Data Via Multi-Modal Surrogate Models

Bayesian optimization framework for tuning control policies using human preferences and pairwise comparisons instead of quantitative evaluations.

Ax Heng Wu, Junjie Wang, Benzhuo Lu 3/26/2026

Linear-Nonlinear Fusion Neural Operator for Partial Differential Equations

Neural operator learning method combining linear and nonlinear effects for efficient PDE solving without repeated solution computation.

Ax Shengyu Duan, Marcos L. L. Sartori, Rishad Shafik, Alex Yakovlev 3/26/2026

TsetlinWiSARD: On-Chip Training of Weightless Neural Networks using Tsetlin Automata on FPGAs

FPGA-based implementation of weightless neural networks using Tsetlin automata for on-chip training and inference with low latency and complexity.

Ax Cansu Sancaktar, David Zhang, Gabriel Synnaeve, Taco Cohen 3/26/2026

A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula

Scalable RL pipeline for improving LLM code generation through synthetic data and curriculum learning, addressing data diversity challenges at scale.

Ax Aymane Harkati, Moncef Garouani, Olivier Teste, Julien Aligon, Mohamed Hamlich 3/26/2026

IPatch: A Multi-Resolution Transformer Architecture for Robust Time-Series Forecasting

Transformer architecture for multivariate time series forecasting using multi-resolution representations to capture short-term and long-range dependencies.

Ax Faiz Taleb, Ivan Gazeau, Maryline Laurent 3/26/2026

Uncovering Memorization in Timeseries Imputation models: LBRM Membership Inference and its link to attribute Leakage

Study on privacy vulnerabilities in deep learning time series imputation models, demonstrating membership inference attacks in black-box settings.

Ax Qianqian Qi, Zhongming Chen, Peter G. M. van der Heijden 3/26/2026

Identification of NMF by choosing maximum-volume basis vectors

Nonnegative matrix factorization approach using maximum-volume basis vectors for identifying NMF solutions in highly mixed data.

Ax Joseph G. Zalameda, Megan A. Witherow, Alexander M. Glandon, Jose Aguilera, Khan M. Iftekharuddin 3/26/2026

Attack Assessment and Augmented Identity Recognition for Human Skeleton Data

Methods for assessing adversarial attack vulnerability and augmenting identity recognition models trained on small LiDAR skeleton datasets.

Ax Yijun Wang, Qiyuan Zhuang, Xiu-Shen Wei 3/26/2026

Embracing Heteroscedasticity for Probabilistic Time Series Forecasting

Framework for probabilistic time series forecasting that explicitly models heteroscedasticity and time-varying conditional variances in nonstationary dynamics.

Ax Jiacheng Wang, Liang Fan, Baihua Li, Luyan Zhang 3/26/2026

Forecasting with Guidance: Representation-Level Supervision for Time Series Forecasting

ReGuider representation-level supervision method improves time series forecasting by capturing extreme patterns and salient dynamics in temporal representations.

Ax Yuhan Zhao, Jacob Tennant, James Yang, Zhishan Guo, Young Whang, Ning Sui 3/26/2026

DeepDTF: Dual-Branch Transformer Fusion for Multi-Omics Anticancer Drug Response Prediction

DeepDTF dual-branch transformer framework predicts cancer drug response from multi-omics data, addressing cross-modal alignment in precision oncology.

Ax Jun Ma, Xu Zhang, Zhengxing Jiao, Yaxin Hou, Hui Liu, Junhui Hou, Yuheng Jia 3/26/2026

Language-Assisted Image Clustering Guided by Discriminative Relational Signals and Adaptive Semantic Centers

Vision-language model approach for image clustering using LLM-generated text features with adaptive semantic centers to improve inter-class discriminability.

Ax Eyal Weiss 3/26/2026

Cost-Sensitive Neighborhood Aggregation for Heterophilous Graphs: When Does Per-Edge Routing Help?

Cost-Sensitive Neighborhood Aggregation (CSNA) GNN layer uses per-edge routing to handle heterophilous graph structures differently based on adversarial vs informative regimes.

Ax Dogan Urgun, Gokhan Gungor 3/26/2026

Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning

Framework uses LLMs to automatically design reward functions for cooperative multi-agent reinforcement learning, synthesizing executable reward programs from environment instrumentation.

Ax Yifeng Zhang, Harsh Goel, Peizhuo Li, Mehul Damani, Sandeep Chinchali, Guillaume Sartoretti 3/26/2026

CoordLight: Learning Decentralized Coordination for Network-Wide Traffic Signal Control

Multi-agent reinforcement learning approach for decentralized adaptive traffic signal control using learned coordination in partially observable environments.

Ax Xiangsen Chen, Ruilong Wu, Yanyan Lan, Ting Ma, Yang Liu 3/26/2026

MolEvolve: LLM-Guided Evolutionary Search for Interpretable Molecular Optimization

MolEvolve framework uses LLM guidance with evolutionary search for interpretable molecular optimization, addressing activity cliffs and lack of interpretability.

Ax Jose del Aguila Ferrandis 3/26/2026

Learning Response-Statistic Shifts and Parametric Roll Episodes from Wave--Vessel Time Series via LSTM Functional Models

LSTM functional models learn nonlinear mappings from wave-vessel time series to predict parametric roll episodes and statistical shifts in ship responses.

Ax Xiangru Jian, Shravan Nayak, Kevin Qinghong Lin, Aarash Feizi, Kaixin Li, Patrice Bechard, Spandana Gella, Sai Rajeswar 3/26/2026

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

CUA-Suite dataset provides massive human-annotated continuous video demonstrations for training computer-use agents on desktop automation tasks, addressing data bottleneck.

Ax Samuel Filgueira da Silva, Mehmet Fatih Ozkan, Faissal El Idrissi, Marcello Canova 3/26/2026

Conformalized Transfer Learning for Li-ion Battery State of Health Forecasting under Manufacturing and Usage Variability

Transfer learning framework using LSTM and conformal prediction for lithium-ion battery state-of-health forecasting across manufacturing variations.

Ax Ron Holzman, Shay Moran, Alexander Shlimovich 3/26/2026

Uniform Laws of Large Numbers in Product Spaces

Theoretical work on uniform laws of large numbers in product spaces extending VC dimension theory under product distribution assumptions.

Ax Mihaela-Larisa Clement, M\'onika Farsang, Agnes Poks, Johannes Edelmann, Manfred Pl\"ochl, Radu Grosu, Ezio Bartocci 3/26/2026

Towards Safe Learning-Based Non-Linear Model Predictive Control through Recurrent Neural Network Modeling

Sequential-AMPC uses recurrent neural networks to approximate nonlinear model predictive control offline, reducing online computation for embedded hardware control systems.

Ax Alexander Panfilov, Peter Romov, Igor Shilov, Yves-Alexandre de Montjoye, Jonas Geiping, Maksym Andriushchenko 3/26/2026

Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs

AI agents using Claude Code autonomously discovered novel adversarial attack algorithms for LLMs that outperform 30+ existing methods in jailbreaking and prompt injection attacks.

Ax Terry Chen, Zhifan Ye, Bing Xu, Zihao Ye, Timmy Liu, Ali Hassani, Tianqi Chen, Andrew Kerr, Haicheng Wu, Yang Xu, Yu-Jung Chen, Hanfeng Chen, Aditya Kane, Ronny Krashinsky, Ming-Yu Liu, Vinod Grover, Luis Ceze, Roger Bringmann, John Tran, Wei Liu, Fung Xie, Michael Lightstone, Humphrey Shi 3/26/2026