Isolater - Feed

Ax Luigi Ciceri, Corrado Mio, Jianyi Lin, Gabriele Gianini 2/17/2026

Geometry-Aware Physics-Informed PointNets for Modeling Flows Across Porous Structures

Physics-informed PointNets and geometry-aware neural operators for modeling flows across porous structures with coupled physics and diverse geometries.

Ax Anton Korznikov, Andrey Galichin, Alexey Dontsov, Oleg Rogov, Ivan Oseledets, Elena Tutubalina 2/17/2026

Sanity Checks for Sparse Autoencoders: Do SAEs Beat Random Baselines?

Sanity checks validating whether sparse autoencoders recover meaningful features beyond random baselines for neural network interpretability.

Ax Xuanbo Su, Hao Luo, Yingfang Zhang, Lijun Zhang 2/17/2026

ROAST: Rollout-based On-distribution Activation Steering Technique

ROAST uses on-distribution rollouts for parameter-efficient LLM activation steering at inference time, replacing off-distribution supervision with continuous soft scaling.

Ax Rizhen Hu, Yuan Cao, Boao Kong, Mou Sun, Kun Yuan 2/17/2026

Synergistic Intra- and Cross-Layer Regularization Losses for MoE Expert Specialization

Plug-and-play regularization losses for Mixture-of-Experts models promoting expert specialization across intra- and cross-layers without structural modifications.

Ax Max Fomin 2/17/2026

When Benchmarks Lie: Evaluating Malicious Prompt Classifiers Under True Distribution Shift

Comprehensive analysis of malicious prompt classifier robustness under distribution shift with 18 datasets spanning jailbreaks and prompt injections for LLM agents.

Ax Yiran Guo, Zhongjian Qiao, Yingqi Xie, Jie Liu, Dan Ye, Ruiqing Zhang, Shuang Qiu, Lijie Xu 2/17/2026

Deep Dense Exploration for LLM Reinforcement Learning via Pivot-Driven Resampling

Pivot-driven resampling technique for deep dense exploration in LLM RL, discovering high-quality trajectories within limited sampling budget from language space.

Ax Nicolas Zumarraga, Thomas Kaar, Ning Wang, Maxwell A. Xu, Max Rosenblattl, Markus Kreft, Kevin O'Sullivan, Paul Schmiedmayer, Patrick Langer, Robert Jakob 2/17/2026

TS-Haystack: A Multi-Scale Retrieval Benchmark for Time Series Language Models

TS-Haystack benchmark evaluates time series language models on long-context retrieval with millions of datapoints, requiring precise temporal localization.

Ax Jinbo Wang, Binghui Li, Zhanpeng Zhou, Mingze Wang, Yuxuan Sun, Jiaqi Zhang, Xunliang Cai, Lei Wu 2/17/2026

Fast Catch-Up, Late Switching: Optimal Batch Size Scheduling via Functional Scaling Laws

Characterizes optimal batch size scheduling for large-scale deep learning under fixed data budget using functional scaling law framework.

Ax Omin Kwon, Yeonjae Kim, Doyeon Kim, Minseo Kim, Yeonhong Park, Jae W. Lee 2/17/2026

MAGE: All-[MASK] Block Already Knows Where to Look in Diffusion LLM

MAGE optimizes KV cache memory access in block diffusion LLMs for long-context settings using dynamic sparse attention adapted to block diffusion uniqueness.

Ax Seyedsaman Emami, Daniel Hern\'andez-Lobato, Gonzalo Mart\'inez-Mu\~noz 2/17/2026

Robust multi-task boosting using clustering and local ensembling

RMB-CLE framework for multi-task learning integrating error-based task clustering with local ensembling to mitigate negative transfer from unrelated tasks.

Ax Yaxuan Kong, Hoyoung Lee, Yoontae Hwang, Alejandro Lopez-Lira, Bradford Levy, Dhagash Mehta, Qingsong Wen, Chanyeol Choi, Yongjae Lee, Stefan Zohren 2/17/2026

Evaluating LLMs in Finance Requires Explicit Bias Consideration

Analysis identifying five recurring biases in financial LLM applications: look-ahead, survivorship, narrative, objective, and cost bias that invalidate deployment claims.

Ax Pinqiao Wang, Sheng Li 2/17/2026

Multi-Agent Debate: A Unified Agentic Framework for Tabular Anomaly Detection

MAD framework treats tabular anomaly detection as multi-agent debate, leveraging disagreement from heterogeneous model families under distribution shift and rare-anomaly regimes.

Ax Manal Rahal, Bestoun S. Ahmed, Roger Renstr\"om, Robert Stener 2/17/2026

Cross-household Transfer Learning Approach with LSTM-based Demand Forecasting

Transfer learning approach using LSTM for cross-household hot water demand forecasting to optimize heat pump operation and reduce energy waste.

Ax Yilun Kuang, Yash Dagade, Deep Chakraborty, Erik Learned-Miller, Randall Balestriero, Tim G. J. Rudner, Yann LeCun 2/17/2026

Radial-VCReg: More Informative Representation Learning Through Radial Gaussianization

Radial-VCReg augments VCReg with radial Gaussianization loss for improved self-supervised representation learning by aligning feature norms with Chi distribution.

Ax Boning Zhou, Ziyu Wang, Han Hong, Haoqi Hu 2/17/2026

Integrating Unstructured Text into Causal Inference: Empirical Evidence from Real Data

Framework leveraging transformer-based language models for causal inference from unstructured text, comparing estimates against structured data baselines.

Ax Lamine Rihani 2/17/2026

Reverse N-Wise Output-Oriented Testing for AI/ML and Quantum Computing Systems

Testing methodology for AI/ML and quantum systems addressing high-dimensional inputs, probabilistic outputs, and evaluation of trustworthiness, fairness, and robustness.

Ax Ruomeng Ding, Tianwei Gao, Thomas P. Zollo, Eitan Bachmat, Richard Zemel, Zhun Deng 2/17/2026

Whom to Query for What: Adaptive Group Elicitation via Multi-Turn LLM Interactions

Framework for adaptive multi-turn LLM interactions to efficiently elicit group-level information from surveys, optimizing respondent selection and questioning strategy.

Ax Kris Shengjun Dong, Sahil Modi, Dima Nikiforov, Sana Damani, Edward Lin, Siva Kumar Sastry Hari, Christos Kozyrakis 2/17/2026

KernelBlaster: Continual Cross-Task CUDA Optimization via Memory-Augmented In-Context Reinforcement Learning

KernelBlaster uses agentic workflows with in-context RL to optimize CUDA code across GPU architectures, aggregating knowledge from prior optimizations without expensive finetuning.

Ax Edwin Chen, Zulekha Bibi 2/17/2026

Machine Learning as a Tool (MLAT): A Framework for Integrating Statistical ML Models as Callable Tools within LLM Agent Workflows

MLAT framework exposes pre-trained ML models as callable tools within LLM agent workflows, enabling agents to invoke quantitative predictions and reason about outputs contextually.

Ax Songyuan Li, Jia Hu, Ahmed M. Abdelmoniem, Geyong Min, Haojun Huang, Jiwei Huang 2/17/2026

DeepFusion: Accelerating MoE Training via Federated Knowledge Distillation from Heterogeneous Edge Devices

Federated learning approach (DeepFusion) for training MoE-based LLMs using knowledge distillation from heterogeneous edge devices, enabling privacy-preserving distributed training.

Ax Hani Beirami, M M Manjurul Islam 2/17/2026

Conformal Signal Temporal Logic for Robust Reinforcement Learning Control: A Case Study

Applies conformal Signal Temporal Logic (STL) specifications to enhance safety and robustness of RL control in aerospace (F-16 simulation), encoding control objectives formally.

Ax Zhi Zhang, Zhen Han, Costas Mavromatis, Qi Zhu, Yunyi Zhang, Sheng Guan, Dingmin Wang, Xiong Zhou, Shuai Wang, Soji Adeshina, Vassilis Ioannidis, Huzefa Rangwala 2/17/2026

Train Less, Learn More: Adaptive Efficient Rollout Optimization for Group-Based Reinforcement Learning

Adaptive efficient rollout optimization (Train Less, Learn More) for Group Relative Policy Optimization in LLM post-training, reducing redundant rollouts when group outcomes are identical.

Ax Mathias Jackermeier, Mattia Giuri, Jacques Cloete, Alessandro Abate 2/17/2026

Zero-Shot Instruction Following in RL via Structured LTL Representations

Zero-shot instruction following in multi-task RL using linear temporal logic (LTL) representations to specify temporally extended tasks for generalist agent policies.

Ax Kensuke Ajimoto, Yuma Yamamoto, Yoshifumi Kusunoki, Tomoharu Nakashima 2/17/2026

A Study on Multi-Class Online Fuzzy Classifiers for Dynamic Environments

Multi-class online fuzzy classifier for dynamic environments with human-defined antecedent fuzzy sets and learned consequent values in streaming data settings.

Ax Abdelali Bouyahia, Fr\'ed\'eric LeBlanc, Mario Marchand 2/17/2026

The geometry of invariant learning: an information-theoretic analysis of data augmentation and generalization

Information-theoretic framework explaining data augmentation's role in generalization and invariance learning, providing theoretical justification for augmentation effectiveness.

Ax Prithwijit Chowdhury, Ahmad Mustafa, Mohit Prabhushankar, Ghassan AlRegib 2/17/2026

A unified framework for evaluating the robustness of machine-learning interpretability for prospect risking

Framework for evaluating robustness of ML interpretability methods (LIME, SHAP) in hydrocarbon prospect risking using geophysical tabular data classification.

Ax Arnav Chavan, Nahush Lele, Udbhav Bamba, Sankalp Dayal, Aditi Raghunathan, Deepak Gupta 2/17/2026

S2D: Selective Spectral Decay for Quantization-Friendly Conditioning of Neural Activations

Addresses activation outliers in transformer quantization through spectral decay technique (S2D), establishing correlation between pre-training scale and outlier severity with theoretical analysis.

Ax Ian Su, Gaurav Purushothaman, Jey Narayan, Ruhika Goel, Kevin Zhu, Sunishchal Dev, Yash More, Maheep Chaudhary 2/17/2026

Broken Chains: The Cost of Incomplete Reasoning in LLMs

Framework analyzing reasoning modalities (code, natural language, hybrid) in LLMs under token constraints, evaluating performance tradeoffs for reasoning-specialized models.

Ax Hasi Hays 2/17/2026

Selective Synchronization Attention

Novel attention mechanism (SSA) replacing dot-product self-attention with Kuramoto model solution, reducing quadratic complexity and grounding in biological neural computation.

Ax Lei Chen, Yuan Meng, Xiaoyu Zhan, Zhi Wang, Wenwu Zhu 2/17/2026

WiSparse: Boosting LLM Inference Efficiency with Weight-Aware Mixed Activation Sparsity

Training-free activation sparsity method (WiSparse) for efficient LLM inference considering weight-aware interactions and inter-block sensitivity, reducing computation and memory access.

Ax Huaming Du, Tao Hu, Yijie Huang, Yu Zhao, Guisong Liu, Tao Gu, Gang Kou, Carl Yang 2/17/2026

Traceable Latent Variable Discovery Based on Multi-Agent Collaboration

Multi-agent collaboration framework for discovering latent causal variables, overcoming limitations of traditional causal discovery algorithms that assume no latent confounders.

Ax Hong Li, Zhen Zhou, Honggang Zhang, Yuping Luo, Xinyue Wang, Han Gong, Zhiyuan Liu 2/17/2026

Silent Inconsistency in Data-Parallel Full Fine-Tuning: Diagnosing Worker-Level Optimization Misalignment

Identifies 'silent inconsistency' in data-parallel fine-tuning of LLMs where worker-level optimization dynamics misalign despite synchronized parameters, impacting training quality.

Ax Chang Liu, Yiran Zhao, Lawrence Liu, Yaoqi Ye, Csaba Szepesv\'ari, Lin F. Yang 2/17/2026

LACONIC: Length-Aware Constrained Reinforcement Learning for LLM

Reinforcement learning approach (LACONIC) for controlling LLM response length during training without fixed heuristic reward shaping, addressing inference latency and computational overhead.

Ax Aadirupa Saha, Amith Bhat, Haipeng Luo 2/17/2026

One Good Source is All You Need: Near-Optimal Regret for Bandits under Heterogeneous Noise

Multi-armed bandit algorithm (SOAR) for heterogeneous noise sources that adaptively selects data sources to minimize regret, applicable to federated or multi-source learning scenarios.

Ax Buze Zhang, Jinkai Tao, Zilang Zeng, Neil He, Ali Maatouk, Menglin Yang, Rex Ying 2/17/2026

Parameter-Efficient Fine-Tuning of LLMs with Mixture of Space Experts

Novel parameter-efficient fine-tuning method for LLMs using mixture of experts in alternative geometric spaces (hyperbolic, spherical) to capture complex language data structures.

Ax Alejandro Francisco Queiruga 2/17/2026

Divine Benevolence is an $x^2$: GLUs scale asymptotically faster than MLPs

Theoretical analysis using numerical methods to explain why GLU variants scale better than MLPs in frontier LLMs, grounding empirical architectural choices in function approximation theory.

Ax Chaosheng Dong, Peiyao Xiao, Yijia Wang, Kaiyi Ji 2/17/2026

DeepMTL2R: A Library for Deep Multi-task Learning to Rank

Open-source framework for multi-task learning to rank using transformers and self-attention for multiple relevance criteria.

Ax Francesco Emanuele Stradi, Kalana Kalupahana, Matteo Castiglioni, Alberto Marchesi, Nicola Gatti 2/17/2026

Truly Adapting to Adversarial Constraints in Constrained MABs

Constrained MAB algorithm for non-stationary environments with unknown constraints under adversarial and stochastic settings.

Ax Qinqi Lin, Ningning Ding, Lingjie Duan, Jianwei Huang 2/17/2026

Governing AI Forgetting: Auditing for Machine Unlearning Compliance

Economic framework for auditing machine unlearning compliance with regulatory data deletion requirements.

Ax Shishir Sharma, Doina Precup, Theodore J. Perkins 2/17/2026

Fluid-Agent Reinforcement Learning

Framework for multi-agent RL with dynamic agent creation and reproduction, extending MARL beyond fixed agent counts.

Ax Qian Liyan, Zhang Yao, Yuan Ye, Zhang Zhaoke, Fang Jin, Jiang Shimiao, Zhang Jin, Li Ke, Liu Beijiang, Xu Chenglin, Zhang Yifan, Jia Xiaoqian, Qin Xiaoshuai, Huang Xingtao 2/17/2026

DCTracks: An Open Dataset for Machine Learning-Based Drift Chamber Track Reconstruction

Open dataset and benchmarks for ML-based drift chamber track reconstruction with GNNs.

Ax Isam Vrce, Andreas Kassler, G\"ok\c{c}e Aydos 2/17/2026

RNM-TD3: N:M Semi-structured Sparse Reinforcement Learning From Scratch

Semi-structured sparsity method (N:M) for training deep RL agents from scratch with hardware acceleration.

Ax Matteo Bollini, Gianmarco Genalti, Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi 2/17/2026

Replicable Constrained Bandits

Study of algorithmic replicability in constrained multi-armed bandit problems for reproducible ML experiments.

Ax Minh Nguyen 2/17/2026

Decoupled Continuous-Time Reinforcement Learning via Hamiltonian Flow

Continuous-time reinforcement learning method using Hamiltonian flow for event-driven control problems.

Ax Tianyi Ma, Yiyang Li, Yiyue Qian, Zheyuan Zhang, Zehong Wang, Chuxu Zhang, Yanfang Ye 2/17/2026

OPBench: A Graph Benchmark to Combat the Opioid Crisis

Graph learning benchmark dataset for evaluating methods on opioid crisis prediction and intervention.

Ax Karim Galliamov, Syed M Ahsan Kazmi, Adil Khan, Ad\'in Ram\'irez Rivera 2/17/2026

Concepts' Information Bottleneck Models

Information bottleneck regularizer for concept bottleneck models to improve interpretability while maintaining accuracy.

Ax Rohit Raj Rai, Abhishek Dhaka, Amit Awekar 2/17/2026

Alignment Adapter to Improve the Performance of Compressed Deep Learning Models

Lightweight adapter aligns compressed DL model embeddings with original models to improve performance in resource-constrained deployment.

Ax Adri\'an Javaloy, Antonio Vergari 2/17/2026

An Embarrassingly Simple Way to Optimize Orthogonal Matrices at Scale

Optimization method for orthogonal matrix constraints in machine learning, improving upon Landing algorithm for scalability.

Ax Farzan Farnia, Mohammad Jalali, Azim Ospanov 2/17/2026

Exposing Diversity Bias in Deep Generative Models: Statistical Origins and Correction of Diversity Error

Analyzes and corrects diversity bias in deep generative models by comparing sample diversity to underlying data distribution.

Ax David Chanin, Adri\`a Garriga-Alonso 2/17/2026

SynthSAEBench: Evaluating Sparse Autoencoders on Scalable Realistic Synthetic Data

SynthSAEBench toolkit for large-scale synthetic benchmarking of sparse autoencoders with realistic feature characteristics.