Isolater - Feed

Ax Anton Altenbernd, Philipp Wiesner, Odej Kao 4/2/2026

Exploring Silent Data Corruption as a Reliability Challenge in LLM Training

Analysis of silent data corruption during LLM training on hardware, studying gradient corruption impacts and detection mechanisms.

Ax Bj\"orn Roman Kohlberger (EctoSpace, Dublin, Ireland) 4/2/2026

Spectral Compact Training: Pre-Training Large Language Models via Permanent Truncated SVD and Stiefel QR Retraction

Spectral Compact Training method reduces LLM training memory footprint by replacing dense weight matrices with truncated SVD factors.

Ax Sayed Hashim, Frank Soboczenski, Paul Cairns 4/2/2026

BioCOMPASS: Integrating Biomarkers into Transformer-Based Immunotherapy Response Prediction

Transformer-based model with biomarkers for immunotherapy response prediction, improving generalization across diverse cancer datasets.

Ax Lala Shakti Swarup Ray, Mengxi Liu, Alcina Pinto, Deepika Gurung, Daniel Geissler, Paul Lukowoicz, Bo Zhou 4/2/2026

ActivityNarrated: An Open-Ended Narrative Paradigm for Wearable Human Activity Understanding

Open-ended narrative framework for wearable human activity recognition using compositional, unscripted activities instead of closed-set classification.

Ax Swapnil Parekh 4/2/2026

Thinking Wrong in Silence: Backdoor Attacks on Continuous Latent Reasoning

ThoughtSteer backdoor attack exploiting continuous reasoning in language models that operate silently in hidden states without token output.

Ax Nikita Gabdullin, Ilya Androsov 4/2/2026

Using predefined vector systems to speed up neural network multimillion class classification

Method to reduce neural network multi-class classification complexity from O(n) to O(1) by leveraging known latent space geometry properties.

Ax Dharma Teja Vooturi, Dhiraj Kalamkar, Dipankar Das, Bharat Kaul 4/2/2026

Scalable Pretraining of Large Mixture of Experts Language Models on Aurora Super Computer

Optimus training library for pretraining mixture-of-experts LLMs at exascale on Aurora supercomputer, demonstrating 1000s GPU tile scaling.

Ax Yuchang Jiang, Jan Dirk Wegner, Vivien Sainte Fare Garnot 4/2/2026

MIRANDA: MId-feature RANk-adversarial Domain Adaptation toward climate change-robust ecological forecasting with deep learning

Deep learning method for plant phenology prediction using domain adaptation to improve climate change forecasting in ecological systems.

Ax Martin Jaraiz 4/2/2026

Cost-Penalized Fitness in FMA-Orchestrated Mixture of Experts: Experimental Evidence for Molecular Memory in Domain Adaptation

Experimental evaluation of Free-Market Algorithm orchestrated Mixture-of-Experts with cost-penalized fitness for domain adaptation.

Ax Yuhang Li, Donghyun Lee, Ruokai Yin, Priyadarshini Panda 4/2/2026

Optimal Brain Decomposition for Accurate LLM Low-Rank Approximation

Optimal decomposition technique for low-rank approximation of LLM weights enabling efficient fine-tuning and inference.

Ax Zhanzhi Lou, Hui Chen, Yibo Li, Qian Wang, Bryan Hooi 4/2/2026

Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

Method for language agents to optimize test-time adaptation policies through iterative refinement during inference.

Ax Huaiyang Wang, Xiaojie Li, Deqing Wang, Haoyi Zhou, Zixuan Huang, Yaodong Yang, Jianxin Li, Yikun Ban 4/2/2026

Policy Improvement Reinforcement Learning

Reinforcement learning approach with verification for iteratively improving LLM policies based on actual performance gains.

Ax Zheng Zhang, Cuong C. Nguyen, David Rosewarne, Kevin Wells, Gustavo Carneiro 4/2/2026

Fatigue-Aware Learning to Defer via Constrained Optimisation

Framework for human-AI cooperation that models fatigue-induced performance degradation in learning-to-defer systems.

Ax Antonin Sulc 4/2/2026

Event Embedding of Protein Networks : Compositional Learning of Biological Function

Compositional embedding method for protein networks using additive sequence models on biological interaction data.

Ax Haorui Ma, Dennis Frauen, Valentyn Melnychuk, Stefan Feuerriegel 4/2/2026

Orthogonal Learner for Estimating Heterogeneous Long-Term Treatment Effects

Orthogonal learning approach for estimating heterogeneous long-term treatment effects combining experiments and observational data.

Ax Hsin-Ling Hsu, Min-Yu Chen, Nai-Chia Chen, Yan-Ru Chen, Yi-Ling Chang, Fang Yu 4/2/2026

WARP: Guaranteed Inner-Layer Repair of NLP Transformers

Method for verifiable repair of transformer vulnerabilities to adversarial perturbations with inner-layer guarantees.

Ax Ruijie Hao, Longfei Zhang, Yang Dai, Yang Ma, Xingxing Liang, Guangquan Cheng 4/2/2026

Flow-based Policy With Distributional Reinforcement Learning in Trajectory Optimization

Flow-based reinforcement learning policy with distributional approach for capturing multimodal solutions in trajectory optimization.

Ax Nikolai Merkel, Ruben Mayer, Volker Markl, Hans-Arno Jacobsen 4/2/2026

EmbedPart: Embedding-Driven Graph Partitioning for Scalable Graph Neural Network Training

Graph partitioning technique using embeddings to enable scalable distributed training of graph neural networks.

Ax Rafael Sojo, Pedro Larra\~naga, Concha Bielza 4/2/2026

Transfer learning for nonparametric Bayesian networks

Transfer learning methodologies for Bayesian network structure learning with scarce data.

Ax Philip Jordan, Maryam Kamgarpour 4/2/2026

Model-Based Learning of Near-Optimal Finite-Window Policies in POMDPs

Model-based learning approach for finite-window policies in partially observable Markov decision processes.

Ax Zhichen Liu, Tianle Lun, Zhibin Wen, Hao An, Yulin Ou, Jianhui Xu, Hao Zhang, Wenyi Fang, Yang Zheng, Yang Xu 4/2/2026

Fast and Accurate Probing of In-Training LLMs' Downstream Performances

Method for efficiently evaluating LLM downstream performance during training without expensive full inference.

Ax Jinzhao Li, Nan Jiang, Yexiang Xue 4/2/2026

Approximating Pareto Frontiers in Stochastic Multi-Objective Optimization via Hashing and Randomization

Algorithmic approach to multi-objective optimization via hashing and randomization for identifying Pareto frontiers.

Ax Kazuya Takabatake, Shotaro Akaho 4/2/2026

Reconsidering Dependency Networks from an Information Geometry Perspective

Theoretical analysis of dependency networks using information geometry perspective for modeling complex systems.

Ax Zhantao Chen, Dongyi He, Jin Fang, Xi Chen, Yisuo Liu, Xiaozhen Zhong, Xuejun Hu 4/2/2026

Toward Personalized Darts Training: A Data-Driven Framework Based on Skeleton-Based Biomechanical Analysis and Motion Modeling

Data-driven sports training framework using skeleton-based biomechanical analysis and motion modeling for dart throwing.

Ax Xiangpeng Li, Yu-Hsuan Ho, Sam D Brody, Ali Mostafavi 4/2/2026

Property-Level Flood Risk Assessment Using AI-Enabled Street-View Lowest Floor Elevation Extraction and ML Imputation Across Texas

AI pipeline extracting building elevation data from street-view imagery with ML imputation for flood risk assessment.

Ax Gleb Rodionov 4/2/2026

Reasoning Shift: How Context Silently Shortens LLM Reasoning

Analysis showing how irrelevant context degrades LLM reasoning performance despite test-time scaling capabilities.

Ax Kai Nelson, Tobias Kreiman, Sergey Levine, Aditi S. Krishnapriyan 4/2/2026

Bridging the Simulation-to-Experiment Gap with Generative Models using Adversarial Distribution Alignment

Generative model approach using adversarial distribution alignment to bridge simulation-to-experiment gap in scientific domains.

Ax Cai Zhou, Zekai Wang, Menghua Wu, Qianyu Julie Zhu, Flora C. Shi, Chenyu Wang, Ashia Wilson, Tommi Jaakkola, Stephen Bates 4/2/2026

Online Reasoning Calibration: Test-Time Training Enables Generalizable Conformal LLM Reasoning

ORCA framework calibrating LLM sampling through conformal prediction to improve test-time reasoning efficiency and generalization.

Ax Prasanjit Dey, Soumyabrata Dev, Angela Meyer, Bianca Schoen-Phelan 4/2/2026

NeuroDDAF: Neural Dynamic Diffusion-Advection Fields with Evidential Fusion for Air Quality Forecasting

Physics-informed neural network combining diffusion-advection with evidential fusion for air quality forecasting.

Ax Ken M. Nakanishi 4/2/2026

Screening Is Enough

Multiscreen mechanism for language models enabling absolute query-key relevance assessment beyond relative attention redistribution.

Ax Youssef Mroueh, Carlos Fonseca, Brian Belgodere, David Cox 4/2/2026

CliffSearch: Structured Agentic Co-Evolution over Theory and Code for Scientific Algorithm Discovery

CliffSearch agent framework for scientific algorithm discovery combining LLM-guided search with structured evolution of theory and code.

Ax Piyush Garg, Diana R. Gergel, Andrew E. Shao, Galen J. Yacalis 4/2/2026

The Recipe Matters More Than the Kitchen:Mathematical Foundations of the AI Weather Prediction Pipeline

Mathematical framework analyzing what determines forecast skill in AI weather prediction, emphasizing training methodology over architecture.

Ax Yuxuan Bao, Xingyue Zhang, J. Nathan Kutz 4/2/2026

LAtent Phase Inference from Short time sequences using SHallow REcurrent Decoders (LAPIS-SHRED)

LAPIS-SHRED method for reconstructing spatio-temporal dynamics from sparse observations using shallow recurrent decoders.

Ax Shikhar Bharadwaj, Chin-Jou Li, Kwanghee Choi, Eunjung Yeo, William Chen, Shinji Watanabe, David R. Mortensen 4/2/2026

An Empirical Recipe for Universal Phone Recognition

PhoneticXEUS model for robust multilingual phone recognition trained on large-scale data with pretrained representations.

Ax Wanxin Li, Denver McNeney, Nivedita Prabhu, Charlene Zhang, Renee Barr, Matthew Kitching, Khanh Dao Duc, Anthony S. Boyce 4/2/2026

Scalable Identification and Prioritization of Requisition-Specific Personal Competencies Using Large Language Models

LLM-based recruitment tool identifying requisition-specific competencies through dynamic few-shot prompting and reflection.

Ax Kyunghoon Hur, Heeyoung Kwak, Jinsu Jang, Nakhwan Kim, Edward Choi 4/2/2026

Multi-lingual Multi-institutional Electronic Health Record based Predictive Model

Text-based harmonization approach using LLMs to unify multi-institutional EHR data without explicit schema standardization.

Ax Nabeel Ahmad Saidd 4/2/2026

Decomposable Reward Modeling and Realistic Environment Design for Reinforcement Learning-Based Forex Trading

Modular RL framework with decomposable reward modeling and realistic environment design for Forex trading applications.

Ax Ernest Fokou\'e, Gregory Babbitt, Yuval Levental 4/2/2026

Isomorphic Functionalities between Ant Colony and Ensemble Learning: Part II-On the Strength of Weak Learnability and the Boosting Paradigm

Mathematical analysis establishing isomorphism between ant colony behavior and ensemble learning methods like boosting.

Ax Christin Pagels, Simon Hacks, Rob Henk Bemthuis 4/2/2026