Isolater - Feed

Ax Ankit Ghimire, Saydul Akbar Murad, Nick Rahimi 3/18/2026

A Depth-Aware Comparative Study of Euclidean and Hyperbolic Graph Neural Networks on Bitcoin Transaction Systems

Comparison of Euclidean and hyperbolic graph neural networks for analyzing Bitcoin transaction networks and fraud detection.

Ax Yuxuan Zhu, Daniel Kang 3/18/2026

Noisy Data is Destructive to Reinforcement Learning with Verifiable Rewards

Analysis showing noisy data significantly degrades reinforcement learning with verifiable rewards despite claims of robustness.

Ax Keru Chen, Jun Luo, Sen Lin, Yingbin Liang, Alvaro Velasquez, Nathaniel Bastian, Shaofeng Zou 3/18/2026

HIPO: Instruction Hierarchy via Constrained Reinforcement Learning

Constrained reinforcement learning approach for hierarchical instruction following in LLMs with priority-ordered system prompts.

Ax Long Li, Zhijian Zhou, Tianyi Wang, Weidi Xu, Zuming Huang, Wei Chu, Zhe Wang, Shirui Pan, Chao Qu, Yuan Qi 3/18/2026

DyJR: Preserving Diversity in Reinforcement Learning with Verifiable Rewards via Dynamic Jensen-Shannon Replay

Experience replay mechanism preserving diversity in on-policy reinforcement learning for LLM reasoning using Jensen-Shannon divergence.

Ax Abhijit Kumar, Natalya Kumar, Shikhar Gupta 3/18/2026

Execution-Grounded Credit Assignment for GRPO in Code Generation

Credit assignment method using execution traces to improve GRPO performance in code generation tasks with verifiable rewards.

Ax Christina Baek, Ricardo Pio Monti, David Schwab, Amro Abbas, Rishabh Adiga, Cody Blakeney, Maximilian B\"other, Paul Burstein, Aldo Gael Carranza, Alvin Deng, Parth Doshi, Vineeth Dorna, Alex Fang, Tony Jiang, Siddharth Joshi, Brett W. Larsen, Jason Chan Lee, Katherine L. Mentzer, Luke Merrick, Haakon Mongstad, Fan Pan, Anshuman Suri, Darren Teh, Jason Telanoff, Jack Urbanek, Zhengping Wang, Josh Wills, Haoli Yin, Aditi Raghunathan, J. Zico Kolter, Bogdan Gaza, Ari Morcos, Matthew Leavitt, Pratyush Maini 3/18/2026

The Finetuner's Fallacy: When to Pretrain with Your Finetuning Data

Study of specialized pretraining strategy using domain data during pretraining to improve finetuning performance and reduce forgetting.

Ax Camille Jimenez Cortes, Philippe Lalanda, German Vega 3/18/2026

Sample-Efficient Adaptation of Drug-Response Models to Patient Tumors under Strong Biological Domain Shift

Transfer learning method for adapting drug-response prediction models from cell lines to patient tumors.

Ax Yongyu Mu, Jiali Zeng, Fandong Meng, JingBo Zhu, Tong Xiao 3/18/2026

Offline Exploration-Aware Fine-Tuning for Long-Chain Mathematical Reasoning

Fine-tuning approach for improving mathematical reasoning in LLMs by optimizing exploration-aware trajectories with verifiable rewards.

Ax Kaixuan Du, Meng Cao, Hang Zhang, Yukun Wang, Xiangzhou Huang, Ni Li 3/18/2026

Dual Consensus: Escaping from Spurious Majority in Unsupervised RLVR via Two-Stage Vote Mechanism

Dual consensus mechanism for improving reinforcement learning from verifiable rewards in LLMs, avoiding convergence to spurious answers.

Ax Chenglin Li, Hang Xu, Jianting Chen, Yanfei Zhang 3/18/2026

Physics-integrated neural differentiable modeling for immersed boundary systems

Physics-informed neural network surrogate model for fluid dynamics simulation near solid boundaries.

Ax Saarang Panchavati, Uddhav Panchavati, Corey Arnold, William Speier 3/18/2026

Laya: A LeJEPA Approach to EEG via Latent Prediction over Reconstruction

Foundation model approach for EEG analysis using latent prediction training for brain-computer interfaces and clinical applications.

Ax Hoang Phan, Quang H. Nguyen, Hung T. Q. Le, Xiusi Chen, Heng Ji, Khoa D. Doan 3/18/2026

Decoding the Critique Mechanism in Large Reasoning Models

Study of how large reasoning models use backtracking and self-verification to detect and correct errors in complex logical reasoning tasks.

Ax Jia Qing Yap 3/18/2026

Behavioral Steering in a 35B MoE Language Model via SAE-Decoded Probe Vectors: One Agency Axis, Not Five Traits

Research on steering behavior in 35B MoE language models using sparse autoencoders and probe vectors to identify and control agentic traits.

Ax Yong Il Choi 3/18/2026

DynamicGate MLP Conditional Computation via Learned Structural Dropout and Input Dependent Gating for Functional Plasticity

MLP architecture with learned structural dropout and input-dependent gating for conditional computation and regularization.

Ax Andrea Moleri, Christian Intern\`o, Ali Raza, Markus Olhofer, David Klindt, Fabio Stella, Barbara Hammer 3/18/2026

FederatedFactory: Generative One-Shot Learning for Extremely Non-IID Distributed Scenarios

Federated learning framework for non-IID distributed scenarios using generative one-shot learning without foundation model dependencies.

Ax David Orlando Salazar Torres, Diyar Altinses, Andreas Schwung 3/18/2026

Prior-Informed Neural Network Initialization: A Spectral Approach for Function Parameterizing Architectures

Spectral initialization method for neural networks designed for function parameterization using prior information.

Ax Debdas Paul, Elisa Ferrari, Irene Gravili, Alessandro Cellerino 3/18/2026

Age Predictors Through the Lens of Generalization, Bias Mitigation, and Interpretability: Reflections on Causal Implications

Study on bias mitigation and generalization in age prediction models using causal analysis and invariant representations.

Ax Hong Jeong 3/18/2026

Trained Persistent Memory for Frozen Encoder--Decoder LLMs: Six Architectural Methods

Methods for adding persistent memory to frozen encoder-decoder LLMs using continuous latent space adapters for multi-session learning.

Ax Yikai Gu, Lele Cao, Bo Zhao, Lei Lei, Lei You 3/18/2026

DISCOVER: A Solver for Distributional Counterfactual Explanations

Solver for distributional counterfactual explanations using optimal transport with statistical certification for model interpretability.

Ax Rishaank Gupta 3/18/2026

Capability-Guided Compression: Toward Interpretability-Aware Budget Allocation for Large Language Models

LLM compression method using capability-guided budget allocation that interprets what model components encode before pruning.

Ax Amon Lahr, Anna Scampicchio, Johannes K\"ohler, Melanie N. Zeilinger 3/18/2026

Optimal uncertainty bounds for multivariate kernel regression under bounded noise: A Gaussian process-based dual function

Optimal uncertainty bounds for multivariate kernel regression using Gaussian processes with applications to safe learning-based control.

Ax Subina Khanal, Seshu Tirupathi, Merim Dzaferagic, Marco Ruffini, Torben Bach Pedersen 3/18/2026

Bridging the High-Frequency Data Gap: A Millisecond-Resolution Network Dataset for Advancing Time Series Foundation Models

High-frequency time series dataset at millisecond resolution for training and evaluating time series foundation models.

Ax Xizhong Yang, Yinan Xia, Huiming Wang, Mofei Song 3/18/2026

From the Inside Out: Progressive Distribution Refinement for Confidence Calibration

Test-time scaling and confidence calibration strategy using internal model information for improved reinforcement learning.

Ax Zhenghang Song, Tang Qian, Lu Chen, Yushuai Li, Zhengke Hu, Bingbing Fang, Yumeng Song, Junbo Zhao, Sheng Zhang, Tianyi Li 3/18/2026

FEAT: A Linear-Complexity Foundation Model for Extremely Large Structured Data

Foundation model for structured data with linear complexity for handling extremely large datasets in healthcare, finance, and e-commerce.

Ax Laurent Cheret, Vincent L\'etourneau, Isar Nejadgholi, Chris Drummond, Hussein Al Osman, Maia Fraser 3/18/2026

Manifold-Matching Autoencoders

Unsupervised autoencoder regularization by aligning pairwise distances between latent and input spaces on learned manifolds.

Ax Hangting Ye, Peng Wang, Wei Fan, Xiaozhuang Song, He Zhao, Dandan Gun, Yi Chang 3/18/2026

Deep Tabular Representation Corrector

Deep learning methods for tabular data using representation correction to improve on in-learning and pre-learning paradigms.

Ax Zelin Zhang, Fei Cheng, Chenhui Chu 3/18/2026

When and Why Does Unsupervised RL Succeed in Mathematical Reasoning? A Manifold Envelopment Perspective

Analysis of when unsupervised reinforcement learning succeeds in LLM mathematical reasoning, addressing scalability of outcome-based RL.

Ax Joe Standridge, Daniel Livescu, Paul Cizmas 3/18/2026

Trajectory-Optimized Time Reparameterization for Learning-Compatible Reduced-Order Modeling of Stiff Dynamical Systems

Time reparameterization technique for improving machine learning reduced-order models of stiff dynamical systems.

Ax Bernardo Williams, Harsha Vardhan Tetali, Arto Klami, Marcelo Hartmann 3/18/2026

Simplex-to-Euclidean Bijection for Conjugate and Calibrated Multiclass Gaussian Process

Gaussian process classifier for multi-class problems using simplex geometry and Aitchison geometry for probability calibration.

Ax Gregor Kornhardt, Jannis Chemseddine, Christian Wald, Gabriele Steidl 3/18/2026

Self-Aware Markov Models for Discrete Reasoning

Method for discrete reasoning using self-aware Markov models that correct errors in masked diffusion models through adaptive denoising.

Ax Sasha Brenner, Thomas R. Kn\"osche, Nico Scherf 3/18/2026

Grid-World Representations in Transformers Reflect Predictive Geometry

Study on how Transformers develop internal geometric representations of grid-world environments through next-token prediction.

Ax Florian Grivet, Louise Trav\'e-Massuy\`es 3/18/2026

Cost Trade-offs in Matrix Inversion Updates for Streaming Outlier Detection

Research on matrix inversion updates for streaming outlier detection using Christoffel function and online learning.

Ax Aaron Zweig, Mingxuan Zhang, David A. Knowles, Elham Azizi 3/18/2026

Learning Lineage-guided Geodesics with Finsler Geometry

Finsler geometry method for trajectory inference incorporating discrete, directed lineage priors in dynamical systems.

Ax Utkarsh Pratiush, Kamyar Barakati, Boris N. Slautin, Catherine C. Bodinger, Christopher D. Lowe, Brandi M. Cossairt, Sergei V. Kalinin 3/18/2026

Novelty-Driven Target-Space Discovery in Automated Electron and Scanning Probe Microscopy

Deep-kernel-learning BEACON framework for automated microscopy discovery using novelty-driven target-space search.

Ax Yuanfang Ren, Varun Sai Vemuri, Zhenhong Hu, Benjamin Shickel, Ziyuan Guan, Tyler J. Loftus, Parisa Rashidi, Tezcan Ozrazgat-Baslanti, Azra Bihorac 3/18/2026

Federated Learning with Multi-Partner OneFlorida+ Consortium Data for Predicting Major Postoperative Complications

Federated learning models predicting postoperative complications using multi-center healthcare data while preserving privacy.

Ax Robert Welch, Emir Konuk, Kevin Smith 3/18/2026

The Cost of Reasoning: Chain-of-Thought Induces Overconfidence in Vision-Language Models

Study showing chain-of-thought prompting degrades uncertainty quantification in vision-language models despite improving reasoning.

Ax Jia Ming Li, Anupriya, Daniel J. Graham 3/18/2026

GeMA: Learning Latent Manifold Frontiers for Benchmarking Complex Systems

GeMA latent manifold benchmarking method for complex systems like rail networks using machine learning frontier estimation.

Ax Kristi Topollai, Anna Choromanska 3/18/2026

Understanding Quantization of Optimizer States in LLM Pre-training: Dynamics of State Staleness and Effectiveness of State Resets

Analysis of quantized optimizer states in LLM pre-training, studying state staleness and effectiveness of reset strategies.

Ax D. Darankoum, C. Habermacher, J. Volle, S. Grudinin 3/18/2026