Isolater - Feed

Ax Antonio \'Alvarez-L\'opez, Marcos Matabuena 3/26/2026

Continuous-Time Learning of Probability Distributions: A Case Study in a Digital Trial of Young Children with Type 1 Diabetes

Continuous-time learning framework for probability distributions applied to glucose monitoring in pediatric diabetes clinical trial.

Ax Jeonghye Kim, Xufang Luo, Minbeom Kim, Sangmook Lee, Dohyung Kim, Jiwon Jeon, Dongsheng Li, Yuqing Yang 3/26/2026

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Analysis of why self-distillation degrades LLM reasoning capability by suppressing epistemic verbalization and expression of uncertainty.

Ax Cursor Reseach, :, Aaron Chan, Ahmed Shalaby, Alexander Wettig, Aman Sanger, Andrew Zhai, Anurag Ajay, Ashvin Nair, Charlie Snell, Chen Lu, Chen Shen, Emily Jia, Federico Cassano, Hanpeng Liu, Haoyu Chen, Henry Wildermuth, Jacob Jackson, Janet Li, Jediah Katz, Jiajun Yao, Joey Hejna, Josh Warner, Julius Vering, Kevin Frans, Lee Danilek, Less Wright, Lujing Cen, Luke Melas-Kyriazi, Michael Truell, Michiel de Jong, Naman Jain, Nate Schmidt, Nathan Wang, Niklas Muennighoff, Oleg Rybkin, Paul Loh, Phillip Kravtsov, Rishabh Yadav, Sahil Shah, Sam Kottler, Alexander M Rush, Shengtong Zhang, Shomil Jain, Sriram Sankar, Stefan Heule, Stuart H. Sul, Sualeh Asif, Victor Rong, Wanqi Zhu, William Lin, Yuchen Wu, Yuri Volkov, Yury Zemlyanskiy, Zack Holbrook, Zhiyuan Zhang 3/26/2026

Composer 2 Technical Report

Composer 2 model specialized for agentic software engineering with long-term planning and coding abilities trained via continued pretraining and reinforcement learning.

Ax John Ray B. Martinez 3/26/2026

Multi-Agent Reasoning with Consistency Verification Improves Uncertainty Calibration in Medical MCQA

Multi-agent framework with verification for improving calibration and accuracy in medical multiple-choice question answering.

Ax Raju Chowdhury, Tanmay Sen, Prajamitra Bhuyan, Biswabrata Pradhan 3/26/2026

Trust Region Constrained Bayesian Optimization with Penalized Constraint Handling

Bayesian optimization method combining penalty formulation and trust region strategy for constrained black-box optimization.

Ax Saahil Mathur, Ryan David Rittner, Vedant Ajit Thakur, Daniel Stuart Schiff, Tunazzina Islam 3/26/2026

Retrieval Improvements Do Not Guarantee Better Answers: A Study of RAG for AI Policy QA

Study evaluating RAG systems on AI policy analysis showing retrieval improvements don't guarantee better answers on complex regulatory documents.

Ax Hao Wang, Zhichao Chen, Zhaoran Liu, Haozhe Li, Degui Yang, Xinggao Liu, Haoxuan Li 3/26/2026

Entire Space Counterfactual Learning for Reliable Content Recommendations

Counterfactual learning approach for conversion rate estimation in recommender systems addressing data sparsity and selection bias.

Ax Dmitrii Krylov, Armin Karamzade, Roy Fox 3/26/2026

Moonwalk: Inverse-Forward Differentiation

Inverse-forward differentiation method to reduce memory requirements for backpropagation by avoiding activation storage.

Ax Parsa Moradi, Behrooz Tahmasebi, Mohammad Ali Maddah-Ali 3/26/2026

Coded Computing for Resilient Distributed Computing: A Learning-Theoretic Framework

Learning-theoretic framework for coded computing in distributed systems to handle slow, faulty, or compromised servers.

Ax Jiancheng Xie, Lou C. Kohler Voinov, Noga Mudrik, Gal Mishne, Adam Charles 3/26/2026

Multiway Multislice PHATE: Visualizing Hidden Dynamics of RNNs through Training

Visualization technique for understanding RNN internal dynamics during training using multislice PHATE algorithm.

Ax Hao Wang, Zhichao Chen, Zhaoran Liu, Xu Chen, Haoxuan Li, Zhouchen Lin 3/26/2026

Proximity Matters: Local Proximity Enhanced Balancing for Treatment Effect Estimation

Statistical method for heterogeneous treatment effect estimation using local proximity constraints in observational data.

Ax Himanshu Pandey, Anshima Singh, Ratikanta Behera 3/26/2026

An efficient wavelet-based physics-informed neural network for multiscale problems

Physics-informed neural networks using wavelet decomposition to improve training on differential equations with rapid oscillations and steep gradients.

Ax Kaixi Bao, Chenhao Li, Yarden As, Andreas Krause, Marco Hutter 3/26/2026

Symmetry-Guided Memory Augmentation for Efficient Locomotion Learning

arXiv paper on Symmetry-Guided Memory Augmentation (SGMA) improving efficiency of RL-based legged locomotion training.

Ax Jimmy Gammell, Anand Raghunathan, Abolfazl Hashemi, Kaushik Roy 3/26/2026

Learning to Localize Leakage of Cryptographic Sensitive Variables

arXiv paper on machine learning techniques to detect and localize power/radiation leakage of cryptographic keys from hardware implementations.

Ax Yifeng Zhang, Yilin Liu, Ping Gong, Peizhuo Li, Mingfeng Fan, Guillaume Sartoretti 3/26/2026

Unicorn: A Universal and Collaborative Reinforcement Learning Approach Towards Generalizable Network-Wide Traffic Signal Control

arXiv paper on multi-agent reinforcement learning for adaptive traffic signal control in heterogeneous urban networks.

Ax Hao Xu, Xiangru Jian, Xinjian Zhao, Wei Pang, Chao Zhang, Suyuchen Wang, Qixin Zhang, Zhengyuan Dong, Joao Monteiro, Bang Liu, Qiuzhuang Sun, Tianshu Yu 3/26/2026

GraphOmni: A Comprehensive and Extensible Benchmark Framework for Large Language Models on Graph-theoretic Tasks

arXiv paper: GraphOmni benchmark framework evaluating LLM reasoning on graph-theoretic tasks with diverse formats and serializations.

Ax Christiaan Meijer, E. G. Patrick Bos 3/26/2026

Explainable embeddings with Distance Explainer

arXiv paper introducing Distance Explainer method for post-hoc interpretability of embedded vector spaces in ML models.

Ax Adnan Oomerjee, Zafeirios Fountas, Haitham Bou-Ammar, Jun Wang 3/26/2026

Bottlenecked Transformers: Periodic KV Cache Consolidation for Generalised Reasoning

arXiv paper on Bottlenecked Transformers: KV cache consolidation technique for scaling inference-time reasoning in LLMs.

Ax Marco Fumero, Luca Moschella, Emanuele Rodol\`a, Francesco Locatello 3/26/2026

Navigating the Latent Space Dynamics of Neural Models

arXiv paper interpreting neural networks as dynamical systems on latent manifolds, analyzing autoencoder vector fields.

Ax Chantal Pellegrini, Ege \"Ozsoy, David Bani-Harouni, Matthias Keicher, Nassir Navab 3/26/2026

EHR2Path: Scalable Modeling of Longitudinal Patient Pathways from Multimodal Electronic Health Records

arXiv paper on scalable longitudinal patient pathway modeling from multimodal EHR data using neural networks for condition forecasting.

Ax Kefan Song, Amir Moeini, Peng Wang, Lei Gong, Rohan Chandra, Shangtong Zhang, Yanjun Qi 3/26/2026

Reward Is Enough: LLMs Are In-Context Reinforcement Learners

Research paper demonstrating LLMs perform in-context reinforcement learning during inference. ICRL prompting framework enables inference-time self-improvement.

Ax Zhiyuan Zhao, Juntong Ni, Shangqing Xu, Haoxin Liu, Wei Jin, B. Aditya Prakash 3/26/2026

TimeRecipe: A Time-Series Forecasting Recipe via Benchmarking Module Level Effectiveness

TimeRecipe benchmarks module-level effectiveness of components in time-series forecasting architectures.

Ax Jinzhou Wu, Baoping Tang, Qikang Li, Yi Wang, Cheng Li, Shujian Yu 3/26/2026

When Brain Foundation Model Meets Cauchy-Schwarz Divergence: A New Framework for Cross-Subject Motor Imagery Decoding

Brain foundation model with Cauchy-Schwarz divergence for cross-subject motor imagery EEG decoding in BCIs.

Ax Parth Naik, Harikrishnan N B 3/26/2026

A Compression Based Classification Framework Using Symbolic Dynamics of Chaotic Maps

Classification framework using symbolic dynamics, chaotic maps, and data compression for pattern recognition.

Ax Omar Bekdache, Naresh Shanbhag 3/26/2026

DART: A Server-side Plug-in for Resource-efficient Robust Federated Learning

DART adds server-side robustness to federated learning for edge devices without expensive client-side computation.

Ax Yifan Hu, Jie Yang, Tian Zhou, Peiyuan Liu, Yujin Tang, Rong Jin, Liang Sun 3/26/2026

Bridging Past and Future: Distribution-Aware Alignment for Time Series Forecasting

TimeAlign uses contrastive learning and representation alignment for time series forecasting by bridging input-target distributions.

Ax Viktor Kovalchuk, Denis Son, Arman Bolatov, Mohsen Guizani, Samuel Horv\'ath, Maxim Panov, Martin Tak\'a\v{c}, Eduard Gorbunov, Nikita Kotelevskii 3/26/2026

Who to Trust? Aggregating Client Predictions in Federated Distillation

Theoretical analysis of federated distillation with weighted aggregation of client predictions under class mismatch.

Ax H. N. Mhaskar, Ryan O'Dowd 3/26/2026

A signal separation view of classification

Alternative classification approach using signal separation and trigonometric polynomial kernels for compact metric spaces.

Ax Suhyeon Lee, Jong Chul Ye 3/26/2026

PromptLoop: Plug-and-Play Prompt Refinement via Latent Feedback for Diffusion Model Alignment

PromptLoop refines prompts for diffusion models using sequential reinforcement learning feedback during sampling.

Ax Jo\v{z}e M. Ro\v{z}anec, Tina \v{Z}ezlin, Laurentiu Vasiliu, Dunja Mladeni\'c, Radu Prodan, Dumitru Roman 3/26/2026

Fiaingen: A financial time series generative method matching real-world data quality

Generative method for synthetic financial time series data to address data shortage in ML models for trading and investment.

Ax Sai Karthikeya Vemuri, Adithya Ashok Chalain Valapil, Tim B\"uchner, Joachim Denzler 3/26/2026

RamPINN: Recovering Raman Spectra From Coherent Anti-Stokes Spectra Using Embedded Physics

Physics-informed neural network for recovering Raman spectra from CARS measurements using scientific theory as inductive bias.

Ax Petrus Mikkola, Luigi Acerbi, Arto Klami 3/26/2026

Score-Based Density Estimation from Pairwise Comparisons

Develops score-based density estimation from pairwise comparisons for learning from human feedback and expert knowledge elicitation.

Ax Divyat Mahajan, Sachin Goyal, Badr Youbi Idrissi, Mohammad Pezeshki, Ioannis Mitliagkas, David Lopez-Paz, Kartik Ahuja 3/26/2026

Beyond Multi-Token Prediction: Pretraining LLMs with Future Summaries

Proposes future summary pretraining for LLMs as alternative to next-token prediction, addressing limitations in long-horizon reasoning and planning tasks.

Ax Zhiyuan Zhao, Haoxin Liu, B. Aditya Prakash 3/26/2026

Tackling Time-Series Forecasting Generalization via Mitigating Concept Drift

Addresses distribution shift in time-series forecasting by identifying concept drift and temporal shift, proposing mitigation strategies for generalization.

Ax Woo-Jin Ahn, Sang-Ryul Baek, Yong-Jun Lee, Hyun-Duck Choi, Myo-Taeg Lim 3/26/2026

OffSim: Offline Simulator for Model-based Offline Inverse Reinforcement Learning

OffSim proposes model-based offline inverse RL framework to learn environmental dynamics and reward functions from offline data without manual definition.

Ax Yu-Chen Kuo, Yi-Ju Tseng 3/26/2026

MedM2T: A MultiModal Framework for Time-Aware Modeling with Electronic Health Record and Electrocardiogram Data

MedM2T is a multimodal framework integrating sparse time series encoding and hierarchical fusion for healthcare data with electronic health records and ECG signals.

Ax Alvaro Prat, Leo Zhang, Charlotte M. Deane, Yee Whye Teh, Garrett M. Morris 3/26/2026

SigmaDock: Untwisting Molecular Docking With Fragment-Based SE(3) Diffusion

SigmaDock uses fragment-based SE(3) diffusion for molecular docking in drug discovery, improving upon generative approaches with better chemical plausibility.

Ax Donggyu Min, Seongjin Choi, Dong-Kyu Kim 3/26/2026

Deep Reinforcement Learning for Dynamic Origin-Destination Matrix Estimation in Microscopic Traffic Simulations Considering Credit Assignment

Applies deep RL to dynamic origin-destination matrix estimation in traffic simulations, addressing credit assignment across temporal vehicle dynamics.

Ax Zhixiong Zhao, Haomin Li, Fangxin Liu, Yuncheng Lu, Zongwu Wang, Tao Yang, Li Jiang, Haibing Guan 3/26/2026

QUARK: Quantization-Enabled Circuit Sharing for Transformer Acceleration by Exploiting Common Patterns in Nonlinear Operations

QUARK is an FPGA acceleration framework using quantization to exploit common patterns in transformer nonlinear operations for efficient inference.

Ax Sebasti\'an Andr\'es Cajas Ord\'o\~nez, Luis Fernando Torres Torres, Mackenzie J. Meni, Carlos Andr\'es Duran Paredes, Eric Arazo, Cristian Bosch, Ricardo Simon Carbajo, Yuan Lai, Leo Anthony Celi 3/26/2026

Uncertainty Makes It Stable: Curiosity-Driven Quantized Mixture-of-Experts

Proposes curiosity-driven quantized Mixture-of-Experts framework using Bayesian uncertainty for deploying neural networks on resource-constrained devices.

Ax Perceval Beja-Battais (CB), Alain Grosset\^ete (CB), Nicolas Vayatis (CB) 3/26/2026

Enhancing Nuclear Reactor Core Simulation through Data-Based Surrogate Models

Uses data-driven surrogate models to improve Model Predictive Control for nuclear reactor core simulation.

Ax Radman Rakhshandehroo, Daniel Coombs 3/26/2026

Reward Engineering for Spatial Epidemic Simulations: A Reinforcement Learning Platform for Individual Behavioral Learning

ContagionRL is a Gymnasium-compatible RL platform for reward engineering in spatial epidemic simulations, enabling systematic study of learned behavioral strategies.

Ax Haichen Hu, David Simchi-Levi 3/26/2026