Isolater - Feed

Ax Pedro Jim\'enez-Gonz\'alez, Miguel C. Soriano, Lucas Lacasa 6/30/2026

Scalar Representations of Neural Network Training Dynamics

Application of scalar embeddings to analyze neural network training trajectories as temporal networks for understanding optimization dynamics.

Ax Tianyu Wang, Gourav Rattihalli, Aditya Dhakal, Junbo Li, Zhiwei Ren, Dejan Milojicic, Longfei Shangguan 6/30/2026

Predict, Reuse, and Repair: Accelerating Dynamic Sparse Attention for Long-Context LLM Decoding

Speculate-reuse-repair runtime optimizing dynamic sparse attention for long-context LLM decoding by exploiting temporal locality in block selections.

Ax Lennart Purucker, Andrej Tschalzev, Nick Erickson, Gioia Blayer, David Holzm\"uller, Alan Arazi, Alexander Pfefferle, Mustafa Tajjar, Ga\"el Varoquaux, Frank Hutter 6/30/2026

Beyond IID: How General Are Tabular Foundation Models, Really?

Evaluation of tabular foundation models on diverse out-of-distribution tasks revealing limitations of current benchmark protocols and model generalization.

Ax Alexis Jacq, Guillaume Couairon, Valentin De Bortoli, Quentin Berthet, Arnaud Doucet, Romuald Elie 6/30/2026

Diffusion Fine-tuning with Rewarded Moment Matching Distillation

Framework combining diffusion model distillation with RL fine-tuning via Rewarded Moment Matching Distillation to improve generative quality.

Ax Jinda Lu, Kexin Huang, Junkang Wu, Shuo Yang, Jinghan Li, Chiyu Ma, Shaohang Wei, Xiang Wang, Guoyin Wang, Jingren Zhou 6/30/2026

Experience Augmented Policy Optimization for LLM Reasoning

Method for improving LLM reasoning via reinforcement learning with verifiable rewards by reusing accumulated experience rather than on-policy optimization from scratch.

Ax Ran Canetti, Shafi Goldwasser, Or Zamir 6/30/2026

Proofs of Ownership for Machine Learning Models

Formal framework for proving ML model ownership through game-theoretic analysis between model owner, thief, and judge.

Ax Liang Wang, Zhaoyang Xi, Zekai Xiang, Heng Meng, Qishan Zhang, Pingyi Zhou, Jin Liu, Litao Chen 6/30/2026

Arko-T: A Foundation Model for Text-to-Structured 3D Generation

4B-parameter text-to-design model generating executable parametric CAD programs from natural language descriptions for mechanical part design.

Ax Huaqing Zhang, Jingchu Gai, Juno Kim, Bingbin Liu, Andrej Risteski 6/30/2026

When Does Online Imitation Learning Help in LLM Post-Training? The Role of (Non-)Realizability Beyond Horizon

Research on when online imitation learning improves LLM post-training, showing benefits depend on realizability rather than error accumulation reduction.

Ax Max Fomin, Elad David, Amit LeVi 6/30/2026

Internal-State Probes Read the Situation, Not the Action: Three Negative Results for Pre-Action Misalignment Monitoring

Study of internal-state probes for monitoring AI agents, finding they read situation context rather than enabling pre-action misalignment detection across model families.

Ax Myung Jun Kim, Maximilian Schambach, Frank Essenberger, Andre Sres, Johannes H\"ohne 6/30/2026

Exploring Differences Between Tabular Enterprise Data and Public Benchmarks

Analysis of statistical characteristics and performance measurements of enterprise tabular data versus public ML benchmarks for business applications.

Ax Songxin Zhang, Zejian Xie, Zhuoyang Song, Cong lin, Junyu Lu, Jiaxing Zhang, Bingyi Jing 6/30/2026

HSAP: A Hierachical Sequence-aware Parallelism for Hybrid-Context Generative Models

Proposes HSAP sequence parallelism framework for hybrid-context packed sequences in large language models, fixing cross-contamination in causal attention.

Ax Thai-Khanh Nguyen, Ngoc-Bich-Uyen Vo, Thieu N. Vo, Tan M. Nguyen, Cuong Pham 6/30/2026

MuonSSM: Orthogonalizing State Space Models for Sequence Modeling

Introduces MuonSSM framework stabilizing state space models for long-sequence modeling by conditioning update geometry rather than recurrent weights.

Ax Davide Domini, Gianluca Aguzzi, Ivana Dusparic, Danilo Pianini, Mirko Viroli 6/30/2026

Discovering Collaboration from Novelty: Random Network Distillation for Clustered Federated Learning

Proposes lightweight approach for clustered federated learning using random network distillation to discover client collaborations without coupling cluster assignment to training.

Ax Mark Rhee, Jamie Simon, Dhruva Karkada 6/30/2026

Muon learns balanced solutions in matrix factorization without slow saddle-to-saddle dynamics

Studies Muon optimizer dynamics on matrix factorization problems, showing it avoids slow saddle-to-saddle transitions compared to gradient descent.

Ax Srinivasa Rao P., Vangmayi P Reddy 6/30/2026

Informational Frustration in Neural Manifolds: Shannon Bottlenecks and the Limits of Learnability

Theoretical framework linking information theory and topology to explain generalization in overparameterized deep networks, addressing theory-practice gap.

Ax Woojoo Na, Jennifer Dy 6/30/2026

ITSPACE: Monotone Gaussian Optimal Transport Updates

Proposes ITSPACE algorithm for optimal transport updates on covariance matrices using Bures-Wasserstein distance for domain adaptation.

Ax Matan Schliserman, Gon Buzaglo, Itay Evron, Daniel Soudry 6/30/2026

Convergence of Continual Learning in Homogeneous Deep Networks

Theoretical analysis of convergence properties in continual learning with deep networks, characterizing sequential projections onto task margin sets.

Ax Kan Zhu, Mathew Jacob, Chenxi Ma, Yi Pan, Stephanie Wang, Arvind Krishnamurthy, Baris Kasikci 6/30/2026

TraceLab: Characterizing Coding Agent Workloads for LLM Serving

TraceLab characterizes real-world coding agent workloads and LLM serving patterns across multiple models for systems optimization.

Ax Ting-Wen Ko, Jonas Geiping 6/30/2026

Attractor States Emerge in Multi-Turn LLM Conversations

Study analyzing attractor state emergence in multi-turn LLM conversations, showing topic-independent stable behaviors in debate interactions.

Ax Mohit Raghavendra, Anisha Gunjal, Aakash Sabharwal, Yunzhong He 6/30/2026

SWE-INTERACT: Reimagining SWE Benchmarks as User-Driven Long-Horizon Coding Sessions

SWE-Interact testbed evaluating coding agents on multi-turn interactive tasks with progressive user requirements instead of complete upfront specifications.

Ax Haoran Jin, Xiting Wang, Shijie Ren, Hong Xie, Defu Lian 6/30/2026

C$^{2}$R: Cross-sample Consistency Regularization Mitigates Feature Splitting and Absorption in Sparse Autoencoders

Cross-sample Consistency Regularization method addressing feature splitting and absorption problems in Sparse Autoencoders for LLM interpretation.

Ax Subramanyam Sahoo, Aman Chadha, Vinija Jain, Divya Chaudhary 6/30/2026

Pessimism's Paradox: Conservative Offline Training Amplifies Reward Hacking During Online Adaptation in Reasoning Models

Study showing conservative offline training paradoxically amplifies reward hacking in reasoning models during online adaptation with DPO.

Ax Philip Zmushko, Egor Petrov, Nursultan Abdullaev, Mikhail Khrushchev, Samuel Horv\'ath 6/30/2026

One-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM Pretraining

Analysis showing one-step gradient delay doesn't hinder asynchronous pipeline parallelism for large-scale LLM pretraining with PipeDream-2BW.

Ax Xuanfan Ni, Liyan Xu, Chenyang Lyu, Longyue Wang, Mo Yu, Lemao Liu, Fandong Meng, Jie Zhou, Piji Li 6/30/2026

ReFreeKV: Towards Threshold-Free KV Cache Compression

ReFreeKV method for KV cache compression in LLM inference without requiring pre-determined domain-specific thresholds.

Ax Daoming Wan, Yizheng Huang, Jimmy X. Huang 6/30/2026

TextClusterLab: An Integrated Framework for Reliable Text Clustering Studies

TextClusterLab framework for reliable evaluation of text clustering algorithms addressing dataset quality and semantic boundary challenges.

Ax Yichuan Wang, Zhifei Li, Zirui Wang, Paul Teiletche, Lesheng Jin, Matei Zaharia, Joseph E. Gonzalez, Sewon Min 6/30/2026

PIXELRAG: Web Screenshots Beat Text for Retrieval-Augmented Generation

PixelRAG method for retrieval-augmented generation using website screenshots in pixel space instead of parsed text for improved context.

Ax Charles L. Wang, Keir Dorchen, Peter Jin 6/30/2026

Agentic Safety is an Epistemic Property, Not a Behavioral One

Argues AI agent safety is epistemic property dependent on system correctability during learning, not just current behavior snapshots.

Ax Irene Strauss, Alexandra Butoi, Ryan Cotterell 6/30/2026

Generating in the Limit with Infinitely Many Hallucinations

Theoretical framework studying language generation in the limit and hallucinations as unavoidable consequence of learning.

Ax Santosh Jaiswal 6/30/2026

Zero-Label Driving Scenario Complexity Detection via Joint Embedding Predictive Architecture

Unsupervised method for detecting complex driving scenarios using Joint Embedding Predictive Architecture without labels.

Ax Jiasheng Wang, Tanun Jitwatcharakomol, Piyawadee Jongpradubgiat, Simeng Zhu 6/30/2026

RADIANT-PET: Reasoning-Augmented PET/CT Lesion Segmentation with Large Language Models and Reinforcement Learning

RADIANT-PET framework combining segmentation models with LLM adjudication for improved lesion segmentation in PET/CT medical imaging.

Ax Emily Bejerano, Federico Tondolo, Devang Gupta, Aaron Mano Cherian, Taeyoo Kim, Ayaan Qayyum, Xiaofan Yu, Xiaofan Jiang 6/30/2026

RadarTwin: Scene-Specific mmWave Radar Simulation and Learning for Mobile Indoor Perception

RadarTwin framework for generating synthetic mmWave radar training data using 3D reconstruction and vision-language models for mobile perception.

Ax Can Demircan, Marcel Binz, Alireza Modirshanechi, Eric Schulz 6/30/2026

Meta-learning as a principle for human-like visual representations

Research proposing meta-learning as principle for human-like visual representations in neural networks to support open-ended task flexibility.

Ax Yunhun Nam, Jongheon Jeong 6/30/2026

Vision-driven Preference Synthesis for Mitigating Hallucinations in VLMs

Method to reduce hallucinations in Vision-Language Models using preference alignment constructed from vision-driven synthesis rather than intervention-based approaches.

Ax Bruno Caro-V\'asquez, Carola Figueroa-Flores, Gast\'on Marquez 6/30/2026

Reinforcement Learning for Software Vulnerability Analysis: A Systematic Review with Emphasis on C/C++ Source Code and Static Analysis

Systematic review of reinforcement learning techniques for C/C++ vulnerability detection and static analysis following PRISMA guidelines.

Ax Jonghyeon Park, Olivier Jiyoun Jung, Myungwoo Oh 6/30/2026

LoRA-Tuned Large Language Models for Dementia Detection via Multi-View Speech-Derived Features

LoRA fine-tuning of LLMs for dementia detection using multi-modal speech features with automatic speech recognition transcripts.

Ax Yang Liu, Yuming Chen 6/30/2026

Event-Conditioned Diagnostics of Kinematic, Contact, and Object-Permanence Fields in Passive Object-State World Models

Research on how world models organize physical information in latent representations using diagnostic protocols for passive object-state prediction.

Ax Chanju Park, Dario Bocchi, Francesco D'Amico, Biagio Lucini, Gert Aarts 6/30/2026

Spectral phase transitions and trainability in neural network learning dynamics

Theoretical analysis of spectral phase transitions in neural network weight matrices during SGD training.

Ax Kevin Der, Harish Kamath, Ben Thompson 6/30/2026

Turn-Averaged SAEs for Feature Discovery and Long-Context Attribution

Turn-averaged sparse autoencoders for interpretable feature extraction in language models with long context.

Ax Matteo Farina, Vishaal Udandarao, Thao Nguyen, Selim Kuzucu, Maximilian B\"other, Andreas Hochlehnert, Adhiraj Ghosh, Marianna Nezhurina, Karsten Roth, Joschka Struber, Yuhui Zhang, Sebastian Dziadzio, Elaine Sui, Soumya Jahagirdar, Dhruba Ghosh, Hasan Hammoud, Thomas De Min, Simone Caldarella, Jehanzeb Mirza, Sedrick Keh, Mehdi Cherti, Hilde Kuehne, Bernt Schiele, Serena Yeung-Levy, Muhammad Ferjad Naeem, Federico Tombari, Ana Klimovic, Elisa Ricci, Matthias Bethge, Sewoong Oh, Ameya Prabhu, Alessio Tonioni, Jenia Jitsev, Massimiliano Mancini, Ludwig Schmidt, Nikhil Parthasarathy 6/30/2026

DataComp-VLM: Improved Open Datasets for Vision-Language Models

DataComp benchmark for evaluating vision-language model dataset curation strategies with 160 open datasets.

Ax Chad A. Capps 6/30/2026

Depth-Staggered Fibonacci Spacing for Sparse Attention: Static Schedules Beat Learned Dilation and Extrapolate Where Dense Attention Fails

Study of sparse attention mechanisms using Fibonacci-spaced offsets in language models with depth-based scheduling.

Ax Binh Nguyen, Colleen Josephson, Mircea Teodorescu, Gert Cauwenberghs, Jason Eshraghian 6/30/2026