Isolater - Feed

Ax Simone Betteti, Luca Laurenti 4/3/2026

Hybrid Energy-Based Models for Physical AI: Provably Stable Identification of Port-Hamiltonian Dynamics

Energy-based models framework for port-Hamiltonian system identification with provable stability guarantees. Physical AI application.

Ax Weyl Lu, Chenjie Hao, Yubei Chen 4/3/2026

Deep Networks Favor Simple Data

Analysis of OOD anomaly where deep networks assign higher density to simple out-of-distribution data than in-distribution test data.

Ax Junxian Wu, Chenghan Fu, Zhanheng Nie, Daoze Zhang, Bowen Wan, Wanxian Guan, Chuan Yu, Jian Xu, Bo Zheng 4/3/2026

MOON3.0: Reasoning-aware Multimodal Representation Learning for E-commerce Product Understanding

MOON3.0 multimodal representation learning for e-commerce product understanding using reasoning-aware MLLMs to capture fine-grained attributes.

Ax Haibo Wang, Zihao Lin, Zhiyang Xu, Lifu Huang 4/3/2026

Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding

Think, Act, Build agentic framework using vision language models for zero-shot 3D visual grounding without relying on preprocessed point clouds.

Ax Mingming Ha, Guanchen Wang, Linxun Chen, Xuan Rao, Yuexin Shi, Tianbao Ma, Zhaojie Liu, Yunqian Fan, Zilong Lu, Yanan Niu, Han Li, Kun Gai 4/3/2026

UniMixer: A Unified Architecture for Scaling Laws in Recommendation Systems

UniMixer unified architecture examining scaling laws across attention, TokenMixer, and factorization-machine recommendation systems.

Ax Zhanzhi Lou, Hui Chen, Yibo Li, Qian Wang, Bryan Hooi 4/3/2026

Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

Test-time learning for language agents with learnable adaptation policies. Improves agent behavior through iterative refinement at inference.

Ax Xiangqi Wang, Yue Huang, Haomin Zhuang, Kehan Guo, Xiangliang Zhang 4/3/2026

Dual Optimal: Make Your LLM Peer-like with Dignity

Dignified Peer framework countering sycophancy and evasiveness in aligned LLMs through anti-sycophancy and empathy.

Ax Zhengyang Tang, Ke Ji, Xidong Wang, Zihan Ye, Xinyuan Wang, Yiduo Guo, Ziniu Li, Chenxin Li, Jingyuan Hu, Shunian Chen, Tongxu Luo, Jiaxi Bi, Zeyu Qin, Shaobo Wang, Xin Lai, Pengyuan Lyu, Junyi Li, Can Xu, Chengquan Zhang, Han Hu, Ming Yan, Benyou Wang 4/3/2026

Do Phone-Use Agents Respect Your Privacy?

MyPhoneBench evaluation framework measuring privacy compliance in phone-use agents during mobile task completion.

Ax Nandan Thakur, Zijian Chen, Xueguang Ma, Jimmy Lin 4/3/2026

ORBIT: Scalable and Verifiable Data Generation for Search Agents on a Tight Budget

ORBIT generates 20K training queries for search agents integrating LMs with web search using scalable and verifiable methods.

Ax Guanzhi Deng, Bo Li, Ronghao Chen, Xiujin Liu, Zhuo Han, Huacan Wang, Lijie Wen, Linqi Song 4/3/2026

DR-LoRA: Dynamic Rank LoRA for Fine-Tuning Mixture-of-Experts Models

DR-LoRA assigns dynamic ranks to expert modules in MoE models for efficient parameter-specific fine-tuning of LLMs.

Ax Davide Di Gioia 4/3/2026

Cognitive Friction: A Decision-Theoretic Framework for Bounded Deliberation in Tool-Using Agents

Triadic Cognitive Architecture for tool-using agents with principled bounds on information-acquisition costs and deliberation.

Ax Elias Hossain, Mehrdad Shoeibi, Ivan Garibay, Niloofar Yousefi 4/3/2026

BIOGEN: Evidence-Grounded Multi-Agent Reasoning Framework for Transcriptomic Interpretation in Antimicrobial Resistance

BIOGEN multi-agent reasoning framework using evidence-grounding for transcriptomic interpretation in antimicrobial resistance.

Ax Tugrul Gorgulu, Atakan Dag, M. Esat Kalfaoglu, Halil Ibrahim Kuru, Baris Can Cam, Halil Ibrahim Ozturk, Ozsel Kilinc 4/3/2026

TaCarla: A comprehensive benchmarking dataset for end-to-end autonomous driving

TaCarla comprehensive benchmarking dataset for end-to-end autonomous driving with perception and planning tasks.

Ax Ziliang Guo, Ziheng Li, Bo Tang, Feiyu Xiong, Zhiyu Li 4/3/2026

MemFactory: Unified Inference & Training Framework for Agent Memory

MemFactory unified framework for training and inference in memory-augmented LLMs using RL to optimize memory operations.

Ax Samuel Bright-Thonney, Thomas R. Harvey, Andre Lukas, Jesse Thaler 4/3/2026

Sven: Singular Value Descent as a Computationally Efficient Natural Gradient Method

Sven optimization algorithm exploiting natural loss decomposition using Moore-Penrose pseudoinverse for efficient neural network training.

Ax Benjamin Turtel, Paul Wilczewski, Kris Skotheim 4/3/2026

Forecasting Supply Chain Disruptions with Foresight Learning

Framework training LLMs to forecast supply chain disruptions using calibrated probabilistic forecasts from disruption outcomes.

Ax Mars Liyao Gao, Yuxuan Bao, Amy S. Rude, Xinwei Shen, J. Nathan Kutz 4/3/2026

UQ-SHRED: uncertainty quantification of shallow recurrent decoder networks for sparse sensing via engression

UQ-SHRED adds uncertainty quantification to shallow recurrent decoder networks for sparse spatiotemporal reconstruction.

Ax Oluwamayowa O. Amusat, Luka Grbcic, Remi Patureau, M. Jibran S. Zuberi, Dan Gunter, Michael Wetter 4/3/2026

An Online Machine Learning Multi-resolution Optimization Framework for Energy System Design Limit of Performance Analysis

Online machine learning framework for multi-resolution energy system design optimization and performance analysis.

Ax Zeyu Xia, Tyler Kim, Trevor Reed, Judy Fox, Geoffrey Fox, Adam Szczepaniak 4/3/2026

JetPrism: diagnosing convergence for generative simulation and inverse problems in nuclear physics

JetPrism diagnoses convergence issues in Conditional Flow Matching for physics simulations and inverse problems.

Ax Haseeb Tariq, Alen Kaja, Marwan Hassani 4/3/2026

Detecting Complex Money Laundering Patterns with Incremental and Distributed Graph Modeling

Distributed graph modeling approach for detecting money laundering transaction patterns at scale.

Ax Zhongwei Yu, Rasul Tutunov, Alexandre Max Maraval, Zikai Xie, Zhenzhi Tan, Jiankang Wang, Zijing Li, Liangliang Xu, Qi Yang, Jun Jiang, Sanzhong Luo, Zhenxiao Guo, Haitham Bou-Ammar, Jun Wang 4/3/2026

Efficient and Principled Scientific Discovery through Bayesian Optimization: A Tutorial

Tutorial on Bayesian Optimization as a principled framework for automating scientific discovery using surrogate models.

Ax Marawan Gamal Abdel Hameed, Derek Tam, Pascal Jr Tikeng Notsawo, Colin Raffel, Guillaume Rabusseau 4/3/2026

Model Merging via Data-Free Covariance Estimation

Principled layer-wise optimization approach for model merging via data-free covariance estimation without task-specific training.

Ax Wenjing Wang, Wenxuan Wang, Songning Lai 4/3/2026

SECURE: Stable Early Collision Understanding via Robust Embeddings in Autonomous Driving

SECURE framework addressing robustness issues in deep learning models for autonomous driving collision prediction.

Ax Ahmer Raza, Hudson Smith 4/3/2026

Massively Parallel Exact Inference for Hawkes Processes

GPU-accelerated inference algorithm for multivariate Hawkes processes achieving O(N) complexity with parallelization.

Ax Vikram Krishnamurthy, Luke Snow 4/3/2026

Malliavin Calculus for Counterfactual Gradient Estimation in Adaptive Inverse Reinforcement Learning

Novel Langevin-based algorithm for adaptive inverse reinforcement learning using Malliavin calculus for gradient estimation.

Ax Brandon Yee, Pairie Koh 4/3/2026

PI-JEPA: Label-Free Surrogate Pretraining for Coupled Multiphysics Simulation via Operator-Split Latent Prediction

PI-JEPA: Physics-informed surrogate model for multiphysics simulation exploiting unlabeled parameter fields via latent prediction.

Ax Qing Zhu, Xian Yu 4/3/2026

Residuals-based Offline Reinforcement Learning

Residuals-based offline reinforcement learning approach for high-stakes applications with restrictive data coverage assumptions.

Ax Urs Hackstein, Jordi Alastruey, Philip Aston, Ciaran Bench, Peter H. Charlton, Loic Coquelin, Nando Hegemann, Vaidotas Marozas, Mohammad Moulaeifard, Manasi Nandi, Andrius Petrenas, Oskar Pfeffer, Mantas Rinkevicius, Andrius Solosenko, Nils Strodthoff, Sara Vardanega 4/3/2026

Benchmark Problems and Benchmark Datasets for the evaluation of Machine and Deep Learning methods on Photoplethysmography signals: the D4 report from the QUMPHY project

Benchmark datasets and evaluation protocols for machine learning methods on photoplethysmography medical signals.

Ax Nicholas Roberts, Sungjun Cho, Zhiqi Gao, Tzu-Heng Huang, Albert Wu, Gabriel Orlanski, Avi Trost, Kelly Buchanan, Aws Albarghouthi, Frederic Sala 4/3/2026

Test-Time Scaling Makes Overtraining Compute-Optimal

Train-to-Test scaling laws optimizing model size, training tokens, and inference samples jointly for compute-optimal LLM deployment.

Ax Rui Wu, Ruixiang Tang 4/3/2026

When Reward Hacking Rebounds: Understanding and Mitigating It with Representation-Level Signals

Study of reward hacking in LLM RL showing reproducible failure patterns and mitigation strategies using representation-level signals.

Ax Arshia Ilaty, Hossein Shirazi, Amir Rahmani, Hajar Homayouni 4/3/2026

DISCO-TAB: A Hierarchical Reinforcement Learning Framework for Privacy-Preserving Synthesis of Complex Clinical Data

Hierarchical RL framework for privacy-preserving synthetic clinical data generation combining LLMs with structured learning.

Ax Tara Saba, Anne Ouyang, Xujie Si, Fan Long 4/3/2026

CuTeGen: An LLM-Based Agentic Framework for Generation and Optimization of High-Performance GPU Kernels using CuTe

CuTeGen: LLM-based agentic framework for automated generation and optimization of high-performance GPU kernels using CuTe abstraction.

Ax William Hoy, Binxu Wang, Xu Pan 4/3/2026

Matching Accuracy, Different Geometry: Evolution Strategies vs GRPO in LLM Post-Training

Comparative study of Evolution Strategies vs GRPO for LLM post-training showing ES achieves comparable accuracy with different parameter geometry.

Ax Zhanliang Wang, Hongzhuo Chen, Quan Minh Nguyen, Mian Umair Ahsan, Kai Wang 4/3/2026

Beyond Logit Adjustment: A Residual Decomposition Framework for Long-Tailed Reranking

Residual decomposition framework for improving classifier performance on long-tailed datasets beyond standard logit adjustment.

Ax Hung Manh Pham, Jialu Tang, Aaqib Saeed, Dong Ma, Bin Zhu, Pan Zhou 4/3/2026

Learning ECG Image Representations via Dual Physiological-Aware Alignments

Self-supervised framework for learning clinical ECG image representations without access to raw signal recordings.

Ax Yixiao Wang, Ting Jiang, Zishan Shao, Hancheng Ye, Jingwei Sun, Mingyuan Ma, Jianyi Zhang, Yiran Chen, Hai Li 4/3/2026

ZEUS: Accelerating Diffusion Models with Only Second-Order Predictor

ZEUS: Training-free acceleration method for diffusion models using second-order predictors to reduce sampling steps.

Ax Shalima Binta Manir, Tim Oates 4/3/2026

Care-Conditioned Neuromodulation for Autonomy-Preserving Supportive Dialogue Agents

Care-Conditioned Neuromodulation framework for LLM-based dialogue agents that balances helpfulness with user autonomy preservation.

Ax Lincan Li, Rikuto Kotoge, Xihao Piao, Zheng Chen, Yushun Dong 4/3/2026

Optimizing EEG Graph Structure for Seizure Detection: An Information Bottleneck and Self-Supervised Learning Approach

EEG seizure detection method using graph neural networks with self-supervised learning and information bottleneck principles.

Ax Dong Shu, Denghui Zhang, Jessica Hullman 4/3/2026

Learning from the Right Rollouts: Data Attribution for PPO-based LLM Post-Training

Influence-Guided PPO framework for LLM post-training that filters noisy rollouts using data attribution to improve training efficiency.

Ax Deeptanshu Malu, Deevyanshu Malu, Aditya Nemiwal, Sunita Sarawagi 4/3/2026

Training In-Context and In-Weights Mixtures Via Contrastive Context Sampling

Research on training LLMs to develop both in-context and in-weights learning capabilities simultaneously via contrastive context sampling.

Ax Taisuke Kobayashi 4/3/2026

Pseudo-Quantized Actor-Critic Algorithm for Robustness to Noisy Temporal Difference Error

Novel reinforcement learning algorithm addressing noisy temporal difference errors in deep RL through pseudo-quantization methods.

Ax Shuibai Zhang, Caspian Zhuang, Chihan Cui, Zhihan Yang, Fred Zhangzhi Peng, Yanxin Zhang, Haoyue Bai, Zack Jia, Yang Zhou, Guanhua Chen, Ming Liu 4/3/2026

Expert-Choice Routing Enables Adaptive Computation in Diffusion Language Models

arXiv paper on expert-choice routing for diffusion language models. Deterministic load balancing improves throughput and convergence vs token-choice.

Ax Junyoung Sung, Seungwoo Lyu, Minjun Kim, Sumin An, Arsha Nagrani, Paul Hongsuck Seo 4/3/2026

CRIT: Graph-Based Automatic Data Synthesis to Enhance Cross-Modal Multi-Hop Reasoning

arXiv paper on CRIT, graph-based automatic data synthesis for cross-modal multi-hop reasoning. Generates complementary image-text data.

Ax Yunrui Zhang, Gustavo Batista, Salil S. Kanhere 4/3/2026

Label Shift Estimation With Incremental Prior Update

arXiv paper on label shift estimation with incremental prior updates. Addresses distribution mismatch between training and deployment.

Ax Barak Gahtan, Alex M. Bronstein 4/3/2026

Coupled Query-Key Dynamics for Attention

arXiv paper on coupled query-key dynamics for scaled dot-product attention. Improves language modeling perplexity by 6-7% on WikiText-103.

Ax Sten R\"udiger, Sebastian Raschka 4/3/2026

MiCA Learns More Knowledge Than LoRA and Full Fine-Tuning

arXiv paper introducing MiCA, parameter-efficient LLM fine-tuning method adapting minor singular vector subspaces. Outperforms LoRA on knowledge retention.

Ax Feiyu Zhou, Marios Impraimakis 4/3/2026

Transformer self-attention encoder-decoder with multimodal deep learning for response time series forecasting and digital twin support in wind structural health monitoring

arXiv paper on transformer encoder-decoder with multimodal learning for wind structural health monitoring and digital twins.

Ax Zhichong Zheng, Xiaohang Nie, Xueqi Wang, Yuanjin Zhao, Haitao Zhang, Yichao Tang 4/3/2026

MATA-Former & SIICU: Semantic Aware Temporal Alignment for High-Fidelity ICU Risk Prediction

arXiv paper on MATA-Former for ICU risk prediction using semantic-aware temporal alignment. Clinical-logic-aligned transformer architecture.

Ax David Grasev 4/3/2026

Koopman-Based Nonlinear Identification and Adaptive Control of a Turbofan Engine

arXiv paper applying Koopman operator methods for multivariable control of turbofan engines. Meta-heuristic extended dynamic mode decomposition.

Ax Giansalvo Cirrincione 4/3/2026

DDCL: Deep Dual Competitive Learning: A Differentiable End-to-End Framework for Unsupervised Prototype-Based Representation Learning

arXiv paper on DDCL, differentiable end-to-end framework for unsupervised prototype-based representation learning. Integrates feature learning with clustering.