Isolater - Feed

Ax Jonathan Lys, Vincent Gripon, Bastien Pasdeloup, Axel Marmoret, Lukas Mauch, Fabien Cardinaux, Ghouthi Boukli Hacene 3/20/2026

D5P4: Partition Determinantal Point Process for Diversity in Parallel Discrete Diffusion Decoding

D5P4 framework applies determinantal point processes to discrete diffusion decoding for diverse parallel text generation.

Ax Lei Yang, Han Wan, Min Zhang, Ling Liang 3/20/2026

Fast and Effective Computation of Generalized Symmetric Matrix Factorization

Algorithm for generalized symmetric matrix factorization with exactness properties and non-Lipschitz optimization.

Ax Skyler Seto, Pierre Ablin, Anastasiia Filippova, Jiayuan Ye, Louis Bethune, Angelos Katharopoulos, David Grangier 3/20/2026

Optimal Splitting of Language Models from Mixtures to Specialized Domains

Method for splitting pretrained language models into specialized domain-specific models using continued pretraining strategies.

Ax Swagat Padhan, Lakshya Jain, Bhavya Minesh Shah, Omkar Patil, Thao Nguyen, Nakul Gopalan 3/20/2026

Meanings and Measurements: Multi-Agent Probabilistic Grounding for Vision-Language Navigation

Multi-agent framework for grounding vision-language navigation using probabilistic reasoning about spatial relations and metric constraints.

Ax Shang-Jui Ray Kuo, Paola Cascante-Bonilla 3/20/2026

Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders

Evaluates State Space Models as vision encoders for Vision-Language Models, comparing SSM backbones to transformer-based alternatives.

Ax Tianjiao Yu, Xinzhuo Li, Muntasir Wahed, Jerry Xiong, Yifan Shen, Ying Shen, Ismini Lourentzou 3/20/2026

DreamPartGen: Semantically Grounded Part-Level 3D Generation via Collaborative Latent Denoising

DreamPartGen generates semantically grounded 3D objects with part-level decomposition using text-to-3D diffusion methods.

Ax Dong Zhuo, Wenzhao Zheng, Sicheng Zuo, Siming Yan, Lu Hou, Jie Zhou, Jiwen Lu 3/20/2026

DriveTok: 3D Driving Scene Tokenization for Unified Multi-View Reconstruction and Understanding

DriveTok proposes efficient 3D tokenization for multi-view driving scenes to improve autonomous driving systems and world models.

Ax Zhuolin Yang, Zihan Liu, Yang Chen, Wenliang Dai, Boxin Wang, Sheng-Chieh Lin, Chankyu Lee, Yangyi Chen, Dongfu Jiang, Jiafan He, Renjie Pi, Grace Lam, Nayeon Lee, Alexander Bukharin, Mohammad Shoeybi, Bryan Catanzaro, Wei Ping 3/20/2026

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Nemotron-Cascade 2: 30B open-weight MoE LLM with strong reasoning and agentic capabilities, achieving IMO Gold Medal performance.

Ax Carlos Esteves, Ameesh Makadia 3/20/2026

Spectrally-Guided Diffusion Noise Schedules

Method for designing adaptive noise schedules in diffusion models for image and video generation using spectral guidance.

Ax Huaide Jiang, Yash Chaudhary, Yuping Wang, Zehao Wang, Raghav Sharma, Manan Mehta, Yang Zhou, Lichao Sun, Zhiwen Fan, Zhengzhong Tu, Jiachen Li 3/20/2026

NavTrust: Benchmarking Trustworthiness for Embodied Navigation

NavTrust benchmark evaluates trustworthiness of embodied navigation agents under real-world corruptions in Vision-Language Navigation and Object-Goal Navigation tasks.

Ax Shaojie Li, Pengwei Tang, Yong Liu 3/20/2026

Improved Learning Rates for Stochastic Optimization

Establishes improved learning rates for stochastic gradient descent and Nesterov accelerated gradient with generalization performance guarantees.

Ax Caleb Princewill Nwokocha 3/20/2026

Rule Extraction in Machine Learning: Chat Incremental Pattern Constructor

Chat Incremental Pattern Constructor extracts ordered token-transition rules from text for interpretable machine learning rule extraction.

Ax Miguel \'A. Carreira-Perpi\~n\'an, Suryabhan Singh Hada 3/20/2026

Inverse classification with logistic and softmax classifiers: efficient optimization

Optimization methods for inverse classification problems including counterfactual explanations and adversarial examples using logistic and softmax classifiers.

Ax Azmine Toushik Wasi, Taki Hasan Rafi, Raima Islam, Serbetar Karlo, Dong-Kyu Chae 3/20/2026

CADGL: Context-Aware Deep Graph Learning for Predicting Drug-Drug Interactions

CADGL uses context-aware deep graph learning for predicting drug-drug interactions with improved generalization and robustness.

Ax Benjamin Th\'erien, Charles-\'Etienne Joseph, Boris Knyazev, Edouard Oyallon, Irina Rish, Eugene Belilovsky 3/20/2026

$\mu$LO: Compute-Efficient Meta-Generalization of Learned Optimizers

μLO derives Maximal Update Parametrization for learned optimizers to improve meta-generalization across network widths and unseen tasks.

Ax Yiming Ma, Jianzhi Teng, Xinjie Li, Xin Sun, Zhiyong Wang, Yuzhou Song, Lionel Z. Wang, Bin Chen 3/20/2026

Modeling Inverse Ellipsometry Problem via Flow Matching with a Large-Scale Dataset

Flow matching approach with large-scale synthetic dataset for solving inverse ellipsometry problem of reconstructing optical film properties.

Ax Yakir Yehuda, Kira Radinsky 3/20/2026

ODE-Constrained Generative Modeling of Cardiac Dynamics for 12-Lead ECG Synthesis

ODE-constrained generative model for synthesizing realistic 12-lead ECG training data to address scarcity of labeled medical recordings.

Ax Jakub Grudzien Kuba, Pieter Abbeel, Sergey Levine 3/20/2026

Cliqueformer: Model-Based Optimization with Structured Transformers

Cliqueformer uses structured transformers for model-based optimization in design problems like protein engineering via offline learning.

Ax \.Ilter Onat Korkmaz, Ya\c{s}ar Cahit Y{\i}ld{\i}r{\i}m, \c{C}a\u{g}{\i}n Ararat, Cem Tekin 3/20/2026

Vector Optimization with Gaussian Process Bandits

VOGP algorithm using Gaussian process bandits for black-box vector optimization with incomplete order relations and Pareto optimality guarantees.

Ax Alec S. Xu, Can Yaras, Peng Wang, Qing Qu 3/20/2026

Linearly Separable Features in Shallow Nonlinear Networks: Width Scales Polynomially with Intrinsic Data Dimension

Theoretical analysis showing shallow nonlinear networks learn linearly separable features with polynomial width scaling relative to data dimension.

Ax Di Chai, Pengbo Li, Feiyuan Zhang, Yilun Jin, Han Tian, Kaiqiang Xu, Binhang Yuan, Dian Shen, Junxue Zhang, Kai Chen 3/20/2026

Unlocking Full Efficiency of Token Filtering in Large Language Model Training

Methods to achieve real-world efficiency gains from token filtering in LLM training through improved sparsity and adaptive filtering strategies.

Ax Khawla Elhadri, Tomasz Michalski, Adam Wr\'obel, J\"org Schl\"otterer, Bartosz Zieli\'nski, Christin Seifert 3/20/2026

This looks like what? Challenges and Future Research Directions for Part-Prototype Models

Survey of Part-Prototype Models for explainable AI, examining interpretability mechanisms and competitive limitations versus alternative approaches.

Ax Jie Shi, Aleksej Cornelissen, Siamak Mehrkanoon 3/20/2026

Integrating Weather Station Data and Radar for Precipitation Nowcasting: SmaAt-fUsion and SmaAt-Krige-GNet

Two neural architectures for precipitation nowcasting integrating weather station data and radar measurements for improved forecast skill.

Ax Sindhuja Madabushi, Ahmad Faraz Khan, Haider Ali, Jin-Hee Cho 3/20/2026

OPUS-VFL: Incentivizing Optimal Privacy-Utility Tradeoffs in Vertical Federated Learning

OPUS-VFL addresses privacy-utility tradeoffs and incentive mechanisms in Vertical Federated Learning with heterogeneous client resources.

Ax Dip Roy 3/20/2026

Causal Intervention Framework for Variational Auto Encoder Mechanistic Interpretability

Causal intervention framework for interpreting Variational Autoencoders mechanistically, addressing interpretability of generative models.

Ax Xiang Shi, Rui Zhang, Jiawei Liu, Yinpeng Liu, Qikai Cheng, Wei Lu 3/20/2026

Modality Equilibrium Matters: Minor-Modality-Aware Adaptive Alternating for Cross-Modal Memory Enhancement

Shapley Value-based alternating training framework for multimodal fusion that balances dominant and minor modalities.

Ax Antonio Ferrara, Francesco Cozzi, Alan Perotti, Andr\'e Panisson, Francesco Bonchi 3/20/2026

Size-adaptive Hypothesis Testing for Fairness

Statistical framework for fairness testing in algorithmic systems that accounts for sampling error and handles intersectional demographic analysis.

Ax Tongtian Zhu, Tianyu Zhang, Mingze Wang, Zhanpeng Zhou, Can Wang 3/20/2026

On the Surprising Effectiveness of a Single Global Merging in Decentralized Learning

Analysis of communication scheduling in decentralized learning showing benefits of concentrating synchronization in later training stages.

Ax Kyeongjin Ahn, Sungwon Han, Seungeon Lee, Donghyun Ahn, Hyoshin Kim, Jungwon Kim, Jihee Kim, Sangyoon Park, Meeyoung Cha 3/20/2026

GeoReg: Weight-Constrained Few-Shot Regression for Socio-Economic Estimation using LLM

GeoReg uses LLMs with satellite imagery and geospatial data for socio-economic indicator estimation in data-scarce regions via few-shot regression.

Ax Zijian Liu 3/20/2026

Online Convex Optimization with Heavy Tails: Old Algorithms, New Regrets, and Applications

Research on Online Convex Optimization algorithms for heavy-tailed gradient distributions, extending beyond finite variance assumptions.

Ax Dhiraj S Kori, Abhinav Chandraker, Syed Abdur Rahman, Punit Rathore, Ankur Chauhan 3/20/2026

Physics-informed neural network for predicting fatigue life of unirradiated and irradiated austenitic and ferritic/martensitic steels under reactor-relevant conditions

Physics-informed neural network framework predicting fatigue life of steels under nuclear reactor conditions.

Ax Mominul Rubel, Adam Meyers, Gabriel Nicolosi 3/20/2026

Fourier Learning Machines: Nonharmonic Fourier-Based Neural Networks for Scientific Machine Learning

Neural network architecture using nonharmonic Fourier series for scientific machine learning applications.

Ax Sepehr Maleki, Negar Pourmoazemi 3/20/2026

Pi-transformer: A prior-informed dual-attention model for multivariate time-series anomaly detection

Transformer architecture with dual attention for multivariate time-series anomaly detection using temporal invariants.

Ax Shuofeng Zhang, Ard Louis 3/20/2026

Closed-form $\ell_r$ norm scaling with data for overparameterized linear regression and diagonal linear networks under $\ell_p$ bias

Theoretical analysis of parameter norm scaling in overparameterized linear regression and diagonal networks.

Ax Pooneh Mousavi, Lovenya Jain, Mirco Ravanelli, Cem Subakan 3/20/2026

Investigating Faithfulness in Large Audio Language Models

Framework evaluating faithfulness of chain-of-thought reasoning in large audio language models for multimodal tasks.

Ax Elaheh Akbari, Shansita Sharma, Ping He, Ahmadreza Moradipari, Kyungtae Han, Hamed Pirsiavash, Yikun Bai, Soheil Kolouri 3/20/2026

OT-MeanFlow3D: Bridging Optimal Transport and Meanflow for Efficient 3D Point Cloud Generation

Flow-matching models for 3D point cloud generation using optimal transport and meanflow for single-step inference acceleration.

Ax Ange-Cl\'ement Akazan, Verlon Roel Mbingui 3/20/2026

Splines-Based Feature Importance in Kolmogorov-Arnold Networks: A Framework for Supervised Tabular Data Dimensionality Reduction

KAN-based feature selection framework for tabular data via spline-based importance scoring. Specialized ML technique.

Ax Maryam Aliakbarpour, Vladimir Braverman, Junze Yin, Haochen Zhang 3/20/2026