Isolater - Feed

Ax Hadi Reisizadeh, Jiajun Ruan, Yiwei Chen, Soumyadeep Pal, Sijia Liu, Mingyi Hong 2/24/2026

Leak@$k$: Unlearning Does Not Make LLMs Forget Under Probabilistic Decoding

Leak@k: study showing existing LLM unlearning methods fail under probabilistic decoding despite success under greedy decoding evaluation.

Ax Hamza Virk, Sandro Amaglobeli, Zuhayr Syed 2/24/2026

Blind Inverse Game Theory: Jointly Decoding Rewards and Rationality in Entropy-Regularized Competitive Games

Blind-IGT: inverse game theory method jointly decoding rewards and rationality in entropy-regularized competitive games with unknown rationality parameter.

Ax Bill Chunyuan Zheng, Vivek Myers, Benjamin Eysenbach, Sergey Levine 2/24/2026

Multistep Quasimetric Learning for Scalable Goal-conditioned Reinforcement Learning

Quasimetric learning method for goal-conditioned RL using multi-step returns to estimate temporal distance between observations over long horizons.

Ax Bernardo Perrone Ribeiro, Jana Faganeli Pucer 2/24/2026

FlowCast: Advancing Precipitation Nowcasting with Conditional Flow Matching

FlowCast: conditional flow matching method for radar-based precipitation nowcasting addressing uncertainty and high-dimensional data modeling.

Ax Zhenshuo Zhang, Minxuan Duan, Youran Ye, Hongyang R. Zhang 2/24/2026

Scalable Multi-Objective and Meta Reinforcement Learning via Gradient Estimation

Gradient estimation method for multi-objective and meta reinforcement learning, partitioning n objectives into k groups for language model preference optimization.

Ax Patryk Krukowski, Jan Miksa, Piotr Helm, Jacek Tabor, Pawe{\l} Wawrzy\'nski, Przemys{\l}aw Spurek 2/24/2026

InTAct: Interval-based Task Activation Consolidation for Continual Learning

InTAct: continual learning approach using interval-based task activation consolidation with mathematical guarantees against catastrophic forgetting.

Ax German Gritsai, Megan Richards, Maxime M\'eloux, Kyunghyun Cho, Maxime Peyrard 2/24/2026

MIST: Mutual Information Estimation Via Supervised Training

MIST: neural network-based mutual information estimator trained on 625K synthetic distributions with known ground-truth MI.

Ax Rui Xue, Shichao Zhu, Liang Qin, Tianfu Wu 2/24/2026

E2E-GRec: An End-to-End Joint Training Framework for Graph Neural Networks and Recommender Systems

E2E-GRec framework for end-to-end joint training of GNNs and recommender systems, replacing two-stage pipeline approach.

Ax Xiao Wu, Ting-Zhu Huang, Liang-Jian Deng, Xiaobing Yu, Yu Zhong, Shangqi Deng, Ufaq Khan, Jianghao Wu, Xiaofeng Liu, Imran Razzak, Xiaojun Chang, Yutong Xie 2/24/2026

SelfAI: A self-directed framework for long-horizon scientific discovery

SelfAI multi-agent system for self-directed long-horizon scientific discovery with human-in-the-loop workflows and exploration trade-offs.

Ax Yaswanth Chittepu, Raghavendra Addanki, Tung Mai, Anup Rao, Branislav Kveton 2/24/2026

ML-Tool-Bench: Tool-Augmented Planning for ML Tasks

ML-Tool-Bench framework for tool-augmented planning in autonomous ML agents orchestrating data analysis and model optimization workflows.

Ax Yifan Zhang, Zixiang Chen, Yifeng Liu, Zhen Qin, Huizhuo Yuan, Kangping Xu, Yang Yuan, Quanquan Gu, Andrew Chi-Chih Yao 2/24/2026

Group Representational Position Encoding

GRAPE framework unifying positional encoding mechanisms using group actions for multiplicative rotations and additive biases.

Ax Luca Miglior, Matteo Tolloso, Alessio Gravina, Davide Bacciu 2/24/2026

Can You Hear Me Now? A Benchmark for Long-Range Graph Propagation

ECHO benchmark for evaluating graph neural networks on long-range graph propagation and interaction tasks.

Ax Daniel M. Jimenez-Gutierrez, Mehrdad Hassanzadeh, David Solans, Mohammed Elbamby, Nicolas Kourtellis, Aris Anagnostopoulos, Ioannis Chatzigiannakis, Andrea Vitaletti 2/24/2026

Clust-PSI-PFL: A Population Stability Index Approach for Clustered Non-IID Personalized Federated Learning

Clustered personalized federated learning framework using Population Stability Index to handle non-IID data across clients.

Ax Mohammad Meymani, Roozbeh Razavi-Far 2/24/2026

Divided We Fall: Defending Against Adversarial Attacks via Soft-Gated Fractional Mixture-of-Experts with Randomized Adversarial Training

Soft-gated fractional mixture-of-experts with randomized adversarial training to defend ML models against adversarial attacks.

Ax Erin Carson, Xinye Chen 2/24/2026

Precision Autotuning for Linear Solvers via Reinforcement Learning

RL framework for adaptive precision tuning in linear solvers using contextual bandit approach to balance precision and efficiency.

Ax Siba Smarak Panigrahi, Jovana Videnovi\'c, Maria Brbi\'c 2/24/2026

HeurekaBench: A Benchmarking Framework for AI Co-scientist

HeurekaBench benchmarking framework for evaluating LLM-based AI co-scientist agents on end-to-end scientific analysis tasks.

Ax Joonwon Seo 2/24/2026

Mathematical Foundations of Polyphonic Music Generation via Structural Inductive Bias

Mathematical framework for polyphonic music generation using structural inductive bias and smart embeddings on Beethoven sonatas.

Ax Sucheta Ghosh, Felix Dietrich, Zahra Monfared 2/24/2026

Contrastive and Multi-Task Learning on Noisy Brain Signals with Nonlinear Dynamical Signatures

Multitask learning framework with denoising autoencoder for EEG signal analysis combining motor imagery and emotion recognition.

Ax Kecheng Cai, Chao Peng, Chenyang Xu, Xia Chen, Yi Wang, Shuo Shi, Qiyuan Liang 2/24/2026

Self-Augmented Mixture-of-Experts for QoS Prediction

Mixture-of-experts model with self-augmentation for Quality of Service prediction in web service recommendation systems.

Ax Alessandro Londei, Matteo Benati, Denise Lanzieri, Vittorio Loreto 2/24/2026

Inverting Self-Organizing Maps: A Unified Activation-Based Framework

Method for inverting Self-Organizing Maps as generative models using activation patterns and distance geometry to reconstruct inputs.

Ax Akila Sampath, Vandana Janeja, Jianwu Wang 2/24/2026

PhysE-Inv: A Physics-Encoded Inverse Modeling approach for Arctic Snow Depth Prediction

Physics-informed inverse modeling framework for Arctic snow depth prediction combining process-based constraints with data-driven learning.

Ax Liheng Yu, Zhe Zhao, Yuxuan Wang, Pengkun Wang, Xiaofeng Cao, Binwu Wang, Yang Wang 2/24/2026

FaLW: A Forgetting-aware Loss Reweighting for Long-tailed Unlearning

Machine unlearning method addressing long-tailed distributions in forget sets using forgetting-aware loss reweighting for privacy compliance.

Ax Paul Whitten, Francis Wolff, Chris Papachristou 2/24/2026

Explainability Methods for Hardware Trojan Detection: A Systematic Comparison

Systematic comparison of explainability methods for detecting malicious hardware trojans in integrated circuits.

Ax Jialei Liu, C. Emre Koksal, Ming Shi 2/24/2026

Bi-Level Online Provisioning and Scheduling with Switching Costs and Cross-Level Constraints

Online optimization algorithm for bi-level resource provisioning and scheduling with switching costs and cross-level constraints.

Ax Yuxin Lu, Zhen Peng, Xiqiang Xia, Jie Wang 2/24/2026

A Novel VAE-DML Fusion Framework for Causal Analysis of Greenwashing in the Mining Industry

Framework combining VAE and deep metric learning for causal analysis of greenwashing in mining industry environmental disclosure.

Ax Dung Anh Hoang, Cuong Pham anh Trung Le, Jianfei Cai, Thanh-Toan Do 2/24/2026

Gradient-Aligned Calibration for Post-Training Quantization of Diffusion Models

Gradient-aligned calibration method for post-training quantization of diffusion models to accelerate inference and reduce memory usage.

Ax Weiqing He, Xiang Li, Li Shen, Weijie Su, Qi Long 2/24/2026

Improving the Trade-off Between Watermark Strength and Speculative Sampling Efficiency for Language Models

Research on trade-off between LLM watermarking strength and speculative sampling efficiency, proposing methods to improve both simultaneously.

Ax Stefanos Pertigkiozoglou, Mircea Petrache, Shubhendu Trivedi, Kostas Daniilidis 2/24/2026

Recurrent Equivariant Constraint Modulation: Learning Per-Layer Symmetry Relaxation from Data

Method for learning per-layer equivariance relaxation in neural networks without manual hyperparameter tuning, improving optimization dynamics.

Ax Soyeon Hong, Jinchan Kim, Jaegook You, Seungtaek Choi, Suha Kwak, Hyunsouk Cho 2/24/2026

TextME: Bridging Unseen Modalities Through Text Descriptions

TextME enables multimodal expansion using text-only training, projecting diverse modalities into LLM embeddings without paired datasets.

Ax Cristian Manca, Christian Scano, Giorgio Piras, Fabio Brau, Maura Pintor, Battista Biggio 2/24/2026

SAGE-5GC: Security-Aware Guidelines for Evaluating Anomaly Detection in the 5G Core Network

Machine learning-based anomaly detection for 5G networks evaluated under realistic conditions without IID assumptions and adaptive attackers.

Ax Kevin Han, Yuhang Zhou, Mingze Gao, Gedi Zhou, Serena Li, Abhishek Kumar, Xiangjun Fan, Weiwei Li, Lizhu Zhang 2/24/2026

EBPO: Empirical Bayes Shrinkage for Stabilizing Group-Relative Policy Optimization

EBPO addresses stability issues in Group Relative Policy Optimization for LLM reasoning via Empirical Bayes shrinkage, reducing variance and gradient problems.

Ax Rui Wu, Li YongJun 2/24/2026

Causal Schr\"odinger Bridges: Constrained Optimal Transport on Structural Manifolds

arXiv paper on causal Schrödinger bridges for constrained optimal transport in generative modeling under causal interventions.

Ax Paul Saegert, Ullrich K\"othe 2/24/2026

Breaking the Simplification Bottleneck in Amortized Neural Symbolic Regression

arXiv paper on amortized neural symbolic regression addressing expression simplification bottleneck for discovering interpretable analytical expressions.

Ax Jinbo Wang, Binghui Li, Zhanpeng Zhou, Mingze Wang, Yuxuan Sun, Jiaqi Zhang, Xunliang Cai, Lei Wu 2/24/2026

Fast Catch-Up, Late Switching: Optimal Batch Size Scheduling via Functional Scaling Laws

arXiv paper analyzing optimal batch size scheduling for large-scale deep learning using functional scaling law framework.

Ax Hani Beirami, M M Manjurul Islam 2/24/2026

Conformal Signal Temporal Logic for Robust Reinforcement Learning Control: A Case Study

arXiv paper on formal temporal logic specifications enhancing safety of reinforcement learning control in aerospace F-16 simulation.

Ax Samira Nazari, Mohammad Saeed Almasi, Mahdi Taheri, Ali Azarpeyvand, Ali Mokhtari, Ali Mahani, Christian Herglotz 2/24/2026

HAWX: A Hardware-Aware FrameWork for Fast and Scalable ApproXimation of DNNs

arXiv paper on HAWX, hardware-aware framework for fast DNN approximation using multi-level sensitivity scoring and heterogeneous approximate computing.

Ax Jim Zhao, Tin Sum Cheng, Wojciech Masarczyk, Aurelien Lucchi 2/24/2026

Optimizer choice matters for the emergence of Neural Collapse

arXiv paper analyzing role of optimizer choice in Neural Collapse emergence during deep neural network training terminal phase.

Ax Nikunj Gupta, James Zachary Hare, Jesse Milzman, Rajgopal Kannan, Viktor Prasanna 2/24/2026

Action-Graph Policies: Learning Action Co-dependencies in Multi-Agent Reinforcement Learning

arXiv paper proposing Action-Graph Policies for modeling action dependencies and coordination in multi-agent reinforcement learning systems.

Ax Yicheng Lang, Changsheng Wang, Yihua Zhang, Mingyi Hong, Zheng Zhang, Wotao Yin, Sijia Liu 2/24/2026

Powering Up Zeroth-Order Training via Subspace Gradient Orthogonalization

arXiv paper on zeroth-order optimization for fine-tuning large-scale models via subspace gradient orthogonalization, improving accuracy-efficiency tradeoff.

Ax Martin Andersson, Benny Avelin 2/24/2026

Exploring Singularities in point clouds with the graph Laplacian: An explicit approach

Theoretical analysis of graph Laplacian methods for detecting singularities in point cloud manifolds with explicit bounds and geometric estimation tests.

Ax Louis Grenioux, Maxence Noble, Marylou Gabri\'e, Alain Oliviero Durmus 2/24/2026

Stochastic Localization via Iterative Posterior Sampling

Investigates stochastic localization techniques for sampling from unnormalized densities using score-based learning.

Ax Xinyu Liu, Hai Zhang 2/24/2026

Model Selection and Parameter Estimation of One-Dimensional Gaussian Mixture Models

Studies optimal sampling complexity for estimating model order and parameters in one-dimensional Gaussian mixture models.

Ax Saeed Masiha, Saber Salehkaleybar, Niao He, Negar Kiyavash, Patrick Thiran 2/24/2026

Optimal Local Convergence Rates of Stochastic First-Order Methods under Local $\alpha$-PL

Research on local convergence rates of stochastic first-order methods under Polyak-Lojasiewicz conditions, a theoretical ML optimization problem.

HN minimal_action 2/24/2026

Against AI Enthusiasm and AI Fear: The Interface Problem

Essay examining the interface problem between AI capabilities and real-world impact, citing Sakana AI's autonomous research system achieving peer-review publication.

HN seawolf2357 2/24/2026

Do Bubbles Form When AIs Simulate Capitalism?

Research paper demonstrating multiple AI agents connected to live trading APIs all bankrupted within 30 minutes due to LLM hallucination causing false market citations.

HN kristoff0601 2/24/2026

Show HN: Indie AI Directory – A Curated List of Indie AI Tools

Curated directory of indie AI tools, startups, and APIs created by independent developers and solo founders with searchable categorization.

HN ajainvivek 2/24/2026

ReasonDB – A database that reasons through your documents

AI-native document database built in Rust enabling AI agents to reason through documents via structural reasoning rather than vector similarity retrieval.

HN tikue 2/24/2026

Show HN: An AI voice agent that navigates IVR and negotiates retention discounts

AI voice agent that autonomously navigates IVR phone systems and negotiates customer retention discounts.

HN parthsamin 2/24/2026

Show HN: Thisorthis.ai – Compare responses from 50 AI models side-by-side

thisorthis.ai compares responses from 47+ text and image models side-by-side. Users submit one prompt and see outputs from ChatGPT, Claude, Gemini and others simultaneously with SmartPick LLM evaluation.

HN evo-dragon 2/24/2026

Show HN: Anno – API that cuts AI web-scraping token costs by 90%

Anno API extracts clean structured text from web pages, reducing AI agent token consumption by 93% (600 vs 15,000 tokens per page). HTTP-based with ensemble extraction and confidence scoring.