Isolater - Feed

Ax Zhiliang Chen, Alfred Wei Lun Leong, Shao Yong Ong, Apivich Hemachandra, Gregory Kang Ruey Lau, Chuan-Sheng Foo, Zhengyuan Liu, Nancy F. Chen, Bryan Kian Hsiang Low 2/20/2026

The Chicken and Egg Dilemma: Co-optimizing Data and Model Configurations for LLMs

Method for jointly optimizing data mixture and model architecture configurations during LLM training to avoid suboptimal individual choices.

Ax Xinxu Wei, Rong Zhou, Lifang He, Yu Zhang 2/20/2026

Diffusion-Guided Pretraining for Brain Graph Foundation Models

Diffusion-guided pretraining approach for brain graph foundation models using semantic-aware augmentation strategies.

Ax Iv\'an Arcuschin, David Chanin, Adri\`a Garriga-Alonso, Oana-Maria Camburu 2/20/2026

Biases in the Blind Spot: Detecting What LLMs Fail to Mention

Automated black-box pipeline for detecting unverbalized biases in LLM reasoning traces and chain-of-thought explanations.

Ax Roberto Molinaro, Niall Siegenheim, Henry Martin, Mark Frey, Niels Poulsen, Philipp Seitz, Marvin Vincent Gabler 2/20/2026

Universal Diffusion-Based Probabilistic Downscaling

Universal diffusion-based framework for converting low-resolution weather forecasts to probabilistic high-resolution predictions without model fine-tuning.

Ax Beatrix M. G. Nielsen, Emanuele Marconato, Luigi Gresele, Andrea Dittadi, Simon Buchholz 2/20/2026

Logit Distance Bounds Representational Similarity

Analysis showing logit distance bounds representational similarity in discriminative models including autoregressive language models.

Ax Chenda Duan, Yipeng Zhang, Sotaro Kanai, Yuanyi Ding, Atsuro Daida, Pengyue Yu, Tiancheng Zheng, Naoto Kuroda, Shaun A. Hussain, Eishi Asano, Hiroki Nariai, Vwani Roychowdhury 2/20/2026

Omni-iEEG: A Large-Scale, Comprehensive iEEG Dataset and Benchmark for Epilepsy Research

Large-scale intracranial EEG dataset and benchmarks for epilepsy research and seizure localization using data-driven approaches.

Ax Jung Min Choi, Vijaya Krishna Yalavarthi, Lars Schmidt-Thieme 2/20/2026

HPMixer: Hierarchical Patching for Multivariate Time Series Forecasting

HPMixer model for long-term multivariate time series forecasting using hierarchical patching to capture periodic patterns and residuals.

Ax Naoki Masuyama, Takanori Takebayashi, Yusuke Nojima, Chu Kiong Loo, Hisao Ishibuchi, Stefan Wermter 2/20/2026

A Parameter-free Adaptive Resonance Theory-based Topological Clustering Algorithm Capable of Continual Learning

ART-based topological clustering algorithm that eliminates need for manual parameter tuning and supports continual learning.

Ax Fran\c{c}ois Bachoc, Tommaso Cesari, Roberto Colomboni 2/20/2026

A Parametric Contextual Online Learning Theory of Brokerage

Online learning theory framework for contextual brokerage between traders with sequential asset trading decisions.

Ax Samet Demir, Zafer Dogan 2/20/2026

Input-Label Correlation Governs a Linear-to-Nonlinear Transition in Random Features under Spiked Covariance

Theoretical analysis of linear-to-nonlinear transition in random feature models under spiked covariance and input-label correlation.

Ax Rachel Ma, Jingyi Qu, Andreea Bobu, Dylan Hadfield-Menell 2/20/2026

Goal Inference from Open-Ended Dialog

Framework for embodied AI agents to infer user goals from open-ended dialog using LLMs for efficient task accomplishment.

Ax Aditya Dutt, Ishikaa Lunawat, Manpreet Kaur 2/20/2026

Multi-View 3D Reconstruction using Knowledge Distillation

Knowledge distillation pipeline to compress Dust3r foundation model for efficient 3D reconstruction and visual localization.

Ax Kaleel Mahmood, Shaoyi Huang 2/20/2026

Efficient Context Propagating Perceiver Architectures for Auto-Regressive Language Modeling

Perceiver architecture for auto-regressive language modeling reducing attention complexity from quadratic to semi-linear.

Ax Bettina Messmer, Vinko Sabol\v{c}ec, Martin Jaggi 2/20/2026

Enhancing Multilingual LLM Pretraining with Model-Based Data Selection

Model-based data filtering framework for multilingual LLM pretraining that identifies diverse, high-quality training samples.

Ax Alan Luo, Kaiwen Yuan 2/20/2026

Simple Self Organizing Map with Vision Transformers

Combining Self-Organizing Maps with Vision Transformers to improve performance on smaller datasets through explicit inductive biases.

Ax Andr\'e Barreto, Vincent Dumoulin, Yiran Mao, Mark Rowland, Nicolas Perez-Nieves, Bobak Shahriari, Yann Dauphin, Doina Precup, Hugo Larochelle 2/20/2026

Capturing Individual Human Preferences with Reward Features

Learning user-specialized reward models for reinforcement learning from human feedback to capture individual preference disagreement.

Ax Ting Qiao, Yingjia Wang, Xing Liu, Sixing Wu, Jianbin Li, Yiming Li 2/20/2026

Cert-SSBD: Certified Backdoor Defense with Sample-Specific Smoothing Noises

Certified defense against backdoor attacks in deep neural networks using sample-specific smoothing noise.

Ax Jacob Carlson, Melissa Dell 2/20/2026

A Unifying Framework for Robust and Efficient Inference with Unstructured Data

Framework for handling unstructured data feature extraction with neural networks while accounting for measurement bias in economic analysis.

Ax Dmitriy Shopkhoev, Ammar Ali, Magauiya Zhussip, Valentin Malykh, Stamatios Lefkimmiatis, Nikos Komodakis, Sergey Zagoruyko 2/20/2026

ReplaceMe: Network Simplification via Depth Pruning and Transformer Block Linearization

ReplaceMe: training-free depth pruning method replacing transformer blocks with linear operations for efficient model compression.

Ax Miguel Aguilera, Sosuke Ito, Artemy Kolchinsky 2/20/2026

Inferring entropy production in many-body systems using nonequilibrium maximum entropy

Method for inferring entropy production in high-dimensional stochastic systems using nonequilibrium maximum entropy principle.

Ax Thibaud Gloaguen, Robin Staab, Nikola Jovanovi\'c, Martin Vechev 2/20/2026

LLM Fingerprinting via Semantically Conditioned Watermarks

LLM fingerprinting via semantically conditioned watermarks that survive finetuning and quantization without being easily detected.

Ax Bosung Kim, Prithviraj Ammanabrolu 2/20/2026

Beyond Needle(s) in the Embodied Haystack: Environment, Architecture, and Training Considerations for Long Context Reasoning

∞-THOR framework for long-horizon embodied AI tasks with Needle(s) in Embodied Haystack benchmark for testing long-context reasoning in agents.

Ax Maximilian Kreutner, Marlene Lutz, Markus Strohmaier 2/20/2026

Persona-driven Simulation of Voting Behavior in the European Parliament with Large Language Models

Using persona-driven prompting to simulate European Parliament voting behavior with LLMs, addressing political bias in model responses.

Ax Jan C. Schulze, Alexander Mitsos 2/20/2026

Nonlinear Model Order Reduction of Dynamical Systems in Process Engineering: Review and Comparison

Review of nonlinear model order reduction methods for creating computationally efficient dynamical system models in process engineering.

Ax Mert Cemri, Nived Rajaraman, Rishabh Tiwari, Xiaoxuan Liu, Kurt Keutzer, Ion Stoica, Kannan Ramchandran, Ahmad Beirami, Ziteng Sun 2/20/2026

$\texttt{SPECS}$: Faster Test-Time Scaling through Speculative Drafts

SPECS: method for faster test-time scaling in LLMs through speculative drafts, balancing reasoning accuracy with user-facing latency.

Ax Szymon Pawlonka, Miko{\l}aj Ma{\l}ki\'nski, Jacek Ma\'ndziuk 2/20/2026

Bongard-RWR+: Real-World Representations of Fine-Grained Concepts in Bongard Problems

Bongard Problems benchmark using real-world images to test abstract visual reasoning and fine-grained concept identification in models.

Ax Mahdi Farahbakhsh, Vishnu Teja Kunde, Dileep Kalathil, Krishna Narayanan, Jean-Francois Chamberland 2/20/2026

Inference-Time Search Using Side Information for Diffusion-Based Image Reconstruction

Proposes inference-time search algorithm that guides diffusion model sampling with side information for improved image reconstruction in inverse problems.

Ax Yasaman Haghighi, Bastien van Delft, Mariam Hassan, Alexandre Alahi 2/20/2026

LayerSync: Self-aligning Intermediate Layers

LayerSync regularizes diffusion models using their own intermediate layer representations to improve generation quality and training efficiency.

Ax Marisa C. Peczuh, Nischal Ashok Kumar, Ryan Baker, Blair Lehman, Danielle Eisenberg, Caitlin Mills, Payu Wittawatolarn, Kushaan Naskar, Keerthi Chebrolu, Sudhip Nashi, Cadence Young, Brayden Liu, Sherry Lachman, Andrew Lan 2/20/2026

Toward LLM-Supported Automated Assessment of Critical Thinking Subskills

Uses LLMs for automated assessment of critical thinking skills in educational contexts, addressing evaluation of evidence and claim reliability.

Ax Zhuojin Li, Marco Paolieri, Leana Golubchik 2/20/2026

A Study on Inference Latency for Vision Transformers on Mobile Devices

Benchmarks inference latency of 190 Vision Transformers on mobile devices compared to CNNs, analyzing architectural factors affecting performance.

Ax Ahmed Aboulfotouh, Hatem Abou-Zeid 2/20/2026

Multimodal Wireless Foundation Models

Extends wireless foundation models to accept multiple input modalities for improved task performance and adaptation across varying conditions.

Ax Akash Doshi, Pinar Sen, Kirill Ivanov, Wei Yang, June Namgoong, Runxin Wang, Rachel Wang, Taesang Yoo, Jing Jiang, Tingfang Ji 2/20/2026

AI/ML based Joint Source and Channel Coding for HARQ-ACK Payload

Applies transformer-based deep learning for joint source-channel coding of non-uniformly distributed HARQ-ACK bits in wireless communications.

Ax Mozes Jacobs, Thomas Fel, Richard Hakim, Alessandra Brondetta, Demba Ba, T. Andy Keller 2/20/2026

Block-Recurrent Dynamics in Vision Transformers

Introduces Block-Recurrent Hypothesis explaining Vision Transformer depth as block-recurrent computational flow for mechanistic interpretation.

Ax Sijia li, Xinran Li, Shibo Chen, Jun Zhang 2/20/2026

Puzzle it Out: Local-to-Global World Model for Offline Multi-Agent Reinforcement Learning

Proposes world model approach for offline multi-agent RL using local-to-global puzzle solving to overcome conservative policies and improve generalization.

Ax Yiyao Yang 2/20/2026

Beyond Predictive Uncertainty: Reliable Representation Learning with Structural Constraints

Framework for treating representation reliability as a first-class property in machine learning, beyond traditional predictive uncertainty quantification.

Ax Amanuel Anteneh, Kyungeun Kim, J. M. Schwarz, Israel Klich, Olivier Pfister 2/20/2026

Laser interferometry as a robust neuromorphic platform for machine learning

Proposes implementing optical neural networks using linear optical resources and phase-shift encoding for neuromorphic machine learning hardware.

Ax Yongxin Deng, Zhen Fang, Sharon Li, Ling Chen 2/20/2026

Beyond In-Domain Detection: SpikeScore for Cross-Domain Hallucination Detection

SpikeScore method for detecting LLM hallucinations that generalizes across domains, addressing the gap in cross-domain hallucination detection for real-world deployment.

Ax Kapilan Balagopalan, Yinan Li, Yao Zhao, Tuan Nguyen, Anton Daitche, Houssam Nassif, Kwang-Sung Jun 2/20/2026

Fixed Budget is No Harder Than Fixed Confidence in Best-Arm Identification up to Logarithmic Factors

Theoretical analysis showing fixed-budget and fixed-confidence best-arm identification in K-armed bandits have equivalent optimal sample complexities up to logarithmic factors.

Ax Luciano Melodia 2/20/2026

Universal Coefficients and Mayer-Vietoris Sequence for Groupoid Homology

Mathematical study of homology in ample groupoids using Moore complexes and continuous étale homomorphisms, with Mayer-Vietoris sequences.

Ax Muhammad J. Alahmadi, Peng Gao, Feiyi Wang, Dongkuan Xu 2/20/2026

Accelerating Large-Scale Dataset Distillation via Exploration-Exploitation Optimization

Proposes exploration-exploitation optimization for dataset distillation to compress large datasets into synthetic versions while maintaining model performance.

Ax Ha Na Cho, Sairam Sutari, Alexander Lopez, Hansen Bow, Kai Zheng 2/20/2026

Building Safe and Deployable Clinical Natural Language Processing under Temporal Leakage Constraints

Addresses temporal leakage in clinical NLP models for discharge planning, proposing methods to prevent overconfident predictions from deployment artifacts.

Ax Binchuan Qi 2/20/2026

Conjugate Learning Theory: Uncovering the Mechanisms of Trainability and Generalization in Deep Neural Networks

Conjugate learning theory framework characterizes trainability and generalization of deep neural networks using convex duality and mini-batch SGD analysis.

Ax Sushant Mehta, Logan Ritchie, Suhaas Garre, Nick Heiner, Edwin Chen 2/20/2026

EnterpriseBench Corecraft: Training Generalizable Agents on High-Fidelity RL Environments

CoreCraft is a high-fidelity enterprise RL simulation environment with 2,500+ entities and 23 tools for training generalizable AI agents in customer support scenarios.

Ax Nivya Talokar, Ayush K Tarun, Murari Mandal, Maksym Andriushchenko, Antoine Bosselut 2/20/2026

Helpful to a Fault: Measuring Illicit Assistance in Multi-Turn, Multilingual LLM Agents

STING benchmark measures how LLM agents can be misused over multiple turns and across languages to assist with illegal tasks, testing multi-step harmful goal execution.

Ax Yixue Zhang, Kun Wu, Zhi Gao, Zhen Zhao, Pei Ren, Zhiyuan Xu, Fei Liao, Xinhua Wang, Shichao Fan, Di Wu, Qiuxuan Feng, Meng Li, Zhengping Che, Chang Liu, Jian Tang 2/20/2026

RoboGene: Boosting VLA Pre-training via Diversity-Driven Agentic Framework for Real-World Task Generation

RoboGene uses an agentic framework to automatically generate diverse robotic manipulation tasks for training vision-language-action models, addressing data scarcity in robot learning.

HN novemp 2/20/2026

AI Impact Summit 2026: How we're partnering to make AI work for everyone

Google announces AI Impact Summit 2026 partnerships and investments for broad AI adoption, marketing content with limited technical depth.

HN abliterationai 2/20/2026

Show HN: Berean Labs – Free AI-powered penetration testing for web apps

Berean Labs open-source autonomous AI penetration testing tool for detecting client-side vulnerabilities, exposed secrets, and web app misconfigurations.

HN mickamy 2/20/2026

Show HN: SQL-tap now has a browser-based Web UI for real-time SQL monitoring

SQL-tap: transparent SQL proxy with new browser-based Web UI for real-time query inspection, EXPLAIN, filtering, and analysis.

HN greesil 2/20/2026

California introduces a bill (AB-2047) that will limit the use of 3D printers

California legislative bill on 3D printer firearm prevention technology, not AI-related.

HN yakshithk_ 2/20/2026

Theres no mainstream AI video editing tool?

User asks about AI video editing tools availability. Low-effort question without substantive content or research.