Isolater - Feed

Ax Boya Xiong, Shuo Wang, Weifeng Ge, Guanhua Chen, Yun Chen 2/17/2026

Enhancing Delta Compression in LLMs via SVD-based Quantization Error Minimization

SVD-based quantization for compressing fine-tuned LLM delta parameters with minimized quantization error for storage efficiency.

Ax Xingyue Huang, Mikhail Galkin, Michael M. Bronstein, \.Ismail \.Ilkan Ceylan 2/17/2026

HYPER: A Foundation Model for Inductive Link Prediction with Knowledge Hypergraphs

HYPER foundation model for inductive link prediction with knowledge hypergraphs handling novel entities and relation types.

Ax Hen Davidov, Shai Feldman, Gilad Freidkin, Yaniv Romano 2/17/2026

Calibrated Predictive Lower Bounds on Time-to-Unsafe-Sampling in LLMs

Novel safety measure for LLMs measuring generations needed to trigger unsafe responses, with calibrated predictive bounds for evaluation.

Ax Hanyu Pei, Jing-Xiao Liao, Qibin Zhao, Ting Gao, Shijun Zhang, Xiaoge Zhang, Feng-Lei Fan 2/17/2026

NeuronSeek: On Stability and Expressivity of Task-driven Neurons

NeuronSeek uses symbolic regression to discover optimal neuron formulations and construct task-driven neural networks.

Ax Fabio Merizzi, Harilaos Loukos 2/17/2026

Vision Transformers for Multi-Variable Climate Downscaling: Emulating Regional Climate Models with a Shared Encoder and Multi-Decoder Architecture

Vision Transformers for climate downscaling using shared encoder and multi-decoder architecture to emulate regional climate models.

Ax Yuta Sato, Kazuhiko Kawamoto, Hiroshi Kera 2/17/2026

Chain of Thought in Order: Discovering Learning-Friendly Orders for Arithmetic

Studies optimal ordering of intermediate steps in chain-of-thought reasoning for arithmetic tasks, improving learning efficiency.

Ax Xiaohang Tang, Rares Dolga, Sangwoong Yoon, Ilija Bogunovic 2/17/2026

wd1: Weighted Policy Optimization for Reasoning in Diffusion Language Models

Weighted policy optimization method for reinforcement learning in diffusion-based LLMs, addressing intractable likelihood approximation.

Ax Yuxi Liu, Konpat Preechakul, Kananart Kuwaranancharoen, Yutong Bai 2/17/2026

The Serial Scaling Hypothesis

Theoretical framework distinguishing inherently sequential problems that cannot be efficiently parallelized, relevant to LLM reasoning.

Ax Joshua Dimasaka, Christian Gei{\ss}, Emily So 2/17/2026

DeepC4: Deep Conditional Census-Constrained Clustering for Large-scale Multitask Spatial Disaggregation of Urban Morphology

Spatial disaggregation clustering method for large-scale urban morphology mapping using Earth observation data.

Ax Xuan Liu, Siru Ouyang, Xianrui Zhong, Jiawei Han, Huimin Zhao 2/17/2026

FGBench: A Dataset and Benchmark for Molecular Property Reasoning at Functional Group-Level in Large Language Models

Benchmark dataset for evaluating LLM reasoning on molecular properties at functional group level for chemistry applications.

Ax Zhaomin Wu, Mingzhe Du, See-Kiong Ng, Bingsheng He 2/17/2026

Beyond Prompt-Induced Lies: Investigating LLM Deception on Benign Prompts

Investigates spontaneous deception in LLMs on benign prompts, revealing trustworthiness risks in reasoning and planning tasks.

Ax Md Sultanul Arifin, Abu Nowshed Sakib, Yeasir Rayhan, Tanzima Hashem 2/17/2026

Lightning Prediction under Uncertainty: DeepLight with Hazy Loss

Deep learning architecture for lightning occurrence prediction using uncertainty-aware loss function.

Ax Jeremy Carleton, Debajoy Mukherjee, Srinivas Shakkottai, Dileep Kalathil 2/17/2026

MAVIS: Multi-Objective Alignment via Inference-Time Value-Guided Selection

Inference-time method for balancing multiple conflicting objectives in LLM outputs without expensive per-objective fine-tuning.

Ax Zayd M. K. Zuhri, Erland Hilman Fuadi, Alham Fikri Aji 2/17/2026

Predicting the Order of Upcoming Tokens Improves Language Modeling

Token order prediction auxiliary objective improves language model performance, offering alternative to multi-token prediction for next-token training.

Ax Arjun Basandrai, Shourya Jain, K. Ilanthenral 2/17/2026

ART: Adaptive Resampling-based Training for Imbalanced Classification

Adaptive resampling method for imbalanced classification that adjusts training data distribution based on class-wise learning difficulty.

Ax Minh Vu, Konstantinos Slavakis 2/17/2026

Online reinforcement learning via sparse Gaussian mixture model Q-functions

Online policy-iteration RL framework using sparse Gaussian mixture model Q-functions with interpretable exploration mechanisms.

Ax Kaiwen Zheng, Huayu Chen, Haotian Ye, Haoxiang Wang, Qinsheng Zhang, Kai Jiang, Hang Su, Stefano Ermon, Jun Zhu, Ming-Yu Liu 2/17/2026

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Online reinforcement learning method for diffusion models addressing intractable likelihoods, enabling RLHF-style training without solver restrictions.

Ax Binghui Li, Fengling Chen, Zixun Huang, Lean Wang, Lei Wu 2/17/2026

Functional Scaling Laws in Kernel Regression: Loss Dynamics and Learning Rate Schedules

Analyzes scaling laws for loss dynamics and learning rate schedules in SGD on kernel regression, with implications for LLM training.

Ax Rohan Chauhan, Ioannis Panageas 2/17/2026

Learning the Inverse Temperature of Ising Models under Hard Constraints using One Sample

Theoretical study on estimating inverse temperature parameters in truncated Ising models using single samples.

Ax Anton Korznikov, Andrey Galichin, Alexey Dontsov, Oleg Y. Rogov, Ivan Oseledets, Elena Tutubalina 2/17/2026

The Rogue Scalpel: Activation Steering Compromises LLM Safety

Demonstrates that activation steering for LLM control can compromise safety mechanisms, causing models to comply with harmful requests.

Ax Aman Gupta, Rafael Celente, Abhishek Shivanna, D. T. Braithwaite, Gregory Dexter, Shao Tang, Hiroto Udagawa, Daniel Silva, Rohan Ramanath, S. Sathiya Keerthi 2/17/2026

Effective Quantization of Muon Optimizer States

8-bit quantization technique for Muon optimizer states in LLM pre-training, reducing memory overhead while maintaining training efficiency.

Ax Steve Hong, Runa Eschenhagen, Bruno Mlodozeniec, Richard Turner 2/17/2026

Better Hessians Matter: Studying the Impact of Curvature Approximations in Influence Functions

Studies how curvature approximations (GGN, K-FAC) in influence functions affect data attribution accuracy for deep learning models.

Ax Zhaomin Wu, Haodong Zhao, Ziyang Wang, Jizhou Guo, Qian Wang, Bingsheng He 2/17/2026

LLM DNA: Tracing Model Evolution via Functional Representations

Method to trace evolutionary relationships between LLMs through functional representations, enabling better model management and understanding of fine-tuning/distillation lineages.

Ax Patrick Langer, Thomas Kaar, Max Rosenblattl, Maxwell A. Xu, Winnie Chow, Martin Maritsch, Robert Jakob, Ning Wang, Juncheng Liu, Aradhana Verma, Brian Han, Daniel Seung Kim, Henry Chubb, Scott Ceresnak, Aydin Zahedivash, Alexander Tarlochan Singh Sandhu, Fatima Rodriguez, Daniel McDuff, Elgar Fleisch, Oliver Aalami, Filipe Barata, Paul Schmiedmayer 2/17/2026

OpenTSLM: Time-Series Language Models for Reasoning over Multivariate Medical Text- and Time-Series Data

OpenTSLM: time-series language models integrating multivariate medical time-series as native modality. Enables LLMs to handle temporal clinical data.

Ax Steve Hong, Samuel Belkadi 2/17/2026

Multi-scale Autoregressive Models are Laplacian, Discrete, and Latent Diffusion Models in Disguise

arXiv paper: Visual Autoregressive models reinterpreted as Laplacian latent pyramid with learned coarse-to-fine refinement. Formal analysis of design trade-offs.

Ax Yukun Zhang, Xueqing Zhou 2/17/2026

Where to Add PDE Diffusion in Transformers

Research on optimal placement of PDE diffusion layers in transformer architectures using heat equation-based smoothing for local geometric priors.

HN cher-nov 2/17/2026

SciTech SNAP Graphics has gone open source

Legacy SciTech SNAP Graphics device driver codebase released as open source.

HN tuwenbo0120 2/17/2026

Show HN: M-Courtyard – Fine-tune LLMs on your Mac with zero code

Zero-code desktop app for fine-tuning LLMs locally on Apple Silicon with full pipeline from documents to Ollama export.

HN Sean-Der 2/17/2026

Show HN: Broadcast Box – Self-hosted low latency streaming

Self-hosted low-latency streaming application using WebRTC for sub-second broadcast.

HN andyngdz 2/17/2026

Show HN: ExoGen – Open-Source Local Stable Diffusion Client

Privacy-focused desktop app for local Stable Diffusion image generation with HuggingFace model support.

HN JohnnyCode 2/17/2026

DelegateOS: Cryptographic delegation tokens for multi-agent sys(DeepMind paper)

DeepMind's cryptographic delegation protocol for multi-agent systems with accountability chains and MCP integration.

HN Meetvelde 2/17/2026

Almost Every infrastructure decision I endorse or regret after 4 years

Retrospective on infrastructure decisions at startup including cloud provider choice, database selection, and tradeoffs.

HN niraj-agarwal 2/17/2026

LLM-generated skills work, if you generate them afterwards

Research showing LLM-generated skills provide no benefit; models cannot reliably author procedural knowledge they consume.

HN gmays 2/17/2026

Manus AI launched 24/7 Agent via Telegram and got suspended

Manus AI launches persistent AI agents with Telegram integration; account suspended shortly after launch. Platform expansion planned via WhatsApp.

HN spirodonfl 2/17/2026

Show HN: Purely Vibe Coded Asmongold Simulator

Vibe-coded Asmongold simulator game with author expressing skepticism about AI-based code generation.

HN oceanwaves 2/17/2026

Agent-evals: Overlap, boundary, and metacognitive scoring for coding agents

Evaluation framework for coding agents detecting overlap, boundary violations, and coverage gaps via static and live analysis.

HN Shmungus 2/17/2026

Show HN: The first financial intelligence MCP server live trading signals Claude

MCP server providing Claude real-time access to trading signals from Reddit, SEC filings, FDA approvals, and Congressional trades.

HN DoomedWheel1027 2/17/2026

Show HN: Forage – MCP server that lets AI agents find and install their own MCPs

MCP server enabling AI agents to discover, install, and learn new tools automatically without restarts or manual configuration.

HN madawei2699 2/17/2026

Show HN: Constrained DSL for Reliable LLM Decisions

Constrained DSL for generating reliable LLM decision logic with schema-driven prompts and deterministic execution for quantitative tasks.

HN zekejohn 2/17/2026

Show HN: An Open-source React UI library for ASCII animations

React library for ASCII animations that converts video to character grids with performance optimization.

HN tpierce89 2/17/2026

Shard – A Distributed P2P AI Network for Shared Inference

Minimal title-only entry about distributed P2P network for AI inference without substantive content.

HN johnhamlin 2/17/2026

Meta patented an AI that lets you keep posting from beyond the grave

Meta patent allowing LLM simulation of deceased users' social media activity for continued posting.

HN tjco 2/17/2026

Game developers and pixel artists are losing their jobs

Marketing content for generative sprite creation tool for game developers, claiming to replace artist jobs.

HN akbarnama 2/17/2026

India's 'AI Impact Summit' Promises Little More Than Spectacle

Opinion on India's AI policy summit with focus on governance and homelessness concerns.

HN logicprog 2/17/2026

Beating GPT-2 for less than $100 – Andrej Karpathy

Andrej Karpathy explores training language models competitive with GPT-2 for under $100, analyzing cost-effectiveness improvements since 2019.

HN bpolania 2/17/2026

Show HN: Bulwark – Open-source governance layer for AI agents (Rust, MCP-native)

Open-source governance proxy in Rust for controlling AI agent access to tools, providing audit trails and content moderation via MCP.

HN nihalwashere 2/17/2026

Why I Built Reader: Open-source web scraping for LLMs

Open-source web scraper optimized for LLM consumption, cleaning HTML noise for agent accessibility to web content.

HN cpeterso 2/17/2026

New GitHub repository settings to configure pull request access

GitHub repository settings update allowing maintainers to disable or restrict pull requests. Not AI/ML related.

HN dankrieg 2/17/2026

GrowthClaw, Distribution Infrastructure for OpenClaw

GrowthClaw is an open-source marketing operating system for agent-driven workflows, converting goals into task pipelines with human approval and evaluation.

HN VorpalWay 2/17/2026

AI is destroying Open Source, and it's not even good yet

Opinion piece critiquing AI's impact on open source, discussing hallucinations, agent harassment, and OpenAI recruitment.