Isolater - Feed

Ax Itamar Tsayag, Ofir Lindenbaum 3/11/2026

Uncovering a Winning Lottery Ticket with Continuously Relaxed Bernoulli Gates

Method for finding sparse subnetworks in neural networks using continuously relaxed Bernoulli gates, improving lottery ticket hypothesis efficiency.

Ax Ronald Sielinski 3/11/2026

Quantifying Uncertainty in AI Visibility: A Statistical Framework for Generative Search Measurement

Research framework quantifying uncertainty in AI visibility metrics for generative search, addressing non-deterministic citation behavior.

Ax Heesup Yun, Isaac Kazuo Uyehara, Earl Ranario, Lars Lundqvist, Christine H. Diepenbrock, Brian N. Bailey, J. Mason Earles 3/11/2026

Using Vision Language Foundation Models to Generate Plant Simulation Configurations via In-Context Learning

Benchmark evaluating vision language models on generating plant simulation configurations for digital twins via in-context learning.

Ax Abdul Rehman Akbar, Samuel Wales-McGrath, Alejadro Levya, Lina Gokhale, Rajendra Singh, Wei Chen, Anil Parwani, Muhammad Khalid Khan Niazi 3/11/2026

PathoScribe: Transforming Pathology Data into a Living Library with a Unified LLM-Driven Framework for Semantic Retrieval and Clinical Integration

PathoScribe framework using LLMs for semantic retrieval and clinical reasoning over pathology reports to unlock institutional knowledge.

Ax Hezhao Zhang, Huang-Cheng Chou, Shrikanth Narayanan, Thomas Hain 3/11/2026

VoxEmo: Benchmarking Speech Emotion Recognition with Speech LLMs

VoxEmo benchmark for evaluating speech emotion recognition using speech LLMs with generative interfaces, addressing prompt sensitivity and emotional ambiguity.

Ax Pranav Mantini, Shishir K. Shah 3/11/2026

BiCLIP: Domain Canonicalization via Structured Geometric Transformation

BiCLIP extends vision-language models to specialized domains using structured geometric transformations based on canonical relationships.

Ax Yuxin Tang, Zhiyuan Xin, Zhimin Ding, Xinyu Yao, Daniel Bourgeois, Tirthak Patel, Chris Jermaine 3/11/2026

Automated Tensor-Relational Decomposition for Large-Scale Sparse Tensor Computation

Automated tensor-relational decomposition method for large-scale sparse tensor computation on relational database systems.

Ax Edward Izgorodin 3/11/2026

Semantic Level of Detail: Multi-Scale Knowledge Representation via Heat Kernel Diffusion on Hyperbolic Manifolds

Semantic Level of Detail (SLoD) framework enables continuous resolution control for knowledge graphs in AI memory systems via heat kernel diffusion.

Ax Tony Mason 3/11/2026

Arbiter: Detecting Interference in LLM Agent System Prompts

Arbiter framework detects interference patterns in LLM coding agent system prompts using formal evaluation rules, tested on Claude Code, Codex CLI, and Gemini CLI.

Ax Tam Nguyen, Moses Ndebugre, Dheeraj Arremsetty 3/11/2026

Security Considerations for Multi-agent Systems

Security framework identifying distinct vulnerabilities in multi-agent systems with delegated tool authority and inter-agent communication.

Ax Aishwarya Fursule, Shruti Kshirsagar, Anderson R. Avila 3/11/2026

Gender Fairness in Audio Deepfake Detection: Performance and Disparity Analysis

Analysis of gender fairness disparities in audio deepfake detection systems.

Ax Bhada Yun, Evgenia Taranova, Dana Feng, Renn Su, April Yi Wang 3/11/2026

AI Phenomenology for Understanding Human-AI Experiences Across Eras

AI phenomenology framework examining subjective human-AI experiences beyond performance metrics and usability scales.

Ax Tony Mason 3/11/2026

The Missing Memory Hierarchy: Demand Paging for LLM Context Windows

Research framing LLM context windows as L1 cache; proposes demand paging and virtual memory hierarchy for efficient token reuse.

Ax Janakan Sivaloganathan, Ainaz Jamshidi, Andriy Miranskyy, Lei Zhang 3/11/2026

Automating Detection and Root-Cause Analysis of Flaky Tests in Quantum Software

Automated detection and root-cause analysis pipeline for flaky tests in quantum software systems.

Ax Tenny Yin, Zhiting Mei, Zhonghe Zheng, Miyu Yamane, David Wang, Jade Sceats, Samuel M. Bateman, Lihan Zha, Apurva Badithela, Ola Shorinwa, Anirudha Majumdar 3/11/2026

PlayWorld: Learning Robot World Models from Autonomous Play

PlayWorld: autonomous pipeline training action-conditioned video models for robot simulators from large-scale datasets.

Ax Zekun Long, Ali Zia, Guanyiman Fu, Vivien Rolland, Jun Zhou 3/11/2026

WS-Net: Weak-Signal Representation Learning and Gated Abundance Reconstruction for Hyperspectral Unmixing via State-Space and Weak Signal Attention Fusion

WS-Net uses state-space modeling and attention mechanisms for hyperspectral image unmixing, addressing weak signal collapse in abundance estimation tasks.

Ax Hongyu Cao, Jinghan Zhang, Kunpeng Liu, Dongjie Wang, Feng Xia, Haifeng Chen, Xiaohua Hu, Yanjie Fu 3/11/2026

Sim2Act: Robust Simulation-to-Decision Learning via Adversarial Calibration and Group-Relative Perturbation

Sim2Act improves simulation-to-reality transfer for robot policies using adversarial calibration and perturbation methods to handle prediction errors in decision-critical regions.

Ax Xingyu Bruce Liu, Mira Dontcheva, Dingzeyu Li 3/11/2026

A Text-Native Interface for Generative Video Authoring

Doki is a text-native interface for generative video creation, enabling users to author videos through natural language writing instead of specialized video editing tools.

Ax Md Selim Sarowar, Omer Tariq, Sungho Kim 3/11/2026

GST-VLA: Structured Gaussian Spatial Tokens for 3D Depth-Aware Vision-Language-Action Models

GST-VLA introduces Gaussian spatial tokenization for vision-language-action models, adding 3D geometric structure awareness to improve robot perception and decision-making.

Ax Alvaro Paredes Amorin, Andre Python, Christoph Weisser 3/11/2026

Not All News Is Equal: Topic- and Event-Conditional Sentiment from Finetuned LLMs for Aluminum Price Forecasting

Finetuned LLMs extract sentiment signals from textual data to forecast aluminum commodity prices, exploring when these signals are most predictive.

Ax Rongxiang Zeng, Yongqi Dong 3/11/2026

Latent World Models for Automated Driving: A Unified Taxonomy, Evaluation Framework, and Open Challenges

Survey paper on latent world models and vision-language-action systems for autonomous driving, covering taxonomy, evaluation frameworks, and challenges.

Ax Yuheng Wang, Yuji Lin, Dongrun Zhu, Jiayue Cai, Sunil Kalia, Harvey Lui, Chunqi Chang, Z. Jane Wang, Tim K. Lee 3/11/2026

Composed Vision-Language Retrieval for Skin Cancer Case Search via Joint Alignment of Global and Local Representations

Vision-language retrieval framework for skin cancer case search using composed image-text queries with global and local representation alignment.

Ax Xiyao Wang, Xiaoyu Tan, Yang Dai, Yuxuan Fu, Shuo Li, Xihe Qiu 3/11/2026

VIVID-Med: LLM-Supervised Structured Pretraining for Deployable Medical ViTs

VIVID-Med uses frozen LLMs as structured semantic teachers for pretraining medical vision transformers, improving clinical image analysis.

Ax Jiang Gao, Xiangyu Dong, Haozhou Li, Haoran Zhao, Yaoming Zhou, Xiaoguang Ma 3/11/2026

PM-Nav: Priori-Map Guided Embodied Navigation in Functional Buildings

Language-driven embodied navigation system using semantic priori-maps and chain-of-thought prompting for functional buildings.

Ax Yifan Han, Zhongxi Chen, Yuxuan Zhao, Congsheng Xu, Yanming Shao, Yichuan Peng, Yao Mu, Wenzhao Lian 3/11/2026

DexHiL: A Human-in-the-Loop Framework for Vision-Language-Action Model Post-Training in Dexterous Manipulation

Human-in-the-loop framework for post-training vision-language-action models in robotic dexterous manipulation tasks.

Ax Junjie Yin, Jiaju Li, Hanfa Xing 3/11/2026

QUSR: Quality-Aware and Uncertainty-Guided Image Super-Resolution Diffusion Model

Diffusion model for image super-resolution with quality-aware and uncertainty-guided modules for real-world degradation.

Ax Zhen Zhang, Jielei Chu, Tianrui Li 3/11/2026

Causally Sufficient and Necessary Feature Expansion for Class-Incremental Learning

Research on mitigating catastrophic forgetting in class-incremental learning using causal feature expansion methods.

Ax Tzu-Heng Huang, Sirajul Salekin, Javier Movellan, Frederic Sala, Manjot Bilkhu 3/11/2026

RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning

Rubric-guided reinforcement learning framework for dense image captioning that improves diversity and generalization over supervised distillation from VLMs.

Ax Siyang Cai, Cangyuan Li, Yinhe Han, Ying Wang 3/11/2026

Wrong Code, Right Structure: Learning Netlist Representations from Imperfect LLM-Generated RTL

Method learning netlist representations from LLM-generated imperfect RTL code, scaling beyond small circuits using self-correction and structural learning.

Ax Jie Li, Qishun Yang, Nuo Li 3/11/2026

GIAT: A Geologically-Informed Attention Transformer for Lithology Identification

Geologically-informed attention transformer for lithology identification from well logs, integrating domain priors with interpretable deep learning.

Ax Haoran Yang, Jiacheng Bao, Yucheng Xin, Haoming Song, Yuyang Tian, Bin Zhao, Dong Wang, Xuelong Li 3/11/2026

ZeroWBC: Learning Natural Visuomotor Humanoid Control Directly from Human Egocentric Video

Visuomotor control for humanoid robots learning natural whole-body behaviors from human egocentric video without expensive teleoperation data.

Ax Jianing Yang, Yusuke Fujita, Yui Sudo 3/11/2026

DuplexCascade: Full-Duplex Speech-to-Speech Dialogue with VAD-Free Cascaded ASR-LLM-TTS Pipeline and Micro-Turn Optimization

Full-duplex speech-to-speech dialogue system combining cascaded ASR-LLM-TTS without VAD segmentation, enabling natural conversational interaction.

Ax Lina Berrayana, Ahmed Heakl, Abdullah Sohail, Thomas Hofmann, Salman Khan, Wei Chen 3/11/2026

Latent-DARM: Bridging Discrete Diffusion And Autoregressive Models For Reasoning

Framework bridging discrete diffusion language models with autoregressive models to enable non-sequential global reasoning and plan revision in multi-agent systems.

Ax Benjamin Reichman, Adar Avasian, Samuel Webster, Larry Heck 3/11/2026

Emotion is Not Just a Label: Latent Emotional Factors in LLM Processing

Study analyzing emotion as a latent representational factor in LLM reasoning and attention mechanisms, rather than just a prediction target.

Ax Chenhui Zuo, Jinhao Xu, Michael Qian Vergnolle, Yanan Sui 3/11/2026

Embodied Human Simulation for Quantitative Design and Analysis of Interactive Robotics

Simulation framework for analyzing human-robot interaction dynamics by modeling human biomechanics and motor responses in physical collaborative systems.

Ax Shuang Liu, Ao Yu, Linkang Cheng, Xiwen Huang, Li Zhao, Junhui Liu, Zhiting Lin, Yu Liu 3/11/2026

BridgeDiff: Bridging Human Observations and Flat-Garment Synthesis for Virtual Try-Off

Virtual try-off system reconstructing flat-garment representations from dressed person images by bridging on-body appearance and canonical layouts.

Ax Kanishkha Jaisankar, Pranav M. Pawar, Diana Susane Joseph, Raja Muthalagu, Mithun Mukherjee 3/11/2026

Multi-model approach for autonomous driving: A comprehensive study on traffic sign-, vehicle- and lane detection and behavioral cloning

Deep learning approach combining traffic sign, vehicle, and lane detection with behavioral cloning for autonomous vehicle perception and control.

Ax Jann Krausse, Zhe Su, Kyrus Mama, Maryada, Klaus Knobloch, Giacomo Indiveri, J\"urgen Becker 3/11/2026