Isolater - Feed

Ax Min Zhang 3/26/2026

Language-Guided Structure-Aware Network for Camouflaged Object Detection

Language-guided network for camouflaged object detection in computer vision using textual semantic priors.

Ax Xiangsen Chen, Ruilong Wu, Yanyan Lan, Ting Ma, Yang Liu 3/26/2026

MolEvolve: LLM-Guided Evolutionary Search for Interpretable Molecular Optimization

MolEvolve combines LLM guidance with evolutionary search for interpretable molecular optimization, addressing activity cliffs.

Ax Xingming Li, Runke Huang, Yanan Bao, Yuye Jin, Yuru Jiao, Qingyong Hu 3/26/2026

When AI Meets Early Childhood Education: Large Language Models as Assessment Teammates in Chinese Preschools

LLMs assess teacher-child interactions in Chinese preschools for scalable early childhood education monitoring.

Ax Bj{\o}rnar Vass{\o}y, Benjamin Kille, Helge Langseth 3/26/2026

Exploring How Fair Model Representations Relate to Fair Recommendations

Studies fairness in recommender systems, examining relationship between fair model representations and fair recommendations.

Ax Songyang Liu, Chaozhuo Li, Chenxu Wang, Jinyu Hou, Zejian Chen, Litian Zhang, Zheng Liu, Qiwei Ye, Yiming Hei, Xi Zhang, Zhongyuan Wang 3/26/2026

ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers

ClawKeeper adds safety mechanisms to OpenClaw autonomous agent runtime, addressing vulnerabilities in tool integration and command execution.

Ax Ben Chen, Siyuan Wang, Yufei Ma, Zihan Liang, Xuxin Zhang, Yue Lv, Ying Yang, Huangyu Dai, Lingtao Mao, Tong Zhao, Zhipeng Qian, Xinyu Sun, Zhixin Zhai, Yang Zhao, Bochao Liu, Jingshan Lv, Xiao Liang, Hui Kong, Jing Chen, Han Li, Chenyi Lei, Wenwu Ou, Kun Gai 3/26/2026

OneSearch-V2: The Latent Reasoning Enhanced Self-distillation Generative Search Framework

OneSearch-V2 improves generative retrieval for search systems with latent reasoning and self-distillation. Industrial-scale framework.

Ax Xiangru Jian, Shravan Nayak, Kevin Qinghong Lin, Aarash Feizi, Kaixin Li, Patrice Bechard, Spandana Gella, Sai Rajeswar 3/26/2026

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Large-scale annotated video demonstration dataset for computer-use agents enabling automation of complex desktop workflows with continuous video sequences.

Ax Domenique Zipperling, Lukas Schmidt, Benedikt Hahn, Niklas K\"uhl, Steven Kimbrough 3/26/2026

Integrating Causal Machine Learning into Clinical Decision Support Systems: Insights from Literature and Practice

Integration of causal machine learning into clinical decision support systems with clinician-facing interfaces for interpretable treatment-specific reasoning.

Ax Badri Narayana Patro 3/26/2026

Counting Without Numbers \& Finding Without Words

Multimodal system for reuniting lost pets using animal vocalizations and cognitive science insights beyond appearance-only matching.

Ax Alexander Panfilov, Peter Romov, Igor Shilov, Yves-Alexandre de Montjoye, Jonas Geiping, Maksym Andriushchenko 3/26/2026

Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs

Autoresearch pipeline using Claude Code LLM agent to autonomously discover novel white-box adversarial attack algorithms outperforming 30+ existing methods.

Ax Emily Schiller, Teodor Chiaburu, Marco Zullich, Luca Longo 3/26/2026

No Single Metric Tells the Whole Story: A Multi-Dimensional Evaluation Framework for Uncertainty Attributions

Multi-dimensional evaluation framework for uncertainty attribution methods in explainable AI with aligned proxy tasks and metrics.

Ax Zichuan Lin, Feiyu Liu, Yijun Yang, Jiafei Lyu, Yiming Gao, Yicheng Liu, Zhicong Lu, Yangbin Yu, Mingyu Yang, Junyou Li, Deheng Ye, Jie Jiang 3/26/2026

UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

Mobile GUI agent using rejection fine-tuning to learn from failed trajectories and improve credit assignment for long-horizon tasks.

Ax Florian Stilz, Vinkle Srivastav, Nassir Navab, Nicolas Padoy 3/26/2026

CliPPER: Contextual Video-Language Pretraining on Long-form Intraoperative Surgical Procedures for Event Recognition

Video-language foundation model pretraining on surgical procedure videos for zero-shot event recognition in intraoperative settings.

Ax Fanjun Bu, Chenyang Yuan, Hiroshi Yasuda 3/26/2026

SEGAR: Selective Enhancement for Generative Augmented Reality

Framework combining diffusion-based world models with selective enhancement for temporally coherent augmented reality applications.

Ax Dana Serditova, Kevin Tang 3/26/2026

A Sociolinguistic Analysis of Automatic Speech Recognition Bias in Newcastle English

Sociolinguistic analysis of bias in automatic speech recognition systems using Newcastle English dialect data.

Ax Samuel Taiwo, Mohd Amaluddin Yusoff 3/26/2026

Evaluating Chunking Strategies For Retrieval-Augmented Generation in Oil and Gas Enterprise Documents

Empirical study comparing chunking strategies for RAG systems in oil and gas documents, evaluating fixed-size, recursive, semantic, and structure-aware approaches.

Ax Keliang Li, Yansong Li, Hongze Shen, Mengdi Liu, Hong Chang, Shiguang Shan 3/26/2026

LensWalk: Agentic Video Understanding by Planning How You See in Videos

Agentic video understanding framework using Vision-Language Models with active planning to seek evidence from raw video during reasoning.

Ax Martin Jaraiz 3/26/2026

The Free-Market Algorithm: Self-Organizing Optimization for Open-Ended Complex Systems

Free-Market Algorithm metaheuristic using distributed supply-and-demand dynamics for open-ended optimization with emergent fitness.

Ax Duc Vu, Anh Nguyen, Chi Tran, Anh Tran 3/26/2026

Anti-I2V: Safeguarding your photos from malicious image-to-video generation

Adversarial attack methods to protect images from malicious diffusion-based image-to-video generation models.

Ax Qijia He, Xunmei Liu, Hammaad Memon, Ziang Li, Zixian Ma, Jaemin Cho, Jason Ren, Daniel S Weld, Ranjay Krishna 3/26/2026

VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models

Vision-Language Models for converting rasterized figures into editable SVG vector graphics automatically.

Ax Saahil Mathur, Ryan David Rittner, Vedant Ajit Thakur, Daniel Stuart Schiff, Tunazzina Islam 3/26/2026

Retrieval Improvements Do Not Guarantee Better Answers: A Study of RAG for AI Policy QA

Study of RAG systems applied to AI policy analysis using AGORA corpus, examining reliability challenges in dense legal language domains.

Ax Debodeep Banerjee, Stefano Teso, Burcu Sayin, Andrea Passerini 3/26/2026

Learning To Guide Human Decision Makers With Vision-Language Models

Vision-Language Models for supporting human decision-making in high-stakes domains like medical diagnosis through collaborative human-AI systems.

Ax Soumyadeep Dhar 3/26/2026

The Collaboration Paradox: Why Generative AI Requires Both Strategic Intelligence and Operational Stability in Supply Chain Management

Study of AI agents powered by LLMs in multi-echelon supply chain simulation investigating emergent strategic behavior and dynamics like the bullwhip effect.

Ax Jessica M. Lundin, Usman Nasir Nakakana, Guillaume Chabot-Couture 3/26/2026

From Guidelines to Guarantees: A Graph-Based Evaluation Harness for Domain-Specific Evaluation of LLMs

Graph-based evaluation framework for domain-specific LLM benchmarking using clinical guidelines transformed into queryable knowledge graphs with dynamic query instantiation.

Ax Shichao Weng, Zhiqiang Wang, Yuhua Zhou, Rui Lu, Ting Liu, Zhiyang Teng, Xiaozhang Liu, Hanmeng Liu 3/26/2026

GeoSketch: A Neural-Symbolic Approach to Geometric Multimodal Reasoning with Auxiliary Line Construction and Affine Transformation

GeoSketch: Neural-symbolic approach for geometric reasoning in MLLMs using auxiliary line construction and affine transformations for problem solving.

Ax Chenwei Tang, Lin Long, Xinyu Liu, Jingyu Xing, Zizhou Wang, Joey Tianyi Zhou, Jiawei Du, Liangli Zhen, Jiancheng Lv 3/26/2026

SAG-Agent: Enabling Long-Horizon Reasoning in Strategy Games via Dynamic Knowledge Graphs

SAG-Agent: LLM-based agent using dynamic knowledge graphs for long-horizon reasoning in strategy games via GUI interaction without APIs.

Ax Xiaohan Zhang, Tian Gao, Mingyue Cheng, Bokai Pan, Ze Guo, Yaguo Liu, Xiaoyu Tao, Qi Liu 3/26/2026

CastMind: An Interaction-Driven Agentic Reasoning Framework for Cognition-Inspired Time Series Forecasting

CastMind: Agentic reasoning framework for time series forecasting using iterative refinement with temporal features, domain knowledge, and case-based references.

Ax Yan Chen, Yu Zou, Jialei Zeng, Haoran You, Xiaorui Zhou, Aixi Zhong 3/26/2026

Pharos-ESG: A Framework for Multimodal Parsing, Contextual Narration, and Hierarchical Labeling of ESG Report

Pharos-ESG: Multimodal framework for parsing and labeling ESG reports with hierarchical document understanding and narrative generation.

Ax Qihao Liu, Luoxin Ye, Wufei Ma, Yu-Cheng Chou, Alan Yuille 3/26/2026

Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning

Generative Adversarial Reasoner: Framework using adversarial RL to improve LLM reasoning capabilities and reduce mathematical errors through co-evolved reasoner-discriminator training.

Ax Xinyu Zhu, Yuzhu Cai, Zexi Liu, Bingyang Zheng, Cheng Wang, Rui Ye, Yuzhi Zhang, Linfeng Zhang, Weinan E, Siheng Chen, Yanfeng Wang 3/26/2026

Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering

Research on enabling ultra-long-horizon autonomous agents with cognitive accumulation for multi-week ML engineering experiments.

Ax Dingyi Yang, Junqi Zhao, Xue Li, Ce Li, Boyang Li 3/26/2026

Are LLMs Smarter Than Chimpanzees? An Evaluation on Perspective Taking and Knowledge State Estimation

Evaluates LLM performance on perspective-taking and knowledge state estimation tasks comparing cognitive abilities to chimpanzees.

Ax Jingyu Li, Zhaocheng Du, Qianhui Zhu, kaiyuan Li, Zhicheng Zhang, Song-Li Wu, Chaolang Li, Pengwen Dai 3/26/2026

CollectiveKV: Decoupling and Sharing Collaborative Information in Sequential Recommendation

CollectiveKV framework reduces inference latency in Transformer-based sequential recommendation systems through KV cache optimization.

Ax Reva Schwartz, Carina Westling, Morgan Briggs, Marzieh Fadaee, Isar Nejadgholi, Matthew Holmes, Fariza Rashid, Maya Carlyle, Afaf Ta\"ik, Kyra Wilson, Peter Douglas, Theodora Skeadas, Gabriella Waters, Rumman Chowdhury, Thiago Lacerda 3/26/2026

CIRCLE: A Framework for Evaluating AI from a Real-World Lens

CIRCLE framework for evaluating AI systems across six lifecycle stages, bridging gap between benchmarks and real-world deployment outcomes.

Ax Zhiyu Ni, Yifeng Xiao, Zheng Liang 3/26/2026

Agentified Assessment of Logical Reasoning Agents

Framework for evaluating logical reasoning agents with agentified assessment, standardized interfaces, and structured failure tracking.

Ax Christian Greisinger, Steffen Eger 3/26/2026

TikZilla: Scaling Text-to-TikZ with High-Quality Data and Reinforcement Learning

TikZilla: Dataset and reinforcement learning approach for scaling text-to-TikZ scientific figure generation from high-quality training data.

Ax Yan Zhang, Simiao Ren, Ankit Raj, En Wei, Dennis Ng, Alex Shen, Jiayu Xue, Yuxin Zhang, Evelyn Marotta 3/26/2026

GPT4o-Receipt: A Dataset and Human Study for AI-Generated Document Forensics

GPT4o-Receipt: Benchmark of 1,235 receipt images comparing AI-generated vs authentic documents evaluated by LLMs and humans.

Ax Vishnu Narayanan Anilkumar, Abhijith Sreesylesh Babu, Trieu Hai Vo, Mohankrishna Kolla, Alexander Cuneo 3/26/2026

Relationship-Aware Safety Unlearning for Multimodal LLMs

Framework for relationship-aware safety unlearning in multimodal LLMs addressing relational safety failures without collateral damage.

Ax Shuai Wang, Dhasarathy Parthasarathy, Robert Feldt, Yinan Yu 3/26/2026

DomAgent: Leveraging Knowledge Graphs and Case-Based Reasoning for Domain-Specific Code Generation

DomAgent: Framework combining knowledge graphs and case-based reasoning with LLMs for domain-specific code generation tasks.

Ax Zining Fang, Chunhui Liu, Bin Xu, Ming Chen, Xiaowei Hu, Cheng Xue 3/26/2026

PhySe-RPO: Physics and Semantics Guided Relative Policy Optimization for Diffusion-Based Surgical Smoke Removal

PhySe-RPO: Diffusion-based framework for surgical smoke removal using physics and semantics-guided relative policy optimization.

Ax Hao Wang, Zhichao Chen, Zhaoran Liu, Haozhe Li, Degui Yang, Xinggao Liu, Haoxuan Li 3/26/2026

Entire Space Counterfactual Learning for Reliable Content Recommendations

Counterfactual learning approach for CVR estimation in recommender systems addressing data sparsity and sample selection bias.

Ax Huaming Du, Cancan Feng, Yuqian Lei, Chenyang Zhang, Guisong Liu, Gang Kou, Carl Yang, Yu Zhao 3/26/2026

A Comprehensive Survey on Enterprise Financial Risk Analysis from Big Data and LLMs Perspective

Survey on enterprise financial risk analysis using big data and LLM technologies for financial prediction and management.

Ax Dmitrii Krylov, Armin Karamzade, Roy Fox 3/26/2026

Moonwalk: Inverse-Forward Differentiation

Moonwalk: Inverse-forward differentiation technique addressing backpropagation's memory limitation for training deeper neural networks.

Ax Weisheng Gong, Chen He, Kaijie Su, Qingyong Li, Tong Wu, Z. Jane Wang 3/26/2026

DIDLM: A SLAM Dataset for Difficult Scenarios Featuring Infrared, Depth Cameras, LIDAR, 4D Radar, and Others under Adverse Weather, Low Light Conditions, and Rough Roads

DIDLM: Multi-sensor SLAM dataset with infrared, depth, LiDAR, 4D radar for adverse weather and low-light robotic navigation scenarios.

Ax Arthur Jacot, Alexandre Kaiser 3/26/2026

Hamiltonian Mechanics of Feature Learning: Bottleneck Structure in Leaky ResNets

Theoretical analysis of feature learning in Leaky ResNets using Hamiltonian mechanics and representation geodesics.

Ax Hao Wang, Zhichao Chen, Zhaoran Liu, Xu Chen, Haoxuan Li, Zhouchen Lin 3/26/2026

Proximity Matters: Local Proximity Enhanced Balancing for Treatment Effect Estimation

Method for heterogeneous treatment effect estimation from observational data using local proximity balancing to reduce treatment selection bias.

Ax Aleksei Staroverov, Muhammad Alhaddad, Aditya Narendra, Konstantin Mironov, Aleksandr Panov 3/26/2026

Dynamic Neural Potential Field: Online Trajectory Optimization in the Presence of Moving Obstacles

Dynamic Neural Potential Field: Learning-enhanced MPC framework coupling Transformer-based predictor with classical optimization for robot obstacle avoidance.

Ax Kaixi Bao, Chenhao Li, Yarden As, Andreas Krause, Marco Hutter 3/26/2026

Symmetry-Guided Memory Augmentation for Efficient Locomotion Learning

SGMA: Framework combining symmetry-guided experience augmentation and memory inference to improve reinforcement learning efficiency for legged locomotion.

Ax Nina Corvelo Benz, Stratis Tsirtsis, Eleni Straitouri, Ivi Chatzi, Ander Artola Velasco, Suhas Thejaswi, Manuel Gomez-Rodriguez 3/26/2026

Evaluation of Large Language Models via Coupled Token Generation

Evaluation framework for large language models addressing randomization in coupled token generation with causal modeling approach.

Ax Yifeng Zhang, Yilin Liu, Ping Gong, Peizhuo Li, Mingfeng Fan, Guillaume Sartoretti 3/26/2026

Unicorn: A Universal and Collaborative Reinforcement Learning Approach Towards Generalizable Network-Wide Traffic Signal Control

Unicorn: Multi-agent reinforcement learning approach for adaptive traffic signal control in heterogeneous urban networks.

Ax Merkourios Simos, Alberto Silvio Chiappa, Alexander Mathis 3/26/2026

KINESIS: Motion Imitation for Human Musculoskeletal Locomotion

KINESIS: Model-free reinforcement learning framework for human motion imitation with musculoskeletal constraints and biomechanical joint modeling.