Isolater - Feed

Ax Po-Hsien Yu, Yu-Syuan Tseng, Shao-Yi Chien 4/2/2026

FedKLPR: KL-Guided Pruning-Aware Federated Learning for Person Re-Identification

Federated learning approach for person re-identification that addresses statistical heterogeneity and communication efficiency in privacy-preserving surveillance systems.

Ax Jubayer Ibn Hamid, Ifdita Hasan Orney, Ellen Xu, Chelsea Finn, Dorsa Sadigh 4/2/2026

Polychromic Objectives for Reinforcement Learning

Addresses mode collapse in reinforcement learning fine-tuning by introducing polychromic objectives that preserve policy diversity and enable better exploration.

Ax Yuanfang Xiang, Lun Ai 4/2/2026

Adaptive Data-Knowledge Alignment in Genetic Perturbation Prediction

Proposes end-to-end integration of data-driven learning and existing knowledge for predicting transcriptional responses to genetic perturbations in biological systems.

Ax Eunki Kim, Na Min An, Wan Ju Kang, Sangryul Kim, James Thorne, Hyunjung Shim 4/2/2026

Are Large Vision-Language Models Ready to Guide Blind and Low-Vision Individuals?

Evaluates whether large vision-language models can effectively guide blind and low-vision individuals, addressing how to measure real-world utility beyond standard metrics.

Ax Shira Schiber, Ofir Lindenbaum, Idan Schwartz 4/2/2026

TempoControl: Temporal Attention Guidance for Text-to-Video Models

TempoControl method enables fine-grained temporal control in text-to-video generative models, allowing specification of when visual elements appear in sequences without retraining.

Ax Jacek Karwowski, Raymond Douglas 4/2/2026

Incoherence in Goal-Conditioned Autoregressive Models

Mathematical analysis of incoherence in goal-conditioned autoregressive models fine-tuned with reinforcement learning.

Ax Elias Hossain, Mehrdad Shoeibi, Ivan Garibay, Niloofar Yousefi 4/2/2026

BIOGEN: Evidence-Grounded Multi-Agent Reasoning Framework for Transcriptomic Interpretation in Antimicrobial Resistance

Multi-agent reasoning framework for interpreting gene clusters in antimicrobial resistance studies using transcriptomic data.

Ax Miko{\l}aj Czarnecki, Micha{\l} Korniak, Oskar Skibski, Piotr Skowron 4/2/2026

Fair Indivisible Payoffs through Shapley Value

Fair division method for indivisible payoffs in coalitional games using Shapley value.

Ax Guneet S. Dhillon, Javier Gonz\'alez, Teodora Pandeva, Alicia Curth 4/2/2026

E-Scores for (In)Correctness Assessment of Generative Model Outputs

Conformal prediction framework for assessing correctness of LLM outputs with user-defined tolerance levels.

Ax Yishan Du, Conrad Borchers, Mutlu Cukurova 4/2/2026

Benchmarking Educational LLMs with Analytics: A Case Study on Gender Bias in Feedback

Benchmarking framework using embeddings to detect gender bias in LLMs used for educational feedback on student essays.

Ax Farheen Ramzan (Cherise), Yusuf Kiberu (Cherise), Nikesh Jathanna (Cherise), Meryem Jabrane (Cherise), Vicente Grau (Cherise), Shahnaz Jamil-Copley (Cherise), Richard H. Clayton (Cherise), Chen (Cherise), Chen (Cherise) 4/2/2026

Seeing Beyond the Image: ECG and Anatomical Knowledge-Guided Myocardial Scar Segmentation from Late Gadolinium-Enhanced Images

Multimodal framework for myocardial scar segmentation combining ECG signals with cardiac MRI imaging.

Ax Rui Lin, Zhiyue Wu, Jiahe Le, Kangdi Wang, Weixiong Chen, Junyu Dai, Tao Jiang 4/2/2026

DuoTok: Source-Aware Dual-Track Tokenization for Multi-Track Music Language Modeling

DuoTok source-aware dual-track tokenizer preserving high-fidelity reconstruction, predictability, and cross-track correspondence for music language models.

Ax Asad Aali, Muhammad Ahmed Mohsin, Vasiliki Bikia, Arnav Singhvi, Richard Gaus, Suhana Bedi, Hejie Cui, Miguel Fuentes, Alyssa Unell, Yifan Mai, Jordan Cahoon, Michael Pfeffer, Roxana Daneshjou, Sanmi Koyejo, Emily Alsentzer, Christopher Potts, Nigam H. Shah, Akshay S. Chaudhari 4/2/2026

Structured Prompts Improve Evaluation of Language Models

Study showing structured prompts significantly improve LLM evaluation accuracy and reduce prompt-dependent variance in benchmark frameworks like HELM.

Ax Sai Koneru, Matthias Huck, Jan Niehues 4/2/2026

OmniFusion: Simultaneous Multilingual Multimodal Translations via Modular Fusion

OmniFusion modular approach for simultaneous multilingual multimodal translation combining speech recognition and translation in open-source LLM pipelines.

Ax Isha Chaudhary, Vedaant Jain, Prineet Parhar, Kavya Sachdeva, Avaljot Singh, Sayan Ranu, Gagandeep Singh 4/2/2026

Lumos: Let there be Language Model System Certification

Lumos framework for formally certifying language model system behaviors using imperative probabilistic programming with graph-based prompt generation.

Ax Kai Kohyama, Yoshimitsu Aoki, Guillermo Gallego, Shintaro Shiba 4/2/2026

Geometric-Photometric Event-based 3D Gaussian Ray Tracing

GPERT framework for event-based 3D Gaussian splatting balancing accuracy and temporal resolution using geometric-photometric event camera data.

Ax Md Jahedur Rahman, Ihsen Alouani 4/2/2026

Bypassing Prompt Injection Detectors through Evasive Injections

Study demonstrating evasive injection techniques that bypass ML-based prompt injection detectors in retrieval-augmented LLM systems.

Ax Sohan Venkatesh, Ashish Mahendran Kurapath 4/2/2026

On the Non-Identifiability of Steering Vectors in Large Language Models

Analysis showing steering vectors in LLMs are fundamentally non-identifiable with large equivalence classes, questioning interpretability of activation steering methods.

Ax Isaac Han, Sangyeon Park, Seungwon Oh, Donghu Kim, Hojoon Lee, Kyung-Joong Kim 4/2/2026

FIRE: Frobenius-Isometry Reinitialization for Balancing the Stability-Plasticity Tradeoff

FIRE reinitialization method balancing stability-plasticity tradeoff in continual learning for deep neural networks through Frobenius-isometry constraints.

Ax Arshad Beg, Diarmuid O'Donoghue, Rosemary Monahan 4/2/2026

Evaluating LLM-Generated ACSL Annotations for Formal Verification

Empirical evaluation of LLM-generated ACSL formal specification annotations for C programs, assessing automatic verification without human assistance.

Ax Wenbo Nie, Zixiang Li, Renshuai Tao, Bin Wu, Yunchao Wei, Yao Zhao 4/2/2026

CoCoDiff: Correspondence-Consistent Diffusion Model for Fine-grained Style Transfer

CoCoDiff training-free style transfer framework using diffusion models and correspondence consistency for fine-grained region-wise semantic preservation.

Ax Eason Chen, Sophia Judicke, Kayla Beigh, Xinyi Tang, Isabel Wang, Nina Yuan, Zimo Xiao, Chuangji Li, Shizhuo Li, Reed Luttmer, Shreya Singh, Maria Yampolsky, Naman Parikh, Yvonne Zhao, Meiyi Chen, Scarlett Huang, Anishka Mohanty, Gregory Johnson, John Mackey, Jionghao Lin, Ken Koedinger 4/2/2026

Chat-Based Support Alone May Not Be Enough: Comparing Conversational and Embedded LLM Feedback for Mathematical Proof Learning

Empirical evaluation of GPTutor LLM tutoring system comparing embedded proof-review feedback versus chatbot support for discrete mathematics learning.

Ax Tugrul Gorgulu, Atakan Dag, M. Esat Kalfaoglu, Halil Ibrahim Kuru, Baris Can Cam, Halil Ibrahim Ozturk, Ozsel Kilinc 4/2/2026

TaCarla: A comprehensive benchmarking dataset for end-to-end autonomous driving

TaCarla comprehensive benchmark dataset for end-to-end autonomous driving with perception and planning information for vehicle research.

Ax Jialong Chen, Xander Xu, Hu Wei, Chuan Chen, Bing Zhao 4/2/2026

SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration

SWE-CI benchmark evaluating LLM-powered agents on repository-level codebase maintenance via continuous integration and multi-step feature iterations.

Ax Ruiying Li, Yunlang Zhou, YuYao Zhu, Kylin Chen, Jingyuan Wang, Sukai Wang, Kongtao Hu, Minhui Yu, Bowen Jiang, Zhan Su, Jiayao Ma, Xin He, Yongjian Shen, Yang Yang, Guanghui Ren, Maoqing Yao, Wenhao Wang, Yao Mu 4/2/2026

RoboClaw: An Agentic Framework for Scalable Long-Horizon Robotic Tasks

RoboClaw agentic framework unifying data collection, policy learning, and deployment for long-horizon robotic tasks with vision-language-action systems.

Ax Mansoor Ahmed, Nadeem Taj, Imdad Ullah Khan, Hemanth Venkateswara, Murray Patterson 4/2/2026

CHIMERA-Bench: A Benchmark Dataset for Epitope-Specific Antibody Design

CHIMERA-Bench standardized benchmark dataset for epitope-specific antibody design enabling fair comparison of computational design methods.

Ax Haoyang Fang, Shuai Zhang, Yifei Ma, Hengyi Wang, Cuixiong Hu, Katrin Kirchhoff, Bernie Wang, George Karypis 4/2/2026

OPERA: Online Data Pruning for Efficient Retrieval Model Adaptation

OPERA framework for data pruning in dense retrieval models that improves both efficiency and effectiveness of domain-specific finetuning through heterogeneous pair selection.

Ax Ishrith Gowda, Chunwei Liu 4/2/2026

SA-CycleGAN-2.5D: Self-Attention CycleGAN with Tri-Planar Context for Multi-Site MRI Harmonization

Self-attention CycleGAN method for harmonizing multi-site MRI data using tri-planar context to address scanner-induced distribution shifts.

Ax Echo Zexuan Pan, Danny Glick, Ying Xu 4/2/2026

How Motivation Relates to Generative AI Use: A Large-Scale Survey of Mexican High School Students

Survey of 6,793 Mexican high school students examining how different motivational profiles relate to generative AI tool usage in math and writing.

Ax Eric A. Moreno, Samuel Bright-Thonney, Andrzej Novak, Dolores Garcia, Philip Harris 4/2/2026

AI Agents Can Already Autonomously Perform Experimental High Energy Physics

Demonstrates LLM-based AI agents autonomously executing high energy physics analysis pipelines including event selection, background estimation, and statistical testing.

Ax Hengwei Ye, Yuanting Guan, Yuxuan Ge, Tianying Zhu, Zhenhan Guan, Yijia Zhong, Yijing Zhang, Han Zhang, Yingna Wu, Zheng Tian 4/2/2026

Children's Intelligence Tests Pose Challenges for MLLMs? KidGym: A 2D Grid-Based Reasoning Benchmark for MLLMs

KidGym benchmark dataset based on children's intelligence tests to evaluate multimodal LLMs on visual reasoning tasks.

Ax Dogan Urgun, Gokhan Gungor 4/2/2026

Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning

Framework using LLMs to automate reward design for multi-agent reinforcement learning by synthesizing executable reward programs.

Ax Marc-Antoine Allard, Arnaud Teinturier, Victor Xing, Gautier Viaud 4/2/2026

Experiential Reflective Learning for Self-Improving LLM Agents

Experiential Reflective Learning framework enabling LLM agents to self-improve by leveraging past interactions and adapting to specialized environments.

Ax Miranda Muqing Miao, Lyle Ungar 4/2/2026

Closing the Confidence-Faithfulness Gap in Large Language Models

Mechanistic interpretability analysis of how LLMs verbalize confidence scores versus actual accuracy using linear probes and activation steering.

Ax Devashish Gaikwad, Wil M. P. van der Aalst, Gyunam Park 4/2/2026

Neuro-Symbolic Process Anomaly Detection

Neuro-symbolic approach combining neural networks with domain knowledge for process anomaly detection in event logs.

Ax Zehai He, Wenyi Hong, Zhen Yang, Ziyang Pan, Mingdao Liu, Xiaotao Gu, Jie Tang 4/2/2026

Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification

Vision2Web: Hierarchical benchmark for evaluating AI agents on website development tasks from UI-to-code to full-stack implementation.

Ax Guilin Zhang, Wulan Guo, Ziqi Tan, Chuanyi Sun, Hailong Jiang 4/2/2026

CarbonEdge: Carbon-Aware Deep Learning Inference Framework for Sustainable Edge Computing

CarbonEdge: Carbon-aware deep learning inference framework for edge computing optimizing environmental impact alongside latency and throughput.

Ax Kesheng Chen, Yamin Hu, Qi Zhou, Zhenqian Zhu, Wenjian Luo 4/2/2026

CDH-Bench: A Commonsense-Driven Hallucination Benchmark for Evaluating Visual Fidelity in Vision-Language Models

CDH-Bench: Benchmark evaluating vision-language models' commonsense-driven hallucinations when visual evidence conflicts with common sense.

Ax Xuan Deng, Xiandong Meng, Hengyu Man, Qiang Zhu, Tiange Zhang, Debin Zhao, Xiaopeng Fan 4/2/2026

LG-HCC: Local Geometry-Aware Hierarchical Context Compression for 3D Gaussian Splatting

LG-HCC proposes geometry-aware compression for 3D Gaussian Splatting to reduce storage overhead while maintaining rendering quality.

Ax Yufei Xu, Fanxu Meng, Fan Jiang, Yuxuan Wang, Ruijie Zhou, Zhaohui Wang, Jiexi Wu, Zhixin Pan, Xiaojuan Tang, Wenjie Pei, Tongxuan Liu, Di yin, Xing Sun, Muhan Zhang 4/2/2026

HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention

HISA improves efficiency of sparse attention mechanisms by optimizing hierarchical indexing to reduce bottlenecks in token-level key selection for LLMs.

Ax Ziliang Guo, Ziheng Li, Bo Tang, Feiyu Xiong, Zhiyu Li 4/2/2026

MemFactory: Unified Inference & Training Framework for Agent Memory

MemFactory: unified inference and training framework for agent memory integration with RL optimization of memory operations in LLM agents.

Ax Zhuoling Li, Jiarui Zhang, Ping Hu, Jason Kuen, Jiuxiang Gu, Hossein Rahmani, Jun Liu 4/2/2026

Automatic Method Illustration Generation for AI Scientific Papers via Drawing Middleware Creation, Evolution, and Orchestration

FigAgent: multi-agent framework for automatic method illustration generation in AI papers via drawing middleware orchestration.

Ax Max Hennick, Guillaume Corlouer 4/2/2026

From Density Matrices to Phase Transitions in Deep Learning: Spectral Early Warnings and Interpretability

Reduced density matrix method from quantum chemistry for predicting and interpreting phase transitions during deep learning model training.

Ax Fangxin Wang, Peyman Baghershahi, Langzhou He, Henry Peng Zou, Sourav Medya, Philip S. Yu 4/2/2026

Two-Stage Optimizer-Aware Online Data Selection for Large Language Models

Optimizer-aware gradient-based online data selection framework for sequential LLM fine-tuning with step-dependent sample utility estimation.

Ax Gabriel U. Talasso, Meghdad Kurmanji, Allan M. de Souza, Nicholas D. Lane, Leandro A. Villas 4/2/2026

Task-Centric Personalized Federated Fine-Tuning of Language Models

Personalized federated fine-tuning approach for language models on distributed heterogeneous task datasets with improved generalization.

Ax Adrian Mart\'inez, Ananya Gupta, Hanka Goralija, Mario Rico, Sa\'ul Fenollosa, Tamar Alphaidze 4/2/2026

Evolution Strategies for Deep RL pretraining

Evolution strategies for Deep RL pretraining offering derivative-free, computationally efficient alternative to standard deep reinforcement learning.

Ax Michael Chertkov 4/2/2026

Temporal Memory for Resource-Constrained Agents: Continual Learning via Stochastic Compress-Add-Smooth

Continual learning framework for resource-constrained agents using stochastic bridge diffusion process for temporal memory management.

Ax Leonardo Medrano Sandonas, David Balcells, Anton Bochkarev, Jacqueline M. Cole, Volker L. Deringer, Werner Dobrautz, Adrian Ehrenhofer, Thorben Frank, Pascal Friederich, Rico Friedrich, Janine George, Luca Ghiringhelli, Alejandra Hinostroza Caldas, Veronika Juraskova, Hannes Kneiding, Yury Lysogorskiy, Johannes T. Margraf, Hanna T\"urk, Anatole von Lilienfeld, Milica Todorovi\'c, Alexandre Tkatchenko, Mariana Rossi, Gianaurelio Cuniberti 4/2/2026