Isolater - Feed

Ax Shubham Kumar Nigam, Balaramamahanthi Deepak Patnaik, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya 3/26/2026

Structured Legal Document Generation in India: A Model-Agnostic Wrapper Approach with VidhikDastaavej

VidhikDastaavej: Model-agnostic wrapper for automated legal document generation in India using LLMs, with anonymized private document dataset.

Ax Koustuv Saha, Yoshee Jain, Violeta J. Rodriguez, Munmun De Choudhury 3/26/2026

Linguistic Comparison of AI- and Human-Written Responses to Online Mental Health Queries

arXiv paper on VidhikDastaavej, model-agnostic wrapper for structured legal document generation in India with large-scale anonymized dataset.

Ax Christiaan Meijer, E. G. Patrick Bos 3/26/2026

Explainable embeddings with Distance Explainer

arXiv paper comparing linguistic features of AI-generated vs human responses to mental health queries. Application of LLMs in healthcare.

Ax Adnan Oomerjee, Zafeirios Fountas, Haitham Bou-Ammar, Jun Wang 3/26/2026

Bottlenecked Transformers: Periodic KV Cache Consolidation for Generalised Reasoning

arXiv paper introducing Distance Explainer for generating post-hoc explanations of embedded vector spaces using saliency-based techniques.

Ax Sudarshan Rajagopalan, Kartik Narayan, Vishal M. Patel 3/26/2026

RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration

arXiv paper on Bottlenecked Transformers using periodic KV cache consolidation for improved reasoning with auxiliary latent-space computation.

Ax Zechen Li, Lanqing Yang, Yiheng Bian, Hao Pan, Yongjian Fu, Yezhou Wang, Zhuxi Chen, Yi-Chao Chen, Guangtao Xue 3/26/2026

Wideband RF Radiance Field Modeling Using Frequency-embedded 3D Gaussian Splatting

arXiv paper on RestoreVAR using visual autoregressive modeling for fast all-in-one image restoration, replacing slower diffusion approaches.

Ax Kefan Song, Amir Moeini, Peng Wang, Lei Gong, Rohan Chandra, Shangtong Zhang, Yanjun Qi 3/26/2026

Reward Is Enough: LLMs Are In-Context Reinforcement Learners

arXiv paper on 3D Gaussian Splatting for wideband RF signal modeling. Computer vision/signal processing, not AI/ML development.

Ax Zheng Zhang, Peilin Zhao, Deheng Ye, Hao Wang 3/26/2026

Enhancing Jailbreak Attacks on LLMs via Persona Prompts

arXiv paper demonstrating in-context reinforcement learning (ICRL) emerges during LLM inference. Proposes ICRL prompting for inference-time self-improvement.

Ax Ruijia Zhang, Xinyan Zhao, Ruixiang Wang, Sigen Chen, Guibin Zhang, An Zhang, Kun Wang, Qingsong Wen 3/26/2026

SafeSieve: From Heuristics to Experience in Progressive Pruning for LLM-based Multi-Agent Communication

arXiv paper exploring persona prompts as jailbreak attack vector against LLMs. Security analysis of prompt injection vulnerabilities.

Ax Yifan Hu, Jie Yang, Tian Zhou, Peiyuan Liu, Yujin Tang, Rong Jin, Liang Sun 3/26/2026

Bridging Past and Future: Distribution-Aware Alignment for Time Series Forecasting

arXiv paper on SafeSieve, progressive pruning algorithm for LLM-based multi-agent systems to reduce token overhead and communication redundancy.

Ax Suhyeon Lee, Jong Chul Ye 3/26/2026

PromptLoop: Plug-and-Play Prompt Refinement via Latent Feedback for Diffusion Model Alignment

arXiv paper introducing TimeAlign, representation learning method using contrastive alignment for time series forecasting.

Ax Jo\v{z}e M. Ro\v{z}anec, Tina \v{Z}ezlin, Laurentiu Vasiliu, Dunja Mladeni\'c, Radu Prodan, Dumitru Roman 3/26/2026

Fiaingen: A financial time series generative method matching real-world data quality

arXiv paper on PromptLoop for iterative prompt refinement in diffusion models via latent RL feedback. Improves generalization and robustness.

Ax Yuan Wang, Mingyu Li, Haibo Chen 3/26/2026

From Imperative to Declarative: Towards LLM-friendly OS Interfaces for Boosted Computer-Use Agents

Proposes declarative OS interfaces to improve computer-use agents' ability to interact with GUIs, replacing error-prone imperative action sequences.

Ax Yijie Xu, Huizai Yao, Zhiyu Guo, Pengteng Li, Aiwei Liu, Xuming Hu, Weiyu Guo, Hui Xiong 3/26/2026

You only need 4 extra tokens: Synergistic Test-time Adaptation for LLMs

SyTTA: Label-free test-time adaptation for LLMs using only 4 extra tokens, enabling domain-specific deployment without fine-tuning on expensive labeled data.

Ax Antonio Montieri, Alfredo Nascita, Antonio Pescap\`e 3/26/2026

From Prompts to Packets: A View from the Network on ChatGPT, Copilot, and Gemini

Network traffic characterization study analyzing ChatGPT, Copilot, and Gemini usage patterns and their impact on internet infrastructure.

Ax Divyat Mahajan, Sachin Goyal, Badr Youbi Idrissi, Mohammad Pezeshki, Ioannis Mitliagkas, David Lopez-Paz, Kartik Ahuja 3/26/2026

Beyond Multi-Token Prediction: Pretraining LLMs with Future Summaries

Proposes future summary prediction as alternative to next-token prediction during LLM pretraining, improving long-horizon reasoning and planning capabilities.

Ax Woo-Jin Ahn, Sang-Ryul Baek, Yong-Jun Lee, Hyun-Duck Choi, Myo-Taeg Lim 3/26/2026

OffSim: Offline Simulator for Model-based Offline Inverse Reinforcement Learning

OffSim: Model-based offline inverse reinforcement learning framework that learns environment dynamics and reward functions from offline data without manual specification.

Ax Asim Mohamed, Martin Gubri 3/26/2026

Is Multilingual LLM Watermarking Truly Multilingual? Scaling Robustness to 100+ Languages via Back-Translation

Multilingual LLM watermarking robustness study showing current methods fail on low-resource languages, proposes back-translation approach for 100+ language coverage.

Ax Zhixiong Zhao, Haomin Li, Fangxin Liu, Yuncheng Lu, Zongwu Wang, Tao Yang, Li Jiang, Haibing Guan 3/26/2026

QUARK: Quantization-Enabled Circuit Sharing for Transformer Acceleration by Exploiting Common Patterns in Nonlinear Operations

QUARK: FPGA acceleration framework leveraging quantization and common patterns in nonlinear operations to accelerate transformer inference.

Ax Sebasti\'an Andr\'es Cajas Ord\'o\~nez, Luis Fernando Torres Torres, Mackenzie J. Meni, Carlos Andr\'es Duran Paredes, Eric Arazo, Cristian Bosch, Ricardo Simon Carbajo, Yuan Lai, Leo Anthony Celi 3/26/2026

Uncertainty Makes It Stable: Curiosity-Driven Quantized Mixture-of-Experts

Curiosity-driven quantized Mixture-of-Experts framework using Bayesian uncertainty routing for accurate inference on resource-constrained devices.

Ax Radman Rakhshandehroo, Daniel Coombs 3/26/2026

Reward Engineering for Spatial Epidemic Simulations: A Reinforcement Learning Platform for Individual Behavioral Learning

ContagionRL: Gymnasium-compatible RL platform for reward engineering in spatial epidemic simulations, enabling systematic evaluation of behavioral learning strategies.

Ax Yara Bahram, M\'elodie Desbos, Mohammadhadi Shateri, Eric Granger 3/26/2026

Uni-DAD: Unified Distillation and Adaptation of Diffusion Models for Few-step Few-shot Image Generation

Unified distillation and adaptation framework for diffusion models enabling fast, high-quality image generation in novel domains with single-stage pipeline.

Ax Zhihao Zhan, Jiaying Zhou, Likui Zhang, Qinhan Lv, Hao Liu, Jusheng Zhang, Weizheng Li, Ziliang Chen, Tianshui Chen, Ruifeng Zhai, Keze Wang, Liang Lin, Guangrun Wang 3/26/2026

E0: Enhancing Generalization and Fine-Grained Control in VLA Models via Tweedie Discrete Diffusion

Vision-Language-Action models enhanced via Tweedie discrete diffusion for improved generalization and fine-grained control in robotic manipulation tasks.

Ax Shutong Chen, Qi Liao, Adnan Aijaz, Yansha Deng 3/26/2026

Goal-Oriented Multi-Agent Semantic Networking: Unifying Intents, Semantics, and Intelligence

Proposes goal-oriented multi-agent semantic networking architecture for 6G services integrating AI-native communication with network-level intelligence.

Ax Kun Yuan, Min Woo Sun, Zhen Chen, Alejandro Lozano, Xiangteng He, Shi Li, Nassir Navab, Xiaoxiao Sun, Nicolas Padoy, Serena Yeung-Levy 3/26/2026

From Panel to Pixel: Zoom-In Vision-Language Pretraining from Biomedical Scientific Literature

Biomedical vision-language pretraining approach that captures fine-grained correspondences in scientific figures and text, improving domain-specific representations.

Ax Jialuo Li, Bin Li, Jiahao Li, Yan Lu 3/26/2026

Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding

Proposes adaptive frame selection method for long-form video understanding with large multimodal models, reducing computational overhead while maintaining query awareness.

Ax Raunak Jain 3/26/2026

Collaborative Causal Sensemaking: Closing the Complementarity Gap in Human-AI Decision Support

Research on LLM-based agents for decision support, proposing collaborative sensemaking approach where agents act as partners rather than answer engines to improve human-AI complementarity.

Ax Guoqiang Zou, Wanyu Wang, Hao Zheng, Longxiang Yin, Yinhe Han 3/26/2026

ODMA: On-Demand Memory Allocation Strategy for LLM Serving on LPDDR-Class Accelerators

ODMA proposes on-demand memory allocation strategy for efficient LLM serving on low-bandwidth accelerators, addressing limitations of static pre-allocation and fine-grained paging.

Ax Jingli Liu, Huannan Zheng, Bohao Zou, Kezhou Yang 3/26/2026

Physics-driven human-like working memory outperforms digital networks in dynamic vision

Physics-driven computing using magnetic tunnel junction dynamics for neuromorphic working memory, demonstrating energy efficiency over GPUs on vision tasks.

Ax Abhisek Ganguly, Santosh Ansumali, Sauro Succi 3/26/2026

Deep Neural Networks as Discrete Dynamical Systems: Implications for Physics-Informed Learning

Theoretical comparison between DNNs as discrete dynamical systems and physics-based differential equation solvers on benchmark PDEs.

Ax Yuan Li, Shin'ya Nishida 3/26/2026

Understanding Pure Textual Reasoning for Blind Image Quality Assessment

Analysis of textual reasoning in blind image quality assessment models. Investigates information flow between image, text, and quality predictions.

Ax Tao Liu, Taiqiang Wu, Runming Yang, Shaoning Sun, Junjie Wang, Yujiu Yang 3/26/2026

ProFit: Leveraging High-Value Signals in SFT via Probability-Guided Token Selection

Probability-guided token selection for SFT to address overfitting to single reference answers. Leverages multiple references while managing data costs.

Ax Hengyu Shen, Tiancheng Gu, Bin Qin, Lan Wu, Yuling Wu, Shuo Tan, Zelong Sun, Jun Wang, Nan Wu, Xiang An, Weidong Cai, Ziyong Feng, Kaicheng Yang 3/26/2026

DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset

100M high-quality Chinese image-text dataset for vision-language pre-training. Addresses bottleneck in Chinese VLP model development.

Ax Yu Yang, Ig-Jae Kim, Dongwook Yoon 3/26/2026

PASTA: A Scalable Framework for Multi-Policy AI Compliance Evaluation

Scalable compliance evaluation framework for multi-policy AI governance. Integrates comprehensive model-card format and streamlines policy compliance burden.

Ax Kla Tantithamthavorn, Hong Yi Lin, Patanamon Thongtanunam, Wachiraphan Charoenwet, Minwoo Jeong, Ming Wu 3/26/2026

HalluJudge: A Reference-Free Hallucination Detection for Context Misalignment in Code Review Automation

Reference-free hallucination detection for LLM-generated code review comments. Identifies context misalignment without ground truth, enabling practical adoption in code review automation.

Ax Raunak Jain 3/26/2026

From Sycophancy to Sensemaking: Premise Governance for Human-AI Decision Making

Framework addressing sycophancy in LLM decision support systems through premise governance. Proposes structured verification for deep-uncertainty decisions.

Ax Natnael Mola, Leonardo S. B. Pereira, Carolina R. Kelsch, Luis H. Arribas, Juan C. S. M. Avedillo 3/26/2026

SPARE: Self-distillation for PARameter-Efficient Removal

Self-distillation approach for machine unlearning in text-to-image diffusion models. Balances effective forgetting with retention of unrelated concepts.

Ax Bjarni Haukur Bjarnason, Andr\'e Silva, Martin Monperrus 3/26/2026

On Randomness in Agentic Evals

Statistical analysis of variance in agentic system evaluations. Shows single-run pass@1 scores on SWE-Bench vary substantially (2.2-6.0%), calling for improved evaluation methodology.

Ax Lei Ma, Jinyang Liu, Tieying Zhang, Peter M. VanNostrand, Dennis M. Hofmann, Lei Cao, Elke A. Rundensteiner, Jianjun Chen 3/26/2026

KRONE: Hierarchical and Modular Log Anomaly Detection

Hierarchical framework for log anomaly detection that preserves component execution structure. Addresses spurious correlations in flat-sequence approaches.

Ax Yuzhu Cai, Zexi Liu, Xinyu Zhu, Cheng Wang, Siheng Chen 3/26/2026

AceGRPO: Adaptive Curriculum Enhanced Group Relative Policy Optimization for Autonomous Machine Learning Engineering

AceGRPO combines adaptive curriculum learning with GRPO for autonomous ML engineering agents. Addresses behavioral stagnation and data inefficiency in long-horizon optimization tasks.

Ax Maomao Li, Zhen Li, Kaipeng Zhang, Guosheng Yin, Zhifeng Li, Dong Xu 3/26/2026

OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model

Joint audio-video generation model for synchronized customization of video identity and audio timbre from reference inputs.

Ax Jeffrey T. H. Wong, Zixi Zhang, Junyi Liu, Yiren Zhao 3/26/2026

Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling

Heterogeneous multi-agent framework treating diverse LLM models as specialized tools. Introduces orchestrator calibration for efficient test-time scaling through coordinated tool calling.

Ax Egor Denisov, Svetlana Glazyrina, Maksim Kryzhanovskiy, Roman Ischenko 3/26/2026

Smooth Gate Functions for Soft Advantage Policy Optimization

Smooth gate functions for stabilizing GRPO LLM training. Replaces hard clipping with sigmoid-based gating to improve optimization stability in reasoning tasks.

Ax Xiang Li, Yuheng Zhang, Nan Jiang 3/26/2026

Beyond State-Wise Mirror Descent: Offline Policy Optimization with Parametric Policies

Theoretical analysis of offline reinforcement learning with general function approximation and parametric policies, extending beyond finite action spaces.

Ax Andrew Chin, Dongkwan Kim, Yu-Fu Fu, Fabian Fleischer, Youngjoon Kim, HyungSeok Han, Cen Zhang, Brian Junekyu Lee, Hanqing Zhao, Taesoo Kim 3/26/2026

OSS-CRS: Liberating AIxCC Cyber Reasoning Systems for Real-World Open-Source Security

Open-source framework for deploying DARPA AIxCC cyber reasoning systems locally. Makes competition CRSs usable outside original infrastructure with improved accessibility.

Ax Anupam Purwar, Aditya Choudhary 3/26/2026

MM-tau-p$^2$: Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings

Evaluation framework for persona-adaptive LLM-powered agents in multi-modal settings, addressing user-aware behavior in customer experience management.

Ax Edward Y. Chang 3/26/2026

Exploring Collatz Dynamics with Human-LLM Collaboration

Mathematical analysis of Collatz conjecture dynamics using modular arithmetic and combinatorial methods. Pure mathematics research unrelated to AI/ML.

Ax Siddharth Srikanth, Freddie Liang, Ya-Chuan Hsu, Varun Bhatt, Shihan Zhao, Henry Chen, Bryon Tjanaka, Minjune Hwang, Akanksha Saran, Daniel Seita, Aaquib Tabrez, Stefanos Nikolaidis 3/26/2026

Red-Teaming Vision-Language-Action Models via Quality Diversity Prompt Generation for Robust Robot Policies

Mathematical analysis of Collatz conjecture dynamics using modular arithmetic and combinatorial methods with LLM assistance.

Ax Zekun Wu, Adriano Koshiyama, Sahan Bulathwela, Maria Perez-Ortiz 3/26/2026

AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Agents

Red-teaming Vision-Language-Action models through quality diversity prompt generation to improve robot policy robustness.

Ax Haoan Feng, Sri Harsha Musunuri, Guan-Ming Su 3/26/2026

Geometry-Guided Camera Motion Understanding in VideoLLMs

AgentDrift: reveals safety risks in LLM agent recommendations when tools are corrupted, hidden by standard metrics.