Isolater - Feed

Ax Sang-Hoon Lee, Ha-Yeong Choi 16d ago

ReGen: Hierarchical Multi-Prompt Representation Generation for Efficient Waveform Diffusion Models

ReGen proposes hierarchical multi-prompt representation generation for efficient waveform diffusion models to address representation alignment issues in diffusion Transformers.

Ax Nada Zine, Tristan Coignion, Vincenzo Stoico, Cl\'ement Quinton, Romain Rouvoy, Patricia Lago 16d ago

Attention to Detail: Evaluating Energy, Performance, and Accuracy Trade-offs Across vLLM Configurations

Empirical evaluation of vLLM inference engine configurations measuring trade-offs between energy consumption, performance, and output quality.

Ax Julius St\"ork 16d ago

Interference and Retention in Continual Learning

Analysis of task interference as mechanism for modeling forgetting in continual learning using path-averaged curvature.

Ax Alfredo Garrach\'on Ruiz, Tom\'as de la Rosa, Daniel Borrajo 16d ago

Git-Assistant: Planning-Based Support for Updating Git Repositories

Git-Assistant combines LLMs with automated planning to help developers with repository management tasks and version control workflows.

Ax Jayadeva, Madhur Aswani 16d ago

All you need is SAMPAT

SAMPAT neural architecture using polynomials and analytic transformations for interpretable learning of continuous functions.

Ax Gwenn Beets, Anniek Jansen, Saar Hommes, Ruben D. Vromans, Leonie Westerbeek, Supraja Sankaran, Julia C. M. van Weert, Emiel J. Krahmer, Nadine Bol 16d ago

LLMs for health: Perceived benefits, risks, intention to use AI chatbots, and willingness to self-disclose across sensitive health topics

Online experiment examining user perceptions of benefits/risks and disclosure willingness when using AI chatbots for health topics.

Ax Saviz Changizi, Nasibeh Mohammadzadeh, Mohammad Shojafar, Rahim Tafazolli 16d ago

Blockchain-Linked Auditable Decision Management for Telecom/IoT Fraud-Control Requests

QLoRA-tuned LLM system for telecom fraud detection with blockchain-based decision management and auditability.

Ax Maxim Chupilkin 16d ago

Geopolitical alignment: Endorsement effects in large language models

Study testing whether four LLMs show geopolitical bias when evaluating international policies presented with different country endorsements.

Ax Pedro P. Santos, F\'abio Vital, Alberto Sardinha, Francisco S. Melo 16d ago

Risk-Aware General-Utility Markov Decision Processes

Research on risk-aware Markov decision processes where agents optimize risk measures of objective value distributions based on state visitation frequency.

Ax Kwan Soo Shin, In Seok Kang, Yunkyung Min 16d ago

Creativity, honesty and designed forgetting emerge in small hyperbolic language models

Studies emergence of creativity, honesty, and selective memory in small hyperbolic language models as companions.

Ax Miguel Arana-Catania, Gillian Pink, Glenn Roe 16d ago

Automatic Thematic Indexing of Large Literary Corpora: A Machine Learning Approach to Voltaire's Complete Works

ML approach to automatic thematic indexing of literary corpora, demonstrated on Voltaire's Complete Works.

Ax Miguel Arana-Catania, Catherine Conisbee, Matthew Kidd 16d ago

Letting the Data Speak: Extracting Keywords from Crowdsourced Collections with AI

Evaluates NLP approaches for automated keyword extraction from crowdsourced digital archives using the Their Finest Hour collection.

Ax Zixin Chen, Peng Liu, Haobo Li, Rui Sheng, Jianhong Tu, Xiaodong Deng, Fei Huang, Kashun Shum, Dayiheng Liu, Huamin Qu 16d ago

WILDTRACE: Benchmarking Natural Evidence Trails in Long-Context Reasoning

WILDTRACE benchmark evaluating LLM long-context reasoning requiring evidence integration across distant document passages.

Ax Guanquan Wang, Yoshimasa Tsuruoka 16d ago

Shortcut Trajectory Planning for Efficient Offline Reinforcement Learning

Shortcut Trajectory Planning: consistency-based approach reducing inference cost in diffusion-based offline RL planners.

Ax Cedric Caruzzo, Donggeun Yoo, Tae Soo Kim 16d ago

Deceptive Grounding: Entity Attribution Failure in Clinical Retrieval-Augmented Generation

Identifies deceptive grounding failure in clinical RAG systems where correct citations mask incorrect entity attribution.

Ax Shirley Yu, Ruben Martins 16d ago

Diversifying to Verify: When Task-Equivalent Programs Differ in Verifiability

Diversify2Verify: LLM pipeline generating diverse program implementations to improve automated verifiability of Why3 code.

Ax Victor J. B. Jung, Gagandeep Singh, Joseph Melber, Kristof Denolf, Francesco Conti, Luca Benini 16d ago

STEEL: Sparsity-Aware Fused Attention for Energy-Efficient Long-Sequence Inference on AMD's XDNA NPU

STEEL: sparsity-aware fused attention optimization for energy-efficient long-sequence LLM inference on AMD NPUs.

Ax Xinyu Zhu, Zhe Xu, Xiaohan Wei, Yunchen Pu, Fei Tian, Chonglin Sun, Kaushik Rangadurai, Hua Zhi, Frank Shyu, Sandeep Pandey, Luke Simon, Yu Meng, Xi Liu 16d ago

Self-Guided Test-Time Training for Long-Context LLMs

Self-guided test-time training method for improving long-context LLM reasoning and evidence utilization during inference.

Ax The Soofi-Team, :, Benedikt Droste, David Fitzek, Ruben H\"arle, Lukas Helff, Maximilian Idahl, Alex Jude, Abbas Goher Khan, Maurice Kraus, Timm Ruland, Richard Rutmann, Sebastian Sztwiertnia, Markus Frey, Daniil Gurgurov, Jan Pfister, Tom R\"ohr, Sebastian von Rohrscheidt, J\"org Bienert, Nicolas Flores-Herr, Simon Gottschalk, Andreas Hotho, Kristian Kersting, Joachim K\"ohler, Alexander L\"oser, Wolfgang Nejdl, Simon Ostermann, Jan Plogsties, Patrick Putzky, Mehdi Ali, Michael Fromm, Max L\"ubbering 16d ago

A Sovereign, Open-Source Foundation Model for German and English

Soofi S 30B-A3B: open-source Mixture-of-Experts hybrid Mamba-Transformer foundation model for German and English with efficient inference.

Ax Spiros Baxevanakis, Peng-Jian Yang 16d ago

Test-Time Scaling for Small VLMs on Multilingual Visual MCQ

Examines test-time scaling techniques for small open-source vision-language models on multilingual visual QA benchmarks.

Ax Anil Osman Tur, Tonje Knutsen Sordalen, Kim Tallaksen Halvorsen, Cigdem Beyan 16d ago

Parameter-Efficient Vision-Language Adaptation with Continuous Metadata Conditioning for Animal Re-Identification

Parameter-efficient CLIP adaptation framework for animal re-identification with continuous metadata conditioning.

Ax Charles Edward Gagnon, Steven H. H. Ding, Philippe Charland, Benjamin C. M. Fung 16d ago

Practical Source Code Recovery from Binary Functions Using Anchor-Based Retrieval and LLM Reasoning

Pipeline combining reverse engineering, code retrieval, and LLM reasoning to recover source code from stripped binaries.

Ax Pan Li 16d ago

All Explanations are Wrong, But Many Are Useful: Exploring the Rashomon Explanation Set with Large Language Models

Explores using LLMs to generate multiple explanations for ML models, arguing explainability and prediction are complementary rather than trade-off objectives.

Ax Filippo Ziliotto, Luciano Serafini, Lamberto Ballan, Tommaso Campari 16d ago

What VGGT Knows About Overlap: Probing Geometric Foundation Models for Co-Visibility

Probes VGGT geometric foundation model for emergent co-visibility encoding useful in 3D reconstruction and robotic localization.

Ax Xiangxin Zhao, Han Li, Shuaiting Li, Tianyi Zhao, Earl T. Barr, Federica Sarro, He Ye 16d ago

Failure as a Process: An Anatomy of CLI Coding Agent Trajectories

Large-scale study analyzing failure trajectories of LLM coding agents in terminal environments as temporal process rather than outcome.

Ax Junfei Zhan, Haoxun Shen, Mingang Guo, Zixuan Huang, Tengjiao He 16d ago

Seeing is Free, Speaking is Not: Uncovering the True Energy Bottleneck in Edge VLM Inference

Energy profiling of on-device VLM inference reveals language generation, not visual processing, as primary energy bottleneck.

Ax Jiawen Li, Tian Guan, Huijuan Shi, Xitong Ling, Mingxi Fu, Anjia Han, Chao He, Yonghong He 16d ago

ALICE: Learning a General-Purpose Pathology Foundation Model from Vision, Vision-Language, and Slide-Level Experts

ALICE unified pathology foundation model using multi-stage distillation from eight vision and vision-language teacher models.

Ax Tianyou Jiang, Ziyu Zhou 16d ago

TCLA: Training-Free Class-wise Logit Adaptation for Medical Vision-Language Models

TCLA: Training-free class-wise logit adaptation for medical vision-language models handling domain shift and class bias.

Ax Yujie Pang, Zudong Li 16d ago

PAC-ACT: Post-training Actor-Critic for Action Chunking Transformers

PAC-ACT post-training method using actor-critic for action chunking transformers in precision industrial robot manipulation.

Ax Nirjhar Das, Md. Al-Mamun Provath 16d ago

Task-Specific Multimodal Question Answering Agents via Confidence Calibration and Incremental Reasoning for QANTA 2026

Task-specific multimodal QA agents with confidence calibration and incremental reasoning for QANTA 2026 challenge.

Ax Cl\'audio L\'ucio do Val Lopes, Lucca Machado da Silva 16d ago

Semantic Pareto-DQN: A Multi-Objective Reinforcement Learning Framework for Financial Anomaly Detection

Semantic Pareto-DQN: Multi-objective reinforcement learning framework for financial anomaly detection addressing class imbalance.

Ax Katherine Swinea, Kshitiz Aryal, Lopamudra Praharaj, Maanak Gupta 16d ago

VEXAIoT: Autonomous IoT Vulnerability EXploitation using AI Agents

VEXAIoT applies LLM agents to autonomous IoT vulnerability exploitation and penetration testing in constrained environments.

Ax Shravan Murlidaran, Miguel P. Eckstein 16d ago

Evolution of Accuracy and Visual-Cognitive Errors in a Decade of Vision-Language AI Models

Introduces Complex Social Behavior dataset and evaluates vision-language model accuracy and error types over decade of progress.

Ax Yiming Zhang, Zhonghan Zhao, Wenwei Zhang, Haiteng Zhao, Tianyang Lin, Yunhua Zhou, Demin Song, Kuikun Liu, Haochen Ye, Haian Huang, Yuzhe Gu, Haijun Lv, Qipeng Guo, Bin Liu, Gaoang Wang, Kai Chen 16d ago

Scalable Visual Pretraining for Language Intelligence

Scalable visual pretraining approach for foundation models incorporating figures, equations, and layouts beyond text conversion.

Ax Jinwei He, Feng Lu 16d ago

IFAR: Multi-Perspective and Multi-Level Causal Discovery with LLMs

IFAR framework using LLMs for multi-perspective causal discovery with DeepAbduction dataset for abductive reasoning evaluation.

Ax Maheep Chaudhary, Fazl Barez 16d ago

Beyond Black-Box Obfuscation: Mechanistic Analysis and Defense of White-Box Monitors

Mechanistic analysis of white-box LLM monitor evasion strategies and proposed defenses through red-team experiments.

Ax Sylee Dandekar, Shripad Deshmukh, Frank Chiu, W. Bradley Knox, Scott Niekum 16d ago

A Descriptive and Normative Theory of Human Beliefs in RLHF

Study on how human beliefs about agent capabilities affect preference generation in RLHF, proposing theoretical framework beyond reward functions.

Ax Zhenxiao Fu, Lei Jiang, Yilun Xu, Gang Huang, Fan Chen 16d ago

QAgent: An LLM-based Multi-Agent System for Autonomous OpenQASM programming

QAgent: Multi-agent LLM framework for autonomous OpenQASM quantum circuit code generation with domain-specific planning and iterative synthesis.

Ax Chenhua Shi, Bhavika Jalli, Gregor Macdonald, John Zou, Wanlu Lei, Mridul Jain, Joji Philip 16d ago

Leveraging Multi-Agent System (MAS) and Fine-Tuned Small Language Models (SLMs) for Automated Telecom Network Troubleshooting

Multi-agent system with fine-tuned small LMs for telecom network troubleshooting. Combines multiple agents and SLMs for domain automation.

Ax Shashank Kirtania, Param Biyani, Priyanshu Gupta, Yasharth Bajpai, Roshni Iyer, Sumit Gulwani, Gustavo Soares 16d ago

Improving Language Agents through BREW: Bootstrapping expeRientially-learned Environmental knoWledge

BREW framework distills agent interaction trajectories into retrievable knowledge base enabling agents to learn from experience across sessions.

Ax Derrick Goh Xin Deik, Quanyu Long, Zhengyuan Liu, Nancy F. Chen, Wenya Wang 16d ago

Programming over Thinking: Efficient and Robust Multi-Constraint Planning

LLM agents solve multi-constraint planning by programming candidate solutions rather than pure reasoning. Efficient constraint satisfaction via code generation.

Ax Zirong Chen, Hongchao Zhang, Meiyi Ma 16d ago

PACE: A Personalized Adaptive Curriculum Engine for 9-1-1 Call-taker Training

Personalized adaptive curriculum engine for 9-1-1 call-taker training. Domain-specific education application with limited tech generalizability.

Ax Yi Huang, Bowen Zheng, Yunxi Dong, Hong Tang, Huan Zhao, S. M. Rakibul Hasan Shawon, Hualiang Zhang 16d ago

A Self-Evolving Agentic Framework for Metasurface Inverse Design

Self-evolving agentic framework for metasurface inverse design coupling coding agent with physics-based evaluator. Agent autonomously generates optimization code.

Ax Zikun Ye, Hema Yoganarasimhan 16d ago

Rectification Difficulty and Optimal Sample Allocation in LLM-Augmented Surveys

Framework for allocating human verification budget when using LLM predictions in surveys. Budget optimization with variable LLM accuracy.

Ax Carissa Cullen, Harry Garland, Alexander Roman, Louis Thomson, Christos Ziakas, Elliott Thornley 16d ago

Towards Shutdownable Agents: Generalizing Stochastic Choice in RL Agents and LLMs

DReST reward function trains agents to lack preferences over trajectory length, promoting shutdownable behavior. Safety-focused agent training method.

Ax Darsh Kachroo, Arjun Prasaath Anbazhagan, Adriana Caraeni, Brennan Lagasse, Kevin Zhu 16d ago

HiPO: Hierarchical Preference Optimization for Adaptive Reasoning in LLMs

Hierarchical Preference Optimization improves DPO for complex reasoning by providing fine-grained feedback on solution subsections. Extends preference learning to multi-step reasoning.

Ax Wei Duan, Junyu Xuan, En Yu, Xiaoyu Yang, Jie Lu 16d ago

Heterogeneous Information-Bottleneck Coordination Graphs for Multi-Agent Reinforcement Learning

Information-bottleneck method for learning coordination graph topology in multi-agent RL. Theoretical approach to agent communication capacity allocation.

Ax Carmen Quiles-Ram\'irez, Leticia L. Rodr\'iguez, Nicol\'as Martorell, Natalia D\'iaz-Rodr\'iguez 16d ago

Explaining is Harder Than Predicting Alone: Evaluating Concept-based Explanations of MLLMs as ICL Visual Classifiers

Evaluates concept-based explainability of multimodal LLMs in few-shot in-context learning. Studies transparency of MLLM reasoning processes.

Ax Jiakang Li, Guanyu Zhu, Can Jin, Chenxi Huang, Dexu Yu, Ronghao Chen, Yang Zhou, Hongwu Peng, Xuanqi Lan, Dimitris N. Metaxas, Youhua Li 16d ago

Latent Reward Steering: An Adaptive Inference-Time Framework that Implicitly Promotes Cognitive Behaviors in Reasoning LLMs

Latent Reward Steering adaptively promotes cognitive behaviors in reasoning LLMs at inference time. Addresses reasoning behavior control without explicit guidance.

Ax Jayanta Dey, Shikhar Srivastava, Itamar Lerner, Christopher Kanan, Dhireesha Kudithipudi 16d ago

SHARP: Sleep-based Hierarchical Accelerated Replay for Long Range Non-Stationary Temporal Pattern Recognition

SHARP learns long-range temporal patterns in streaming settings via hierarchical replay. Temporal sequence modeling for specialized domains.