Isolater - Feed

Ax Yuan Wu, Zongxian Yang, Jiayu Qian, Songpan Gao, Guanxing Chen, Qiankun Li, Yu-An Huang, Zhi-An Huang 12d ago

Better Eyes, Better Thoughts: Why Vision Chain-of-Thought Fails in Medicine

Analysis showing chain-of-thought prompting underperforms direct answering in medical vision-language models due to perception bottlenecks in domain-specific tasks.

Ax Minh-Duong Nguyen, Thien-Thanh Dao, Le-Tuan Nguyen, Dung D. Le, Kok-Seng Wong 12d ago

Memory-efficient Continual Learning with Prototypical Exemplar Condensation

Memory-efficient continual learning method using prototypical exemplar condensation to reduce storage requirements while maintaining performance.

Ax Zhexi Lian, Haoran Wang, Xuerun Yan, Weimeng Lin, Xianhong Zhang, Yongyu Chen, Jia Hu 12d ago

Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement Learning in End-to-end Autonomous Driving

Parallel framework combining imitation and reinforcement learning for autonomous driving, addressing limitations of sequential fine-tuning approaches.

Ax Omkar Patil, Ondrej Biza, Thomas Weng, Karl Schmeckpeper, Wil Thomason, Xiaohan Zhang, Robin Walters, Nakul Gopalan, Sebastian Castro, Eric Rosen 12d ago

You've Got a Golden Ticket: Improving Generative Robot Policies With A Single Noise Vector

Method to improve pretrained generative robot policies by replacing sampled noise with optimized constant noise vectors for downstream reward optimization.

Ax Mengxian Lyu, Cheng Peng, Ziyi Chen, Mengyuan Zhang, Jieting Li Lu, Yonghui Wu 12d ago

Improving Automatic Summarization of Radiology Reports through Mid-Training of Large Language Models

Mid-training adaptation strategy for LLMs to improve automatic summarization of radiology reports, exploring domain-specific pre-training approaches.

Ax Sen Jia, Ning Zhu, Jinqin Zhong, Jiale Zhou, Huaping Zhang, Jenq-Neng Hwang, Lei Li 12d ago

RAM: Recover Any 3D Human Motion in-the-Wild

RAM: motion capture system for 3D human pose reconstruction in unconstrained video with occlusion handling and temporal smoothing.

Ax Clemens Watzenb\"ock, Daniel Aletaha, Micha\"el Deman, Thomas Deimel, Jana Eder, Ivana Janickova, Robert Janiczek, Peter Mandl, Philipp Seeb\"ock, Gabriela Supp, Paul Weiser, Georg Langs 12d ago

Chronological Contrastive Learning: Few-Shot Progression Assessment in Irreversible Diseases

ChronoCon: contrastive learning approach for disease progression assessment from longitudinal medical imaging without explicit severity annotations.

Ax Robert Aufschl\"ager, Jakob Folz, Gautam Savaliya, Manjitha D Vidanalage, Michael Heigl, Martin Schramm 12d ago

Towards Context-Aware Image Anonymization with Multi-Agent Reasoning

CAIAMAR: multi-agent framework for context-aware image anonymization in street-level imagery using agentic reasoning.

Ax Haochuan Kevin Wang, Zechen Zhang 12d ago

Kill-Chain Canaries: Stage-Level Tracking of Prompt Injection Across Attack Surfaces and Model Safety Tiers

Kill-chain canary methodology for tracking prompt injection attacks across multi-agent LLM systems with stage-level diagnostics.

Ax Hita Kambhamettu, Will Crichton, Sean Welleck, Harrison Goldstein, Andrew Head 12d ago

Explorable Theorems: Making Written Theorems Explorable by Grounding Them in Formal Representations

System for making mathematical theorems interactive by grounding LLM-generated explanations in formal representations enabling execution and stepping.

Ax Myra Cheng, Isabel Sieh, Humishka Zope, Sunny Yu, Lujain Ibrahim, Aryaman Arora, Jared Moore, Desmond Ong, Dan Jurafsky, Diyi Yang 12d ago

Verbalizing LLMs' assumptions to explain and control sycophancy

Framework for eliciting and verbalizing LLM assumptions to explain and mitigate sycophantic behavior in model outputs.

Ax Yi Zhou 12d ago

From Paper to Program: Accelerating Quantum Many-Body Algorithm Development via a Multi-Stage LLM-Assisted Workflow

Multi-stage LLM-assisted workflow for scientific algorithm development separating theory extraction, formal specification, and code generation.

Ax Cheol Woo Kim, Jai Moondra, Roozbeh Nahavandi, Andrew Perrault, Milind Tambe, Swati Gupta 12d ago

Many Preferences, Few Policies: Towards Scalable Language Model Personalization

Method for LLM personalization using a small portfolio of models capturing diverse user preferences without per-user models.

Ax Zequn Chen, Wesley J. Marrero 12d ago

Boosted Distributional Reinforcement Learning: Analysis and Healthcare Applications

Distributional reinforcement learning approach for decision-making in healthcare, accounting for uncertainty across heterogeneous populations.

Ax Jingwei Zuo, Xinze Feng, Zien Liu, Kaijian Wang, Fanjiang Ye, Ye Cao, Zhuang Wang, Yuke Wang 12d ago

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads

ALTO: system for adaptive LoRA hyperparameter tuning and orchestration across heterogeneous LLM fine-tuning workloads in multi-tenant environments.

Ax Zhengming Yu, Li Ma, Mingming He, Leo Isikdogan, Yuancheng Xu, Dmitriy Smirnov, Pablo Salamanca, Dao Mi, Pablo Delgado, Ning Yu, Julien Philip, Xin Li, Wenping Wang, Paul Debevec 12d ago

DiffHDR: Re-Exposing LDR Videos with Video Diffusion Models

DiffHDR: video diffusion model approach for converting low-dynamic-range videos to high-dynamic-range format.

Ax Yiquan Wu, Yuhang Liu, Yifei Liu, Ang Li, Siying Zhou, Kun Kuang, Fei Wu 12d ago

WisdomInterrogatory (LuWen): An Open-Source Legal Large Language Model Technical Report

WisdomInterrogatory (LuWen): open-source Chinese legal language model built on Baichuan foundation model for legal domain applications.

Ax Xue Qin, Simin Luan, John See, Cong Yang, Zhijun Li 12d ago

Governed Capability Evolution for Embodied Agents: Safe Upgrade, Compatibility Checking, and Runtime Rollback for Embodied Capability Modules

System for safe capability evolution in embodied agents with compatibility checking and runtime rollback mechanisms.

Ax Seungjae Moon, Seunghyun Oh, Youngmin Ro 12d ago

OV-Stitcher: A Global Context-Aware Framework for Training-Free Open-Vocabulary Semantic Segmentation

Training-free open-vocabulary semantic segmentation framework (OV-Stitcher) leveraging pretrained vision-language models without additional training.

Ax Juwei Yue, Chuanrui Hu, Jiawei Sheng, Zuyi Zhou, Wenyuan Zhang, Tingwen Liu, Li Guo, Yafeng Deng 12d ago

HyperMem: Hypergraph Memory for Long-Term Conversations

HyperMem: hypergraph-based memory architecture for conversational agents enabling long-term context tracking and high-order associations.

Ax Nishikanta Mohanty, Bikash K. Behera, Badshah Mukherjee, Pravat Dash 12d ago

QARIMA: A Quantum Approach To Classical Time Series Analysis

Quantum-inspired ARIMA methodology combining quantum autocorrelation with variational quantum circuits for time series analysis.

Ax Rui Gan, Junyi Ma, Pei Li, Xingyou Yang, Kai Chen, Sikai Chen, Bin Ran 12d ago

CrashSight: A Phase-Aware, Infrastructure-Centric Video Benchmark for Traffic Crash Scene Understanding and Reasoning

Vision-language benchmark (CrashSight) for evaluating traffic crash scene understanding from infrastructure perspective.

Ax Yunsong Zhou, Hangxu Liu, Xuekun Jiang, Xing Shen, Yuanzhen Zhou, Hui Wang, Baole Fang, Yang Tian, Mulin Yu, Qiaojun Yu, Li Ma, Hengjie Li, Hanqing Wang, Jia Zeng, Jiangmiao Pang 12d ago

SIM1: Physics-Aligned Simulator as Zero-Shot Data Scaler in Deformable Worlds

Physics-aligned simulator (SIM1) for generating synthetic data in deformable object robotic manipulation tasks.

Ax Ruiyao Xu, Kaize Ding 12d ago

GNN-as-Judge: Unleashing the Power of LLMs for Graph Learning with GNN Feedback

Framework combining LLMs with graph neural networks for text-attributed graph learning in low-resource settings using GNN feedback.

Ax Abhilasha Saroj, Shaked Regev, Guanhao Xu, Jinghui Yuan, Roy Luo, Ross Wang 12d ago

Memory-Guided Trust-Region Bayesian Optimization (MG-TuRBO) for High Dimensions

Bayesian optimization method (MG-TuRBO) for high-dimensional traffic simulation calibration, comparing genetic algorithms with Bayesian approaches.

Ax Ali Slim, Haydar Hamieh, Jawad Kotaich, Yehya Ghosn, Mahdi Chehimi, Ammar Mohanna, Hasan Abed Al Kader Hammoud, Bernard Ghanem 12d ago

QuanBench+: A Unified Multi-Framework Benchmark for LLM-Based Quantum Code Generation

QuanBench+ unified benchmark for LLM quantum code generation across Qiskit, PennyLane, Cirq with 42 aligned executable tasks.

Ax Pavel Golikov, Evgenii Opryshko, Gennady Pekhimenko, Mark C. Jeffrey 12d ago

Robust Reasoning Benchmark

Benchmark evaluating robustness of LLM reasoning with 14 perturbation techniques applied to mathematical reasoning tasks.

Ax Matheus Vin\'icius Todescato, Joel Lu\'is Carbonera 12d ago

Silhouette Loss: Differentiable Global Structure Learning for Deep Representations

Silhouette loss function for learning discriminative representations with explicit geometric properties in embedding space.

Ax Rasched Haidari, Sam Martin, Maxime Allard 12d ago

Distilling Genomic Models for Efficient mRNA Representation Learning via Embedding Matching

Distillation framework compressing genomic foundation models for efficient mRNA representation learning.

Ax Syed Rameez Naqvi, Lu Peng 12d ago

MolPaQ: Modular Quantum-Classical Patch Learning for Interpretable Molecular Generation

Quantum-classical hybrid molecular generator using VAE and quantum computing for interpretable drug discovery.

Ax Yeping Jin, Jiaming Hu, Ioannis Ch. Paschalidis 12d ago

Distributionally Robust Token Optimization in RLHF

DRTO combines token-level RLHF with distributional robustness to improve LLM resilience to input perturbations and formatting changes.

Ax Phong Lam, Ha-Linh Nguyen, Thu-Trang Nguyen, Son Nguyen, Hieu Dinh Vo 12d ago

Structured Exploration and Exploitation of Label Functions for Automated Data Annotation

Automated label function generation for data annotation using LLMs with structured exploration-exploitation strategy.

Ax Krisanu Sarkar 12d ago

On the Spectral Geometry of Cross-Modal Representations: A Functional Map Diagnostic for Multimodal Alignment

Analysis of cross-modal alignment between vision and language encoders using functional map framework from computational geometry.

Ax Abdulrahman Albaiz, Fathi Amsaad 12d ago

Fully Autonomous Z-Score-Based TinyML Anomaly Detection on Resource-Constrained MCUs Using Power Side-Channel Data

TinyML Z-score anomaly detection system running on resource-constrained microcontrollers using power side-channel data.

Ax Jun Liu, Ying Chen, Ziqian Lu, Qinyue Tong, Jun Tang 12d ago

Multivariate Time Series Anomaly Detection via Dual-Branch Reconstruction and Autoregressive Flow-based Residual Density Estimation

Dual-branch reconstruction method for multivariate time series anomaly detection using autoregressive flow-based density estimation.

Ax Chuxu Song, Zhencan Peng, Jiuqi Wei, Chuanhui Yang 12d ago

CSAttention: Centroid-Scoring Attention for Accelerating LLM Inference

CSAttention: sparse attention mechanism for accelerating LLM inference by reducing KV-cache bottlenecks through centroid-scoring without retraining.

Ax David Ramos, Lucas Lacasa, Ferm\'in Guti\'errez, Eusebio Valero, Gonzalo Rubio 12d ago

FluidFlow: a flow-matching generative model for fluid dynamics surrogates on unstructured meshes

Flow-matching generative model for CFD surrogate modeling on unstructured meshes as alternative to deep learning approaches.

Ax Matthew DosSantos DiSorbo, Harang Ju 12d ago

Act or Escalate? Evaluating Escalation Behavior in Automation with Language Models

Framework evaluating when LLMs should act versus escalate decisions using uncertainty estimation across five real-world domains.

Ax Ha Na Cho, Daniel Eisenberg, Cheryl King, Kai Zheng 12d ago

EngageTriBoost: Predictive Modeling of User Engagement in Digital Mental Health Intervention Using Explainable Machine Learning

Machine learning system predicting user engagement in digital mental health interventions using explainable ML methods.

Ax Brendan R. Hogan, Xiwen Chen, James T. Wilson, Kashif Rasul, Adel Boyarsky, Thomas Kamei, Anderson Schneider, Yuriy Nevmyvaka 12d ago

AlphaLab: Autonomous Multi-Agent Research Across Optimization Domains with Frontier LLMs

AlphaLab autonomous research system using frontier LLMs as agents to automate full experimental cycles in optimization domains without human intervention.

Ax Ivan Viakhirev, Kirill Borodin, Grach Mkrtchian 12d ago

From Dispersion to Attraction: Spectral Dynamics of Hallucination Across Whisper Model Scales

Analysis of hallucination phase transitions in Whisper ASR models using spectral sensitivity theorem and eigenspectra analysis.

Ax H. Xu, B. He, S. Wang 12d ago

Joint Interference Detection and Identification via Adversarial Multi-task Learning

Multi-task learning for wireless interference detection and identification using adversarial training methods.

Ax Zhuang Qi, Ying-Peng Tang, Lei Meng, Guoqing Chao, Lei Wu, Han Yu, Xiangxu Meng 12d ago

From Selection to Scheduling: Federated Geometry-Aware Correction Makes Exemplar Replay Work Better under Continual Dynamic Heterogeneity

Federated learning approach using exemplar replay to reduce catastrophic forgetting in continual learning with dynamic heterogeneity.

Ax Ivo Nowak 12d ago

StructRL: Recovering Dynamic Programming Structure from Learning Dynamics in Distributional Reinforcement Learning

StructRL: recovers dynamic programming structure from distributional RL learning dynamics. Bridges data-driven and structured approaches for stable learning.

Ax Yesmine Abdennadher, Philip N. Garner 12d ago

Practical Bayesian Inference for Speech SNNs: Uncertainty and Loss-Landscape Smoothing

Bayesian inference for spiking neural networks in speech processing. Explores weight uncertainty and loss landscape smoothing for temporal tasks.

Ax Yongchan Chun, Chanhee Park, Jeongho Yoon, Jaehyung Seo, Heuiseok Lim 12d ago

Evidential Transformation Network: Turning Pretrained Models into Evidential Models for Post-hoc Uncertainty Estimation

Evidential Transformation Network: adapts pretrained models for post-hoc uncertainty estimation. Efficient alternative to ensembles/MC dropout for deployed models.

Ax Rahul D Ray, Utkarsh Srivastava 12d ago

VOLTA: The Surprising Ineffectiveness of Auxiliary Losses for Calibrated Deep Learning

VOLTA: benchmark comparing uncertainty quantification methods for deep learning. Evaluates 10 UQ baselines across modalities and distribution shifts.

Ax Ramakrishnan Krishnamurthy, Arpit Agarwal, Lakshminarayanan Subramanian, Maximilian Nickel 12d ago

Creator Incentives in Recommender Systems: A Cooperative Game-Theoretic Approach for Stable and Fair Collaboration in Multi-Agent Bandits

Game-theoretic analysis of creator incentives in multi-agent recommender systems. Cooperative game formulation for fair collaboration in bandit problems.

Ax Maxim Ostroukhov, Ruslan Mikhailov, Vladimir Iashin, Artem Sokolov, Andrei Akshonov, Vitaly Protasov, Dmitrii Beloborodov, Vince Mullin, Roman Yokunda Enzmann, Georgios Kolovos, Jason Renders, Pavel Nesterov, Anton Repushko 12d ago

PRAGMA: Revolut Foundation Model

PRAGMA: foundation models for banking event sequences. Transformer-based architecture with self-supervised pretraining on financial transaction data.

Ax Fengwei Teng, Jinyi Bai, Xinhao Yao, Demi Ruohan Wang, Jiahao Zhao, Zhijiang Guo 12d ago

Skip-Connected Policy Optimization for Implicit Advantage

Skip-Connected Policy Optimization (SKPO) for reinforcement learning with reasoning tasks. Improves upon GRPO by addressing high-variance advantage estimation.