Isolater - Feed

Ax Julio Candanedo 9d ago

The Diffusion-Attention Connection

Theoretical connection between Transformers, diffusion maps, and magnetic Laplacians through Markov geometry.

Ax James K. Ruffle, Samia Mohinta, Chris Foulon, Mohamad Zeina, Zicheng Wang, Sebastian Brandner, Harpreet Hyare, Parashkev Nachev 9d ago

Fairboard: a quantitative framework for equity assessment of healthcare models

Framework for evaluating fairness and equity across patient subgroups in brain tumor segmentation models.

Ax Pankayaraj Pathmanathan, Furong Huang 9d ago

Deliberative Alignment is Deep, but Uncertainty Remains: Inference time safety improvement in reasoning via attribution of unsafe behavior to base model

Study of deliberative alignment for deeper safety in reasoning LLMs via attribution analysis.

Ax Hua-Dong Xiong (School of Psychological and Brain Sciences, Georgia Tech), Li Ji-An (Department of Psychology, New York University), Jiaqi Huang (Department of Cognitive Science, Indiana University Bloomington, Honda Research Institute), Robert C. Wilson (School of Psychological and Brain Sciences, Georgia Tech, Center of Excellence for Computational Cognition, Georgia Tech), Kwonjoon Lee (Honda Research Institute), Xue-Xin Wei (Departments of Neuroscience and Psychology, The University of Texas at Austin) 9d ago

Human-like Working Memory Interference in Large Language Models

Analysis of working memory limitations in LLMs and comparison with biological systems.

Ax Liu Xiao 9d ago

Belief-State RWKV for Reinforcement Learning under Partial Observability

RWKV-based RL approach with explicit belief state representation for partial observability problems.

Ax Dongmin Kim, Hoshinori Kanazawa, Yasuo Kuniyoshi 9d ago

Active Inference with a Self-Prior in the Mirror-Mark Task

Computational model using Transformer self-prior to simulate mirror self-recognition behavior without external rewards.

Ax Ming Lei, Christophe Baehr 9d ago

A Comparative Theoretical Analysis of Entropy Control Methods in Reinforcement Learning

Theoretical analysis comparing entropy regularization and covariance-based mechanisms for controlling policy collapse in RL-enhanced LLMs.

Ax Samah Fodeh, Ganesh Puthiaraju, Elyas Irankhah, Linhai Ma, Srivani Talakokkul, Afshan Khan, Sreeraj Ramachandran, Jordan Alpert, Sarah Schellhorn 9d ago

STaR-DRO: Stateful Tsallis Reweighting for Group-Robust Structured Prediction

Framework for robust structured prediction using Tsallis reweighting and task-agnostic prompting with XML structure for group-robust fine-tuning.

Ax Vijay Lingam, Aditya Golatkar, Anwesan Pal, Ben Vo, Narayanan Sadagopan, Alessandro Achille, Jun Huan, Anoop Deoras, Stefano Soatto 9d ago

ExecTune: Effective Steering of Black-Box LLMs with Guide Models

Guide-Core Policies framework for black-box LLM agents where guide models generate structured strategies executed by core models reducing inference costs.

Ax Mainak Kundu, Catherine Chen, Rifatul Islam, Ismail Uysal, Ria Kanjilal 9d ago

Explainable Human Activity Recognition: A Unified Review of Concepts and Mechanisms

Review of explainable AI mechanisms for human activity recognition in healthcare, assistive living, and smart environments applications.

Ax Weijian Mai, Mu Nan, Yu Zhu, Jiahang Cao, Rui Zhang, Yuqin Dai, Chunfeng Song, Andrew F. Luo, Jiamin Wu 9d ago

NeuroFlow: Toward Unified Visual Encoding and Decoding from Neural Activity

Unified visual encoding and decoding framework from neural activity modeling consistency between brain stimulus prediction and reconstruction.

Ax Robin Young, Michael E. Van Nuland, E. Toby Kiers, Tom\'a\v{s} V\v{e}trovsk\'y, Petr Kohout, Petr Baldrian, Srinivasan Keshav 9d ago

Below-ground Fungal Biodiversity Can be Monitored Using Self-Supervised Learning Satellite Features

Self-supervised learning on satellite imagery predicting mycorrhizal fungal biodiversity at landscape scales for ecosystem monitoring.

Ax Jan Kirin 9d ago

Relational Preference Encoding in Looped Transformer Internal States

Analysis of preference encoding in looped transformer internal states using lightweight evaluator heads on RLHF dataset.

Ax Yi-Hao Peng, Samarth Das, Jeffrey P. Bigham, Jason Wu 9d ago

Efficient Personalization of Generative User Interfaces

Methods for personalizing generative user interfaces addressing subjective preferences through preference divergence analysis and sparse feedback.

Ax Halil Ibrahim Gulluk, Olivier Gevaert 9d ago

SemEnrich: Self-Supervised Semantic Enrichment of Radiology Reports for Vision-Language Learning

Self-supervised semantic enrichment method for medical vision-language datasets addressing reporting bias in radiology reports.

Ax Tyler Yang, Romal Mitr 9d ago

Improving Pediatric Emergency Department Triage with Modality Dropout in Late Fusion Multimodal EHR Models

Multimodal EHR model for pediatric emergency triage using modality dropout to improve generalizability across demographics.

Ax Micha{\l} Derezi\'nski, Xiaoyu Dong 9d ago

Last-Iterate Convergence of Randomized Kaczmarz and SGD with Greedy Step Size

Convergence analysis of randomized Kaczmarz and SGD with greedy step size proving O(1/t^3/4) last-iterate convergence rate.

Ax Joseph Liu, Nameer Hirschkind, Xiao Yu, Mahesh Kumar Nandwana 9d ago

Regularized Entropy Information Adaptation with Temporal-Awareness Networks for Simultaneous Speech Translation

Temporal-aware network improving simultaneous speech translation policy to balance quality and latency with temporal context awareness.

Ax Theo X. Olausson, Metod Jazbec, Xi Wang, Armando Solar-Lezama, Christian A. Naesseth, Stephan Mandt, Eric Nalisnick 9d ago

A Tale of Two Temperatures: Simple, Efficient, and Diverse Sampling from Diffusion Language Models

Sampling methods for diffusion language models balancing speed, quality, and diversity through tempered confidence-based remasking.

Ax Maryam Ahang, Todd Charter, Masoud Jalayer, Homayoun Najjaran 9d ago

A Hybrid Intelligent Framework for Uncertainty-Aware Condition Monitoring of Industrial Systems

Hybrid condition monitoring framework combining data-driven learning with physics-based insights for industrial systems reliability.

Ax Smita Deb, Shirin Panahi, Mulugeta Haile, Ying-Cheng Lai 9d ago

Vestibular reservoir computing

Physical reservoir computing inspired by biological vestibular system addressing hardware complexity with designed uncoupled topology.

Ax Renjini R. Nair (Microsoft), Damian K. Kowalczyk (Microsoft), Marco Gaudesi (Microsoft), Chhaya Methani (Microsoft) 9d ago

SLM Finetuning for Natural Language to Domain Specific Code Generation in Production

Fine-tuning small language models for domain-specific code generation in production environments with strict latency requirements.

Ax James Nguyen 9d ago

From Recency Bias to Stable Convergence Block Kaczmarz Methods for Online Preference Learning in Matchmaking Applications

Kaczmarz-based preference learning algorithms for real-time matchmaking with stable convergence replacing recency-biased normalization.

Ax Ziyue Liu, Ruijie Zhang, Zhengyang Wang, Yequan Zhao, Yupeng Su, Zi Yang, Zheng Zhang 9d ago

Muon$^2$: Boosting Muon via Adaptive Second-Moment Preconditioning

Extension of Muon optimizer reducing computational overhead in foundation model pre-training through adaptive second-moment preconditioning.

Ax Wei Liu, Anweshit Panda, Ujwal Pandey, Haven Cook, George M. Slota, Naigang Wang, Jie Chen, Yangyang Xu 9d ago

LoDAdaC: a unified local training-based decentralized framework with adaptive gradients and compressed communication

Decentralized learning framework combining adaptive gradients and compressed communication for federated settings with multiple local training steps.

Ax Kening Wang, Di Wen, Yufan Chen, Ruiping Liu, Junwei Zheng, Jiale Wei, Kailun Yang, Rainer Stiefelhagen, Kunyu Peng 9d ago

Towards Multi-Source Domain Generalization for Sleep Staging with Noisy Labels

First benchmark for multi-source domain generalization in automatic sleep staging with noisy labels across institutions and devices.

Ax Chi Zhang, Jingpu Cheng, Zhixian Wang, Ping Liu 9d ago

Closed-Form Concept Erasure via Double Projections

Closed-form method for concept erasure in diffusion models using double projections without iterative optimization.

Ax Prakash Suman, Yanzhen Qu 9d ago

Cross-Validated Cross-Channel Self-Attention and Denoising for Automatic Modulation Classification

Cross-validated self-attention with denoising for automatic modulation classification under low signal-to-noise conditions.

Ax Jose Efraim Aguilar Escamilla, Haoyang Hong, Jiawei Li, Haoyu Zhao, Xuezhou Zhang, Sanghyun Hong, Huazheng Wang 9d ago

When Can You Poison Rewards? A Tight Characterization of Reward Poisoning in Linear MDPs

Characterizes necessity and sufficiency conditions for reward poisoning attacks in reinforcement learning with linear MDPs.

Ax Yujie Li, Jiuniu Wang, Mugen Peng, Guangzuo Li, Wenjia Xu 9d ago

Graph-RHO: Critical-path-aware Heterogeneous Graph Network for Long-Horizon Flexible Job-Shop Scheduling

Heterogeneous graph network with critical-path awareness for long-horizon flexible job-shop scheduling using rolling horizon optimization.

Ax Hongkang Li, Hancheng Min, Rene Vidal 9d ago

Transformers Learn the Optimal DDPM Denoiser for Multi-Token GMMs

Theoretical analysis of why transformers learn optimal DDPM denoiser for multi-token Gaussian mixture models.

Ax Zunhai Su, Hengyuan Zhang, Wei Wu, Yifan Zhang, Yaxiu Liu, He Xiao, Qingyao Yang, Yuxuan Sun, Rui Yang, Chao Zhang, Keyu Fan, Weihao Ye, Jing Xiong, Hui Shen, Chaofan Tao, Taiqiang Wu, Zhongwei Wan, Yulei Qian, Yuchen Xie, Ngai Wong 9d ago

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Survey on attention sink phenomenon in transformers, covering utilization, interpretation, and mitigation strategies.

Ax Francesco Carlucci, Giovanni Pollo, Xiaying Wang, Massimo Poncino, Enrico Macii, Luca Benini, Sara Vinco, Alessio Burrello, Daniele Jahier Pagliari 9d ago

End-to-end Automated Deep Neural Network Optimization for PPG-based Blood Pressure Estimation on Wearables

Automated DNN optimization for PPG-based blood pressure estimation on resource-constrained wearable devices.

Ax Yogesh Prasanna Kumar Rao, Tamas Keviczky, Raj Thilak Rajan 9d ago

Consensus-based Recursive Multi-Output Gaussian Process

Distributed consensus-based framework for recursive multi-output Gaussian processes in large-scale streaming settings.

Ax Ami Chopra, Supriya Bordoloi, Shyamanta M. Hazarika 9d ago

A Temporally Augmented Graph Attention Network for Affordance Classification

Temporally augmented graph attention network for affordance classification from EEG sequential data.

Ax Rui Lin, Zhenyu Jin, Guancheng Zhou, Xuyang Ge, Wentao Shu, Jiaxing Wu, Junxuan Wang, Zhengfu He, Junping Zhang, Xipeng Qiu 9d ago

Tracing the Thought of a Grandmaster-level Chess-Playing Transformer

Interprets internal computation of Leela Chess Zero transformer using sparse decomposition to explain grandmaster-level reasoning.

Ax Keivan Faghih Niresi, Christian M{\o}ller Jensen, Carsten Skovmose Kalles{\o}e, Rafael Wisniewski, Olga Fink 9d ago

Virtual Smart Metering in District Heating Networks via Heterogeneous Spatial-Temporal Graph Neural Networks

Spatial-temporal graph neural networks for virtual metering in sparsely instrumented district heating networks.

Ax Yuto Omae, Kazuki Sakai, Yohei Kakimoto, Makoto Sasaki, Yusuke Sakai, Hirotaka Takahashi 9d ago

Wolkowicz-Styan Upper Bound on the Hessian Eigenspectrum for Cross-Entropy Loss in Nonlinear Smooth Neural Networks

Theoretical bounds on Hessian eigenspectrum for cross-entropy loss in nonlinear neural networks.

Ax Shihong Ding, Weicheng Lin, Cong Fang 9d ago

Mild Over-Parameterization Benefits Asymmetric Tensor PCA

Theoretical analysis of asymmetric tensor PCA showing gradient descent benefits from mild over-parameterization.

Ax Joana Sim\~oes, Jo\~ao Correia 9d ago

Exploring the impact of fairness-aware criteria in AutoML

Studies fairness-aware criteria in automated machine learning frameworks to mitigate bias and discriminatory outcomes.

Ax Yuqi Su, Xiaolei Fang 9d ago

A Multi-head Attention Fusion Network for Industrial Prognostics under Discrete Operational Conditions

Multi-head attention fusion network for predicting degradation of industrial machinery operating under changing conditions.

Ax Mani Rash Ahmadi 9d ago

The Phase Is the Gradient: Equilibrium Propagation for Frequency Learning in Kuramoto Networks

Theoretical analysis proving phase displacement in Kuramoto oscillator networks equals gradient of loss for frequency learning.

Ax Jie Shi, Siamak Mehrkanoon 9d ago

A Diffusion-Contrastive Graph Neural Network with Virtual Nodes for Wind Nowcasting in Unobserved Regions

Graph neural network with diffusion-contrastive learning for wind nowcasting in regions lacking dense observation networks.

Ax Adil Derrazi, Javad Pourmostafa Roshan Sharami 9d ago

Integrating SAINT with Tree-Based Models: A Case Study in Employee Attrition Prediction

Combines SAINT attention mechanism with tree-based models like XGBoost for improved employee attrition prediction on tabular HR data.

Ax Jiaqi Wen, Pingbo Tang, Shaolei Ren, Jianyi Yang 9d ago

WaterAdmin: Orchestrating Community Water Distribution Optimization via AI Agents

AI agents for optimizing community water distribution systems by scheduling pumps and valves to meet demands while minimizing energy in dynamic real-world environments.

Ax Muhammad Imran Hossain, Md Fazley Rafy, Sarika Khushlani Solanki, Anurag K. Srivastava 9d ago

Battery health prognosis using Physics-informed neural network with Quantum Feature mapping

Combines physics-informed neural networks with quantum feature mapping for battery state-of-health estimation across chemistries.

Ax Rui Chen, Jinsong Wu 9d ago

Structural Gating and Effect-aligned Lag-resolved Temporal Causal Discovery Framework with Application to Heat-Pollution Extremes

Proposes SGED-TCD framework for lag-resolved causal discovery in multivariate time series with applications to environmental data.

Ax Zhe Ye, Aidan Z. H. Yang, Huangyuan Su, Zhenyu Liao, Samuel Tenka, Zhizhen Qin, Udaya Ghai, Dawn Song, Soonho Kong 9d ago