Isolater - Feed

Ax Albert Nasybullin, Vladimir Maksimenko, Semen Kurkin 27d ago

How Long short-term memory artificial neural network, synthetic data, and fine-tuning improve the classification of raw EEG data

Pipeline combining LSTM, synthetic data, and fine-tuning for EEG classification on implicit visual stimuli tasks.

Ax Zequn Chen, Wesley J. Marrero 27d ago

Boosted Distributional Reinforcement Learning: Analysis and Healthcare Applications

Analysis of distributional reinforcement learning for complex domains like healthcare, addressing heterogeneous groups under uncertainty.

Ax Xiuyuan Cheng, Yunqin Zhu, Yao Xie 27d ago

Generative models for decision-making under distributional shift

Tutorial on using flow- and score-based generative models for decision-making under distributional shift in operations research.

Ax Andrew Qing He 27d ago

Deep Kuratowski Embedding Neural Networks for Wasserstein Metric Learning

Neural architectures for learning to approximate Wasserstein-2 distances using Kuratowski embedding theorem.

Ax Karthik Suresh, Amine Ben Khalifa, Li Zhang, Wei-ting Hsu, Fangzheng Wu, Vinay More, Asim Kadav 27d ago

CPT: Controllable and Editable Design Variations with Language Models

System for generating editable design variations using decoder-only language model with Creative Markup Language representation.

Ax Narim Jeong, Donghwan Lee 27d ago

Finite-Time Analysis of Q-Value Iteration for General-Sum Stackelberg Games

Theoretical analysis of Q-value iteration convergence in multi-agent Stackelberg games using control-theoretic perspective.

Ax Hiroshi Takahashi, Tomoharu Iwata, Atsutoshi Kumagai, Sekitoshi Kanai, Masanori Yamada, Kosuke Nishida, Kazutoshi Shinoda 27d ago

Relative Density Ratio Optimization for Stable and Statistically Consistent Model Alignment

Research on aligning LLMs with human preferences using relative density ratio optimization without assuming specific preference models, improving statistical consistency.

Ax Seoyoung Park, Haemin Lee, Hankook Lee 27d ago

Is Prompt Selection Necessary for Task-Free Online Continual Learning?

Study questioning necessity of prompt selection in task-free online continual learning for non-stationary data streams.

Ax Henrik Krauss, Takehisa Yairi 27d ago

Estimating Central, Peripheral, and Temporal Visual Contributions to Human Decision Making in Atari Games

Ablation framework to estimate contributions of central, peripheral, and temporal visual information to human decision-making in Atari games.

Ax Prasanjit Dey, Zachary Yahn, Bianca Schoen-Phelan, Soumyabrata Dev 27d ago

TinyNina: A Resource-Efficient Edge-AI Framework for Sustainable Air Quality Monitoring via Intra-Image Satellite Super-Resolution

TinyNina: edge-AI framework for satellite super-resolution applied to NO2 air quality monitoring with resource constraints.

Ax Fatemeh Khadem, Sajad Mousavi, Yi Fang, Yuhong Liu 27d ago

DP-OPD: Differentially Private On-Policy Distillation for Language Models

DP-OPD: differentially private on-policy distillation method for compressing LLMs on sensitive data while maintaining privacy guarantees.

Ax Zhe Feng, Shilong Tao, Haonan Sun, Shaohan Chen, Zhanxing Zhu, Yunhuai Liu 27d ago

MAVEN: A Mesh-Aware Volumetric Encoding Network for Simulating 3D Flexible Deformation

MAVEN: mesh-aware volumetric encoding network for simulating 3D flexible deformation using graph neural networks on mesh structures.

Ax Liwei Deng, Qingxiang Liu, Xinhe Niu, Shengchao Chen, Sheng Sun, Yuankai Wu, Guodong Long, Yuxuan Liang 27d ago

Discrete Prototypical Memories for Federated Time Series Foundation Models

Discrete Prototypical Memories approach for federated time series foundation models using LLMs while preserving data privacy.

Ax Arjuna Scagnetto 27d ago

ECG Biometrics with ArcFace-Inception: External Validation on MIMIC and HEEDB

External validation study on ECG biometrics using Inception-v1 with ArcFace on MIMIC and HEEDB datasets.

Ax Tauhid Khan 27d ago

Isokinetic Flow Matching for Pathwise Straightening of Generative Flows

Isokinetic Flow Matching introduces pathwise acceleration regularization to improve few-step sampling in flow-based generative models.

Ax Ziwei Li, Yuang Ma, Yi Kang 27d ago

SLaB: Sparse-Lowrank-Binary Decomposition for Efficient Large Language Models

SLaB: sparse-lowrank-binary decomposition framework for efficient LLM compression maintaining performance at high compression ratios.

Ax Qiang He, Yucheng Yang, Tianyi Zhou, Meng Fang, Mykola Pechenizkiy, Setareh Maghsudi 27d ago

One Model for All: Multi-Objective Controllable Language Models

Multi-objective controllable language models framework enabling personalized alignment with varying human preferences beyond fixed reward optimization.

Ax Hengshuai Yao, Xing Chen, Ahmed Murtadha, Guan Wang 27d ago

GAIN: Multiplicative Modulation for Domain Adaptation

GAIN: multiplicative modulation technique for domain adaptation in LLMs, preventing catastrophic forgetting through feature re-emphasis.

Ax Ole Delzer, Sidney Bender 27d ago

Reproducibility study on how to find Spurious Correlations, Shortcut Learning, Clever Hans or Group-Distributional non-robustness and how to fix them

Reproducibility study on spurious correlations and shortcut learning in DNNs, comparing frameworks for ensuring models use causally relevant features.

Ax Mark Braverman, Roi Livni, Yishay Mansour, Shay Moran, Kobbi Nissim 27d ago

Learning from Equivalence Queries, Revisited

Revisits learning from equivalence queries model for modern ML systems like generative models and recommendation systems with periodic updates.

Ax Donghu Kim, Youngdo Lee, Minho Park, Kinam Kim, I Made Aswin Nahendra, Takuma Seno, Sehee Min, Daniel Palenicek, Florian Vogt, Danica Kragic, Jan Peters, Jaegul Choo, Hojoon Lee 27d ago

FlashSAC: Fast and Stable Off-Policy Reinforcement Learning for High-Dimensional Robot Control

FlashSAC: off-policy reinforcement learning algorithm for stable, fast robot control in high-dimensional action spaces.

Ax Motoki Nakamura 27d ago

Dynamic Free-Rider Detection in Federated Learning via Simulated Attack Patterns

Detection method for free-riders in federated learning via simulated attack patterns, improving the WEF-based approach.

Ax Bohao Li, Tao Zou, Junchen Ye, Yan Gong, Bowen Du 27d ago

A Clinical Point Cloud Paradigm for In-Hospital Mortality Prediction from Multi-Level Incomplete Multimodal EHRs

Deep learning approach for clinical mortality prediction from incomplete multimodal Electronic Health Records using point cloud paradigm.

Ax Zhuohao Yu, Zhiwei Steven Wu, Adam Block 27d ago

From Curiosity to Caution: Mitigating Reward Hacking for Best-of-N with Pessimism

Method to mitigate reward hacking in Best-of-N sampling for language models using pessimism, addressing inference-time compute scaling challenges.

Ax Daniel Bloch 27d ago

Anticipatory Reinforcement Learning: From Generative Path-Laws to Distributional Value Functions

Novel Anticipatory Reinforcement Learning framework for non-Markovian decision processes with jump-diffusions and structural breaks, designed for single trajectory learning.

Ax Qing Zhou, Bingxuan Zhao, Tao Yang, Hongyuan Zhang, Junyu Gao, Qi Wang 27d ago

Batch Loss Score for Dynamic Data Pruning

Batch Loss Score metric for dynamic data pruning using exponential moving averages, accelerating deep learning training.

Ax Andrei-Alexandru Bunea, Ovidiu Ghibea, Dan-Matei Popovici, Ion Daniel, Octavian Andronic 27d ago

Explainable Machine Learning for Sepsis Outcome Prediction Using a Novel Romanian Electronic Health Record Dataset

Explainable ML models for sepsis prediction using Romanian EHR dataset with 12,286 hospitalizations and 600 lab test types.

Ax Seoungsub Lee, In Seo Kim, Seon Wook Kim 27d ago

MUXQ: Mixed-to-Uniform Precision MatriX Quantization via Low-Rank Outlier Decomposition

Quantization method for LLMs combining mixed-precision and low-rank decomposition for efficient INT computation on NPU devices.

Ax Asena Karolin \"Ozdemir, Lars H. Heyen, Arvid Weyrauch, Achim Streit, Markus G\"otz, Charlotte Debus 27d ago

Sampling Parallelism for Fast and Efficient Bayesian Learning

Sampling parallelism approach for efficient Bayesian neural networks and uncertainty quantification in risk-sensitive applications.

Ax Peter Balogh 27d ago

Darkness Visible: Reading the Exception Handler of a Language Model

Mechanistic analysis decomposing GPT-2 Small's final MLP into legible exception handler with 27 named neurons routing decisions.

Ax Justin Chih-Yao Chen, Archiki Prasad, Zaid Khan, Joykirat Singh, Runchu Tian, Elias Stengel-Eskin, Mohit Bansal 27d ago

Cog-DRIFT: Exploration on Adaptively Reformulated Instances Enables Learning from Hard Reasoning Problems

Method using task reformulation to enable LLMs to learn from difficult problems via reinforcement learning from verifiable rewards.

Ax Houzhe Wang, Xiaojie Zhu, Chi Chen 27d ago

Forgetting to Witness: Efficient Federated Unlearning and Its Visible Evaluation

Complete pipeline for federated unlearning with evaluation framework, enabling models to forget deleted data in distributed learning.

Ax Naveen Raman, Stephanie Milani, Fei Fang 27d ago

Selecting Decision-Relevant Concepts in Reinforcement Learning

Algorithms for automatic concept selection in interpretable reinforcement learning policies without manual domain expertise.

Ax Amit Kiran Rege 27d ago

The Role of Generator Access in Autoregressive Post-Training

Research on how generator access constraints affect autoregressive post-training and learning from rollouts vs prefix queries.

Ax Nick Souligne, Vignesh Subbian 27d ago

FairLogue: A Toolkit for Intersectional Fairness Analysis in Clinical Machine Learning Models

Python toolkit for intersectional fairness analysis in clinical ML models, addressing compounded disparities beyond single-axis comparisons.

Ax James Hu, Mahdi Ghelichi 27d ago

Noise Immunity in In-Context Tabular Learning: An Empirical Robustness Analysis of TabPFN's Attention Mechanisms

Empirical robustness analysis of TabPFN's attention mechanisms for in-context learning on tabular data, examining noise immunity without retraining.

Ax Shiek Ruksana, Sailesh Kiran Kurra, Thipparthi Sanjay Baradwaj 27d ago

Optimizing LLM Prompt Engineering with DSPy Based Declarative Learning

DSPy framework for optimizing LLM prompt engineering through declarative learning instead of manual trial-and-error, improving scalability and reproducibility.

Ax Amit Kiran Rege 27d ago

Data Attribution in Adaptive Learning

Formalizes data attribution methods for adaptive learning settings where training data is generated by models themselves, addressing feedback loop in online/RL systems.

Ax Connor Dilgren, Sarah Wiegreffe 27d ago

Are Latent Reasoning Models Easily Interpretable?

Investigation into interpretability challenges of latent reasoning models that operate without explicit natural language reasoning, examining two approaches.

Ax Vadim Vashkelis, Natalia Trukhina 27d ago

HI-MoE: Hierarchical Instance-Conditioned Mixture-of-Experts for Object Detection

Hierarchical Instance-Conditioned Mixture-of-Experts architecture for object detection using sparse routing at instance level rather than image/patch level.

Ax Xuyang Shen, Zijie Pan, Diego Cerrai, Xinxuan Zhang, Christopher Colorio, Emmanouil N. Anagnostou, Dongjin Song 27d ago

Empowering Power Outage Prediction with Spatially Aware Hybrid Graph Neural Networks and Contrastive Learning

Hierarchical instance-conditioned mixture-of-experts architecture for object detection with sparse parameter activation.

Ax Justin Curry, Alberto Speranzon 27d ago

Stratifying Reinforcement Learning with Signal Temporal Logic

Graph neural networks with contrastive learning for predicting power outages from extreme weather events.

Ax Yizhe Li (University of Birmingham, Birmingham, United Kingdom), Shixiao Wang (University of Birmingham, Birmingham, United Kingdom), Jian K. Liu (University of Birmingham, Birmingham, United Kingdom) 27d ago

Copilot-Assisted Second-Thought Framework for Brain-to-Robot Hand Motion Decoding

Novel stratification-based semantics for Signal Temporal Logic with applications to reinforcement learning.

Ax Gallil Maimon, Ori Yoran, Felix Kreuk, Michael Hassid, Gal Cohen, Pierre Chambon, Yossi Adi 27d ago

Self-Execution Simulation Improves Coding Models

CNN-attention hybrid model for decoding hand kinematics from EEG in brain-computer interfaces.

Ax Tomek Kaszy\'nski 27d ago

Emergent Compositional Communication for Latent World Properties

Training method enabling Code LLMs to simulate program execution step-by-step, improving competitive programming performance.

Ax Kishanthan Kingston, Olivier Boucher, Freddy Bouchet, Pierre Chapel, Rosemary Eade, Jean-Francois Lamarque, Redouane Lguensat, Kazem Ardaneh 27d ago

IPSL-AID: Generative Diffusion Models for Climate Downscaling from Global to Regional Scales

Multi-agent research showing emergence of compositional communication protocols for representing latent physical properties without explicit supervision.

Ax Geoffroy Keime, Nicolas Cuperlier, Benoit R. Cottereau 27d ago

Event-Driven Neuromorphic Vision Enables Energy-Efficient Visual Place Recognition

IPSL-AID: generative diffusion model for climate downscaling from global to regional resolutions.

Ax Md Nahid Hasan, Vishwam Tiwari, Aditya Challa, Vaskar Raychoudhury, Snehanshu Saha 27d ago

Multi-Agent Training-free Urban Food Delivery System using Resilient UMST Network

SpikeVPR: neuromorphic approach using event-based cameras and spiking neural networks for energy-efficient visual place recognition.

Ax Xinyu Liu, Qing Xu, Zhen Chen 27d ago

XAttnRes: Cross-Stage Attention Residuals for Medical Image Segmentation

Cross-Stage Attention Residuals mechanism for medical image segmentation using selective aggregation of encoder-decoder outputs.

Ax Jinwu Yang, Jiaan Wu, Zedong Liu, Xinyang Ma, Hairui Zhao, Yida Gu, Yuanhong Huang, Xingchen Liu, Wenjing Huang, Zheng Wei, Jing Xing, Yili Ma, Qingyi Zhang, Baoyi An, Zhongzhe Hu, Shaoteng Liu, Xia Zhu, Jiaxun Lu, Guangming Tan, Dingwen Tao 27d ago

ENEC: A Lossless AI Model Compression Method Enabling Fast Inference on Ascend NPUs

Lossless compression method for LLMs enabling fast inference on Ascend NPUs, addressing weight data transfer bottleneck.