Isolater - Feed

Ax Nan Huang, Xiaoxiao Zhou, Junxia Cui, Mario Tapia-Pacheco, Tiffany Amariuta, Yang Li, Jingbo Shang 12d ago

EvoLen: Evolution-Guided Tokenization for DNA Language Model

EvoLen: evolution-guided tokenization approach for DNA language models. Addresses fundamental tokenization design challenges in biological sequence modeling.

Ax Charles Arnal, Vivien Cabannes, Taco Cohen, Julia Kempe, Remi Munos 12d ago

Efficient RL Training for LLMs with Experience Replay

Experience replay for LLM post-training RL formalizing optimal buffer design as trade-off between sample efficiency and data freshness.

Ax Tiejin Chen, Huaiyuan Yao, Jia Chen, Evangelos E. Papalexakis, Hua Wei 12d ago

Every Response Counts: Quantifying Uncertainty of LLM-based Multi-Agent Systems through Tensor Decomposition

Tensor decomposition method quantifying uncertainty in LLM-based multi-agent systems accounting for communication and role dependencies.

Ax Diyi Hu, Bhaskar Krishnamachari 12d ago

Wireless Communication Enhanced Value Decomposition for Multi-Agent Reinforcement Learning

CLOVER framework for multi-agent RL cooperation conditioning value decomposition on realistic wireless communication graphs.

Ax Hananel Hazan, Yanbo Zhang, Benedikt Hartl, Michael Levin 12d ago

A Little Rank Goes a Long Way: Random Scaffolds with LoRA Adapters Are All You Need

LottaLoRA training paradigm showing frozen random backbones with trained LoRA adapters recover 96-100% performance across diverse tasks.

Ax Julian Quick, Marcus Binder Nilsen, Andreas Bechmann, Tran Nguyen Le, Pierre-Elouan Mikael Rethore 12d ago

Adversarial Sensor Errors for Safe and Robust Wind Turbine Fleet Control

Adversarial sensor error framework for robust wind turbine fleet control against measurement errors and hacking.

Ax Mingjie Hu, Siyang Gao, Jian-qiang Hu, Enlu Zhou 12d ago

Adaptive Simulation Experiment for LLM Policy Optimization

Adaptive simulation experiment framework using pairwise comparisons to optimize LLM policies for operations management tasks.

Ax Zhaolin Gao (Sid), Yu (Sid), Wang, Bo Liu, Thorsten Joachims, Kiant\'e Brantley, Wen Sun 12d ago

$p1$: Better Prompt Optimization with Fewer Prompts

Prompt optimization method decomposing reward variance into response and prompt variance to identify task amenability to optimization.

Ax Yashodhan D. Hakke, Almuatazbellah M. Boker, Lamine Mili, Michael von Spakovsky, Hoda Eldardiry 12d ago

Alleviating Community Fear in Disasters via Multi-Agent Actor-Critic Reinforcement Learning

Multi-agent actor-critic reinforcement learning for disaster resilience controlling power, communication, and emergency response systems.

Ax Haonan Zhu, Adrienne Deganutti, Elad Hirsch, Purvanshi Mehta 12d ago

Structural Evaluation Metrics for SVG Generation via Leave-One-Out Analysis

Evaluation metrics for SVG generation via element-level structural analysis using leave-one-out evaluation.

Ax Mehran Taghian, Yunke Peng, Xing Huang, Yao Wang, Yaoyuan Wang, Wei Guo, Yuanyong Luo, Tianchi Hu, Junsong Wang, Xin Wang, Hu Liu, Yu Cheng, Ziwei Yu, Hongliang Li, Mehdi Rahimifar, Lei Yan, Xuefei Wang, Zhuang Ma, Lei Liu, Hui Yu, Anandharaju Durai Raju, Hoang Le, Hei Yi Mak, Tanzila Rahman, Shadan Golestan 12d ago

HiFloat4 Format for Language Model Pre-training on Ascend NPUs

4-bit floating-point format (HiFloat4) for efficient language model pre-training on Ascend NPU hardware.

Ax Chia-Hong Hsu, Randall Balestriero 12d ago

Post-Hoc Guidance for Consistency Models by Joint Flow Distribution Learning

Guidance method for consistency models using joint flow distribution learning to enable classifier-free guidance without separate teacher model.

Ax Chia-Hong Hsu, Frank Wood 12d ago

Discrete Meanflow Training Curriculum

Training curriculum method for discrete flow-based image generation models to improve one-step sampling stability and quality.

Ax Roi Paul 12d ago

Spectral Geometry of LoRA Adapters Encodes Training Objective and Predicts Harmful Compliance

Analysis of LoRA adapter spectral geometry to identify fine-tuning objectives and predict harmful model behavior in language models.

Ax Jinqi Luo, Jinyu Yang, Tal Neiman, Lei Fan, Bing Yin, Son Tran, Mubarak Shah, Ren\'e Vidal 12d ago

Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs

Safety steering mechanism for multimodal LLMs using dictionary-aligned concept control to prevent unsafe outputs without retraining.

Ax Yuwen Jiang 12d ago

Finite-Sample Analysis of Nonlinear Independent Component Analysis:Sample Complexity and Identifiability Bounds

Theoretical analysis of finite-sample properties and identifiability bounds for nonlinear Independent Component Analysis algorithms.

Ax Rafael da Silva, Jeff Eicher, Gregory Longo 12d ago

Temporal Dropout Risk in Learning Analytics: A Harmonized Survival Benchmark Across Dynamic and Early-Window Representations

Survival analysis benchmark for predicting student dropout in learning analytics using OULAD dataset with dynamic and static representations.

Ax Amrut Nadgir, Vijay Balasubramanian, Pratik Chaudhari 12d ago

How does Chain of Thought decompose complex tasks?

Demonstrates power-law scaling of classification error with number of classes and how chain-of-thought decomposition reduces error through task splitting.

Ax Rafael da Silva, Jeff Eicher, Gregory Longo 12d ago

A Mathematical Framework for Temporal Modeling and Counterfactual Policy Simulation of Student Dropout

Temporal modeling framework for predicting student dropout using LMS data and logistic regression with counterfactual policy simulation.

Ax Tokio Kajitsuka, Ukyo Honda, Sho Takase 12d ago

Revisiting the Capacity Gap in Chain-of-Thought Distillation from a Practical Perspective

Practical analysis of chain-of-thought distillation from students to teachers, revisiting capacity gap assumptions and baseline comparisons.

Ax Abhiram Vellore, Niraj K. Jha 12d ago

Uncertainty-Aware Transformers: Conformal Prediction for Language Models

Conformal prediction framework for transformers providing uncertainty quantification and calibration for trustworthy LLM deployment.

Ax Hang Gao, Kunyu Li, Huang Hong, Baoquan Cui, Fengge Wu 12d ago

A Closer Look at the Application of Causal Inference in Graph Representation Learning

Analysis of causal inference applications in graph representation learning and risks of aggregating graph elements.

Ax Donney Fan, Geoff Pleiss 12d ago

Adaptive Candidate Point Thompson Sampling for High-Dimensional Bayesian Optimization

Adaptive Thompson sampling for high-dimensional Bayesian optimization addressing sparse candidate point grids.

Ax Jimmy Bach, Yang Li, Yaqi Liu, John Sankok, Rose Kimani, Carrie B. Dolan, Julius N. Odhiambo, Haipeng Chen 12d ago

Using Synthetic Data for Machine Learning-based Childhood Vaccination Prediction in Narok, Kenya

Using synthetic data for ML-based childhood vaccination prediction in nomadic populations in Kenya.

Ax Taojie Zhu, Dongyang Xu, Ding Zou, Sen Zhao, Qiaobo Hao, Zhiguo Yang, Yonghong He 12d ago

Bridging SFT and RL: Dynamic Policy Optimization for Robust Reasoning

Dynamic policy optimization bridging SFT and RL for LLMs, addressing bias-variance tradeoff in post-training through adaptive loss weighting.

Ax Zhipeng Zhou, Linxiao Cao, Pengcheng Wu, Peilin Zhao, Chunyan Miao 12d ago

Delve into the Applicability of Advanced Optimizers for Multi-Task Learning

Empirical study on effectiveness of advanced optimizers for multi-task learning, identifying overlooked factors in optimization approaches.

Ax Binesh Sadanandan, Vahid Behzadan 12d ago

Predictive Entropy Links Calibration and Paraphrase Sensitivity in Medical Vision-Language Models

Analyzes calibration and paraphrase sensitivity in medical vision-language models using predictive entropy and uncertainty quantification.

Ax Benjamin Amoh, Geoffrey Parker, Wesley Marrero 12d ago

Multi-Agent Decision-Focused Learning via Value-Aware Sequential Communication

SeqComm-DFL: Multi-agent coordination via sequential communication and decision-focused learning for value-aware message generation.

Ax Mintae Kim, Koushil Sreenath 12d ago

WOMBET: World Model-based Experience Transfer for Robust and Sample-efficient Reinforcement Learning

WOMBET: World model-based framework for experience transfer in robotics RL, generating and utilizing prior data for sample efficiency.

Ax Zhiqiang Dong, Teng Pang, Rongjian Xu, Guoqiang Wu 12d ago

Efficient Hierarchical Implicit Flow Q-learning for Offline Goal-conditioned Reinforcement Learning

Hierarchical implicit flow Q-learning for offline goal-conditioned reinforcement learning with improved policy expressiveness.

Ax Yueyuan Sui, Payal Mohapatra, Do\u{g}a\c{c} Eldenk, Haodong Yang, Yiting Zhang, Haoyan Zhang, Qi Zhu, Stephen Xia 12d ago

Modality-Aware Zero-Shot Pruning and Sparse Attention for Efficient Multimodal Edge Inference

SentryFuse: Framework for efficient multimodal model compression on edge devices with sensor dropout robustness via zero-shot pruning.

Ax Yi Luo, Xu Sun, Guangchun Luo, Aiguo Chen 12d ago

Neighbourhood Transformer: Switchable Attention for Monophily-Aware Graph Learning

Graph neural network architecture addressing heterophilic graphs using switchable attention mechanism for monophily-aware learning.

Ax Carlos Jimeno Miguel, Raul Orduna, Francesco Zola 12d ago

Identification and Anonymization of Named Entities in Unstructured Information Sources for Use in Social Engineering Detection

Named entity identification and anonymization system for cybercrime datasets on Telegram with speech-to-text transcription.

Ax Gyuwon Park, DongIl Shin, SolGil Oh, SangGi Ryu, Byung-Hak Kim 12d ago

The nextAI Solution to the NeurIPS 2023 LLM Efficiency Challenge

Solution to NeurIPS 2023 LLM Efficiency Challenge: Fine-tuning LLaMA 70B on single A100 GPU within 24-hour constraint.

Ax Salva R\"uhling Cachay, Duncan Watson-Parris, Rose Yu 12d ago

U-Cast: A Surprisingly Simple and Efficient Frontier Probabilistic AI Weather Forecaster

U-Cast: Efficient probabilistic weather forecasting model using standard U-Net architecture, simplifying state-of-art approaches.

Ax Min Young Baeg, Yoon-Yeong Kim 12d ago

PDE-regularized Dynamics-informed Diffusion with Uncertainty-aware Filtering for Long-Horizon Dynamics

PDYffusion: Diffusion model for long-horizon spatiotemporal prediction incorporating physics-based constraints and uncertainty quantification.

Ax Yu Chen, Weijun Lv, Yue Huang, Xiaozhao Fang, Jie Wen, Yong Xu, Guanbin Li 12d ago

Feature-Label Modal Alignment for Robust Partial Multi-Label Learning

Proposes PML-MA method for partial multi-label learning using feature-label modal alignment to handle noisy labels.

Ax Jafar Bakhshaliyev, Johannes Burchert, Niels Landwehr, Lars Schmidt-Thieme 12d ago

Temporal Patch Shuffle (TPS): Leveraging Patch-Level Shuffling to Boost Generalization and Robustness in Time Series Forecasting

arXiv paper proposing Temporal Patch Shuffle data augmentation for time series forecasting preserving temporal coherence and improving generalization.

Ax Harry Proshian, Nikita Severin, Sergey Nikolenko, Kireev Ivan, Andrey Savchenko, Ivan Sergeev, Maria Postnova, Ilya Makarov 12d ago

Beyond Isolated Clients: Integrating Graph-Based Embeddings into Event Sequence Models

arXiv paper integrating graph-based embeddings into event sequence models for user-item interactions in fraud and recommendation systems.

Ax Jiabao Brad Wang, Xiang Shi, Yiliang Yuan, Mustafa Misir 12d ago

GeoPAS: Geometric Probing for Algorithm Selection in Continuous Black-Box Optimisation

arXiv paper on GeoPAS geometric probing approach for automated algorithm selection in continuous black-box optimization.

Ax Yi-Lun Liao, Alexander J. Hoffman, Sabrina C. Shen, Alexandre Duval, Sam Walton Norwood, Tess Smidt 12d ago

EquiformerV3: Scaling Efficient, Expressive, and General SE(3)-Equivariant Graph Attention Transformers

arXiv paper on EquiformerV3, advancing SE(3)-equivariant graph attention Transformers for efficiency, expressivity, and 3D atomistic modeling.

Ax Vladim\'ir Hol\'y, Michal \v{C}ern\'y 12d ago

Score-Driven Rating System for Sports

arXiv paper proposing score-driven rating system extending classical Elo rating to accommodate diverse game outcomes and rankings.

Ax Yushi Feng, Junye Du, Qifan Wang, Zizhan Ma, Qian Niu, Yutaka Matsuo, Long Feng, Lequan Yu 12d ago

CORA: Conformal Risk-Controlled Agents for Safeguarded Mobile GUI Automation

arXiv paper on CORA, conformal risk-controlled GUI agents using vision language models with formal safety guarantees for mobile automation.

Ax Xubin Zhou, Yipeng Yang, Zhan Li 12d ago

Truncated Rectified Flow Policy for Reinforcement Learning with One-Step Sampling

arXiv paper proposing truncated rectified flow policy for maximum entropy RL enabling one-step multimodal action distribution sampling.

Ax Jennifer Werner, Justus Arweiler, Indra Jungjohann, Jochen Schmid, Fabian Jirasek, Hans Hasse, Michael Bortz 12d ago

Automated Batch Distillation Process Simulation for a Large Hybrid Dataset for Deep Anomaly Detection

arXiv paper augmenting distillation process dataset with simulations for deep learning-based anomaly detection in chemical batch processes.

Ax Mansour Zoubeirou a Mayaki 12d ago

Generalization and Scaling Laws for Mixture-of-Experts Transformers

arXiv paper developing generalization and scaling theory for Mixture-of-Experts Transformers with covering-number bounds and routing overhead analysis.

Ax Anas Hattay, Fred Ngole Mboula, Eric Gascard, Zakaria Yahoun 12d ago

On the Role of DAG topology in Energy-Aware Cloud Scheduling : A GNN-Based Deep Reinforcement Learning Approach

arXiv paper on GNN-based deep reinforcement learning scheduler for cloud workflow DAGs optimizing completion time and energy consumption.

Ax Augustin Chan 12d ago

Statistical Properties of the King Wen Sequence: An Anti-Habituation Structure That Does Not Improve Neural Network Training

arXiv paper analyzing statistical properties of ancient I-Ching King Wen sequence, finding no improvement to neural network training.

Ax Zedong Peng, Zeju Li, Qiang Xu, Jieru Zhao 12d ago

DiffHLS: Differential Learning for High-Level Synthesis QoR Prediction with GNNs and LLM Code Embeddings

arXiv paper proposing DiffHLS framework using GNNs and LLM code embeddings for high-level synthesis quality prediction via differential learning.

Ax Huanran Chen, Huaqing Zhang, Xiao Li, Yinpeng Dong, Ke Shen, Jun Zhu 12d ago

Nexus: Same Pretraining Loss, Better Downstream Generalization via Common Minima

arXiv paper investigating LLM pretraining geometry and common minima to improve downstream generalization without changing loss function.