Isolater - Feed

Ax David Chanin, Adri\`a Garriga-Alonso 23d ago

Sparse but Wrong: Incorrect L0 Leads to Incorrect Features in Sparse Autoencoders

Study of L0 hyperparameter effects on sparse autoencoders for LLM feature extraction and interpretability.

Ax Mrinmay Sen, Ankita Das, Sidhant Nair, C Krishna Mohan 23d ago

FedDAF: Federated Domain Adaptation Using Model Functional Distance

FedDAF federated domain adaptation approach addressing domain shifts and limited labeled data across clients.

Ax Hongkang Li, Songtao Lu, Xiaodong Cui, Pin-Yu Chen, Meng Wang 23d ago

How Can Mamba Learn In Context with Outliers and Generalize Provably?

Theoretical analysis of in-context learning in Mamba models with outliers and generalization guarantees.

Ax Yifei Sun 23d ago

Solver-Integrated Adversarial Attacking and Training of Neural Operators

Research on adversarial robustness and generalizability of neural operators as PDE surrogates from solver perspective.

Ax Amin Omidvar 23d ago

SmartMixed: A Two-Phase Training Strategy for Adaptive Activation Function Learning in Neural Networks

Two-phase training strategy enabling per-neuron adaptive activation function selection while maintaining inference efficiency.

Ax Thaweerath Phisannupawong, Joshua Julian Damanik, Han-Lim Choi 23d ago

LLM4Delay: Flight Delay Prediction via Cross-Modality Adaptation of Large Language Models and Aircraft Trajectory Representation

LLM-based framework for flight delay prediction integrating aeronautical data and aircraft trajectory representations.

Ax Eugenio Varetti, Matteo Torzoni, Marco Tezzele, Andrea Manzoni 23d ago

Adaptive digital twins for predictive decision-making: Online Bayesian learning of transition dynamics

Bayesian online learning approach for adaptive digital twin state transitions in civil engineering.

Ax Zhuo Zhang, Xi Yang, Ying Miao, Xiaobin Hu, Yifu Gao, Yong Yang, Canqun Yang, Boocheong Khoo 23d ago

PGOT: A Physics-Geometry Operator Transformer for Complex PDEs

Transformer architecture for solving complex PDEs on unstructured meshes with geometric feature preservation.

Ax Gang Liao, Hongsen Qin, Ying Wang, Alicia Golden, Michael Kuchnik, Yavuz Yetim, Jia Jiunn Ang, Chunli Fu, Yihan He, Samuel Hsia, Zewei Jiang, Dianshi Li, Uladzimir Pashkevich, Varna Puvvada, Feng Shi, Matt Steiner, Ruichao Xiao, Liyuan Li, Nathan Yan, Xiayu Yu, Zhou Fang, Roman Levenstein, Kunming Ho, Haishan Zhu, Alec Hammond, Richard Li, Ajit Mathews, Kaustubh Gondkar, Abdul Zainul-Abedin, Ketan Singh, Hongtao Yu, Wenyuan Chi, Barney Huang, Sean Zhang, Noah Weller, Zach Marine, Wyatt Cook, Carole-Jean Wu, Gaoxiang Liu 23d ago

KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta

Agentic framework for automated kernel code generation across heterogeneous AI accelerators at scale for recommendation models.

Ax Itai Morad, Nir Shlezinger, Yonina C. Eldar 23d ago

SGD-Based Knowledge Distillation with Bayesian Teachers: Theory and Guidelines

Theoretical analysis of knowledge distillation convergence with Bayesian teacher networks using SGD.

Ax Ramnath Kumar, Kyle Ritscher, Junmin Judy, Lawrence Liu, Cho-Jui Hsieh 23d ago

FlexAct: Why Learn when you can Pick?

Research on learning discrete activation functions using Gumbel-Softmax for task-specific neural networks.

Ax Javier Porras-Valenzuela, Samar Hadou, Alejandro Ribeiro 23d ago

A Constrained Optimization Perspective of Unrolled Transformers

Constrained optimization framework training transformers as descent algorithms with layerwise loss decrease guarantees.

Ax Zhiheng Jiang, Yunzhe Wang, Ryan Marr, Ellen Novoseller, Benjamin T. Files, Volkan Ustun 23d ago

GraphAllocBench: A Flexible Benchmark for Preference-Conditioned Multi-Objective Policy Learning

GraphAllocBench benchmark for multi-objective reinforcement learning with preference-conditioned policies and flexible resource allocation.

Ax Shicheng Fan, Kun Zhang, Lu Cheng 23d ago

TRACE: Trajectory Recovery for Continuous Mechanism Evolution in Causal Representation Learning

TRACE method for causal representation learning handling continuous mechanism transitions across domains.

Ax Gloria Felicia (University of Virginia), Zitha Sasindran (Indian Institute of Science Bangalore), Jinfeng He (Cornell University), Michael Eniolade (University of the Cumberlands), Hemant Kumar (University of Arizona), Milan Hussain Angati (California State University Northridge) 23d ago

StepShield: When, Not Whether to Intervene on Rogue Agents

StepShield benchmark for agent safety measuring detection timeliness of rogue agent behavior on 9,429 code-agent trajectories.

Ax Simon B\"uhrer, Andreas Plesner, Aczel Till, Roger Wattenhofer 23d ago

BitLogic: Training Framework for Gradient-Based FPGA-Native Neural Networks

BitLogic training framework for gradient-based neural networks deployable to GPU, FPGA, and ASIC with single codebase.

Ax Stefano Woerner, Seong Joon Oh, Christian F. Baumgartner 23d ago

Universal Algorithm-Implicit Learning

Meta-learning framework defining practical universality and generalizable learning across arbitrary task distributions.

Ax Joshua S. Schiffman 23d ago

Transformers converge to invariant algorithmic cores

Study extracting invariant algorithmic cores from transformers across independent training runs to identify necessary computations.

Ax Davide Tugnoli, Andrea De Lorenzo, Marco Virgolin, Giovanni Cin\`a 23d ago

Improving TabPFN's Synthetic Data Generation by Integrating Causal Structure

Improvement to TabPFN synthetic tabular data generation by incorporating causal structure and feature dependencies.

Ax Tycho F. A. van der Ouderaa, Mart van Baalen, Paul Whatmough, Markus Nagel 23d ago

Leech Lattice Vector Quantization for Efficient LLM Compression

LLM compression technique using Leech lattice vector quantization to overcome information-theoretic limits of scalar quantization.

Ax Huaiyang Wang, Xiaojie Li, Xiaohan Wang, Zhixia Zhang, Xiaodong Lu, Zixuan Huang, Jiajun Chai, Guojun Yin, Deqing Wang, Haoyi Zhou, Yaodong Yang, Jianxin Li, Yikun Ban 23d ago

Policy Improvement Reinforcement Learning

Policy Improvement RL method for post-training LLMs and agents that explicitly verifies policy improvements over baselines.

Ax Paul-Tiberiu Iordache, Elena Burceanu 23d ago

Fine-Tuning Regimes Define Distinct Continual Learning Problems

Analysis showing continual learning performance varies significantly based on fine-tuning regime (trainable parameter subspace).

Ax Yang Fu, Peng Qin, Liming Chen, Zihao Zhang, Hao Yu, Yifei Wang 23d ago

Joint Energy Management and Coordinated AIGC Workload Scheduling for Distributed Data Centers: A Diffusion-Aided Reward Shaping Approach

Reinforcement learning approach for scheduling AIGC workloads and managing energy in distributed data centers using diffusion-based reward shaping.

Ax Noah Farr, Aryaman Reddi, Carlo D'Eramo, Jan Peters 23d ago

Streaming Reinforcement Learning under Partial Observability with Real-Time Recurrent Learning

Streaming reinforcement learning method enabling online learning with partial observability using real-time recurrent backpropagation.

Ax Zili Zhang, Chengxu Yang, Shenglong Zhang, Chenyu Wang, Yufan Zhang, Tuo Dai, Zhouyang Li, Yuhong Ge, Chao Jin, Xin Jin, Yuliang Liu 23d ago

BigMac: Breaking the Pareto Frontier of Compute and Memory in Multimodal LLM Training

BigMac training pipeline for multimodal LLMs that improves compute-memory efficiency tradeoffs through nested encoder-generator computation.

Ax Zhiwei Li, Yong Hu 23d ago

SkillHone: A Harness for Continual Agent Skill Evolution Through Persistent Decision History

Framework for continual evolution of agent skills in LLM-based agents by maintaining persistent decision history across task changes.

Ax Ananth K Suresh, Arya Hariharan 23d ago

Reproducibility Study of "AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models"

Reproducibility study of AlphaEdit null-space constrained knowledge editing method for LLMs, validating and extending original results.

Ax Kathan Shah 23d ago

Token Geometry

Lightweight optimizer exploiting gradient geometry of embedding tables and LM-heads for improved training efficiency across finetuning and pretraining.

Ax Haotian Xie, Junlin Chen, Mingkai Zheng, Lishan Yang, Zhao Zhang 23d ago

PHOENIX: Resilient LLM Training with Hot-Swapping via Zero-Overhead Checkpoint

Fault-tolerant LLM training system using zero-overhead checkpointing and hot-swapping for resilience across hardware failures.

Ax Zijun Xie, Yuyang You, Yongzhi Li, Enlei Gong, Zeyu Chen, Quan Chen, Yanhua Cheng, Peng Jiang, Yadong Mu 23d ago

ACPO: Adaptive Credit Policy Optimization via Fine-Grained Surrogate Entropy

Credit assignment optimization for RL-based LLM reasoning using fine-grained surrogate entropy for token-level rewards.

Ax Teng-Ruei Chen 23d ago

How Much of the Routing Gap Is Real? Decomposing the Router-to-Oracle Gap into Reproducible Specialist Advantage and Single-Draw Label Noise

Analysis decomposing router-to-oracle performance gap in LLM routing, identifying label noise versus specialist advantage contributions.

Ax Jaeyeon Kim, Jewon Lee, Bo-Kyeong Kim 23d ago

Quantize the Target, Quantize the Drafter: Efficient Inference with Qwen3.5-4B

Efficient inference system combining quantization and speculative decoding for Qwen3.5-4B LLM on resource-constrained hardware.

Ax Jianyi Zhang, Hao Frank Yang, Ang Li, Xin Guo, Pu Wang, Haiming Wang, Yiran Chen, Hai Li 23d ago

MLLM-LLaVA-FL: Multimodal Large Language Model Assisted Federated Learning

Federated learning framework using multimodal LLMs (LLaVA) to address data heterogeneity across distributed clients.

Ax Seemanta Bhattacharjee, MD. Muhtasim Fuad, A. K. M. Fakhrul Hossain 23d ago

Classification of Financial Data Using Quantum Support Vector Machine

Application of quantum support vector machines to financial data classification using Dhaka Stock Exchange dataset.

Ax Joachim Tomasi, Sandrine Anthoine, Hachem Kadri 23d ago

Benign Overfitting with Quantum Kernels

Theoretical study of benign overfitting in quantum kernel methods for machine learning on quantum computers.

Ax Gojko Perovic, Nuno Ferreira Duarte, Atabak Dehban, Gon\c{c}alo Teixeira, Egidio Falotico, Jos\'e Santos-Victor 23d ago

HERB: Human-augmented Efficient Reinforcement learning for Bin-packing

Human-augmented reinforcement learning approach for 3D bin-packing in logistics, combining RL with human feedback to reduce training time.

Ax Alireza Furutanpey, Carmen Walser, Philipp Raith, Pantelis A. Frangoudis, Schahram Dustdar 23d ago

Leveraging Neural Graph Compilers in Machine Learning Research for Edge-Cloud Systems

Evaluation of neural network graph compilers across heterogeneous hardware platforms, showing how vendor-specific optimizations affect performance comparisons.

Ax Alina Wernick, Kristof Meding 23d ago

Position: EU AI Act's Research Exemptions Can Break the Publication Norms of Major AI Conferences

Position paper on EU AI Act research exemptions and potential conflicts with academic publication norms at major conferences.

Ax Gautam Jajoo, Atharva Pandey, Pranjal A Chitale, Saksham Agarwal 23d ago

MASCA: LLM based-Multi Agents System for Credit Assessment

Multi-agent LLM system for credit assessment that mirrors real-world decision-making processes in financial evaluation.

Ax Constantin Venhoff, Iv\'an Arcuschin, Philip Torr, Arthur Conmy, Neel Nanda 23d ago

Base Models Know How to Reason, Thinking Models Learn When

Analysis of reasoning behaviors in thinking language models using Sparse Autoencoders and model diffing techniques.

Ax Scott Thornton 23d ago

SecureCode: A Production-Grade Multi-Turn Dataset for Training Security-Aware Code Generation Models

Production-grade dataset of 2,185 multi-turn examples for training secure code generation models covering OWASP Top 10 and ML security.

Ax Joyjit Roy, Samaresh Kumar Singh 23d ago

Agentic AI for Commercial Insurance Underwriting with Adversarial Self-Critique

LLM-based agentic AI system for insurance underwriting with self-critique mechanisms for high-stakes decision support.

Ax Haitao Lin, Hanyang Yu, Jingshun Huang, He Zhang, Yonggen Ling, Ping Tan, Xiangyang Xue, Yanwei Fu 23d ago

PoseVLA: Universal Pose Pretraining for Generalizable Vision-Language-Action Policies

Vision-Language-Action model for robotic control that combines VLM representations with 3D pose understanding for embodied AI tasks.

Ax Zhenhang Shang, Yingzhe Yu, Kani Chen 23d ago

Fine-Tuning Integrity for Modern Neural Networks: Structured Drift Proofs via Norm, Rank, and Sparsity Certificates

Cryptographic method to verify fine-tuned neural network models haven't deviated from claimed update procedures without accessing parameters.

Ax Ziwei Su, Imon Banerjee, Diego Klabjan 23d ago

Model-based Bootstrap of Controlled Markov Chains

arXiv paper on model-based bootstrap for transition kernels in controlled Markov chains with applications to offline reinforcement learning.

Ax Sayak Dutta 23d ago

CARVE: Content-Aware Recurrent with Value Efficiency for Chunk-Parallel Linear Attention

arXiv research on CARVE, recurrent model with improved state management for memory-aware chunk-parallel linear attention.

Ax Lianghua Huang, Zhi-Fan Wu, Yupeng Shi, Wei Wang, Mengyang Feng, Junjie He, Chen-Wei Xie, Yu Liu, Jingren Zhou, Ang Wang, Bang Zhang, Baole Ai, Chen Liang, Cheng Yu, Chongyang Zhong, Jinwei Qi, Kai Zhu, Pandeng Li, Peng Zhang, Wenyuan Zhang, Xinhua Cheng, Yitong Huang, Yun Zheng, Yuxiang Bao, Yuzheng Wang, Zoubin Bi 23d ago

Wan-Streamer v0.2: Higher Resolution, Same Latency

arXiv paper on Wan-Streamer v0.2, audio-visual interaction model achieving higher resolution (640x368) while maintaining 200ms latency.

Ax Mouhamed Amine Bouchiha, Gregory Blanc 23d ago

TACTIC-KG: Toward Small Agent Teams for Cyber Threat Intelligence Knowledge Graph Construction

arXiv paper on TACTIC-KG using small LLM agent teams for constructing cybersecurity threat intelligence knowledge graphs from unstructured text.

Ax Zhifeng Kong, Sang-gil Lee, Jaehyeon Kim, Boxin Wang, Zihan Liu, Sungwon Kim, Yang Chen, Arushi Goel, Rajarshi Roy, Wenliang Dai, Zhuolin Yang, Yangyi Chen, Dongfu Jiang, Sreyan Ghosh, Tuomas Rintamaki, Andrew Tao, Jonathan Raiman, Mohammad Shoeybi, Bryan Catanzaro, Wei Ping 23d ago

Unified Audio Intelligence Without Regressing on Text Intelligence

arXiv paper introducing Nemotron Audex-30B-A3B, unified audio-text LLM maintaining text performance while adding audio understanding.

Ax Anthony Hu, V\'aclav Volhejn, Adrien Ramanana Rahary, Chris Mulder, Aditya Makkar, Alyx Liao, Am\'elie Royer, Manu Orsini, Adam Jelley, Eloi Alonso, Florian Laurent, Fredrik Nor\'en, James Swingos, Jan H\"unermann, Kent Rollins, Lucas Hosseini, Matthieu Le Cauchois, Maxim Peter, Pim de Witte, Tim Brown, Vincent Micheli, Moritz B\"ohle, Gabriel de Marmiesse, Viktoriia Sharmanska, Lucia Specia, Michael Black, Patrick P\'erez 23d ago

Multiplayer Interactive World Models with Representation Autoencoders

arXiv paper on multiplayer world models for dynamic environments with complex physics, conditioning on multiple agent action streams.