Ax Frieda Born, Tom Neuh\"auser, Lukas Muttenthaler, Brett D. Roads, Bernhard Spitzer, Andrew K. Lampinen, Matt Jones, Klaus-Robert M\"uller, Michael C. Mozer 5d ago

Context Sensitivity Improves Human-Machine Visual Alignment

Machine learning method improving alignment between human and model visual perception through context-sensitive embedding representations.

Ax Joel Niklaus, Atsuki Yamaguchi, Michal \v{S}tef\'anik, Guilherme Penedo, Hynek Kydl\'i\v{c}ek, Elie Bakouch, Lewis Tunstall, Edward Emanuel Beeching, Thibaud Frere, Colin Raffel, Leandro von Werra, Thomas Wolf 5d ago

How Can We Synthesize High-Quality Pretraining Data? A Systematic Study of Prompt Design, Generator Model, and Source Data

Systematic study of synthetic data generation for LLM pretraining, testing rephrasing strategies, generator models, and source data across one trillion tokens to identify optimal design choices.

Ax Davyd Naveriani, Albert Zeyer, Ralf Schl\"uter, Hermann Ney 5d ago

Diffusion Language Models for Speech Recognition

Exploration of masked and uniform-state diffusion language models for speech recognition rescoring and ASR hypothesis improvement.

Ax Buse \c{S}en, Yifan Hu, Daniel Kuhn 5d ago

Multistage Conditional Compositional Optimization

Multistage conditional compositional optimization framework combining stochastic programming with uncertainty handling. Applied to control and stopping problems.

Ax Vansh Kapoor, Jayakrishnan Nair 5d ago

MDPs with a State Sensing Cost

Markov decision processes with state sensing costs, balancing optimal actions against sensing/communication/computation expenses in decision-making.

Ax Mingkuan Feng, Jinyang Wu, Siyuan Liu, Shuai Zhang, Ruihan Jin, Feihu Che, Pengpeng Shao, Zhengqi Wen, Jianhua Tao 5d ago

Two-Stage Regularization-Based Structured Pruning for LLMs

Two-stage regularization-based structured pruning method for reducing LLM parameters while minimizing knowledge loss and retraining requirements.

Ax Amir Noorizadegan, Sifan Wang, Leevan Ling, Juan P. Dominguez-Morales 5d ago

A Practitioner's Guide to Kolmogorov-Arnold Networks

Comprehensive review of Kolmogorov-Arnold Networks covering theory, relationships to MLPs and kernel methods, and applications.

Ax Erle Zhu, Dazhi Jiang, Yuan Wang, Xujun Li, Jiale Cheng, Yuxian Gu, Yilin Niu, Aohan Zeng, Jie Tang, Minlie Huang, Hongning Wang 5d ago

Data-Efficient RLVR via Off-Policy Influence Guidance

Influence-guided data selection for RLVR with theoretical guarantees for improving LLM reasoning efficiency.

Ax Julian Kleutgens, Claudio Battiloro, Lingkai Kong, Benjamin Grewe, Francesca Dominici, Mauricio Tec 5d ago

Guided Transfer Learning for Discrete Diffusion Models

Transfer learning via classifier guidance for discrete diffusion models in small-data regimes, extending continuous diffusion techniques.

Ax Ioannis Anagnostides, Gabriele Farina, Maxwell Fishelson, Haipeng Luo, Jon Schneider 5d ago

Swap Regret Minimization Through Response-Based Approachability

Computationally efficient algorithms for swap regret minimization in online optimization with connections to correlated equilibrium and non-manipulability.