Ax Stefano Goria, Levent A. Meng\"ut\"urk, Murat C. Meng\"ut\"urk, Berkan Sesen 27d ago

Random-Bridges as Stochastic Transports for Generative Models

Random-bridges framework for generative models using stochastic processes conditioned on target distributions for flexible transport between distributions.

Ax Angelika Romanou, Mark Ibrahim, Candace Ross, Chantal Shaib, Kerem Oktar, Samuel J. Bell, Anaelia Ovalle, Jesse Dodge, Antoine Bosselut, Koustuv Sinha, Adina Williams 27d ago

Brittlebench: Quantifying LLM robustness via prompt sensitivity

Framework for measuring LLM robustness to prompt variations, typos, and alternative phrasings in real-world inputs.

Ax Zijin Gu, Tatiana Likhomanenko, Vimal Thilak, Jason Ramapuram, Navdeep Jaitly 27d ago

Path-Constrained Mixture-of-Experts

Research on sparse Mixture-of-Experts architectures proposing expert path perspective to understand token routing patterns across layers.

Ax Yufei Xu, Fanxu Meng, Fan Jiang, Yuxuan Wang, Ruijie Zhou, Zhaohui Wang, Jiexi Wu, Zhixin Pan, Xiaojuan Tang, Wenjie Pei, Tongxuan Liu, Di Yin, Xing Sun, Muhan Zhang 27d ago

HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention

HISA: hierarchical indexing system for efficient sparse attention in LLMs, reducing indexer bottleneck in token-level sparse mechanisms.