Ax Bo Jiang, Sian Jin 3/31/2026

KVSculpt: KV Cache Compression as Distillation

KVSculpt: KV cache compression for long-context LLM inference treating compression as knowledge distillation, orthogonal to quantization and low-rank methods.

Ax Kieran Didi, Zuobai Zhang, Guoqing Zhou, Danny Reidenbach, Zhonglin Cao, Sooyoung Cha, Tomas Geffner, Christian Dallago, Jian Tang, Michael M. Bronstein, Martin Steinegger, Emine Kucukbenli, Arash Vahdat, Karsten Kreis 3/31/2026

Scaling Atomistic Protein Binder Design with Generative Pretraining and Test-Time Compute

Proteina-Complexa: fully atomistic protein binder generation method combining conditional generative modeling with structure-based optimization.