Ax Payel Bhattacharjee, Osvaldo Simeone, Ravi Tandon 2/20/2026

MARS: Margin-Aware Reward-Modeling with Self-Refinement

Margin-aware reward modeling framework with self-refinement for RLHF/RLAIF alignment pipelines, reducing reliance on human preference data through augmentation.

Ax Simon Lermen, Daniel Paleka, Joshua Swanson, Michael Aerni, Nicholas Carlini, Florian Tram\`er 2/20/2026

Large-scale online deanonymization with LLMs

Large-scale deanonymization attack using LLM agents with internet access to re-identify pseudonymous online profiles.

Ax Yiqing Xie, Emmy Liu, Gaokai Zhang, Nachiket Kotalwar, Shubham Gandhi, Sathwik Acharya, Xingyao Wang, Carolyn Rose, Graham Neubig, Daniel Fried 2/20/2026

Hybrid-Gym: Training Coding Agents to Generalize Across Tasks

Hybrid-Gym environment for training coding agents on diverse software engineering tasks beyond single GitHub issues.

Ax Sasha Behrouzi, Lichao Wu, Mohamadreza Rostami, Ahmad-Reza Sadeghi 2/20/2026

NeST: Neuron Selective Tuning for LLM Safety

NeST selective neuron tuning approach for parameter-efficient LLM safety alignment without full fine-tuning overhead.

Ax Kiana Farhadyar, Maren Hackenberg, Kira Ahrens, Charlotte Schenk, Bianca Kollmann, Oliver T\"uscher, Klaus Lieb, Michael M. Plichta, Andreas Reif, Raffael Kalisch, Martin Wolkewitz, Moritz Hess, Harald Binder 2/20/2026

A statistical perspective on transformers for small longitudinal cohort data

Transformer architecture applied to longitudinal cohort data modeling with attention mechanisms for temporal dependencies.

Ax Sorawit Saengkyongam, Juan L. Gamella, Andrew C. Miller, Jonas Peters, Nicolai Meinshausen, Christina Heinze-Deml 2/20/2026

Anti-causal domain generalization: Leveraging unlabeled data

Domain generalization approach leveraging unlabeled data in anti-causal settings where outcomes cause observed features.

Ax Etienne Lempereur, Nathana\"el Cuvelle--Magar, Florentin Coeurdoux, St\'ephane Mallat, Eric Vanden-Eijnden 2/20/2026

MGD: Moment Guided Diffusion for Maximum Entropy Generation

Diffusion-based method for generating high-dimensional samples from moment constraints with maximum entropy guarantees.

Ax Marcin P{\l}odzie\'n 2/20/2026

Quantum Scrambling Born Machine

Quantum generative modeling approach using fixed entangling unitaries with optimized single-qubit rotations.

Ax Mateusz Nowak, Xavier Cadet, Peter Chin 2/20/2026

ABCD: All Biases Come Disguised

Study of position and label biases in LLM multiple-choice question answering via synthetic benchmark evaluation.