Ax Aur Shalev Merin 29d ago

Temporal Credit Is Free

Recurrent network training without Jacobian propagation using hidden state temporal credit. Studies gradient normalization and online adaptation.

Ax Samuel Girard, Aurelien Bibaut, Arthur Gretton, Nathan Kallus, Houssam Zenati 29d ago

Fast Best-in-Class Regret for Contextual Bandits

Fast regret bounds for contextual bandits without realizability assumptions using pessimistic policy updates.

Ax Toufique Ahmed, Jatin Ganhotra, Avraham Shinnar, Martin Hirzel 29d ago

Investigating Test Overfitting on SWE-bench

Investigation of test overfitting in SWE-bench for code resolution, where models pass tests but miss important cases.

HN XYen0n 29d ago

SSH to any machine without IP

SSH tool for connecting to machines behind NAT/firewalls without port forwarding. Infrastructure utility unrelated to AI.