Ax Zhixu Du, Krishna Teja Chitty-Venkata, Murali Emani, Venkatram Vishwanath, Hai Helen Li, Yiran Chen 3/10/2026

Swimba: Switch Mamba Model Scales State Space Models

Mixture-of-experts approach for state space models with expert specialization while maintaining computational efficiency.

Ax Ruipeng Zhang, Hongzhan Yu, Ya-Chien Chang, Chenghao Li, Henrik I. Christensen, Sicun Gao 3/10/2026

Learning Quadruped Walking from Seconds of Demonstration

Imitation learning analysis for quadruped locomotion showing effectiveness in small data regimes via limit cycle structure.

Ax Zhiji Yang, Mei Huang, Xinyu Li, Xianli Pan, Qi Wang, Jianhua Zhao 3/10/2026

Interpretable Maximum Margin Deep Anomaly Detection

Deep SVDD improvement for anomaly detection addressing hypersphere collapse and interpretability via maximum margin approach.

Ax Woogyeol Jin, Taywon Min, Yongjin Yang, Swanand Ravindra Kadhe, Yi Zhou, Dennis Wei, Nathalie Baracaldo, Kimin Lee 3/10/2026

Entropy-Aware On-Policy Distillation of Language Models

On-policy distillation method using entropy-aware objectives for improved knowledge transfer between language models.

Ax Yair Ashlagi, Roi Livni, Shay Moran, Tom Waknine 3/10/2026

Margin in Abstract Spaces

Theoretical analysis of margin-based learning in metric spaces and generalization guarantees independent of parameter count.