Ax Tao Wang, Suhang Zheng, Xiaoxiao Xu 9d ago

RTMC: Step-Level Credit Assignment via Rollout Trees

Rollout tree-based credit assignment method for multi-step agentic RL, leveraging implicit state overlap between group rollouts to avoid uniform advantage assignment.

Ax Wei Li, Hangjie Yuan, Zixiang Zhao, Borui Kang, Ziwei Liu, Tao Feng 9d ago

A Faster Path to Continual Learning

Optimization technique for continual learning reducing computational overhead of C-Flat while maintaining ability to balance new and old task performance.