Off-policy Reinforcement Learning with Model-based Exploration Augmentation
Published in 🧠Neural Information Processing Systems (NeurIPS), 2025
Likun Wang, Xiangteng Zhang, Yinuo Wang, Guojian Zhan, Wenxuan Wang, Haoyu Gao, Jingliang Duan, Shengbo Eben Li
