Bicriteria Policy Optimization for High Accuracy Reinforcement Learning
Published in ⚙️🧠 IEEE Transactions on Neural Networks and Learning Systems (TNNLS) (Minor Revision), 2025
Guojian Zhan, Xiangteng Zhang, Feihong Zhang, Letian Tao, Shengbo Eben Li