Bicriteria Policy Optimization for High Accuracy Reinforcement Learning

Published in ⚙️🧠 IEEE Transactions on Neural Networks and Learning Systems (TNNLS) (Minor Revision), 2025

Guojian Zhan, Xiangteng Zhang, Feihong Zhang, Letian Tao, Shengbo Eben Li