相似文献/References:
[1]张峰,钱辉,董春茹,等.随机状态下基于期望经验回放的Q学习算法[J].深圳大学学报理工版,2020,37(2):202.[doi:10.3724/SP.J.1249.2020.02202]
ZHANG Feng,QIAN Hui,DONG Chunru,et al.An expected experience replay based Q-learning algorithm with random state transition[J].Journal of Shenzhen University Science and Engineering,2020,37(3):202.[doi:10.3724/SP.J.1249.2020.02202]