7. Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Wu Y, Mansimov E, Liao S, et al. Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation[J]. arXiv: Learning, 2017.