Skip to Main Content


Skip Nav Destination

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

Neural Computation (2010) 22 (2): 342–376.
This article has been cited by the following articles in journals that are participating in Crossref Cited-by Linking.
  • Ryo IWAKI
  • Hiroki YOKOYAMA
  • Minoru ASADA
IEICE Transactions on Information and Systems (2018) E101.D (9): 2346.
  • Ivo Grondman
  • Lucian Busoniu
  • Gabriel A. D. Lopes
  • Robert Babuska
IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) (2012) 42 (6): 1291.
Close Modal

or Create an Account

Close Modal
Close Modal