Skip Nav Destination
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning
Neural Computation (2010) 22 (2): 342–376.
This article has been cited by the following articles in journals that are participating in Crossref Cited-by Linking.
- Ryo IWAKI
- Hiroki YOKOYAMA
- Minoru ASADA
IEICE Transactions on Information and Systems (2018) E101.D (9): 2346.
- Ivo Grondman
- Lucian Busoniu
- Gabriel A. D. Lopes
- Robert Babuska
IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) (2012) 42 (6): 1291.