Skip Nav Destination
An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions
Neural Computation (2016) 28 (3): 563–593.
This article has been cited by the following articles in journals that are participating in Crossref Cited-by Linking.
- Yanning Li
- Yi Zhang
- Ruixin Wang
- Jiangfeng Fu
Sensors (2024) 24 (11): 3323.