Skip Nav Destination
Reward-Weighted Regression with Sample Reuse for Direct Policy Search in Reinforcement Learning
Neural Computation (2011) 23 (11): 2798–2832.
This article has been cited by the following articles in journals that are participating in Crossref Cited-by Linking.
- Young-Ha Yang
- Cheol-Soo Lee
Journal of Korea Robotics Society (2022) 17 (1): 1.
- Jiexin Wang
- Eiji Uchibe
- Kenji Doya
Frontiers in Neurorobotics (2017) 11
- Norikazu Sugimoto
- Voot Tangkaratt
- Thijs Wensveen
- Tingting Zhao
- Masashi Sugiyama
- Jun Morimoto
IEEE Robotics & Automation Magazine (2016) 23 (1): 96.
- Takahiro Hasegawa
- Takamitsu Matsubara
- Kenji Sugimoto
Transactions of the Institute of Systems, Control and Information Engineers (2016) 29 (8): 346.
- Jiexin Wang
- Eiji Uchibe
- Kenji Doya
Artificial Life and Robotics (2016) 21 (1): 125.
- Paweł Wawrzyński
- Ajay Kumar Tanwani
Neural Networks (2013) 41: 156.
- Tingting Zhao
- Hirotaka Hachiya
- Voot Tangkaratt
- Jun Morimoto
- Masashi Sugiyama
Neural Computation (2013) 25 (6): 1512.
- Masashi Sugiyama
- Makoto Yamada
- Marthinus Christoffel du Plessis
Wiley Interdisciplinary Reviews: Computational Statistics (2013) 5 (6): 465.