Skip to Main Content

Advertisement

Skip Nav Destination

Reward-Weighted Regression with Sample Reuse for Direct Policy Search in Reinforcement Learning

Neural Computation (2011) 23 (11): 2798–2832.
This article has been cited by the following articles in journals that are participating in Crossref Cited-by Linking.
  • Young-Ha Yang
  • Cheol-Soo Lee
Journal of Korea Robotics Society (2022) 17 (1): 1.
  • Norikazu Sugimoto
  • Voot Tangkaratt
  • Thijs Wensveen
  • Tingting Zhao
  • Masashi Sugiyama
  • Jun Morimoto
IEEE Robotics & Automation Magazine (2016) 23 (1): 96.
  • Takahiro Hasegawa
  • Takamitsu Matsubara
  • Kenji Sugimoto
Transactions of the Institute of Systems, Control and Information Engineers (2016) 29 (8): 346.
  • Jiexin Wang
  • Eiji Uchibe
  • Kenji Doya
Artificial Life and Robotics (2016) 21 (1): 125.
  • Paweł Wawrzyński
  • Ajay Kumar Tanwani
Neural Networks (2013) 41: 156.
  • Tingting Zhao
  • Hirotaka Hachiya
  • Voot Tangkaratt
  • Jun Morimoto
  • Masashi Sugiyama
Neural Computation (2013) 25 (6): 1512.
  • Masashi Sugiyama
  • Makoto Yamada
  • Marthinus Christoffel du Plessis
Wiley Interdisciplinary Reviews: Computational Statistics (2013) 5 (6): 465.
Close Modal

or Create an Account

Close Modal
Close Modal