Bernhard Schölkopf is Director at the Max Planck Institute for Intelligent Systems in Tübingen, Germany. He is coauthor of
John Platt is the Manager of the Knowledge Tools group at Microsoft Research, and Program Chair of the 2006 NIPS conference.
Thomas Hofmann is a Director of Engineering at Google's Engineering Center in Zurich and Adjunct Associate Professor of Computer Science at Brown University.
Logarithmic Online Regret Bounds for Undiscounted Reinforcement Learning
-
Published:2007
-
In Special Collection: CogNet
Peter Auer, Ronald Ortner, 2007. "Logarithmic Online Regret Bounds for Undiscounted Reinforcement Learning", Advances in Neural Information Processing Systems 19: Proceedings of the 2006 Conference, Bernhard Schölkopf, John Platt, Thomas Hofmann
Download citation file: