Csaba Szepesvári,
a Hungarian computer scientiest with research interests in applications of statistical techniques in AI, and Reinforcement Learning[1].
In 2006, together with Levente Kocsis, Csaba Szepesvári introduced UCT (Upper Confidence bounds applied to Trees), a new algorithm that applies bandit ideas to guide Monte-Carlo planning[4].
Csaba Szepesvári (1998). Reinforcement Learning: Theory and Practice. in Proceedings of the 2nd Slovak Conference on Artificial Neural Networks, zipped ps
Rémi Munos, Csaba Szepesvári (2008). Finite time bounds for sampling based fitted value iteration. Journal of Machine Learning Research, 9:815-857, 2008. pdf, pdf
a Hungarian computer scientiest with research interests in applications of statistical techniques in AI, and Reinforcement Learning [1].
Csaba Szepesvári worked at the Computer and Automation Research Institute of the Hungarian Academy of Sciences, and is actually Associate Professor [2] at the Department of Computing Science, University of Alberta and is principal investigator of the RLAI [3] group.
In 2006, together with Levente Kocsis, Csaba Szepesvári introduced UCT (Upper Confidence bounds applied to Trees), a new algorithm that applies bandit ideas to guide Monte-Carlo planning [4].
Table of Contents
Selected Publications
[6] [7]1994 ...
2005 ...
2010 ...
External Links
References
What links here?
Up one level