Volodymyr+Mnih

a Canadian research scientist at at Google DeepMind with expertise in deep learning, heading the team working on deep Q-networks (DQN) mastering [|Atari games]. DQNs were tested with games such as [|Pong], [|Space Invaders], [|Breakout] and [|Seaquest], receiving only the pixels and the game score as inputs, to surpass the performance of all previous algorithms and achieve a level comparable to that of a professional human games tester across a set of 49 games, using the same algorithm, network architecture and hyperparameters. Volodymyr Mnih holds a Ph.D. in machine learning from University of Toronto under supervision of Geoffrey E. Hinton, and a Master's degree in computing science fro University of Alberta where his advisor was Csaba Szepesvári. || toc =Selected Publications=
 * Home * People * Volodymyr Mnih**
 * [[image:VolodymyrMnih.jpg link="https://www.cs.toronto.edu/~vmnih/"]] ||~ || **Volodymyr Mnih**,
 * Volodymyr Mnih ||~ ||^ ||

2008

 * Volodymyr Mnih (**2008**). //Efficient Stopping Rules//. Masters thesis, University of Alberta, advisor: Csaba Szepesvári, [|pdf]

2010 ...

 * Volodymyr Mnih, Geoffrey E. Hinton (**2010**). //[|Learning to Detect Roads in High-Resolution Aerial Images]//. [|ECCV 2010]
 * Volodymyr Mnih (**2013**). //Machine Learning for Aerial Image Labeling//. Ph.D. thesis, University of Toronto, advisor Geoffrey E. Hinton, [|pdf]
 * Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller (**2013**). //Playing Atari with Deep Reinforcement Learning//. [|arXiv:1312.5602]

2015 ...

 * Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, Demis Hassabis (**2015**). //[|Human-level control through deep reinforcement learning]//. [|Nature], Vol. 518
 * Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy Lillicrap, Tim Harley, David Silver, Koray Kavukcuoglu (**2016**). //Asynchronous Methods for Deep Reinforcement Learning//. [|arXiv:1602.01783v2]
 * Max Jaderberg, Volodymyr Mnih, Wojciech Marian Czarnecki, Tom Schaul, Joel Z. Leibo, David Silver, Koray Kavukcuoglu (**2016**). //Reinforcement Learning with Unsupervised Auxiliary Tasks//. [|arXiv:1611.05397v1]
 * Hado van Hasselt, Arthur Guez, Matteo Hessel, Volodymyr Mnih, David Silver (**2016**). //Learning values across many orders of magnitude//. [|arXiv:1602.07714v2], [|NIPS 2016]

=External Links=
 * [|Vlad Mnih - Homepage]
 * [|Volodymyr Mnih - Google Scholar Citations]

=References= =What links here?= include page="Volodymyr Mnih" component="backlinks" limit="40"
 * Up one level**