Skip to main content
guest
Join
|
Help
|
Sign In
chessprogramming
Home
guest
|
Join
|
Help
|
Sign In
Wiki Home
Recent Changes
Pages and Files
Members
Home
Basics
Getting Started
Board Representation
Search
Evaluation
Principle Topics
Chess
Programming
Artificial Intelligence
Knowledge
Learning
Testing
Tuning
User Interface
Protocols
Dictionary
Lists
Arts
Cartoons
CC Forums
Conferences
Dedicated CC
Engines
Games
Hardware
History
Organizations
Papers
People
Periodical
Samples
Software
Timeline
Tournaments and Matches
Videos
Misc
Acknowledgments
On New Pages
Recommended Reading
Wikispaces Help
Yasuhiro Osaki
Edit
0
3
…
2
Tags
people
researcher
Notify
RSS
Backlinks
Source
Print
Export (PDF)
Home
*
People
* Yasuhiro Osaki
Table of Contents
Selected Publications
References
What links here?
Yasuhiro Osaki
,
a Japanese computer scientist at Department of Computer Science,
Tokyo University of Agriculture and Technology
, affiliated with the laboratory of professor
Yoshiyuki Kotani
. His research interest includes
reinforcement learning
and the application of
TD(λ)
based on
Monte-Carlo simulations
in computer games. The program committee of the
12th Game Programming Workshop 2007
gave the best presentation award to Yasuhiro Osaki on
TD(λ)-MC : A reinforcement learning with Monte-carlo simulations
[1]
[2]
.
Selected Publications
Yasuhiro Osaki
,
Kazutomo Shibahara
,
Yasuhiro Tajima
,
Yoshiyuki Kotani
(
2007
).
Reinforcement Learning of Evaluation Functions Using Temporal Difference-Monte Carlo learning method
.
12th Game Programming Workshop
Yasuhiro Osaki
,
Kazutomo Shibahara
,
Yasuhiro Tajima
,
Yoshiyuki Kotani
(
2008
).
An Othello Evaluation Function Based on Temporal Difference Learning using Probability of Winning
.
CIG'08
,
pdf
Yasuhiro Osaki
,
Yoshiyuki Kotani
(
2009
).
A Learning Method of Evaluation Function Based on Selective Simulations
.
14th Game Programming Workshop
References
^
Yasuhiro Osaki
,
Kazutomo Shibahara
,
Yasuhiro Tajima
,
Yoshiyuki Kotani
(
2007
).
Reinforcement Learning of Evaluation Functions Using Temporal Difference-Monte Carlo learning method
.
12th Game Programming Workshop
,
pdf
(Japanese)
^
TD-Lamda from Wikipedia
What links here?
Page
Date Edited
Kazutomo Shibahara
Jan 27, 2016
Learning
Feb 20, 2018
Othello
Jan 4, 2018
People
Feb 28, 2018
Reinforcement Learning
Feb 12, 2018
Temporal Difference Learning
Feb 20, 2018
Yasuhiro Osaki
Apr 3, 2014
Yasuhiro Tajima
Jan 27, 2016
Yoshiyuki Kotani
Dec 27, 2016
Up one level
Javascript Required
You need to enable Javascript in your browser to edit pages.
help on how to format text
Turn off "Getting Started"
Home
...
Loading...
Table of Contents
Yasuhiro Osaki,
a Japanese computer scientist at Department of Computer Science, Tokyo University of Agriculture and Technology, affiliated with the laboratory of professor Yoshiyuki Kotani. His research interest includes reinforcement learning and the application of TD(λ) based on Monte-Carlo simulations in computer games. The program committee of the 12th Game Programming Workshop 2007 gave the best presentation award to Yasuhiro Osaki on TD(λ)-MC : A reinforcement learning with Monte-carlo simulations [1] [2].
Selected Publications
References
What links here?
Up one level