A computer backgammon player that
learned to play by playing against
itself using Reinforcement Learning and Temporal Difference Learning.
After some tuning it has turned
into one of the very best backgammon
players in the world. It was created
by Gerald Tesauro of IBM.
His paper is
available online at http://www.research.ibm.com/xw-D953/tdl.html .

Log in or registerto write something here or to contact authors.