AI spends 7,000 hours beating Pokemon Red’s first gym, but still can’t find the second one after 50,000 hours

0
1024


One programmer has given an AI model 50,000 hours worth of training in how to play Pokemon Red, leading to an algorithm that’s capable of exploring the game and building a team to defeat the first gym leader – but not one that can find its way through Mt. Moon or know better than to keep buying Magikarp. Most of all, this exercise is a fascinating way to get an idea of how machine learning actually works.

As outlined in an extensive video by Peter Whidden, the AI is able to interact with the game through the usual control inputs on an emulator. It hits a button and looks at the screen to see what happened, the same as a human player. Whidden set learning sessions at two hours worth of game time apiece, though with emulation sped up those sessions could be completed in around six minutes of real-time – and the process was further sped up by running 40 testing sessions simultaneously.



Source link