Conclusion
AlphaZero is a pivotal use of neural networks and Monte Carlo tree search to create computers that become subject-matter experts without input from human experts. The amount of time or computing power to train these networks to the point of being an expert is enormous, but the results are beyond promising. The following video provides a surface-level recap of AlphaZero and how it is used to intelligently play complex deterministic games.
Again, to reiterate, our results shows that the AI trained to play Pente improved drastically in the first few iterations of training, and then the performance begins to slowly degrade over time.
Figure 1: Elo Rating vs Training Iteration
But why should anyone care about the results of this project or the results of AlphaZero? As the video summarizes, we are obligated to ask "why is this worth doing?" when the final version of a model like this requires a tremendous amount of computing power to create.
This question was answered two months ago when DeepMind published an article describing how they used essentially the same combination of algorithms to discover a new, much more efficient method of matrix multiplication than the previous best method known. In other words, methods like AlphaZero are having a big impact beyond just the rather eccentric use case of playing board games. Matrix multiplication isn't going anywhere, so the one-time use of a very large number of computing resources has created the opportunity to reduce energy usage and computing power needs for all future operations involving matrix operations.
Beyond the technical world, AlphaZero has potential use cases in all areas of graph exploration, whether to make better internet search engines, music recommendations, self-driving cars, directions to the grocery store, or any number of other uses. All of these potential use cases make it a valuable algorithm to understand.
Comments
Post a Comment