DeepMind AlphaGo — LLMpedia

DeepMind AlphaGo
Name	AlphaGo
Developer	DeepMind
Initial release	October 2015
Programming language	Python, C++
Operating system	Linux
Genre	Artificial intelligence, board game software
License	Proprietary

Contents

Background and development
Architecture and algorithms
Training and evaluation
Matches and competitive performance
Impact and legacy

DeepMind AlphaGo AlphaGo was a computer program developed by DeepMind that achieved superhuman play in the board game of Go. It combined techniques from Deep learning, Reinforcement learning, Monte Carlo tree search, and expert systems to defeat top human professionals and influenced research across artificial intelligence, machine learning, computer science, and game theory. The project generated public attention through high-profile matches in South Korea, China, and United Kingdom, and prompted responses from institutions such as Google, University College London, University of Cambridge, Stanford University, and Massachusetts Institute of Technology.

Background and development

Development began at DeepMind, a company founded by Demis Hassabis, Shane Legg, and Mustafa Suleyman, after DeepMind's acquisition by Google in 2014. The team drew on foundational work by researchers at University of Toronto and University of Montreal, including contributions from Geoffrey Hinton, Yoshua Bengio, and Yann LeCun on convolutional and deep neural networks. Early Go research referenced programs such as Monte Carlo tree search implementations, GoTools, and engines derived from Alpha-beta pruning traditions like Handtalk and projects at KTH Royal Institute of Technology. Public demonstrations and collaborations involved institutions including Oxford University, Imperial College London, Tsinghua University, and Peking University.

Architecture and algorithms

AlphaGo's architecture combined deep neural networks with Monte Carlo tree search. The system used policy networks and value networks inspired by architectures from Convolutional neural network research at NYU and Facebook AI Research, and implemented via frameworks influenced by TensorFlow and earlier libraries such as Theano and Torch. Training protocols incorporated supervised learning from game records from tournaments like the Go Congress and professional matches including Honinbo and Meijin events. Reinforcement learning elements referenced methods from Temporal difference learning and algorithms related to Q-learning and Policy gradient methods. Tree search enhancements drew on statistical techniques common to Monte Carlo methods, and the overall system bridged research lines connected to IBM Watson and Deep Blue.

Training and evaluation

AlphaGo was trained using large datasets of expert games from archives maintained by organizations like the International Go Federation and national associations in Japan, Korea, and China. Supervised learning used datasets curated from professional players such as Lee Sedol, Gu Li, Fan Hui, and Ke Jie, while reinforcement learning produced self-play matches reminiscent of studies at Bell Labs and theoretical frameworks from Alan Turing and Norbert Wiener. Evaluation occurred via internal leagues and matches run on servers comparable to KGS Go Server and tournaments analogous to the Ing Cup and Samsung Cup. Metrics for performance referenced Elo-style systems implemented in platforms operated by European Go Federation and assessments by researchers affiliated with Carnegie Mellon University and Princeton University.

Matches and competitive performance

AlphaGo's public breakthrough came with a 2015 series of games where it defeated European champion Fan Hui and later a widely publicized 2016 match in Seoul against Lee Sedol, generating coverage from outlets including The New York Times, BBC, The Guardian, and Reuters. Subsequent matches included games against Ke Jie in Wuzhen and exhibition matches in San Francisco and London. The program's victories prompted responses from professional organizations such as the Korean Baduk Association, Chinese Weiqi Association, and the Nihon Ki-in. Commentary and analysis involved professionals like Gu Li, Cho Hunhyun, Park Junghwan, and journalists from The Wall Street Journal and Nature.

Impact and legacy

AlphaGo accelerated adoption of deep learning research at institutions like Google DeepMind (the successor brand), influenced curricula at Harvard University and Yale University, and catalyzed commercial interest from companies such as Google, Microsoft, Amazon Web Services, and NVIDIA. It inspired successor systems including AlphaGo Zero, AlphaZero, and efforts at research centers like OpenAI, MIT Computer Science and Artificial Intelligence Laboratory, and ETH Zurich. The project spurred ethical and policy discussions at bodies such as the European Commission, United Nations Educational, Scientific and Cultural Organization, and think tanks including Brookings Institution and RAND Corporation. Its legacy extended to applications in healthcare collaborations with institutions like Moorfields Eye Hospital and industrial projects with partners including DeepMind Health and Google Health, and influenced prize competitions like those run by XPRIZE and standards efforts at ISO.

Category:Computer Go