编辑“︁AlphaGo”︁（章节）

== 算法 ==
AlphaGo使用[[蒙特卡洛树搜索]]（{{lang|en|Monte Carlo tree search}}），借助估值网络（value network）与走棋网络（policy network）这两种[[深度神经网络]]，通过估值网络来评估大量选点，并通过走棋网络选择落点<ref name=DeepMindnature2016/><ref name="googlego"/>。AlphaGo最初通过模仿人类玩家，尝试匹配职业棋手的过往棋局，其数据库中约含3000万步棋着。<ref name=MetzWired2016/>後來它达到了一定的熟练程度，它开始和自己对弈大量棋局，使用[[强化学习]]进一步改善它<ref name="googlego"/>。围棋无法仅通过寻找最佳棋步来解决；<ref name=":0">{{Cite web|title = Google AI algorithm masters ancient game of Go|url = http://www.nature.com/news/google-ai-algorithm-masters-ancient-game-of-go-1.19234|website = Nature News & Comment|access-date = 2016-01-31|archive-url = https://web.archive.org/web/20190502200837/http://www.nature.com/news/google-ai-algorithm-masters-ancient-game-of-go-1.19234|archive-date = 2019-05-02|dead-url = no}}</ref>游戏一盘平均约有150步，每一步平均约有200种可选的下法，<ref name="#1"/>这意味着有太多需要解决的可能性<ref name=":0" />。