博文

计算机玩各种棋达到了什么水平？

已有 6992 次阅读 2014-8-18 13:21 |个人分类:科普小兵|系统分类:科普集锦

IEEE Spectrum杂志2014年7月号发表题为“AIs Have Mastered Chess. Will Go Be Next?”（人工智能掌握了国际象棋。围棋会是下一个目标吗？）的文章（http://spectrum.ieee.org/robotics/artificial-intelligence/ais-have-mastered-chess-will-go-be-next）。文中附有计算机玩9种棋类游戏已经达到的“段位”。一种棋戏所可能含有的走法（状态）越多，计算机掌握它就越难。围棋的可能状态数达到10的172次方量级，故目前的围棋弈棋程序只达到了业余高手的水平。对此文感兴趣的还可参阅该杂志的中文版《科技纵横》（本所主办）7月号。

goPlaying1

Tic－TAC-TOE

Game positions: 10⁴
Computer strength: PERFECT

goPlaying2 OWARE

Game positions: 10¹¹
Computer strength: PERFECT

goPlaying3 CHECKERS（跳棋）

Game positions: 10²⁰
Computer strength: PERFECT

goPlaying4 OTHELLO

Game positions: 10²⁸
Computer strength: SUPERHUMAN

goPlaying5 9-BY-9 GO

Game positions: 10³⁸
Computer strength: BEST PROFESSIONAL

goPlaying6 CHESS（国际象棋）

Game positions: 10⁴⁵
Computer strength: SUPERHUMAN

goPlaying7 XIANGQI (CHINESE CHESS，中国象棋)

Game positions: 10⁴⁸
Computer strength: BEST PROFESSIONAL

goPlaying8 SHOGI (JAPANESE CHESS)

Game positions: 10⁷⁰
Computer strength: STRONG PROFESSIONAL

goPlaying9 19-BY-19 GO（围棋）

Game positions: 10¹⁷²
Computer strength: STRONG AMATEUR

A Go-Playing AI can repeatedly apply its MCTS algorithm until resources—time or memory—run out. Like many other search methods, MCTS constructs a game tree, in which each possible move creates branches of new possible moves, which are conventionally drawn pointing downward. For a basic example of this algorithm, imagine that a Go program is trying to decide on its next move. It would therefore repeat these four steps:

Tree descent: From the existing board position (the root node of the search tree), select a candidate move (a leaf node) for evaluation. At the very beginning of the search, the leaf node is directly connected to the root. Later on, as the search deepens, the program follows a long path of branches to reach the leaf node to be evaluated.
Simulation: From the selected leaf node, choose a random sequence of alternating moves until the end of the game is reached.
Evaluation and back propagation: Determine whether the simulation ends in a win or loss. Use that result to update the statistics for each node on the path from the leaf back to the root. Discard the simulation sequence from memory—only the result matters.
Tree expansion: Grow the game tree by adding an extra leaf node to it.

转载本文请联系原作者获取授权，同时请注明本文来自武夷山科学网博客。
链接地址：https://blog.sciencenet.cn/blog-1557-820376.html

上一篇：Use the Right Word摘抄系列（40）--“善”之表达
下一篇：从轻便拖把的诞生看创新链

收藏 IP: 168.160.20.*| 热度|

武夷山分享 http://blog.sciencenet.cn/u/Wuyishan 中国科学技术发展战略研究院研究员；南京大学信息管理系博导

博文

计算机玩各种棋达到了什么水平？

当前推荐数：20 推荐人：田瑞强 钟炳 李学宽 董焱章 刘洋 张利华 张忆文 庄世宇 杨文卿 陈学雷 戴德昌 杨正瓴 秦承志 唐常杰 李世春 白图格吉扎布 陈筝 蒋迅 冯大诚 aliala

该博文允许注册用户评论请点击登录评论 (9 个评论)

武夷山

全部作者的精选博文

全部作者的其他最新博文

全部精选博文导读

相关博文

武夷山分享 http://blog.sciencenet.cn/u/Wuyishan 中国科学技术发展战略研究院研究员；南京大学信息管理系博导

博文

计算机玩各种棋达到了什么水平？

当前推荐数：20 推荐人： 田瑞强 钟炳 李学宽 董焱章 刘洋 张利华 张忆文 庄世宇 杨文卿 陈学雷 戴德昌 杨正瓴 秦承志 唐常杰 李世春 白图格吉扎布 陈筝 蒋迅 冯大诚 aliala

该博文允许注册用户评论 请点击登录 评论 (9 个评论)

武夷山

全部作者的精选博文

全部作者的其他最新博文

全部精选博文导读

相关博文

当前推荐数：20 推荐人：田瑞强钟炳李学宽董焱章刘洋张利华张忆文庄世宇杨文卿陈学雷戴德昌杨正瓴秦承志唐常杰李世春白图格吉扎布陈筝蒋迅冯大诚 aliala

该博文允许注册用户评论请点击登录评论 (9 个评论)