武夷山分享 http://blog.sciencenet.cn/u/Wuyishan 中国科学技术发展战略研究院研究员;南京大学信息管理系博导

博文

计算机玩各种棋达到了什么水平?

已有 6992 次阅读 2014-8-18 13:21 |个人分类:科普小兵|系统分类:科普集锦

       IEEE Spectrum杂志2014年7月号发表题为“AIs Have Mastered Chess. Will Go Be Next?”(人工智能掌握了国际象棋。围棋会是下一个目标吗?)的文章(http://spectrum.ieee.org/robotics/artificial-intelligence/ais-have-mastered-chess-will-go-be-next)。文中附有计算机玩9种棋类游戏已经达到的“段位”。一种棋戏所可能含有的走法(状态)越多,计算机掌握它就越难。围棋的可能状态数达到10的172次方量级,故目前的围棋弈棋程序只达到了业余高手的水平。对此文感兴趣的还可参阅该杂志的中文版《科技纵横》(本所主办)7月号。


     goPlaying1

Tic-TAC-TOE

Game positions: 104
Computer strength: PERFECT


goPlaying2OWARE

Game positions: 1011
Computer strength: PERFECT


goPlaying3CHECKERS(跳棋)

Game positions: 1020
Computer strength: PERFECT


goPlaying4OTHELLO

Game positions: 1028
Computer strength: SUPERHUMAN


goPlaying59-BY-9 GO

Game positions: 1038
Computer strength: BEST PROFESSIONAL


goPlaying6CHESS(国际象棋)

Game positions: 1045
Computer strength: SUPERHUMAN


goPlaying7XIANGQI (CHINESE CHESS,中国象棋)

Game positions: 1048
Computer strength: BEST PROFESSIONAL


goPlaying8SHOGI (JAPANESE CHESS)

Game positions: 1070
Computer strength: STRONG PROFESSIONAL


goPlaying919-BY-19 GO(围棋)

Game positions: 10172
Computer strength: STRONG AMATEUR


A Go-Playing AI can repeatedly apply its MCTS algorithm until resources—time or memory—run out. Like many other search methods, MCTS constructs a game tree, in which each possible move creates branches of new possible moves, which are conventionally drawn pointing downward. For a basic example of this algorithm, imagine that a Go program is trying to decide on its next move. It would therefore repeat these four steps:

  1. Tree descent: From the existing board position (the root node of the search tree), select a candidate move (a leaf node) for evaluation. At the very beginning of the search, the leaf node is directly connected to the root. Later on, as the search deepens, the program follows a long path of branches to reach the leaf node to be evaluated.

  2. Simulation: From the selected leaf node, choose a random sequence of alternating moves until the end of the game is reached.

  3. Evaluation and back propagation: Determine whether the simulation ends in a win or loss. Use that result to update the statistics for each node on the path from the leaf back to the root. Discard the simulation sequence from memory—only the result matters.

  4. Tree expansion: Grow the game tree by adding an extra leaf node to it.



https://blog.sciencenet.cn/blog-1557-820376.html

上一篇:Use the Right Word摘抄系列(40)--“善”之表达
下一篇:从轻便拖把的诞生看创新链
收藏 IP: 168.160.20.*| 热度|

20 田瑞强 钟炳 李学宽 董焱章 刘洋 张利华 张忆文 庄世宇 杨文卿 陈学雷 戴德昌 杨正瓴 秦承志 唐常杰 李世春 白图格吉扎布 陈筝 蒋迅 冯大诚 aliala

该博文允许注册用户评论 请点击登录 评论 (9 个评论)

数据加载中...
扫一扫,分享此博文

Archiver|手机版|科学网 ( 京ICP备07017567号-12 )

GMT+8, 2024-4-20 04:34

Powered by ScienceNet.cn

Copyright © 2007- 中国科学报社

返回顶部