國立臺灣師範大學博碩士論文全文系統

簡易檢索 / 詳目顯示

回結果列表

研究生：	林猷琛 Lin, Yu-Chen
論文名稱：	利用他家資訊模組來改良麻將程式 Using Other Players' Information Models to Improve Mahjong Program
指導教授：	林順喜 Lin, Shun-Shii
學位類別：	碩士 Master
系所名稱：	資訊工程學系 Department of Computer Science and Information Engineering
論文出版年：	2020
畢業學年度：	108
語文別：	中文
論文頁數：	47
中文關鍵詞：	電腦對局、麻將、不完全資訊遊戲、機率性遊戲
英文關鍵詞：	Computer Game, Mahjong, Imperfect Information Game, Probabilistic Game
DOI URL：	http://doi.org/10.6345/NTNU202001457
論文種類：	學術論文
相關次數：	點閱：481 下載：30
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

近年來，隨著設備及技術的發展，在完全資訊的眾多遊戲上，電腦已經超越人類。不同於圍棋、黑白棋、將棋，麻將是一個多人、機率性、不完全資訊的遊戲，還有更多的發展課題。這篇論文將會簡單介紹麻將的玩法，並簡述過去的麻將程式相關技術。
本論文採用部分Let’s Play Mahjong!論文的方法，將說明是如何實作出判斷手牌狀態、如何判斷鳴牌的抉擇，並以此來加強麻將程式。
本篇論文在進攻的選擇上，採用注重機率的棄牌選擇。在防守上則將他人的棄牌以及整個盤面紀錄，以此分析並將其資訊化成數值。最後將防守和進攻的選擇一同判斷後，選擇出最佳的棄牌。
在防守策略上並非純以統計或模擬的方式，而是根據玩家在棄牌時，會以最佳化自己的手牌做處理為原則，並依照在其他論文看到的理論進行推導，做出一個防守模組。
解讀數據上，將實驗時的統計解讀方式做了修正，避免運氣上的成分超過實際的實力差。

In recent years, with the development of equipments and techniques, computers have beat top human players in many perfect information games. Rather than Go, Othello, and Shogi, Mahjong is a multi-player, probabilistic, and imperfect information game, which still has many development issues to be addressed. This thesis would simply introduce the game rules of Mahjong and talk about the programs that made before.
We apply for partial ideas in the paper titled "Let's Play Mahjong!". We would explain how to judge the state of a hand, how to choose whether to chow a tile or not, and what ideas we get to improve our program.
On offense, we use probability-oriented policy for tile selection. On defense. We observe the discarded tiles of other players to analyze their information. Then we take advantage of these statistics information as well as the offense policy to choose a best tile to discard.
On the defense strategy, we don't simply choose a tile to discard by simulation or statistics. Instead, we defense by trying to discard a tile to optimize the player's hand. According to the concepts in other papers, we make a defense model.
Sometimes, a Mahjong program would get lots of score which is not proportional to its strength. In this thesis, we focus on the rate that our program will win or lose in order to avoid the element of luck.

第一章  緒論  1
第1節  研究背景  1
第2節  研究目的     3
第3節  論文架構     4

第二章  文獻探討  5
第1節  台灣麻將及比賽規則簡介  5
第2節  相關論文及程式介紹  10
第三章  防守模組的推導及介紹  18
第1節  非需求牌  19
第2節  非需求度  21

第四章  麻將程式的更新  23
第1節  進胡數的實作  23
第2節  偵測棄牌  28
第3節  吃、碰、槓的抉擇    29
第4節  進攻價值    31
第5節  防守價值    32

第五章  實驗結果  34
第1節  實驗環境及用詞解釋  34
第2節  排除分數的原因  35
第3節  實驗數據解析  37
第4節  比賽結果與解析  42

第六章  結論與未來展望  43
6.1　進攻策略檢討  43
6.2　防守策略檢討  44
6.3　其它增強可能  44

參考文獻  46
                                

[1] D. Silver, A. Huang, C. J. Maddison, A. Guez, L. Sifre, G. V. D. Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, S. Dieleman, D. Grewe, J. Nham, N. Kalchbrenner, I. Sutskever, T. Lillicrap, M. Leach, K. Kavukcuoglu, T. Graepel, and D. Hassabis, Mastering the game of Go with deep neural networks and tree search, Nature, vol. 529, no. 7587, pp. 484–489, 2016.
[2] D. Silver, J. Schrittwieser, K. Simonyan, I. Antonoglou, A. Huang, A. Guez, T. Hubert, L. Baker, M. Lai, A. Bolton, Y. Chen, T. Lillicrap, F. Hui, L. Sifre, G. V. D. Driessche, T. Graepel, and D. Hassabis, Mastering the game of Go without human knowledge, Nature, vol. 550, no. 7676, pp. 354–359, 2017.
[3] Surag Nair, Alpha Zero General, https://github.com/suragnair/alpha-zero-general
[4] Noam Brown, Tuomas Sandholm, Superhuman AI for Heads-up No-limit Poker: Libratus Beats Top Professionals, Science, Vol.359, pp.418-424, 2018.
[5] Junjie Li, Sotetsu Koyamada, Suphx: Mastering Mahjong with Deep Reinforcement Learning, arXiv:2003.13590, 2020.
[6] Cameron B. Browne, Edward Powley, A Survey of Monte Carlo Tree Search Methods, IEEE Transactions on Computational Intelligence and AI in Games, Vol.4, pp.1-43, 2012.
[7] 吳俊緯，電腦麻將程式MahJongDaXia的設計與實作，2015，國立台灣師範大學資工所碩士論文。
[8] 維基百科，麻將基本的規則介紹，https://zh.wikipedia.org/wiki/%E5%8F%B0%E7%81%A3%E9%BA%BB%E5%B0%87
[9] 林宏正，麻將平台與人工智慧程式溝通協定，2011。
[10] 陳新颺，電腦麻將程式ThousandWind的設計與實作，2013，國立台灣師範大學資工所碩士論文。
[11] 林汸燁，電腦麻將程式 MahJongJr的設計與實作，2018，國立台灣師範大學資工所碩士論文。
[12] 林典餘，麻將人工智慧之研究，2008，國立交通大學資工所碩士論文。
[13] Naoki Mizukami and Yoshimasa Tsuruoka, Building a Computer Mahjong Player Based on Monte Carlo Simulation and Opponent Models, IEEE Conference on Computational Intelligence and Games(CIG), pp.275-283, 2015.
[14] Sanjiang Li and Xueqing Yan, Let's Play Mahjong!, Computing Research Repository(CoRR abs/1903.03294), 2019.