簡易檢索 / 詳目顯示

研究生: 黃南軒
Huang, Nan-Hsuan
論文名稱: 遠讀<魔戒>及其人物網絡的統計分析
Distant reading of “The Lord of the Rings” and the statistical analysis of its character network
指導教授: 陳啟明
Chen, Chi-Ming
口試委員: 馬偉雲
Ma, Wei-Yun
陳柏琳
Chen, Bo-Lin
陳啟明
Chen, Chi-Ming
口試日期: 2022/05/30
學位類別: 碩士
Master
系所名稱: 物理學系
Department of Physics
論文出版年: 2022
畢業學年度: 110
語文別: 中文
論文頁數: 53
中文關鍵詞: 最小展開聚類距離矩陣中心性語文探索與字詞計算社群網絡
英文關鍵詞: MSC, Distance matrix, Centrality, LIWC, Social network
DOI URL: http://doi.org/10.6345/NTNU202201181
論文種類: 學術論文
相關次數: 點閱:107下載:17
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 遠讀是一種用於文學研究的計算方法,是利用自動化的演算法去擷取並分析文學作品中的資訊,本研究開發出一種利用分群演算法、可視化方法、以及語言分析的工具去遠讀一本小說的處理方法,來獲得小說的情節與角色間的相互關係,以本研究為例,小說<魔戒>中的角色網絡就是利用此方法來處理的,小說人物的資訊是利用一種叫做專有名詞辨識的方法擷取出來的,接著利用一種叫做最小展開聚類的演算法去分析確切的角色網絡,並建構出社群的網絡結構。
    接著進行統計分析調查角色的作用還有信息流,整體角色網絡結構的可視化是利用一個叫做Gephi的軟體,藉由網絡的可視化更進一步展示出角色之間的關係,而主要角色的個性特徵則是利用一個叫做語文探索與字詞計算的分析工具去分析角色的對話內容,此外,為了瞭解小說的發展,上述的聚類與統計分析是和故事線一併進行的,最後,本研究利用了語文探索與字詞計算的工具去調查了小說的情感基調隨著小說隨故事線的演進。

    Distant reading is a computational approach in literary studies that applies automated algorithms to extract and analyze literary information. In this work, I developed an approach for the distant reading of a novel by combining clustering algorithms, visualization methods, and language analytic tools to obtain a perspective of its plot and visualize the interaction between characters. As an example, the character network in “The Lord of the Rings” was analyzed with this approach. The information of characters in the novel was extracted by named entity recognition (NER). The identified character network was then analyzed by the minimum span clustering (MSC) algorithm to construct the community structure of the network. Statistical analyses were performed to investigate the role of characters as well as the flow of information. The overall structure of the character network was visualized by the software Gephi to further reveal the relationship between characters. The personality traits of the main characters were analyzed from their dialogues using the linguistic and word count (LIWC) analyzer. Furthermore, to obtain a comprehension of the novel’s development, the above clustering and statistical analyses were carried out along with the storyline. Finally, I used LIWC to investigate the emotional tone of the novel plot as the story progresses.

    目次 第一章、前人研究與研究動機.......................1 第二章、資料收集方法與演算法.....................3 2.1 小說文字檔預處理.....................3 2.2 Distance 與 Distance matrix..........4 2.3 Minimum span clustering (MSC) ........6 2.4 Gephi軟體與Centrality ..............9 2.5 LIWC介紹............................13 2.6 人物對話擷取方法....................15 第三章、結果與討論..............................18 3.1 小說全文人物網絡....................18 3.2 六冊分冊人物網絡與Centrality........23 3.3 小說章節SPI值分析..................40 3.4 角色對話與人物特性分析..............44 第四章、其他問題與討論..........................50 第五章、結論....................................52 第六章、參考資料................................53

    Ardanuy, M. C. and C. Sporleder (2015). "Clustering of novels represented as social networks." Linguistic Issues in Language Technology 12(4).

    Chang, Y. F. and C. M. Chen (2011). "Classification and Visualization of the Social Science Network by the Minimum Span Clustering Method." Journal of the American Society for Information Science and Technology 62(12): 2404-2413.

    Chang, Y. F. and C. M. Chen (2014). "Visualizing the clustering of financial networks and profitability of stocks." Journal of Complex Networks 3(2): 303-318.

    Chen, R.H.-G and Chen, C.-C and Chen, C.-M (2019). "Unsupervised cluster analyses of character networks in fiction:
    Community structure and centrality." Knowledge-Based Systems: 11.

    Curry I. (2009)“A Hybrid AI Approach to the Classification of Emotive Text” International Conference on Artificial Intelligence (as part of WORLDCOMP'09)

    Hu, G.-M., et al. (2015). "Clustering and visualizing similarity networks of membrane proteins." Proteins: Structure, Function, and Bioinformatics 83(8): 1450-1461.

    James, W. P. and Martha, E. F. (1999) “Linguistic Inquiry and Word Count (LIWC)”

    Semi, M. and Juyong, P. (2019). "Modeling narrative structure and dynamics with networks, sentiment analysis, and topic modeling." 20.

    下載圖示
    QR CODE