簡易檢索 / 詳目顯示

研究生: 郭珮涵
Kuo, Pei-Han
論文名稱: 以人工智慧輔助中文期刊參考文獻剖析之研究─以人文社會科學領域為例
Artificial Intelligence-Facilitated Reference Parsing from Chinese Journals—A Case Study of Social Sciences and Humanities
指導教授: 曾元顯
Tseng, Yuan-Hsien
口試委員: 曾元顯
Tseng, Yuan-Hsien
陳舜德
Chen, Shun-Der
林頌堅
Lin, Sung-Chien
口試日期: 2024/06/07
學位類別: 碩士
Master
系所名稱: 圖書資訊學研究所圖書資訊學數位學習碩士在職專班
Graduate Institute of Library and Information Studies_Online Continuing Education Master's Program of Library and Information Studies
論文出版年: 2024
畢業學年度: 112
語文別: 中文
論文頁數: 82
中文關鍵詞: 人工智慧自然語言處理命名實體識別大型語言模型參考文獻剖析
英文關鍵詞: Artificial Intelligence, Natural Language Processing, Named Entity Recognition, Large Language Models, Bibliographic Reference Parsing
DOI URL: http://doi.org/10.6345/NTNU202400662
論文種類: 學術論文
相關次數: 點閱:348下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 隨著科學論文發表數量的快速增長,引用來源的多樣性和格式差異增加了參考文獻剖析的難度。本研究旨在探討如何自動化擷取科學論文中的參考文獻,並利用人工智慧工具進行剖析,藉以簡化工作流程,降低人力和時間成本,並提升圖書館的知識傳播效能。本文提出了從中文期刊文章檔案中自動化擷取參考文獻的方法,並評估使用人工智慧工具剖析參考文獻的可行性。
    本研究實驗分為三個部分,第一部分設計程式,擷取期刊文章中的參考文獻章節;第二部分評估不同人工智慧工具在參考文獻剖析任務中的效能;第三部分根據第二部分的實驗結果修正實驗方法,並評估和比較修正後的成果。實驗結果如下:
    1. 在參考文獻擷取實驗中,基於規則方法的程式能夠自動擷取文章中的參考文獻內容,用於建立資料集作為後續研究基礎。
    2. 在參考文獻剖析實驗中,本研究比較了spaCy和ChatGPT兩種基於Transformer架構的人工智慧工具的效能。實驗結果顯示,ChatGPT在各欄位的F1-score表現優於spaCy,具有較高的準確性和穩定性。
    3. 在第三部分實驗中,選擇了第二部分中效能較佳的ChatGPT進行提示修正。實驗結果顯示,經過提示調整後,ChatGPT在各欄位的F1-score表現均有所提升。
    本研究結果顯示了使用人工智慧工具自動化剖析參考文獻的可行性,並展現了大型語言模型在這一任務中的潛力和優勢。未來研究可以進一步嘗試結合多種人工智慧工具,探討利用不同模型優勢提升參考文獻剖析的準確性,同時探討減低剖析成本的可能性。

    With the rapid growth in the number of scientific publications, the diversity of citation styles has increased the difficulty of reference parsing. This thesis aims to discuss how to automate the extraction and parsing of references from scientific papers using artificial intelligence tools, thereby simplifying workflows, reducing time costs, and enhancing the efficiency of knowledge dissemination in libraries. This paper proposes a method for extracting references from Chinese journal articles and evaluates the feasibility of parsing these references using AI tools.
    The study is divided into three parts. The first part involves extracting reference sections from journal articles. The second part assesses the performance of different AI tools in the task of reference parsing. The third part modifies the experimental methods based on the results of the second part and evaluates and compares the outcomes after these adjustments. The experimental results are as follows:
    1. In the first experiment, the rule-based program successfully extracted reference content from the articles in their entirety.
    2. The second experiment compared the performance of two AI tools, spaCy and ChatGPT, both based on the Transformer architecture, in reference parsing. Results showed that ChatGPT outperformed spaCy in terms of F1-score, indicating higher accuracy and stability.
    3. In the third experiment, ChatGPT, which demonstrated better performance in the second part, was selected for model adjustments. We optimized the prompt, and the results indicated that after adjustments, ChatGPT's F1-score performance improved across all fields.
    In summary, the results of this study demonstrate the feasibility of parsing references using AI tools and reveal the potential of large language models in this task. Future research could explore further integration of various artificial intelligence tools to enhance the accuracy of this task, as well as possibilities for reducing the costs.

    摘要 i Abstract ii 目次 iii 表次 iv 圖次 v 第一章 緒論 1 第一節 研究背景與動機 1 第二節 研究目的與問題 3 第三節 研究範圍 4 第四節 研究限制 5 第二章 文獻探討 6 第一節 參考文獻剖析工具 6 第二節 命名實體識別工具 9 第三節 大型語言模型的應用 14 第三章 研究方法 17 第一節 研究設計 17 第二節 研究工具 19 第三節 研究資料 21 第四節 研究實施與步驟 23 第四章 實驗結果與分析 36 第一節 參考文獻擷取 36 第二節 參考文獻剖析 41 第三節 實驗設計調整 64 第五章 結論 75 第一節 結論 75 第二節 未來研究建議 76 參考文獻 78

    馬行遠、李韋杰 、劉昭麟(2022 年 11 月 21-22 日)。中文醫療文件的命名實體辨識報告。The 34th Conference on Computational Linguistics and Speech Processing。台北市,台灣。
    陳光華 (2003) 。引文索引與臺灣學術期刊之經營。人文與社會科學簡訊,10:3卷,68-81。
    曾淑賢、鄭秀梅、羅金梅(2013)。臺灣連結世界・世界認識臺灣―「臺灣人文及社會科學引文索引資料庫」建置經驗。國家圖書館館刊,102(2),139-171。
    BibPro: A Citation Parser Based on Sequence Alignment Techniques | IEEE Conference Publication | IEEE Xplore. (n.d.). Retrieved 19 December 2023, from https://ieeexplore.ieee.org/document/4483078
    Blair-Stanek, A., Holzenberger, N., & Van Durme, B. (2023). Can GPT-3 Perform Statutory Reasoning? Proceedings of the Nineteenth International Conference on Artificial Intelligence and Law, 22–31. https://doi.org/10.1145/3594536.3595163
    Blecher, L., Cucurull, G., Scialom, T., & Stojnic, R. (2023). Nougat: Neural Optical Understanding for Academic Documents (arXiv:2308.13418). arXiv. https://doi.org/10.48550/arXiv.2308.13418
    Brzustowicz, R. (2023). From ChatGPT to CatGPT: The Implications of Artificial Intelligence on Library Cataloging. Information Technology and Libraries, 42(3), Article 3. https://doi.org/10.5860/ital.v42i3.16295
    C. -C. Chen, K. -H. Yang, H. -Y. Kao, & J. -M. Ho. (2008). BibPro: A Citation Parser Based on Sequence Alignment Techniques. 22nd International Conference on Advanced Information Networking and Applications - Workshops (Aina Workshops 2008), 1175–1180. https://doi.org/10.1109/WAINA.2008.125
    Chen, Y., Lasko, T. A., Mei, Q., Denny, J. C., & Xu, H. (2015). A study of active learning methods for named entity recognition in clinical text. Journal of Biomedical Informatics, 58, 11–18. https://doi.org/10.1016/j.jbi.2015.09.010
    Cho, S., Jeong, S., Seo, J. yeon, & Park, J. (2023). Discrete Prompt Optimization via Constrained Generation for Zero-shot Re-ranker. In A. Rogers, J. Boyd-Graber, & N. Okazaki (Eds.), Findings of the Association for Computational Linguistics: ACL 2023 (pp. 960–971). Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.findings-acl.61
    Choi, J. H., Hickman, K. E., Monahan, A., & Schwarcz, D. (2023). ChatGPT Goes to Law School (SSRN Scholarly Paper 4335905). https://doi.org/10.2139/ssrn.4335905
    Cioffi, A., & Peroni, S. (2022). Structured references from PDF articles: Assessing the tools for bibliographic reference extraction and parsing (arXiv:2205.14677). arXiv. https://doi.org/10.48550/arXiv.2205.14677
    Councill, I., Giles, C. L., & Kan, M.-Y. (2008). ParsCit: An Open-source CRF Reference String Parsing Package. In N. Calzolari, K. Choukri, B. Maegaard, J. Mariani, J. Odijk, S. Piperidis, & D. Tapias (Eds.), Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC’08). European Language Resources Association (ELRA). http://www.lrec-conf.org/proceedings/lrec2008/pdf/166_paper.pdf
    Dai, S., Shao, N., Zhao, H., Yu, W., Si, Z., Xu, C., Sun, Z., Zhang, X., & Xu, J. (2023). Uncovering ChatGPT’s Capabilities in Recommender Systems. Proceedings of the 17th ACM Conference on Recommender Systems, 1126–1132. https://doi.org/10.1145/3604915.3610646
    Fantechi, A., Gnesi, S., Livi, S., & Semini, L. (2021). A spaCy-based tool for extracting variability from NL requirements. 32–35. https://doi.org/10.1145/3461002.3473074
    Gemini Team, R. Anil, S. Borgeaud, Y. Wu, J.-B. Alayrac, J. Yu, R. Soricut, J. Schalkwyk, A. M. Dai, A. Hauth, et al. Gemini: a family of highly capable multimodal models. arXiv preprint arXiv:2312.11805, 2023.
    Guangshang, G. a. O. (2022). Survey on Attention Mechanisms in Deep Learning Recommendation Models. Computer Engineering and Applications, 58(9), 9. https://doi.org/10.3778/j.issn.1002-8331.2112-0382
    Hadi, M. U., Al-Tashi, Q., Qureshi, R., Shah, A., Muneer, A., Irfan, M., Zafar, A., Shaikh, M., Akhtar, N., Wu, J., & Mirjalili, S. (2023). Large Language Models: A Comprehensive Survey of its Applications, Challenges, Limitations, and Future Prospects. https://doi.org/10.36227/techrxiv.23589741.v3
    Hu, C., Gong, H., & He, Y. (2022). Data driven identification of international cutting edge science and technologies using SpaCy. PLOS ONE, 17(10), e0275872. https://doi.org/10.1371/journal.pone.0275872
    Islam, S., Elmekki, H., Elsebai, A., Bentahar, J., Drawel, N., Rjoub, G., & Pedrycz, W. (2023). A comprehensive survey on applications of transformers for deep learning tasks. Expert Systems with Applications, 241, 122666. https://doi.org/10.1016/j.eswa.2023.122666
    Keretna, S., Lim, C. P., & Creighton, D. (2014). A hybrid model for named entity recognition using unstructured medical text. 2014 9th International Conference on System of Systems Engineering (SOSE), 85–90. https://doi.org/10.1109/SYSOSE.2014.6892468
    Khabsa, M., & Giles, C. L. (2014). The number of scholarly documents on the public web. PloS One, 9(5), e93949. https://doi.org/10.1371/journal.pone.0093949
    Lin, T., Wang, Y., Liu, X., & Qiu, X. (2022). A survey of transformers. AI Open, 3, 111–132. https://doi.org/10.1016/j.aiopen.2022.10.001
    Liu, P., Guo, Y., Wang, F., & Li, G. (2022). Chinese named entity recognition: The state of the art. Neurocomputing, 473, 37–53. https://doi.org/10.1016/j.neucom.2021.10.101
    Lund, B., & Ting, W. (2023). Chatting about ChatGPT: How May AI and GPT Impact Academia and Libraries? (SSRN Scholarly Paper 4333415). https://doi.org/10.2139/ssrn.4333415
    Matelsky, J. K., Parodi, F., Liu, T., Lange, R. D., & Kording, K. P. (2023). A large language model-assisted education tool to provide feedback on open-ended responses (arXiv:2308.02439). arXiv. https://doi.org/10.48550/arXiv.2308.02439
    Nguyen, M. V., Lai, V. D., Pouran Ben Veyseh, A., & Nguyen, T. H. (2021). Trankit: A Light-Weight Transformer-based Toolkit for Multilingual Natural Language Processing. In D. Gkatzia & D. Seddah (Eds.), Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations (pp. 80–90). Association for Computational Linguistics. https://doi.org/10.18653/v1/2021.eacl-demos.10
    Nov, O., Singh, N., & Mann, D. (2023). Putting ChatGPT’s Medical Advice to the (Turing) Test: Survey Study. JMIR Medical Education, 9, e46939. https://doi.org/10.2196/46939
    OpenAI. (2023). GPT-4 Technical Report (arXiv:2303.08774). arXiv. http://arxiv.org/abs/2303.08774
    Panda, S., & Kaur, N. (2023). Exploring the viability of ChatGPT as an alternative to traditional chatbot systems in library and information centers. Library Hi Tech News, 40(3), 22–25. https://doi.org/10.1108/LHTN-02-2023-0032
    Perera, N., Dehmer, M., & Emmert-Streib, F. (2020). Named Entity Recognition and Relation Detection for Biomedical Information Extraction. Frontiers in Cell and Developmental Biology, 8, 673. https://doi.org/10.3389/fcell.2020.00673
    Prasad, A., Kaur, M., & Kan, M.-Y. (2018). Neural ParsCit: A deep learning-based reference string parser. International Journal on Digital Libraries, 19(4), 323–337. https://doi.org/10.1007/s00799-018-0242-1
    Rodrigues Alves, D., Colavizza, G., & Kaplan, F. (2018). Deep Reference Mining From Scholarly Literature in the Arts and Humanities. Frontiers in Research Metrics and Analytics, 3, 21. https://doi.org/10.3389/frma.2018.00021
    Saha, S., & Ekbal, A. (2013). Combining multiple classifiers using vote based classifier ensemble technique for named entity recognition. Data & Knowledge Engineering, 85, 15–39. https://doi.org/10.1016/j.datak.2012.06.003
    Sasaki, Y., Tsuruoka, Y., McNaught, J., & Ananiadou, S. (2008). How to make the most of NE dictionaries in statistical NER. BMC Bioinformatics, 9 Suppl 11(Suppl 11), S5. https://doi.org/10.1186/1471-2105-9-S11-S5
    Son, G., Jung, H., Hahm, M., Na, K., & Jin, S. (2023). Beyond Classification: Financial Reasoning in State-of-the-Art Language Models (arXiv:2305.01505). arXiv. https://doi.org/10.48550/arXiv.2305.01505
    Song, M., Yu, H., & Han, W.-S. (2015). Developing a hybrid dictionary-based bio-entity recognition technique. BMC Medical Informatics and Decision Making, 15(1), S9. https://doi.org/10.1186/1472-6947-15-S1-S9
    Tang, R., Han, X., Jiang, X., & Hu, X. (2023). Does Synthetic Data Generation of LLMs Help Clinical Text Mining? (arXiv:2303.04360). arXiv. https://doi.org/10.48550/arXiv.2303.04360
    Tkaczyk, D., Collins, A., Sheridan, P., & Beel, J. (2018). Machine Learning vs. Rules and Out-of-the-Box vs. Retrained: An Evaluation of Open-Source Bibliographic Reference and Citation Parsers (arXiv:1802.01168). arXiv. https://doi.org/10.48550/arXiv.1802.01168
    Touvron, H., Lavril, T., Izacard, G., Martinet, X., Lachaux, M.-A., Lacroix, T., Rozière, B., Goyal, N., Hambro, E., Azhar, F., Rodriguez, A., Joulin, A., Grave, E., & Lample, G. (2023). LLaMA: Open and Efficient Foundation Language Models (arXiv:2302.13971). arXiv. https://doi.org/10.48550/arXiv.2302.13971
    Trautmann, D., Petrova, A., & Schilder, F. (2022). Legal Prompt Engineering for Multilingual Legal Judgement Prediction (arXiv:2212.02199). arXiv. https://doi.org/10.48550/arXiv.2212.02199
    Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is All you Need. Advances in Neural Information Processing Systems, 30. https://papers.nips.cc/paper_files/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html
    Yang, K., Ji, S., Zhang, T., Xie, Q., & Ananiadou, S. (2023). On the Evaluations of ChatGPT and Emotion-enhanced Prompting for Mental Health Analysis.
    Zhang, B., Yang, H., & Liu, X.-Y. (2023). Instruct-FinGPT: Financial Sentiment Analysis by Instruction Tuning of General-Purpose Large Language Models (SSRN Scholarly Paper 4489831). https://doi.org/10.2139/ssrn.4489831
    Zhang, X., Zou, J., Le, D. X., & Thoma, G. R. (2011). A structural SVM approach for reference parsing. BMC Bioinformatics, 12(3), S7. https://doi.org/10.1186/1471-2105-12-S3-S7
    Zhao, W. X., Zhou, K., Li, J., Tang, T., Wang, X., Hou, Y., Min, Y., Zhang, B., Zhang, J., Dong, Z., Du, Y., Yang, C., Chen, Y., Chen, Z., Jiang, J., Ren, R., Li, Y., Tang, X., Liu, Z., … Wen, J.-R. (2023, March 31). A Survey of Large Language Models. https://arxiv.org/abs/2303.18223v13
    Zhao, J., Huang, F., Lv, J., Duan, Y., Qin, Z., Li, G. & Tian, G.. (2020). Do RNN and LSTM have Long Memory?. Proceedings of the 37th International Conference on Machine Learning, 119(113), 65-11375.

    無法下載圖示 本全文未授權公開
    QR CODE