簡易檢索 / 詳目顯示

研究生: 許育聞
Yu-Wen Shu
論文名稱: 會議與期刊文獻對預測主題趨勢之比較研究—以「資訊檢索」領域為例
A comparison study on conference papers and journal articles for predicting topic trends – using「Information Retrieval」as an example
指導教授: 曾元顯
Tseng, Yuen-Hsien
學位類別: 碩士
Master
系所名稱: 圖書資訊學研究所
Graduate Institute of Library and Information Studies
論文出版年: 2009
畢業學年度: 97
語文別: 中文
論文頁數: 192
中文關鍵詞: 主題趨勢預測會議文獻共字分析自動化歸類資訊檢索
英文關鍵詞: topic trends predict, conference paper, co-word analysis, automatic categorization, information retrieval
論文種類: 學術論文
相關次數: 點閱:212下載:12
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 多數進行主題趨勢探測的學者,皆採用期刊文獻作為主要的分析素材,鮮少針對其他類型的文獻進行研究,然而在學術傳播中會議論文的重要性不可小覷,因此本研究以資訊檢索(Information Retrieval)領域為範圍,針對會議文獻與期刊文獻分別進行主題趨勢預測,以觀察不同類型的文獻進行主題趨勢預測時的差異性。
    本研究收集1990年至2007年資訊檢索領域具代表性的SIGIR會議文獻及五本核心期刊中收錄主題為「資訊檢索」的期刊文獻,五本核心期刊分別是:Information Processing & management、JASIST&JASIS、Journal of Information science、Journal of Documentation、Information Retrieval,主題歸類的部份是以主題整併和自動化歸類兩種方式進行。為了確保預測的準確性,本研究以相同文獻類型和相異文獻類型分別進行預測,以比較其預測上的準確性,最後分別改變預測集和驗證集之年代範圍以比較其差異性。
    研究發現會議文獻和期刊文獻在主題詞彙的用法上有所差異,且各自有較關注探討的主題。會議文獻大部分比期刊文獻較早出現,然而在主題預測上,會議文獻並未佔有優勢,當預測的主題範圍較廣時,期刊文獻預測之效果較佳,且相同類型文獻預測效果優於交叉預測之效果。
    最後提出之建議為:期刊文獻之控制詞彙尚未完善,許多單複數詞彙和縮寫詞彙尚需統整;主題預測的部份,若要瞭解較廣泛的領域趨勢,以期刊文獻預測的效果較佳,而要了解細部領域的趨勢則是以相同的文獻類型進行預測較佳;在後續研究的部份,可以針對像是專利或部落格等其他類型的灰色文獻進行研究,或是以文獻之作者群進行社會網絡分析也是一個可行的研究方向。

    Many scholars who study topic trends use journal articles as primary texts for analysis and hardly pay attention to other types of documents. However, the importance of conference papers cannot be neglected in the academic field of Scholarly Communication. Hence, the research focusing on Informal Retrieval puts topic trends into practice in two kinds of literature, conference papers and journal articles, and observes the discrepant results of those in different types of documents.
    The research collects representative researches on “Information Retrieval” in SIGIR conference papers and five core journals: Information Processing & Management, JASIST&JASIS, Journal of Information Science, Journal of Documentation, and Information Retrieval. The methods of categorizing documents rely on topics of journal articles given in databases, session titles of conference papers, and then the previous articles and papers in automatic categorization. In order to ensure the accuracy of prediction, and prediction is experimented in two groups, the same and the different types of literature. Then, the research changes periods of prediction and validation set to compare the results.
    The research finds that conference papers and journal articles differ not only in the uses of topic vocabulary but also in the topics of their concerns. Although most conference papers publish earlier than journal articles, the latter possesses more advantages in topic prediction. When the scope of the predicted topic is wider, the predicted results of journal articles are better. The predicted results of documents from the same type also generate superior outcomes than those from the different type. Suggestions are proposed in the end of the research. Control terms of publication papers are defective because plenty of singular/plural vocabulary and abbreviations need arranging. In the part of topic prediction, if understanding trends in wilder fields is needed, the prediction of topic trends in journal articles leads to better effects. The result of using journal articles to predict topic trends is better. To understand trends in detailed field, the prediction of topic trends in same type of documents is more effective. Finally, further studies on Information Retrieval is recommended to study other types of gray literature, such as patents or articles on blogs, or make an social network analysis on authors of documents.

    摘要 ii 目次 iv 表次 vi 圖次 xii 附錄表次 xiv 第一章 緒 論 1 第一節 研究動機與背景 1 第二節 研究目的與研究問題 4 第三節 研究範圍與限制 5 第四節 名詞解釋 6 第二章 文獻探討 7 第一節 會議文獻與期刊文獻 7 第二節 主題趨勢之相關研究 17 第三章 研究方法 27 第一節 研究概念 27 第二節 研究假設 28 第三節 研究方法 28 第四節 研究流程與實驗規劃 36 第五節 研究對象與工具 47 第四章 研究結果與分析 53 第一節 文獻主題判斷與歸類 53 第二節 文獻主題與年代分佈 65 第三節 會議與期刊文獻主題趨勢預測差異 100 第四節 小結 145 第五章 結論與建議 153 第一節 結論 153 第二節 建議 157 參考文獻 161 附錄 167 附錄 一 第一種和第二種主題整併下主題年代序列表 167 附錄 二 主題萃取系統歸類結果 182 附錄 三 系統自動萃取主題之文獻數量年代排序 188

    Allen, R. S. (1995). The magnitude of conference proceedings published in physics journals. Special Libraries 86(2), 136-144.
    Åström, F. (2007). Changes in the LIS research front: Time-sliced cocitation analyses of LIS journal articles, 1990-2004. Journal of the American Society for Information Science and Technology, 58(7), 947-957.
    Baker, D. R. (1990). Citation Analysis: A methodological review. Social Work Research & Abstracts, 26, 3-10.
    Borgman, C. L., & Furner, J. (1990). Scholarly communication and bibliometrics. Annual Review of Information Science and Technology, 36, 1-45.
    Cai, K.-Y., & Card, D. (2007). An analysis of research topics in software engineering – 2006 Journal of Systems and Software, 81(6), 1051-1058.
    Callon, M., Courtial, J. P., & Laville, F. (1991). Co-word analysis as a tool for describing the network of interactions between basic and technological research: The case of polymer chemistry. Scientometrics, 22(1), 155-205.
    Chen, C. (2005). CiteSpace II: Detecting and visualizing emerging trends and transient patterns in scientific literature. Journal of the American Society for Information Science and Technology, 57(3), 359-377.
    Courtial, J. P. (1994). A cowaord analysis of scientometrics. SCIENTOMETRICS, 31(3), 251-260
    C.W.Hanson, & Janes, M. (1960). Lack of index in reports of conference. Journal of the Document, 16(2), 65-70.
    C.W.Hanson, J., M. (1961). Coverage by abstract journals of conference papers. Journal of the Document, 17(3), 143-149.
    Ding, Y., Chowdhury, G., & Foo, S. (1999a). Mapping intellectual struture of information retrieval: An author cocitation analysis, 1987-1997. Paper presented at the 7th International Conference of the International Society for Scientometric and Informetrics, Colima, Mexico.
    Ding, Y., Chowdhury, G., & Foo, S. (1999b). Mapping the development in information retrieval speciality: A bibliometric analysis via journals. Paper presented at the 7th International Conference of the International Society for Scientometric and Informetrics, Colima, Mexico.
    Ding, Y., Chowdhury, G. G., & Foo, S. (2001). Bibliometric cartography of information retrieval research by using co-word analysis. Information Processing & Management, 37(6), 817-842
    Drott, M. C. (1995). Reexamining the role of conference papers in scholarly communication. Journal of the American Society for Information Science, 6(4), 301.
    Funk, M. E. (1988). The usefulness of monographic proceedings. Buletin of Medical Library Association, 6(1), 14-21.
    Gardner, J. (1980). The conference as an integral component in the science and technology information dissemination network- with some thought on the role of librarianship as a facilitator between the scientist / Engineer and the Printed Work. Paper presented at the Conference Literature in Science and Technology: Its Role in the Distribution in Information.
    Garvey, W. D., Lin, N., Nelson, C. E., & Tomita, K. (1972). Research studies in patterns of scientific communicationⅡ: The role of nationa meeting in scientific and technical communication. Information Storage and Retrieval, 8(4), 164.
    Garvey, W. D., Lin, N., Nelson, C. E., & Tomita, K. (1972). Research studies in patterns of scientific communication Ⅲ: Information – exchange processes associated with the production of journal articles. Information Storage and Retrieval, 8(5), 207-221.
    Garvey, W. D., Lin, N., Nelson, C. E., & Tomita, K. (1972). Research studies in patterns of scientific communication Ⅱ: The role of national meeting in scientific and technical communication. Information Storage and Retrieval, 8(4), 159-169.
    HE, Q. (1999). Knowledge discovery through co-word analysis LIBRARY TRENDS, 48(1), 133-159.
    Hjerland, B. (2003). Fundamentals of knowledge organization. Knowledge organization, 30(2), 87-111.
    Kean, P., & Ronayne, J. (1972). Preliminary communications in chemistry. Journal of Chemical Documentation, 12(4), 218-220.
    Mills, P. R. (1973). Characteristics of unpblished conference proceedings. Journal of Documentation, 29(1), 36-50.
    Montesi, M., & Owen, J. M. (2008). From conference to journal publication:How conference papers in software engineering are extended for publication in journals. Journal of the American Society for Information Science and Technology, 59(5), 816-829.
    Mooers, C. N. (1951). Zatocoding applied to mechanical organization of knowledge. American Documentation, 2(1), 20-32.
    Pelzer, N. L., & Wiese, W. H. (2003). Bibliometric study of grey literature in core veterinary medical journals. Journal of the Medical Library Association 91(4), 434-441.
    Perry, B. (1995). Grey literature in the internatonal monetary fund. Paper presented at the Second International Conference on Grey Literature.
    Persson, O. (1994). The intellectual base and research fronts of JASIS 1968-1990. Journal of American Society for Information Science, 45(1), 31-38.
    Rokaya, M., Atlam, E., Fuketa, M., Dorji, T. C., & Aoe, J.-i. (2008). Ranking of field association terms using Co-word analysis INFORMATION PROCESSING & MANAGEMENT 44(2), 738-755
    Smeaton, A. F., Keogh, G., Gurrin, C., McDonald, K., & Sødring, T. (2002). Analysis of papers from twenty-five years of SIGIR conferences: What Have We Been Doing for the Last Quarter of a Century ? ACM SIGIR Forum 36(2), 39 - 43
    Subramanyam, K. (1981). Scientific and technical information resources. New York: Marcel Dekker.
    Tseng, Y.-H., Lin, Y.-I., Kuo, C.-H., & Lee, Y.-Y. (2008). Which kinds of trend metrics are more effective for emerging trend detection? Paper presented at the Proceedings of the International Symposium on Webometrics and Web Mining, 國立政治大學公企中心.
    Tseng, Y.-H., Lin, C.-J., & Lin, Y.-I. (2007). Text mining techniques for patent analysis. Information Processing and Management, 43(5), 1216-1247
    Tsay, M.-Y., Jou, S.-J., & Ma, S.-S. (2000). A bibliometric study of semiconductor Literature, 1978-1997. Scientometrics, 49(3), 491-509.
    van Raan, A. F. J., & Tijssen, R. J. W. (1993). The neural net of neural network research. Scientometrics, 26(1), 169-192.
    White, H. D., & Griffith, B. C. (1981). Author cocitation : A literature measure of intellectual structure. Journal of the American Society for Information Science, 32, 163-171.
    White, H. D., & McCain, K. W. (1998). Visualizing a discipline: An author co-citation analysis of information science, 1972-1995. Journal of the American Society for Information Science, 49, 327-355.
    Woibel. (1995). Metadatea the foundations of resources description. D-LIB Magazine.
    Zhao, D., & Strotmann, A. (2007). Can citation analysis of web publications better detect research fronts? Journal of the American Society for Information Science and Technology, 58 (9), 1285 - 1302.

    中文部份:
    昌炎新(2006)。核心期刊的淵源以及功效分析。武漢科技大學學報,8(5),21-23。
    周靜怡、孫坦、陳濤(2007)。共詞可視化:以人類基因組領域為例。情報學報,26(4),532~537。
    邱炯友(2006)。學術傳播與期刊出版。臺北市:遠流。
    許雅婷(2006)。資訊檢索文獻老化現象之研究-兼論同時法與歷時法之特質。台北市:國立政治大學圖書資訊與檔案學研究所 。
    郭華(2006)。論重要的科技信息資源-會議文獻。圖書館工作與研究(1),25-27。
    張正宏(2008)。運用關鍵詞建構知識管理研究網絡。台北市:國防管理學院資訊管理學系。
    曾元顯(2007)。農業研究前沿探勘模式與系統之開發(研究計畫,STPI-C-960906 )。台北:財團法人國家實驗研究院科技政策研究與資訊中心。
    曾元顯、林堬一(2006)。文字探勘技術在教育評鑑研究發展趨勢分析之應用。在教育評鑑國際學術研討會,國立台灣師範大學。
    黃淑娟、蔣嘉寧、黃擎天、宋雪芳(1998)。由文獻分析檢視會議文獻的傳播。教育資料科與圖書館學,35(4),340。原文引自Felix Liebensy, “Lost Information: Unpublished Conference Papers,”in International Conference on Scientific Information(Washington: National Academy of Science-National Research Council, 1959), pp.475-495.
    蔡佳玲(1990)。我國社會科學學術期刊論文引用灰色文獻之研究。國立政治大學圖書資訊研究所碩士論文。
    蔡明月(1997)。學術傳播與書目計量學。教育資料與圖書館學,35(1),39。
    蔡明月(2003)。資訊計量學與文獻特性。臺北市:編譯館。
    蔡明月、劉瓊芳(2007)。1992-2005 資訊計量學研究及其發展演變。圖書與資訊學刊,61,42-56。
    蔡明月、賴芊卉(2007)。資訊科學引用與被引用文獻分散現象與主題變化研究:1985-2005。圖書資訊學研究,1(2),1-31。
    吳嘉雯(1998)。半導體文獻雙被引現象分析。台北縣:淡江大學教育資料科學學系。
    傅雅秀(1996)。從科學傳播的觀點探討中央研究院生命科學專家的資訊尋求行為。圖書館學刊,11,133-163。
    黃秀琴(1998)。會議文獻作者生產力與其延續出版品之研究:以國防科技學術研討會為例 。台北縣:淡江大學教育資料科學研究所碩士論文。
    黃淑娟、蔣嘉寧、黃擎天(1981)。會議文獻之探討。教育資料科學,18(4),88。
    黃惠美(2007)。臺灣地區圖書資訊學文獻高生產作者的引用圖像—以作者雙被引分析為例。中華民國圖書館學會電子報,5。
    楊世瑩(2005)。SPSS統計分析實務。臺北市:旗標。
    鄭琚媛(2004)。臺灣地區生命科學國際會議文獻生產力與延續出版之研究。台北縣:淡江大學資訊與圖書館學研究所碩士論文。
    羅思嘉(1990)。引用文獻分析與學術傳播研究。中國圖書館學會會報,66,73-85。
    羅思嘉、陳光華、林純如(1990)。圖書資訊學學術文獻主題分類體系之研究。臺灣大學圖書資訊學刊,16,185-208。

    資料庫與網路資源:
    Science Citation Index®. Retrieved 6/1, 2008, from http://scientific.thomson.com/products/sci/
    The 30th Annual International ACM SIGIR Conference 23-27 July 2007, Amsterdam. Retrieved 6/3, 2007, from http://www.sigir2007.org/history.html
    中國大百科全書智慧藏。智慧藏學習科技股份有限公司編著。檢索日期:2008,4/26。檢自:http://dblink.ncl.edu.tw/web/Content.asp?ID=377&Query=1

    下載圖示
    QR CODE