簡易檢索 / 詳目顯示

研究生: 李慧萱
Hui-Shuan Lee
論文名稱: 華語作文分級系統
Automated Chinese Essay Scoring System for CFL
指導教授: 張國恩
Chang, Ko-En
宋曜廷
Sung, Yao-Ting
張道行
Chang, Tao-Hsing
學位類別: 碩士
Master
系所名稱: 資訊教育研究所
Graduate Institute of Information and Computer Education
論文出版年: 2013
畢業學年度: 101
語文別: 中文
論文頁數: 92
中文關鍵詞: 華語作文評閱系統文法特徵文法剖析器貝氏機器學習
英文關鍵詞: Automated Chinese essay scoring system, grammar feature, grammar parser, Bayesian theoremmachine learning
論文種類: 學術論文
相關次數: 點閱:246下載:16
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 有感於世界對於華語文學習的需求與日俱增,但在華語學習環境中,卻沒有像英文托福考試使用的e-Rater這類的工具,可以幫助華語文教師或學生進行教學或學習,因此研製一個給華語文領域使用的作文分級系統,希望能對此有所助益。
    本論文之研究使用Stanford parser作為文法剖析器,開發出數個文法相關特徵,並以貝氏機率為機器學習之模型,實作出華語作文分級系統。
    本研究所開發出的系統可達到93%的正確性,對於華語作文以分數作為分級的方法,可達到不錯的效果,亦可在華語老師的教學上或華語測驗裡實際使用。

    Due to the learning boom of CFL, the needs of learning equipment increased. However, there is no such tool like e-Rater for TOEFL in the CFL learning field for Chinese teaching instructors and students to use. In this study, we tent to build up an automated essay scoring system for CFL learners leading to a better CFL learning environment.
    The system we developed in the study used Stanford Parser as a grammar parser to analyze and parse sentences to design some grammar features that could fit the system. We used Bayesian theorem as a machine learning model. By integrating features to the model, we built up a Chinese essay scoring system for CFL.
    The system could reach to 93% on the adjacent accuracy in rating the scores of essays and could literally use for the practical needs in CFL teaching or test.

    第一章緒論 1 第一節研究動機 1 第二節 研究目的 5 第三節 研究限制 6 第二章文獻探討 7 第一節外語寫作學習 7 第二節AES自動評分系統 11 第三節文法剖析技術 13 第四節CRIE與Coh-Metrix 17 第三章系統設計 19 第一節資料前處理 21 第二節文法剖析 22 第三節文章特徵 25 第四節 評分系統 29 第四章研究方法 35 第一節 研究工具 35 第二節 研究設計 45 第三節 研究目的與結果 50 第四節 研究討論 74 第五章結論與未來發展 77 第一節 結論 77 第二節 未來發展 79 參考文獻 81 附錄一 Coh-Metrix2.0 文本分析工具特徵整理表 89  

    一、 中文部分
    仇小屏(2005) 。論新詩以句構篇之類型及其特色。成大中文學報第12期,1-22。
    宋曜廷、熊玉雯、李慧萱(2012)。華語文寫作測驗評分規準。國立臺灣師範大學華語文與科技研究中心技術手冊,未發表。
    林千翔、張嘉惠及陳貞伶(2010)。結合長詞優先與序列標記之中文斷詞研究。Computational Linguistics and Chinese Language Processing。Vol. 15, No. 3-4 Sep/Dec 2010, 161-180。
    林信宏(2009)。基於貝氏機器學習法之中文自動作文評分系統A Bayesian Based Chinese Essay Scoring System。國立交通大學資訊科學與工程研究所碩士論文。
    林宗宏(2005)。漢語複雜句之句法分析。國科會計畫報告書2005.8-2006.7。
    許菱祥(1986)。中文文法。大中國圖書公司。
    陳稼興、謝佳倫、許芳誠(2000)。以遺傳演算法為基礎的中文斷詞研究。電子商務學報,2(2),27-44。
    陳行葳(2005)。傳統與限制式命題作文教學之研究傳統與限制式命題作文教學之研究。國立高雄師範大學國文學所碩士論文。
    陳鳳如。國中學生寫作能力的學習潛能評估之分析研究。師大學報:教育類,52(2),73-94(2007)。
    張晏晟(2008)。擴展反應型論述反應之自動化評估方法-以教師教學能力為例。國立臺灣師範大學資訊教育研究所碩士論文。
    胡夢珂(2011)。使用支援向量機進行中文文本可讀性分類-以國小國語課文為例。國立臺灣師範大學資訊教育研究所碩士論文。
    粘志鵬(2005)。基於支援向量機之中文自動作文評分系統。國立交通大學資訊科學與工程研究所碩士論文。
    鄭昭明(2004) 第二語言的學習。華語文教學研究(Journal of Chinese Language Teaching) 2004.6.1-1,159-169。
    張道行,譚克平及李嘉晃(2007)。如何發展中文的寫作自動評分技術?以ACES 為例。東亞教育評鑑論壇。
    黃政傑(1997)。教學原理。師大書院。
    黃幀祥(2011)。使用潛在語義分析建構可讀性分類模型-以國小社會科課文為例。國立臺灣師範大學資訊教育研究所碩士論文。
    劉珣,張凱,劉社會,陳曦,左珊丹,施家煒(2008)。新實用漢語課本:繁體本。北京:北京語言大學出版社。
    孫懿芬及丁國雲(2009)。現代華語泰國版。聯經出版事業有限公司。
    王淑美,盧翠英及陳夜寧(2011)。實用視聽華語。國立臺灣師範大學、中正書局。
    黎瓊麗(2004)。技職院校大專學生英文寫作錯誤之分析與探討─以美和技術學院為例。大葉學報13-2,民國93年。
    國民中學學生基本學力測驗推動工作委員會(2012) 國民中學學生基本學力測驗寫作評分規準一覽表99.08版。
    國立臺灣師範大學數位學習實驗室(2012)。取自 http://140.122.77.149crie 1.0文本可讀性指標自動化分析系統。

    盧慧娟及呂羅雪(2001)。篇章結構與句型結構之相關性研究:簡化原則在西班牙與閱讀教學應用。國科會研究報告2001-8~2002-7。

    二、 英文部分

    ACTFL (2012). ACTFL proficiency guildline2012-writing.
    Attali, Y. & Burstein, J. (2006).Automated scoring with e-rater v.2.0. The Journal of Technology, Learning and Assessment, 4(3).
    Borg S.(1999). Studying teacher cognition in second language grammar teaching, System, Volume 27, Issue 1, March 1999, Pages 19-31.
    Burstein, J., Kukich, K., Wolff, S., Lu, C., Chodorow M., Braden-Harder, L.,& Harris M. D. (1998). Automated scoring using a hybrid feature identification technique. Paper presented at the 36th Annual Meeting of the Association of Computational Linguistics, Montreal, Canada.
    Buckingham T. and Pech W.C., (1976) .v An Experience Approach to Teaching Composition. TESOL Quarterly , Vol. 10, No. 1 (Mar., 1976), pp. 55-65.
    Jing Chen, Sheida White, Michael McCloskey, JalehSoroui, Young Chun. (2010) Effects of computer versus paper administration of an adult functional writing assessment.
    Wanxiang Che, Valentin I. Spitkovsky, and Ting Liu. (2012). A Comparison of Chinese Parsers for Stanford Dependencies. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012).
    Chen X., Huang C., Li M. and Kit C..(2009) Better Parser Combination . 第十屆全國人機語音通訊學術會議,中國.
    Crossley, S.C. & Greenfield, J. & McNamara, D.S.(2008). Assessing text readability using cognitively based indices. TESOL Quarterly, 42(3), 475-493.
    ETS (2012). Test of English as Foreign Language Official Guild 2012. ETS.

    ETS (2012). ETS iBT/Next generation TOEFL Test - independent writing rubrics-scoring standard. ETS.
    Hayes, J. R., & Flower, L. S. (1980). Identifying the organization of writing processes. In L. W. Gregg & E. R. Steinberg (Eds.), Cognitive processes in writing (pp.3-30). Hillsdale, NJ: Lawrence Erlbaum.
    Kachru Y..(2006) Pedagogical Grammars: Second Language. Encyclopedia of Language & Linguistics (Second Edition), Elsevier, Oxford, 2006, Pages 248-254.
    Krapels, A. R.(1991) An overview of second language writing process research. Second language writing: Research insights for classroom.(37-56). Cambridge: Cambridge University Press.
    Martin East(2009). Evaluating the reliability of a detailed analytic scoring rubric for foreign language writing . Assessing Writing 14(2009)88-115.
    Miyuki Sasaki(1993). Relationships among second language proficiency, foreign language aptitude, and intelligence: A structural equation modeling approach, Language Learning : A journal of research in language studies, 1993-43-3, 313-344.
    Landauer, T. K., Laham, D. & Foltz, P. W.,(2000).The intelligent essay assessor. IEEE Intelligent System, 15, 27-31.
    Lafferty, John, A. McCallum, and F. Pereira, (2001). Conditional random field: Probabilistic models for segmenting and labeling sequence data. ICML 18.
    Petrov S.& Klein D..(2007). Improved inference for unlexicalized parsing. Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference, pages 404–411, Rochester, New York, April. Association for Computational Linguistics.
    Richards, Jack C.(1970). A Non-Contrastive Approach to Error Analysis. TESOL Convention, San Francisco, March 1970.
    Shermis M.D., J. Burstein, and C. Leacock.(2006). Applications of computers in assessment and analysis of writing. Handbook of writing research, 403–416.
    Shermis M.D., Shneyderman A. and Attalic Y.,(2008). How important is content in the ratings of essay assessments? Assessment in Education: Principles, Policy & Practice, 15, No. 1, 191–105.
    Tseng, Chang, Andrew, Jurafsky and Manning.(2005) A Conditional Random Field Word Segmenter for Sighan Bakeoff 2005. Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing.
    Wang W. and Wen Q.,(2002). L1 use in the L2 composing process: An exploratory study of 16 Chinese EFL writers. Journal of Second Language Writing 11(2002)225-246.
    Yates R. and Kenkel J..(2002) Responding to sentence-level errors in writing. Journal of Second Language Writing 11(2002)29-47.
    Zhao, Huang and Li,(2006). An improved Chinese word segmentation system with conditional random field. Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pages 162–165.
    McCallum A. K. and Nigam K,. (1998). Employing EM in pool-based active learning for text classification. Proceedings of ICML-98, 15th International Conference on Machine Learning, pages 350-358.
    AP(2012). Refer to AP Chinese and culture http://www.collegeboard.com/student/testing/ap/sub_chineselang.html
    Bayesian Essay Test Scoring system(2012). Refer to Bayesian Essay Test Scoring sYstem – BETSY http://echo.edres.org:8080/betsy/
    CEFR(2012). Refer to CEFR-Guide for the Development of Language Education Policies in Europe. http://www.actfl.org/i4a/pages/index.cfm?pageid=1
    The Stanford Natural Language Processing Group( 2012 ). Refer to The Stanford Parser: A statistical parser. http://nlp.stanford.edu/software/lex-parser.shtml#About

    下載圖示
    QR CODE