簡易檢索 / 詳目顯示

研究生: 翁相擇
Weng, Xiang-Zer
論文名稱: 以二次規劃方式控制初始試題機率達成試題曝光率之最佳化
Controlling the Probability of the Initial Items to Decrease Item Exposure Rates
指導教授: 何榮桂
Ho, Rong-Guey
學位類別: 碩士
Master
系所名稱: 資訊教育研究所
Graduate Institute of Information and Computer Education
論文出版年: 2002
畢業學年度: 90
語文別: 英文
論文頁數: 97
中文關鍵詞: 電腦化適性測驗試題曝光率非線性規劃二次規劃
英文關鍵詞: Computerized adaptive testing (CAT), item exposure rates, item response theory (IRT), nonlinear programming, quadratic programming
論文種類: 學術論文
相關次數: 點閱:191下載:4
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 現代測驗理論確實在理論上較古典測驗理論更為人所信服,但是現代測驗理論卻遲遲未能邁入實用的階段。直至近年來,托福(TOEFL)測驗及GRE測驗開始採用電腦化適性測驗之後,因為電腦化適性需要當代測驗理論的配合,方確立現代測驗理論在實際面的價值。而現代測驗理論進入實用階段之後,首需考量的課題便是試題曝光率的問題。因為對於電腦化適性測驗而言,建置題庫需要花費相當大的成本,因此,為使題庫可以長久使用,題庫的保密性便成為重要的課題。而題庫保密性是否良好,其中之一便需考量試題曝光率的控制。
    本研究重點便在於試題曝光率的控制。主要的構想即是分析題庫中每一題試題出現的期望次數,找出初始試題出現之機率,以使得試題曝光率達到最佳化,也即先分析所有初始試題其所有可能的情況,以推導出題庫中每一題試題出現的期望次數。然後利用非線性規劃的技術,計算出每一題初始試題出現之機率,以使得試題曝光率達到最佳化。最後再進行模擬研究以驗證此分析是否正確,且試題曝光率較未控制初始試題出現機率時之為佳。經由本研究所發展之系統,預期可以找出題庫中使用率過低的試題亦或是永遠也用不到之試題,以控管題庫的品質。此外利用本研究所發展之系統,可以找出不適合作為初始試題的題目。
    關鍵字:電腦化適性測驗、試題曝光率、非線性規劃、二次規劃

    This study proposed a new approach that controls the probabilities of the initial items with quadratic programming to decrease item exposure rates. The purpose of propose method is expected to accomplish the CAT without changing any process in general CAT process. In most researches about item exposure rates control, these ways change some steps in general CAT process that often leads to pick up an item which is not most informative as next item, so these ways improve item exposure rates, but lengthen the test length. And these ways lose the sprit, efficiency, which CAT wants.
    In this research, the way we propose indeed not only improves the item exposure rates obviously, but also promises that will shorten the test length and test overlap rate. In short, the way is analyzing all the possible responses, and counting the expected values of each item in the item bank for each initial item. Then, by the expected values of each item in the item bank for each initial item that we find, we make use of quadratic programming to find the optimum probabilities of initial items to decrease item exposure rates. Though, the degree of improvements for item exposure rates is not better than most ways that were often used, but we make it possible to shorten the test length and improve the item exposure rates.
    Keywords: Computerized adaptive testing (CAT), item exposure rates, item response theory (IRT), nonlinear programming, quadratic programming

    Abstract……………………………………………………………… I Chapter 1 Introduction………………………………………………………. 1 1.1 Research Motivation………………………………………… 1 1.2 Research Purposes………………………………………. 2 Chapter 2 Literature Review…………………………………… 4 2.1 Item Exposure Rates……………………………………… 4 2.1.1 The Sympson and Hetter Approach…………… 4 2.1.2 The Davey and Parshall Approach…………… 6 2.1.3 The Stocking and Lewis Approach…………… 6 2.1.4 The 5-4-3-2-1 Algorithm……………………… 7 2.1.5 The a-Stratified Approach………..………. 7 2.1.6 Reviews of Literatures………..……………………… 8 2.2 Mathematic Programming……………..………………… 9 2.2.1 Linear Programming…………………………… 9 2.2.2 Nonlinear Programming……………………… 9 2.2.3 Quadratic Programming……………………… 10 2.2.4 Summarization of Mathematic Programming… 11 Chapter 3 Methodology……………………………………………………. 12 3.1 Definitions……..……………………………………… 12 3.2 Equations…………..…………………………………… 14 3.3 Proofs……………………………………………………. 26 Chapter 4 Results and Discussion…………………………….. 39 4.1 Results……..…………………………………………… 39 4.1.1 Theory Results……..………………………. 39 4.1.2 Theory Simulation Results…..…………… 39 4.1.2.1 Theory Simulation Background… 40 4.1.2.2 Improvements of Variance Value in Theory…… 42 4.1.2.3 Improvements of Test Length in Theory………. 42 4.1.2.4 Improvements of Test Overlap Rate in Theory………… 44 4.1.2.5 Improvements of Degree of Item Exposure Uniform in Theory.45 4.1.2.6 Summary of Theory Simulation Results…… 47 4.1.3 Real World Simulation Results…………… 47 4.1.3.1 Real World Simulation Background............................................ 47 4.1.3.2 Improvements of Variance Value in Practice…… 54 4.1.3.3 Improvements of Test Length in Practice…………56 4.1.3.4 Improvements of Test Overlap Rate in Practice……… 57 4.1.3.5 Improvements of Degree of Item Exposure Uniform in Practice…… 58 4.1.3.6 Summary of Real World Simulation Results…….……………. 59 4.2 Discussions…..…………………….. 54 Chapter 5 Conclusions and Suggestions…………… 56 5.1 Conclusions……..…………………………… 56 5.2 Suggestions……………..………………… 57 5.2.1 Ability Estimation with Shorter Test Length…………… 57 5.2.2 Theory about Multiple Choices or Multiple Dimensions Item Bank.. 57 5.2.3 Contents Consideration with Security….. 58 5.2.4 Combinations with other Ways……………. 58 References……..…………………………………………………… 60 Appendix…….……………………………………………………… 67 Theorem 1 ……………………………………………………….… 67 Theorem 2 ……………………………………………………….… 68 Theorem 3 ……………………………………………………….… 70 Theorem 4 ……………………………………………………….… 71 Theorem 5 ……………………………………………………….… 73 Theorem 6 ……………………………………………………….… 75 Theorem 7 ……………………………………………………….… 78 Theorem 8 ……………………………………………………….… 79 Theorem 9 ……………………………………………………….… 81 Theorem 10 ………………………………………………………… 85 Theorem 11 ………………………………………………………… 88 Theorem 12 ………………………………………………………… 90

    Chang, S. W., & Twu, B. Y. (1998). A comparative study of item exposure control methods in computerized testing. ACT research report series; 98-3.
    Chang, H. H., & Ying, Z. (1996). A global information approach to computerized adaptive testing. Applied Psychological Measurement, 20, 213-229.
    Chang, H. H., & Ying, Z. (1999). a-Straitified multistage computerized adaptive testing. Applied Psychological Measurement, 23, 263-278.
    Davey, T., & Parxhall, C. G. (1995). New algorithms for item selection and exposure control with computerized adaptive testing. Paper presented at the annual meeting of the American Educational Research Association. April, 1995, San Franciso, CA.
    Load, F. M. A theory of test scores. Psycometric Monograph, 1952, NO. 7.
    McBride, J. R., & Martin, J. T. (1983). Reliability and validity of adaptive ability tests in a military setting. In D. J. Weiss (Ed.) New horizons in testing: Latent trait test theory and computerized adaptive testing (pp. 223-236). New York, Academic Press.
    Sympson, J. B., & Hetter, R. D. (1985). Controlling item-exposure rates in computerized adaptive testing, as described in Wainer, et al., (1990)
    Stocking, M. L. & Lewis, C. (1995a). A new method of controlling item exposure in computerized adaptive testing (Research Report 95-25). Princeton, NJ: Educational Testing Service.
    Stocking, M. L. & Lewis, C. (1995b). Controlling item exposure conditional on ability in computerized adaptive testing (Research Report 95-24). Princeton, NJ: Educational Testing Service.

    QR CODE