研究生: |
楊鎧溶 Yang, Kai-Jung |
---|---|
論文名稱: |
基於自然語言技術之急診病患檢傷階段再住院預測研究 Prediction of Hospital Readmission for the Emergency Department at Triage Stage Based on Natural Language Processing Techniques |
指導教授: |
吳怡瑾
Wu, I-Chin |
口試委員: |
陳子立
Chen, Tzu-Li 唐牧群 Tang, Muh-Chyun 吳怡瑾 Wu, I-Chin |
口試日期: | 2023/07/12 |
學位類別: |
碩士 Master |
系所名稱: |
圖書資訊學研究所 Graduate Institute of Library and Information Studies |
論文出版年: | 2023 |
畢業學年度: | 111 |
語文別: | 中文 |
論文頁數: | 96 |
中文關鍵詞: | 急診室 、延伸主訴 、自然語言模型 、再住院預測 、死亡預測 |
英文關鍵詞: | Emergency Department, Expanded CCs, Natural Language Model, Prediction of Hospital Readmission, Prediction of Hospital Death |
研究方法: | 實驗設計法 |
DOI URL: | http://doi.org/10.6345/NTNU202300769 |
論文種類: | 學術論文 |
相關次數: | 點閱:148 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
在急診早期檢傷階段中,病患主訴語(chief complaints)為後續檢傷與醫療初步判定之重要依據。本研究以台北馬偕醫院2011到2018的八年度急診病患就診資料,將檢傷階段可取得之主訴語、年齡、檢傷分級、到院模式…等資料經過非結構後與結構化資料預處理、關鍵字分析、自然語言處理模型,機器學習程序,以進行急診住院、再住院、再入院、死亡預測的實證分析研究。研究在機器學習的程序上,首先採用類神經網路之Word2vec詞嵌入語言模型,由主訴篩選住院相關之重要語意關聯詞,研究進而透過BERT模型進行後續住院、再住院、再入院、死亡預測的研究。研究採用Word2vec與BERT自然語言處理模型進行預測研究,預期可以協助醫院及早準備重症病患相關醫療資源。實驗結果顯示(1) BERT模型預測效力優於Word2vec模型;(2)採用主訴語可以有好的急診住院、再住院、再入院、死亡預測力,於住院、再住院、再入院、死亡預測方面,BERT模型單純採用主訴進行預測, AUC分別為0.9446、0.9877、0.9883、1.0000;(3)考慮結構化變數以產生本研究提出之延伸主訴(Expanded CCs)概念將可提升急診再住院預測效果,BERT模型於住院、再住院、再入院、死亡預測方面的AUC分別為0.9611、0.9949、0.9947、1.0000;(4) 在死亡預測方面,不論是否不平衡處理,單純採用主訴的情況下,Word2vec在維度50的0.8750 AUC優於維度200的0.8394 AUC;若考慮檢傷階段的重要結構化變數,Word2vec在維度50的0.7730 AUC優於維度200的0.7325 AUC,採用單純主訴的各項評估值優於考慮重要結構化變數之Expanded CCs的各項評估值。本研究提出之架構與延伸主訴概念可提供急診預測相關研究的參考。
In the early stage of emergency department triage, the patient's chief complaints are important for subsequent injury assessment and preliminary medical diagnosis. This study analyzed the eight-year emergency department data from Taipei Mackay Memorial Hospital from 2011 to 2018, including data such as chief complaints, age, injury severity score, and mode of arrival, which were processed through non-structured and structured data pre-processing, keyword analysis, natural language processing models, and machine learning algorithms to perform empirical analysis of emergency hospitalization, readmission, reentry, and death prediction. The study first used the Word2vec word embedding language model of neural networks to select important semantic keywords related to hospitalization from chief complaints, and then used the BERT model to perform subsequent hospitalization, readmission, reentry, and death prediction research. The study expected that the use of Word2vec and BERT natural language processing models in prediction research could help hospitals prepare early for critically ill patients. The experimental results showed that (1) the BERT model had better prediction performance than the Word2vec model; (2) the use of chief complaints can have good prediction performance for emergency hospitalization, readmission, reentry, and death, and the AUC of the BERT model in hospitalization, readmission, reentry, and death prediction were 0.9446, 0.9877, 0.9883, and 1.0000, respectively; (3) considering structured variables can improve the prediction performance of emergency readmission, and the BERT model's AUC in hospitalization, readmission, reentry, and death prediction were 0.9611, 0.9949, 0.9947, and 1.0000, respectively, when using the Expanded CCs concept proposed in this study; (4) in death prediction, regardless of whether it is imbalanced processing, the AUC of Word2vec in dimension 50 (0.8750 AUC) is better than that in dimension 200 (0.8394 AUC) when only considering chief complaints. If important structured variables in the injury assessment stage are considered, the AUC of Word2vec in dimension 50 (0.7730 AUC) is better than that in dimension 200 (0.7325 AUC), and the evaluation values of using only chief complaints are better than those of considering Expanded CCs with important structured variables. The framework and Expanded CCs concept proposed in this study can provide a reference for emergency prediction research.
中華民國統計資訊網。取自https://reurl.cc/Gxq0Qv
Alsharawneh, A., Hasan, A. A. (2021). Cancer related emergencies with the chief complaint of pain: Incidence, ED recognition, and quality of care. International Emergency Nursing, 56, 100981.
Clifford, C. T., Pour, T. R., Freeman, R., Reich, D. L., Glicksberg, B. S., Levin, M. A., & Klang, E. (2021). Association between COVID-19 Diagnosis and Presenting Chief Complaint from New York City Triage Data. American Journal of Emergency Medicine, 46, 520-524.
Davazdahemami, B., Peng, P., & Delen, D. (2022). A Deep Learning Approach for Predicting Early Bounce-backs to The Emergency Departments. Healthcare Analytics, 2, 100018.
Delevaux, J. E., Djahnine, A., Talbot, F., Richard, A., Gouttard, S., Mansuy, A., Douek, P., Mohamed, S. S., Boussel, L. (2023). BERT-based Natural Language Processing Analysis of French CT Reports: Application to the Measurement of the Positivity Rate for Pulmonary Embolism. Research in Diagnostic and Interventional Imaging, 6, 100027.
Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Google AI Language. https://arxiv.org/pdf/1810.04805.pdf
Gordon, A. J., Banerjee, I., Block, J., Winstead-Derlega, C., Wilson, J. G., Mitarai, T., Jarrett, M., Sanyal, J., Rubin, D. L., Wintermark, M., & Kohn, M. A. (2022). Natural Language Processing of Head CT Reports to Identify Intracranial Mass Effect: CTIME Algorithm. American Journal of Emergency Medicine, 51, 388-392.
Kalyan, K. S., & Sangeetha, S. (2020). SECNLP: A Survey of Embeddings in Clinical Natural Language Processing. Journal of Biomedical Informatics, 101, 103323.
Lee, H. Y.(2019)。 ELMO, BERT, GPT。取自https://www.youtube.com/watch?v=UYPa347-DdE
Lee, M.(2019)。進擊的BERT:NLP界的巨人之力與遷移學習。https://reurl.cc/RXN8vz
Lord, K., Rothenberg, C., Parwani, V., Finn, E., Khan, A., Sather, J., Ulrich, A., Chaudhry, S., & Venkatesh, A. (2021). Association between Emergency Department Chief Complaint and Adverse Hospitalization Outcomes: A Simple Early Warning System?. The American Journal of Emergency Medicine, 45, 548-550.
Mikolov, T., Chen, K., Corrado, G. S., & Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. Google AI Language. https://arxiv.org/pdf/1301.3781.pdf
Parker, C. A., Liu, N., Wu, S. X., Shen, Y., Lam, S. S. W., & Ong, M. E. H. (2019). Predicting Hospital Admission at The Emergency Department Triage: A Novel Prediction Model. The American Journal of Emergency Medicine, 37(8), 1498-1504.
Perotte, R., Sugalski, G., Underwood, J. P., & Ullo, M. (2021). Characterizing COVID-19: A Chief Complaint Based Approach. American Journal of Emergency Medicine, (45), 398-403.
Roquette, B. P., Nagano, H., Marujo, E. C., Maiorano, A. C. (2020). Prediction of Admission in Pediatric Emergency Department with Deep Neural Networks and Triage Textual Data. Neural Networks, 126, 170-177.
Shahi, N., Shahi, A. K., Phillips, R., Shirek, G., Lindberg, D. M., & Moulton, S. L. (2021). Using Deep Learning and Natural Language Processing Models to Detect Child Physical Abuse. Journal of Pediatric Surgery, 56(12), 2326-2332.