研究生: |
李振遠 Chen Yuan Lee |
---|---|
論文名稱: |
基於背景模型的姿勢判斷系統 Arm Gesture Recognition Based on Background Model |
指導教授: |
李忠謀
Lee, Chung-Mou |
學位類別: |
碩士 Master |
系所名稱: |
資訊工程學系 Department of Computer Science and Information Engineering |
論文出版年: | 2011 |
畢業學年度: | 99 |
語文別: | 中文 |
論文頁數: | 44 |
中文關鍵詞: | 姿勢辨識 、高斯混合背景模型 、輪廓圖像 、連通元件 、支持向量機 、人臉偵測 |
英文關鍵詞: | gesture recognition, mixture of Gaussian background model, silhouette image, connected component, Support Vector Machine, face detection |
論文種類: | 學術論文 |
相關次數: | 點閱:212 下載:2 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
姿勢辨識在電腦視覺領域中,特別是針對人體部分是項越來越重要的議題,涵蓋的範圍可分為:手部與手臂的姿勢辨識、頭部與臉部姿勢辨識、整個身體姿勢辨識等種類。在姿勢辨識的問題中,一個很大的瓶頸在於如何在複雜環境下取得所需要的特徵資訊,並且選擇適當的方法將這些資訊完成姿勢辨識。本論文主要目標是在真實的教室裡並且只有一台攝影機拍攝下,能即時(real-time)辨識出講者的手臂姿勢來達到控制投影片的效果,所提出的方法能讓講者在教室投影機照射下,穩定並不受投影機照射並且背景隨著投影片的換頁變化影響下抓取需要的資訊來進行辨識。本論文使用高斯混合背景模(Mixture of Gaussian background model)來擷取出前景(foreground)的輪廓(silhouette)影像,並使用連通元(connected component)將前景輪廓的特徵資訊截取出來,並套入支持向量機(Support Vector Machine,SVM)對手臂動作進行分類。此外,搭配人臉偵測(face detection)方法能分辨出左右手,達到不同手部動作來控制投影片的效果。
Gesture recognition, recognize what poses a human body appears, has become an important issue in computer vision in recent. In general, gesture recognition considers different parts of human body, including head, hand and arm, and the whole body. In order to deal with gesture recognition, we need to well extract body silhouette even in a complex environment, to adopt features for gesture representation, and to design a proper classifier for recognition. In this thesis, our goal is to design a real-time presentation control system in a real classroom by recognizing the lecturer’s arm gestures only with single camera. Our proposed system is robust to strong lighting of projector and slide change in the projection screen. We first employ the mixture of Gaussian background model to segment the body silhouette of foreground. Then, the extracted feature of the body silhouette is classified as arm gestures by Support Vector Machine (SVM). In addition, the adaboosting approach of face detection helps our system to understand the left and the right hand to involve more hand actions for presentation control.
[1] J. K. Aggarwal and Q. Cai, “Human motion analysis: A review,” Comput. Vis. Image Understanding, vol. 72, pp. 428–440, 1999.
[2] D. M. Gavrila, “The visual analysis of human movement: A survey,”Comput. Vis. Image Understanding, vol. 72, pp. 82–98, 1999.
[3] T. C. C. Henry, E. G. R. Janapriya, and L. C. deSilva, “An automatic system for multiple human tracking and actions recognition in office environment,” in Proc. ICASSP, 2003, vol. 3, pp. 45–48.
[4] J. Krumm, S. Harris, B. Meyers, B. Brumitt, M. Hale, and S. Shafer, “Multi-camera multi-person tracking for easy living,” in Proc. 3rd IEEE Int. Workshop Visual Surveillance, Jul. 2000, pp. 3–10.
[5] S. Dagtas, W. A. Khatib, A. Ghafoor, and R. L. Kashyap, “Models for motion-based video indexing and retrieval,” IEEE Trans. Image Process., vol. 9, no. 1, pp. 88–101, Jan. 2000.
[6] J. Xiaofei, and L. Honghai, “Advances in view-invariant human motion analysis: A review,” Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions, vol. 40, no. 1, pp. 13-24, Jan. 2010.
[7] S. Yuping, and H. Foroosh, “View-invariant action recognition from point triplets,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 31, no. 10, pp. 1898-1905, Oct. 2009.
[8] T. B. Moeslund and F. Bajers, “Summaries of 107 computer visionbased human motion capture papers,” Univ. Aalborg, Aalborg, Denmark, Tech. Rep. L1A99–01, 1999.
[9] T. B. Moeslund and E. Granum, “A survey of computer vision-based human motion capture,” Comput. Vis. Image Understand., vol. 81, pp. 231–268, 2001.
[10] D. Demirdjian and T. Darrell, “3-D articulated pose tracking for untethered diectic reference,” in Proc. Int. Conf. Multimodal Interfaces,Pittsburgh, PA, 2002.
[11] P. Fua, A. Gruen, N. D’Apuzzo, and R. Plankers, “Markerless full body shape and motion capture from video sequences,” Int. Arch. Photogramm. Rem. Sens., vol. 34, pp. 256–261, May 2002.
[12] R. Plankers and P. Fua, “Tracking and modeling people in video sequences,” Comput. Vis. Image Understand., vol. 81, Mar. 2001.
[13] R. Cucchiara, C. Grana, M. Piccardi, and A. Prati, “Detecting moving objects, ghosts, and shadows in video streams,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 25, no. 10, pp. 1337-1342, 2003.
[14] B. Shoushtarian, and H. E. Bez, “A practical adaptive approach for dynamic background subtraction using an invariant colour model and object tracking,” Pattern Recognition Letters, vol. 26, no. 1 pp. 5-26, 2005.
[15] C. R. Wren, A. Azarbayejani, T. Darrell, and A. P. Pentland, “Pfinder: Real-time tracking of the human body,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 19, no. 7, pp. 780-785, 1997.
[16] Qi Zang and Reinhard Klette,“Parameter analysis for mixture of Gaussians model,” Communication and Information Technology Research Technical Report 188, 2006.
[17] C. Stauffer, and W. E. L. Grimson, “Adaptive background mixture models for real-time tracking,” IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 246-252, 1999.
[18] P. KaewTraKulPong, and R. Bowden, “An improved background mixture model for real-time tracking with shadow detection,” Proc. 2nd European Workshop on Advanced Video Based Surveillance Systems, vol. 25, 2001.
[19] S. E. Chen, “QuickTime VR – An image based approach to virtual environment navigation,” Proc. SIGGRAPH 95, pp. 29-38, 1995.
[20] Y. Ren, C. S. Chua, and Y. K. Ho, “Statistical background modeling for non-stationary camera,” Pattern Recognition Letters, vol. 24, pp. 183-196, 2003.
[21] M. Singh, A. Basu, and M.K. Mandal, “Human activity recognition based on silhouette directionality,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 18, noO. 9, 2008
[22] Correa Hernandez, P. , Czyz, J. , Marques, F. , Umeda, T. , Marichal, X. , Macq, B. “Bayesian approach for morphology-based 2-D human motion capture,” IEEE Transactions on Multimedia, volL. 9, no. 4, 2007
[23] D. Chen, C. Fookes, “Labelled silhouettes for human pose estimation,” 10th International Conference on Information Sciences Signal Processing and their Applications (ISSPA), 2010
[24] Lim Siew Hooi , G. Sainarayanan, Liau Chung Fan, “Human pose modelling and body tracking from monocular video sequences,” International Conference on Intelligent and Advanced Systems, 2007
[25] A. Datta, M. Shah, N. Da Vitoria Lobo, “Person-on-person violence detection in video data,” 16th International Conference on Pattern Recognition, 2002
[26] E. Peng , L. Li, “Acquiring human skeleton proportions from monocular images without posture estimation,” 10th International Conference on Control, Automation, Robotics and Vision, 2008
[27] Jianhao Ding, Yigang Wang, Lingyun Yu, “Extraction of human body skeleton based on silhouette images,” Second International Workshop on Education Technology and Computer Science (ETCS), 2010
[28] Meng Li , Tao Yang , Runping Xi , Zenggang Lin, “Silhouette-based 2-D human pose estimation,” Fifth International Conference on Image and Graphics, 2009
[29] Fei Xie, Guili Xu, Yuehua Cheng, Yupeng Tian, “An improved thinning algorithm for human body recognition,” IEEE International Workshop on Imaging Systems and Techniques, 2009
[30] Chia-Feng Juang, Chia-Ming Chang, Jiuh-Rou Wu, D. Lee, “Computer vision-based human body segmentation and posture estimation,” IEEE Transactions on Systems, Man and Cybernetics, Part A: Systems and Humans, vol. 39, no. 1, 2009
[31] B.-W. Min, H.-S. Yoon, J. Soh, Y.-M. Yang, and T. Ejima, “ Hand gesture recognition using hidden Markov models,” IEEE International Conference on Systems, Man, and Cybernetics, 'Computational Cybernetics and Simulation'., , Florida, USA, pp. 4232-4235, Oct. 1997.
[32] J. Lafferty, A. McCallum, and F. Pereira, “Conditional random fields: Probabilistic models for segmenting and labeling sequence data,” International Conference on Machine Learning, Williams College, Williamstown, MA, USA, pp. 282-289, June 2001.
[33] Rafael C. Gonzales and Richard E. Woods, Digital Image Processing. 3rd ed. Prentice Hall, Inc. 2008.
[34] Corinna Cortes and V. Vapnik, Support-Vector Networks. Kluwer Academic, Boston. 1995.