簡易檢索 / 詳目顯示

研究生: 郭秉璋
Bing-Chang, Kuo
論文名稱: 基於Histogram of Oriented Gradients之課堂舉手辨識研究
Histogram of Oriented Gradients based Arm Gesture Recognition Research
指導教授: 李忠謀
學位類別: 碩士
Master
系所名稱: 資訊工程學系
Department of Computer Science and Information Engineering
論文出版年: 2012
畢業學年度: 100
語文別: 中文
論文頁數: 44
中文關鍵詞: 姿勢辨識人物分割k-meansHOG
英文關鍵詞: Gesture recognition, People segmentation, dynamic k-means, HOG
論文種類: 學術論文
相關次數: 點閱:136下載:5
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 在現今教學環境中,以科技融入教學的e化教室越來越普遍,教師希望可以透過各類的e化工具來了解學生的學習情況。而在種種學生的動作中,舉手是最容易在課堂上與老師互動的,因此本研究提供了學生舉手辨識系統,希望借由該系統讓老師可以即時快速的掌握學生的情況。本研究使用少量攝影機在多人且複雜背景的情況下進行辨識,希望可以降低環境因素的影響。為了達到此一目的,本研究主要分為二個部份:複雜背景人物辨識分割與舉手判斷。

    在複雜背景人物辨識與分割部分,使用了k-means clustering與motion的結合,將膚色取出來加上motion資訊能夠完整的抓出人物。在舉手判斷的部份,使用了Histogram of Oriented Gradient(HOG)製作出edge的feature進行舉左手、正常狀態、舉右手的辨識。實驗中也交叉比對了不同人物在場景的辨識、光線影響圖片的辨識。其正確率在同一日當model的情況下平均高達91%,而在不同日的情況下也有7-8成的水準。

    In recently classroom environment, there are more and more teachers using electronic devices to help them easy to understand what students think and how students act. In all gestures, raising hand is the most popular way that students interacting with teachers. In this research, we provided a raising hand recognition system to help teacher to handle all students’ behavior. We use camera in complex background and monitor multiple people. To propose a system that can satisfy all environments and will not re-train after changing environments, we separate the system into two parts: people segmentation and gesture recognition.

    In people segmentation part, we use k-means clustering to extract skin color and then use motion to remove skin-liked background. In gesture recognition part, we use histogram of oriented gradient to get the gesture feature and then use SVM to classify. Finally in experimental part, we test 3 scenes to verify our method. When we use the same case to train and test, the correct rate is average 91%. Even we use different day for training/testing, the correct rate can also reach 80%.

    圖目錄 III 表目錄 IV 第一章 簡介 7 1.1 序論 7 1.2 系統目的 10 1.3 研究困難 11 1.4 論文結構 12 第二章 文獻探討 13 2.1 教育研究 13 2.2 人物的分割與多攝影機的校正 14 2.3 人物肢體辨識 14 2.4 過去影像處理技術討論 17 第三章 系統架構與運作流程 21 3.1 人物辨識系統 21 3.2 舉手辨識系統 26 第四章 實驗結果與分析 31 4.1 實驗目的 32 4.2 實驗環境 32 4.3 實驗方法與評分方式 33 4.4 Data sets 34 4.5 實驗結果 36 4.6 分析 39 第五章 結論 43 參考文獻 A.1 

    [Dav05] David A. Sadlier and Noel E. O’Connor, “Event Detection in Field Sports Video Using Audio-Visual Features and a Support Vector Machine,” IEEE Transactions on Circuits and Systems for Video Technology, pp. 1225-1233, 2005.
    [Gua09] Guangyu Zhu, Ming Yang, Kai Yu, Wei Xu, and Yihong Gong, “Detecting video events based on action recognition in complex scenes using spatio-temporal descriptor,” Proceedings of the seventeen ACM international conference on Multimedia, pp. 165-174, 2009
    [Han09] J. Han, G. Awad, and A. Sutherland, “Automatic Skin Segmentation and Tracking in Sign Language Recognition,” Computer Vision, IET, pp. 24-35, 2009.
    [Jin10] Jina Lee and Stacy C. Marsella, “Predicting Speaker Head Nods and the Effects of Affective Information,” IEEE Transactions on Multimedia, pp. 552-562, 2010.
    [Jua05] Juan P. Wachs, Helman Stern, and Yael Edan, “Cluster Labeling and Parameter Estimation for the Automated Setup of a Hand-Gesture Recognition System,” IEEE Transactions on Systems, pp. 932-944, 2005.
    [Lal01] Lalit Gupta and Suwei Ma, “Gesture-Based Interaction and Communication: Automated Classification of Hand Gesture Contours,” IEEE Transactions on Systems, Man, and Cybernetics – Part C: Applications and Reviews, pp. 114-120, 2011.
    [Lil00] Lily Lee, Raquel Romano, and Gideon Stein, “Monitoring Activities from Multiple Video Stream: Establishing a Common Coordinate Frame,” IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 758-767, 2000.
    [Nav05] Navneet Dalal and Bill Triggs, “Histograms of Oriented Gradients for Human Detection,” IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 886-893, Vol. 1, 2005
    [Qin08] Qing Chen, Nicolas D. Georganas, “Hand Gesture Recognition Using Haar-Like Features and a Stochastic Context-Free Grammar,” IEEE Transactions on Instrumentation and Measurement, pp. 1562-1571, 2008.
    [Sar08] Sargin, M.E., Yemez, Y., Erezin, E., and Tekalp, A.M., “Analysis of Head Gesture and Prosody Patterns for Prosody-Driven Head-Gesture Animation,” IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1330-1345, 2008.
    [Sus07] Sushmita Mitra and Tinku Acharya, “Gesture Recognition: A Survey,” IEEE Transactions on Systems, Man, and Cybernetics—Part C: Applications and Reviews, pp.311-324, 2007.
    [Wed10] Wendy L. Bjorklund and Diana L. Rehling, “Student Perceptions of Classroom Incivility,” College Teaching, pp. 15-18, 2010.
    [Wei06] Weiming Hu, Min Hu, Xue Zhou, and Tieniu Ten, “Principal Axis-Based Correspondence between Multiple Cameras for People Tracking,” IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 663-671, 2006.
    [Xia09] Xiaogang Wang, Kinh Tieu, and Grimson, E.L., “Correspondence-Free Activity Analysis and Scene Modeling in Multiple Camera Views,” IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 56-71, 2009.
    [Yu09] Yu-Ting Pai, Li-Te Lee, Shanq-Jang Ruan, Yen-Hsiang Chen, Saraju P. Mohanty, and Elias Kougianos, “Honeycomb Model Based Skin Color Detector for Face Detection,” International Conference on Mechatronics and Machine Vision in Practice, pp. 11-16, 2008.
    [1] 吳怡君,「小筆電融入國小五年級英語科教學之研究」,國立台灣師範大學,碩士論文,2010年。
    [2] 紀宜岑,「小筆電於國小教學應用之個案研究」,國立台灣師範大學,碩士論文,2009年。
    [3] T客邦,「身體就是控制器,微軟Kinect是怎麼做到的?」,取自“http://www.techbang.com.tw/posts/2936-get-to-know-how-it-works-kinect.”
    [4] 「中華大學成功研發智慧型教室~人臉辨識自動點名系統」,取自“http://ctee.com.tw/News/view.aspx?newsid=3772”
    [5] 康寧醫護暨管理專科學校,「幼兒行為觀察紀錄法」,取自“http://www.knjc.edu.tw/teaching/echild/%E6%88%91%E5%80%91%E7%9A%84%E5%B8%AB%E8%B3%87/%E6%95%99%E5%B8%AB%E5%9F%BA%E6%9C%AC%E8%B3%87%E6%96%99/%E8%89%AF%E8%AA%A0/%E8%A1%8C%E7%82%BA%E8%A7%80%E5%AF%9F/%E5%B9%BC%E5%85%92%E8%A1%8C%E7%82%BA%E8%A7%80%E5%AF%9F%E8%A8%98%E9%8C%84%E6%B3%95.ppt”.
    [6] 永吉國中 鄭玉玲老師,「教學輔導之教學觀察」,取自http://elearning.ice.ntnu.edu.tw/pfo_down.asp?psn=42096。
    [7] 成大醫學院神經科 蔡景仁,「醫學生的不發問、怕被問和怕被指正」,醫學教育通訊,第23期,2002年。
    [9] 台大資訊工程學系 陳昱廷,「淺談行人偵測」,取 自http://140.113.87.112/vol_13/tech2.htm
    [10] “Histogram of oriented gradient”,取自http://en.wikipedia.org/wiki/Histogram_of_oriented_gradients_(HOG)
    [11] Chih-Chung Chang and Chih-Jen Lin, “LIBSVM,” 取自http://www.csie.ntu.edu.tw/~cjlin/libsvm/

    下載圖示
    QR CODE