研究生: |
郭俊麟 Jiun-Lin Guo |
---|---|
論文名稱: |
以Graph cut演算法為基礎的連續影像人臉偵測系統 A Video Face Detection Method Using the Graph Cut Algorithm |
指導教授: |
方瓊瑤
Fang, Chiung-Yao |
學位類別: |
碩士 Master |
系所名稱: |
資訊工程學系 Department of Computer Science and Information Engineering |
論文出版年: | 2012 |
畢業學年度: | 101 |
語文別: | 中文 |
論文頁數: | 96 |
中文關鍵詞: | 圖分割 、膚色偵測 、人臉偵測 、教室 、前景擷取 |
英文關鍵詞: | graph cut, skin color detection, face detection, classroom, background subtraction |
論文種類: | 學術論文 |
相關次數: | 點閱:202 下載:46 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本論文提出一套可在各種教室情境中運作的人臉偵測系統,偵測對象為教室中的多名學生,主要應用在自動教室觀察與記錄系統中。本研究採用顏色做為人臉偵測時的特徵,且利用graph cut技術做為人臉偵測時主要的方法。
以顏色為特徵的人臉偵測有著較不受頭部轉動和傾斜影響的優點,因為在頭部轉動和傾斜時,膚色依然在人臉中佔有一定比例的面積;至於眼睛、嘴巴和鼻子等其他人臉特徵在頭部轉動和傾斜時(尤其是轉動)在影像中較不穩定。這個顏色特徵的優點對於在教室中進行自動人臉偵測來觀察與記錄學生的行為有很大的幫助,因為在課堂中學生頭部的姿勢變化常常都是有意義的,如疲憊時打瞌睡、表示贊同時點頭或心不在焉時將人臉轉向他處等,而這些變化也往往是教學觀察者們(教師、研究人員)所關心的現象。因此,本系統若能夠在各種頭部姿勢狀態下做人臉偵測,就能夠更進一步地去分析這些姿勢變化和其所代表的意義。
利用顏色特徵來偵測人臉必須選擇一個適當的色彩空間,並且決定人臉的膚色在該色彩空間中的範圍。然而,這類作法常會遇到兩個問題,一是不同的光線以及人種需定義不同的膚色範圍,二是在教室中有許多物體顏色接近膚色(如原木色課桌椅),會降低人臉偵測的正確率。針對第一個問題,本研究提出一個動態的膚色範圍定義方式;而為了解決第二個問題,本研究提出一個穩定的方法在影像中擷取前景(即學生的部分)。此方法結合單點建模與graph cut的技術,可以得到完整不破碎的前景,在前景的範圍內擷取膚色,避免類膚色背景的干擾。
另一方面,利用膚色在Hue色彩空間中高度集中的特質,本研究再次以graph cut技術優化膚色區域的偵測結果,統計收集到的膚色像素、動態更新膚色範圍,以提高偵測的穩定性。
在實驗時,本研究架設單一攝影機來擷取影像,每張影像中均包含4~6位學生。本研究假設初始教室沒有學生,系統首先進行背景建置,待學生進入教室,系統偵測到影像中有前景出現後,便會開始進行人臉偵測。實驗結果顯示,本研究提出的人臉偵測技術,較不受各種頭部轉動和傾斜角度之影響,並且能夠在低解析度影像下,維持高準確率。
We propose a face detection system which is used in classrooms with various environments. The targets are several students in class, whose behaviors are to be observed and recorded in a classroom observation system. The feature chosen for detection is “color”, and the kernel method is the well-known graph cut algorithm.
Color feature is robust against head pose changing because the area of skin regions changes little during head rotating or tilting, while other features like eyes, noses and mouth are unstable under these conditions. This character is useful for observing students’ behaviors in class since it’s usually meaningful when a student change his head pose. For example, one may doze off if he is tired, nod his head to show his agreement, or turn his head out when he distracted. These behaviors are also important events which educational researchers concern. As a result, if we can perform face detection under various head poses, then the results can be used to detect such behaviors mentioned above for further researches.
To detect faces with color feature, we must choose a proper color space first, and determine the range of “skin color”. However, this kind of methods have two problems. First, the range changes with different lighting conditions and human races. Second, there are many non-human object with skin like color which affects on the precision a lot. For the first problem, we propose a dynamic learning scheme to change the skin color range frame by frame. And to solve the second problem, we propose a robust background subtraction method to eliminate non-human object. This method combining pixel based background modeling and the graph cut algorithm extracts complete foreground region from the input frame and thus avoid the effect of skin like background pixels.
On the other side, since the hue values of skin color pixels are distributed concentratedly in hue color band, we apply graph cut to improve the result of skin color detection, and then collect the skin pixels for learning new skin color range.
In the experiments, we set up a single camera, and there are 4~6 students in the image. Assuming an empty classroom in the beginning, the system constructs the background model first. Then, when objects appear in the image, the system will start to perform face detection. According to the experimental result, the technique proposed is robust under various head poses, and retain high precision in the low resolution images.
L. Cheng, M. Gong, D. Schuurmans, and T. Caelli, "Real-Time Discriminative Background Subtraction," IEEE Transactions on Image Processing, vol. 20, no. 5, pp. 1401-1414, 2010.
Z. Wu and R. Leahy, "An Optimal Graph Theoretic Approach to Data Clustering: Theory and its Application to Image Segmentation," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15, no. 11, pp. 1101-1113, 1993.
O. Juan and Y. Boykov, "Interactive Graph Cuts," IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York, pp. 1023-1029, 2006.
P. Kohli, "Dynamic Graph Cuts for Efficient Inference in Markov Random Fields," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 29, no. 12, pp. 2079-2088, 2007.
Y. Boykov, "An Experimental Comparison of Min-cut/Max-flow Algorithms for Energy Minimization in Vision," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 9, pp. 1124-1137, 2004.
P. Kae and R. Bow, "An Improved Adaptive Background Mixture Model for Real-Time Tracking with Shadow Detection," Proceedings of the 2nd European Workshop on Advanced Video-Based Surveillance Systems, London, 2001.
O. Carmichael and M. Hebert, "Shape-Based Recognition of Wiry Objects," IEEE Transactions on Pattern Analysis and Machine Intelligence, pp.1537-1552, vol.26, no.12, Dec. 2004.
C. Stauffer and W. E. L. Grimson, "Learning Patterns of Activity using Real-Time Tracking," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, no. 8, pp. 747-757, 2000.
A. M. Elgammal, R. Duraiswami, D. Harwood, and L. S. Davis, "Background and Foreground Modeling using Nonparametric Kernel Density Estimation for Visual Surveillance," Proceedings of the IEEE, vol. 90, no. 7, pp. 1151-1163, 2002.
A. M. Elgammal, D. Harwood, and L. S. Davis, "Non-Parametric Model for Background Subtraction," Proceedings of the 6th European Conference on Computer Vision-Part II (ECCV '00), pp. 751-767, London, 2000.
K. Kim, T. H. Chalidabhongse, D. Harwood, and L. Davis, "Real-Time Foreground-Background Segmentation using Codebook Model," RealTime Imaging, vol. 11, no. 3, pp. 172-185, 2005.
R. Lienhart, L. Liang, and A. Kuranov, "A Detector Tree of Boosted Classifier for Real-Time Object Detection and Tracking," Proceedings of International Conference on Multimedia and Expo (ICME '03), vol.2, pp. II-277-280, Baltimore, Maryland, 2003.
L. M. Huang, C. S. Ouyang, and S. J. Lee, "A Neuro-Fuzzy Approach for Multiple Human Objects Segmentation," Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, vol. 3, pp. 2815-2820, Washington, 2003.
Y. Yacoob and L. S. Davis, "Detection and Analysis of Hair," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, no. 7, pp. 1164-1169, 2006.
K. C. Lee, D. Anguelov, B. Sumengen, and S. B. Gokturk, "Unsupervised Learning from Local Features for Video-Based Face Recognition," Proceedings of the 8th IEEE International Conference on Automatic Face & Gesture Recognition (FG '08), pp. 1-6, Amsterdam, The Netherlands, 2008.
P. Viola and M. J. Jones, "Robust Real-Time Face Detection," International Journal of Computer Vision, vol. 2, no. 57, pp. 137-154, 2004.
A. H. Gee and R. Cipolla, "Determining the Gaze of Faces in Images," Image and Vision Computing, vol. 12, pp. 639-647, 1994.
J. G. Wang and E. Sung, "EM Enhancement of 3D Head Pose Estimated by Point at Infinity," Image and Vision Computing, vol. 25, no. 12, pp. 1864-1874, 2007.
P. Campadelli and R. Lanzarotti, "A Face Recognition System Based on Local Feature Analysis," Proceedings of the 4th international conference on Audio- and video-based biometric person authentication(AVBPA'03), pp. 68-73, Springer-Verlag Berlin, Heidelberg, 2003.
E. Hjelmås and B. K. Low, "Face Detection: A Survey," Computer Vision and Image Understanding, vol. 83, no. 3, pp. 236-274, 2001.
J. Tu, Y. Fu, and T. S. Huang, "Locating Nose-Tips and Estimating Head Poses in Images by Tensorposes," IEEE Transactions on Circuits and Systems for Video Technology, vol. 19 , no. 1, pp. 90-102, 2009.
P. Martins and J. Batista, "Single View Head Pose Estimation," Proceedings of the 15th IEEE International Conference on Image Processing (ICIP '08), pp. 1652-1655, San Diego, California, USA, 2008.
Q. Li, Y. Mao, Z. Wang, and W. Xiang, "Robust Real-Time Detection of Abandoned and Removed Objects," Proceedings of the 5th International Conference on Image and Graphics (ICIG '09), pp. 156-161, Xi'an, China, 2009.
S. C. Huang, "An Advanced Motion Detection Algorithm with Video Quality Analysis for Video Surveillance Systems," IEEE Transactions on Circuits and Systems for Video Technology, vol. 21, no. 1, pp. 1-14, 2011.
M. Vargas, J. M. Milla, S. L. Toral, and F. Barrero, "An Enhanced Background Estimation Algorithm for Vehicle Detection in Urban Traffic Scenes," IEEE Transactions on Vehicular Technology, vol. 59, no. 8, pp. 3694-3709, 2010.
W. Wang, J. Yang, and W. Gao, "Modeling Background and Segmenting Moving Objects from Compressed Video," IEEE Transactions on Circuits and Systems for Video Technology, vol. 18, no. 5, pp. 670-681, 2008.
K. Huang, S. Wang, T. Tan, and S. J. Maybank, "Human Behavior Analysis Based on a New Motion Descriptor," IEEE Transactions on Circuits and Systems for Video Technology, vol. 19, no. 12, pp. 1830-1840, 2009.
H. Wang, Q. Lu, X. Wang, G. Lv, and L. Xu, "Method for video incident detection based on biological visual Mechanism," Proceedings of IEEE International Conference on Automation and Logistics (ICAL '10), pp. 114-119, Hong Kong, 2010.
L. Wang, L. Wang, M. Wen, Q. Zhuo, and W. Wang, "Background Subtraction using Incremental Subspace Learning," Proceedings of IEEE International Conference on Image Processing(ICIP '07), vol. 5, pp. 45-48, San Antonio, Texas, USA, 2007.
Y. Liu and Y. Li, "Robust Moving Object Region Extraction Using a Column and Row Projection Algorithm," Proceedings of the 3rd International Conference on Advanced Computer Theory and Engineering (ICACTE '10), vol. 6, pp. 168-171, Chengdu, China, 2010.
C. C. Chiu, M. Y. Ku, and L. W. Liang, "A Robust Object Segmentation System Using a Probability-Based Background Extraction Algorithm," IEEE Transactions on Circuits and Systems for Video Technology, vol. 20, no. 4, pp. 518-528, 2010.
Y. Mu, H. Zhang, H. Wang, and W. Zuo, "Automatic Video Object Segmentation using Graph Cut," Proceedings of IEEE International Conference on Image Processing(ICIP '07), vol. 3, pp. III-377-III-380, San Antonio, Texas, USA, 2007.
J. Park, A. Tabb, and A. C. Kak, "Hierarchical Data Structure for Real-Time Background Subtraction," Proceedings of IEEE International Conference on Image Processing(ICIP '06), pp. 1849-1852, 8-11, Atlanta, GA, USA, 2006.
X. N. Zhang, J. Jiang, Z. H. Liang and C. L. Liu, "Skin Color Enhancement Based on Favorite Skin Color in HSV Color Space," IEEE Transactions on Consumer Electronics, vol. 56, no. 3, pp.1789-1793, 2010.
D. Chai and K. N. Ngan, "Face Segmentation Using Skin-Color Map in Videophone Applications," IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, no. 4, pp. 551-564, 1999.
Z. Liu, J. Yang and N. S. Peng, "An Efficient Face Segmentation Algorithm Based on Binary Partition Tree," Proceedings of Signal Processing:Image Communication, vol. 20, no. 4, pp. 295-314, Shanghai, 2005.
H. Sahbi and N. Boujemaa, "Coarse to Fine Face Detection Based on Skin ColorAdaption," Proceedings of the International ECCV Workshop, pp. 112-120, Copenhagen, 2002.
T. M. Mahmoud, "A New Fast Skin Color Detection Technique," World Academy of Science, Engineering and Technology, pp. 501-505, 2008.
S. K. Singh, D. S. Chauhan, M. Vatsa and R. Singh, "A Robust Skin Color Based Face Detection Algorithm," Tamkang Journal of Science and Engineering, vol. 6, no. 4, pp. 227-234, 2003.
S. A. Phung, "A Novel Skin Color Model in YCbCr Color Space And Its Application to Human Face Detection," Proceedings of IEEE International Conference on Image Processing(ICIP '02), vol. 1, I-289 - I-292, New York, 2002.
L. Sigal, S. Sclaroff and V. Athitsos, " Skin Color-based Video Segmentation Under Time-varying Illumination," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 7, pp. 862-877, 2004.
M. Ying, "Automatic Video Object Segmentation Using Graph Cut," Proceedings of IEEE International Conference on Image Processing(ICIP '07), vol. 3, pp. III-377 - III-380, Texas, Usa, 2007.
C. Rother, V. Kolmogorov, and A. Blake, "Grabcut—Interactive Foreground Extraction Using Iterated Graph Cuts," ACM Transactions on Graphics (SIGGRAPH), pp.309-314, New York, 2004.