Robust RGB-D face recognition using Kinect sensor

In this paper we propose a robust face recognition algorithm for low resolution RGB-D Kinect data. Many techniques are proposed for image preprocessing due to the noisy depth data. First, facial symmetry is exploited based on the 3D point cloud to obtain a canonical frontal view image irrespective o...

Full description

Bibliographic Details
Main Authors: Li, B., Xue, M., Mian, A., Liu, Wan-Quan, Krishna, A.
Format: Journal Article
Published: Elsevier BV 2015
Online Access:http://hdl.handle.net/20.500.11937/24404
_version_ 1848751419146895360
author Li, B.
Xue, M.
Mian, A.
Liu, Wan-Quan
Krishna, A.
author_facet Li, B.
Xue, M.
Mian, A.
Liu, Wan-Quan
Krishna, A.
author_sort Li, B.
building Curtin Institutional Repository
collection Online Access
description In this paper we propose a robust face recognition algorithm for low resolution RGB-D Kinect data. Many techniques are proposed for image preprocessing due to the noisy depth data. First, facial symmetry is exploited based on the 3D point cloud to obtain a canonical frontal view image irrespective of the initial pose and then depth data is converted to XYZ normal maps. Secondly, multi-channel Discriminant Transforms are then used to project RGB to DCS (Discriminant Color Space) and normal maps to DNM (Discriminant Normal Maps). Finally, a Multi-channel Robust Sparse Coding method is proposed that codes the multiple channels (DCS or DNM) of a test image as a sparse combination of training samples with different pixel weighting. Weights are calculated dynamically in an iterative process to achieve robustness against variations in pose, illumination, facial expressions and disguise. In contrast to existing techniques, our multi-channel approach is more robust to variations. Reconstruction errors of the test image (DCS and DNM) are normalized and fused to decide its identity. The proposed algorithm is evaluated on four public databases. It achieves 98.4% identification rate on CurtinFaces, a Kinect database with 4784 RGB-D images of 52 subjects. Using a first versus all protocol on the Bosphorus, CASIA and FRGC v2 databases, the proposed algorithm achieves 97.6%, 95.6% and 95.2% identification rates respectively. To the best of our knowledge, these are the highest identification rates reported so far for the first three databases.
first_indexed 2025-11-14T07:52:25Z
format Journal Article
id curtin-20.500.11937-24404
institution Curtin University Malaysia
institution_category Local University
last_indexed 2025-11-14T07:52:25Z
publishDate 2015
publisher Elsevier BV
recordtype eprints
repository_type Digital Repository
spelling curtin-20.500.11937-244042017-09-13T15:07:14Z Robust RGB-D face recognition using Kinect sensor Li, B. Xue, M. Mian, A. Liu, Wan-Quan Krishna, A. In this paper we propose a robust face recognition algorithm for low resolution RGB-D Kinect data. Many techniques are proposed for image preprocessing due to the noisy depth data. First, facial symmetry is exploited based on the 3D point cloud to obtain a canonical frontal view image irrespective of the initial pose and then depth data is converted to XYZ normal maps. Secondly, multi-channel Discriminant Transforms are then used to project RGB to DCS (Discriminant Color Space) and normal maps to DNM (Discriminant Normal Maps). Finally, a Multi-channel Robust Sparse Coding method is proposed that codes the multiple channels (DCS or DNM) of a test image as a sparse combination of training samples with different pixel weighting. Weights are calculated dynamically in an iterative process to achieve robustness against variations in pose, illumination, facial expressions and disguise. In contrast to existing techniques, our multi-channel approach is more robust to variations. Reconstruction errors of the test image (DCS and DNM) are normalized and fused to decide its identity. The proposed algorithm is evaluated on four public databases. It achieves 98.4% identification rate on CurtinFaces, a Kinect database with 4784 RGB-D images of 52 subjects. Using a first versus all protocol on the Bosphorus, CASIA and FRGC v2 databases, the proposed algorithm achieves 97.6%, 95.6% and 95.2% identification rates respectively. To the best of our knowledge, these are the highest identification rates reported so far for the first three databases. 2015 Journal Article http://hdl.handle.net/20.500.11937/24404 10.1016/j.neucom.2016.06.012 Elsevier BV restricted
spellingShingle Li, B.
Xue, M.
Mian, A.
Liu, Wan-Quan
Krishna, A.
Robust RGB-D face recognition using Kinect sensor
title Robust RGB-D face recognition using Kinect sensor
title_full Robust RGB-D face recognition using Kinect sensor
title_fullStr Robust RGB-D face recognition using Kinect sensor
title_full_unstemmed Robust RGB-D face recognition using Kinect sensor
title_short Robust RGB-D face recognition using Kinect sensor
title_sort robust rgb-d face recognition using kinect sensor
url http://hdl.handle.net/20.500.11937/24404