How far are we from solving the 2D & 3D face alignment problem? (and a dataset of 230,000 3D facial landmarks)

This paper investigates how far a very deep neural network is from attaining close to saturating performance on existing 2D and 3D face alignment datasets. To this end, we make the following 5 contributions: (a) we construct, for the first time, a very strong baseline by combining a state-of-the-art...

Full description

Bibliographic Details
Main Authors: Bulat, Adrian, Tzimiropoulos, Georgios
Format: Conference or Workshop Item
Published: 2017
Online Access:https://eprints.nottingham.ac.uk/44749/
_version_ 1848796989663215616
author Bulat, Adrian
Tzimiropoulos, Georgios
author_facet Bulat, Adrian
Tzimiropoulos, Georgios
author_sort Bulat, Adrian
building Nottingham Research Data Repository
collection Online Access
description This paper investigates how far a very deep neural network is from attaining close to saturating performance on existing 2D and 3D face alignment datasets. To this end, we make the following 5 contributions: (a) we construct, for the first time, a very strong baseline by combining a state-of-the-art architecture for landmark localization with a state-of-the-art residual block, train it on a very large yet synthetically expanded 2D facial landmark dataset and finally evaluate it on all other 2D facial landmark datasets. (b)We create a guided by 2D landmarks network which converts 2D landmark annotations to 3D and unifies all existing datasets, leading to the creation of LS3D-W, the largest and most challenging 3D facial landmark dataset to date (~230,000 images). (c) Following that, we train a neural network for 3D face alignment and evaluate it on the newly introduced LS3D-W. (d) We further look into the effect of all “traditional” factors affecting face alignment performance like large pose, initialization and resolution, and introduce a “new” one, namely the size of the network. (e) We show that both 2D and 3D face alignment networks achieve performance of remarkable accuracy which is probably close to saturating the datasets used. Training and testing code as well as the dataset can be downloaded from https: //www.adrianbulat.com/face-alignment/
first_indexed 2025-11-14T19:56:45Z
format Conference or Workshop Item
id nottingham-44749
institution University of Nottingham Malaysia Campus
institution_category Local University
last_indexed 2025-11-14T19:56:45Z
publishDate 2017
recordtype eprints
repository_type Digital Repository
spelling nottingham-447492020-05-04T19:13:58Z https://eprints.nottingham.ac.uk/44749/ How far are we from solving the 2D & 3D face alignment problem? (and a dataset of 230,000 3D facial landmarks) Bulat, Adrian Tzimiropoulos, Georgios This paper investigates how far a very deep neural network is from attaining close to saturating performance on existing 2D and 3D face alignment datasets. To this end, we make the following 5 contributions: (a) we construct, for the first time, a very strong baseline by combining a state-of-the-art architecture for landmark localization with a state-of-the-art residual block, train it on a very large yet synthetically expanded 2D facial landmark dataset and finally evaluate it on all other 2D facial landmark datasets. (b)We create a guided by 2D landmarks network which converts 2D landmark annotations to 3D and unifies all existing datasets, leading to the creation of LS3D-W, the largest and most challenging 3D facial landmark dataset to date (~230,000 images). (c) Following that, we train a neural network for 3D face alignment and evaluate it on the newly introduced LS3D-W. (d) We further look into the effect of all “traditional” factors affecting face alignment performance like large pose, initialization and resolution, and introduce a “new” one, namely the size of the network. (e) We show that both 2D and 3D face alignment networks achieve performance of remarkable accuracy which is probably close to saturating the datasets used. Training and testing code as well as the dataset can be downloaded from https: //www.adrianbulat.com/face-alignment/ 2017-10-24 Conference or Workshop Item PeerReviewed Bulat, Adrian and Tzimiropoulos, Georgios (2017) How far are we from solving the 2D & 3D face alignment problem? (and a dataset of 230,000 3D facial landmarks). In: International Conference on Computer Vision (ICCV17), 22-29 Oct 2017, Venice, Italy. http://ieeexplore.ieee.org/document/8237378/
spellingShingle Bulat, Adrian
Tzimiropoulos, Georgios
How far are we from solving the 2D & 3D face alignment problem? (and a dataset of 230,000 3D facial landmarks)
title How far are we from solving the 2D & 3D face alignment problem? (and a dataset of 230,000 3D facial landmarks)
title_full How far are we from solving the 2D & 3D face alignment problem? (and a dataset of 230,000 3D facial landmarks)
title_fullStr How far are we from solving the 2D & 3D face alignment problem? (and a dataset of 230,000 3D facial landmarks)
title_full_unstemmed How far are we from solving the 2D & 3D face alignment problem? (and a dataset of 230,000 3D facial landmarks)
title_short How far are we from solving the 2D & 3D face alignment problem? (and a dataset of 230,000 3D facial landmarks)
title_sort how far are we from solving the 2d & 3d face alignment problem? (and a dataset of 230,000 3d facial landmarks)
url https://eprints.nottingham.ac.uk/44749/
https://eprints.nottingham.ac.uk/44749/