Towards uncertainty-aware and label-efficient machine learning of human expressive behaviour

The ability to recognise emotional expressions from non-verbal behaviour plays a key role in human-human interaction. Endowing machines with the same ability is critical to enriching human-computer interaction. Despite receiving widespread attention so far, human-level automatic recognition of affec...

Full description

Bibliographic Details
Main Author:	Tellamekala, Mani Kumar
Format:	Thesis (University of Nottingham only)
Language:	English
Published:	2022
Subjects:	Emotion Recognition Machine Learning Uncertainty Modelling Self-supervised Learning Probabilistic Machine Learning Models Stochastic Process Regression Uncertainty-Aware Learning Label Ambiguity
Online Access:	https://eprints.nottingham.ac.uk/71876/

_version_	1848800699149713408
author	Tellamekala, Mani Kumar
author_facet	Tellamekala, Mani Kumar
author_sort	Tellamekala, Mani Kumar
building	Nottingham Research Data Repository
collection	Online Access
description	The ability to recognise emotional expressions from non-verbal behaviour plays a key role in human-human interaction. Endowing machines with the same ability is critical to enriching human-computer interaction. Despite receiving widespread attention so far, human-level automatic recognition of affective expressions is still an elusive task for machines. Towards improving the current state of machine learning methods applied to affect recognition, this thesis identifies two challenges: label ambiguity and label scarcity. Firstly, this thesis notes that it is difficult to establish a clear one-to-one mapping between inputs (face images or speech segments) and their target emotion labels, considering that emotion perception is inherently subjective. As a result, the problem of label ambiguity naturally arises in the manual annotations of affect. Ignoring this fundamental problem, most existing affect recognition methods implicitly assume a one-to-one input-target mapping and use deterministic function learning. In contrast, this thesis proposes to learn non-deterministic functions based on uncertainty-aware probabilistic models, as they can naturally accommodate the one-to-many input-target mapping. Besides improving the affect recognition performance, the proposed uncertainty-aware models in this thesis demonstrate three important applications: adaptive multimodal affect fusion, human-in-the-loop learning of affect, and improved performance on downstream behavioural analysis tasks like personality traits estimation. Secondly, this thesis aims to address the challenge of scarcity of affect labelled datasets, caused by the cumbersome and time-consuming nature of the affect annotation process. To this end, this thesis notes that audio and visual feature encoders used in the existing models are label-inefficient i.e. learning them requires large amounts of labelled training data. As a solution, this thesis proposes to pre-train the feature encoders using unlabelled data to make them more label-efficient i.e. using as few labelled training examples as possible to achieve good emotion recognition performance. A novel self-supervised pre-training method is proposed in this thesis by posing hand-engineered emotion features as task-specific representation learning priors. By leveraging large amounts of unlabelled audiovisual data, the proposed self-supervised pre-training method demonstrates much better label efficiency compared to the commonly employed pre-training methods.
first_indexed	2025-11-14T20:55:42Z
format	Thesis (University of Nottingham only)
id	nottingham-71876
institution	University of Nottingham Malaysia Campus
institution_category	Local University
language	English
last_indexed	2025-11-14T20:55:42Z
publishDate	2022
recordtype	eprints
repository_type	Digital Repository
spelling	nottingham-718762022-12-15T09:56:33Z https://eprints.nottingham.ac.uk/71876/ Towards uncertainty-aware and label-efficient machine learning of human expressive behaviour Tellamekala, Mani Kumar The ability to recognise emotional expressions from non-verbal behaviour plays a key role in human-human interaction. Endowing machines with the same ability is critical to enriching human-computer interaction. Despite receiving widespread attention so far, human-level automatic recognition of affective expressions is still an elusive task for machines. Towards improving the current state of machine learning methods applied to affect recognition, this thesis identifies two challenges: label ambiguity and label scarcity. Firstly, this thesis notes that it is difficult to establish a clear one-to-one mapping between inputs (face images or speech segments) and their target emotion labels, considering that emotion perception is inherently subjective. As a result, the problem of label ambiguity naturally arises in the manual annotations of affect. Ignoring this fundamental problem, most existing affect recognition methods implicitly assume a one-to-one input-target mapping and use deterministic function learning. In contrast, this thesis proposes to learn non-deterministic functions based on uncertainty-aware probabilistic models, as they can naturally accommodate the one-to-many input-target mapping. Besides improving the affect recognition performance, the proposed uncertainty-aware models in this thesis demonstrate three important applications: adaptive multimodal affect fusion, human-in-the-loop learning of affect, and improved performance on downstream behavioural analysis tasks like personality traits estimation. Secondly, this thesis aims to address the challenge of scarcity of affect labelled datasets, caused by the cumbersome and time-consuming nature of the affect annotation process. To this end, this thesis notes that audio and visual feature encoders used in the existing models are label-inefficient i.e. learning them requires large amounts of labelled training data. As a solution, this thesis proposes to pre-train the feature encoders using unlabelled data to make them more label-efficient i.e. using as few labelled training examples as possible to achieve good emotion recognition performance. A novel self-supervised pre-training method is proposed in this thesis by posing hand-engineered emotion features as task-specific representation learning priors. By leveraging large amounts of unlabelled audiovisual data, the proposed self-supervised pre-training method demonstrates much better label efficiency compared to the commonly employed pre-training methods. 2022-12-14 Thesis (University of Nottingham only) NonPeerReviewed application/pdf en cc_by https://eprints.nottingham.ac.uk/71876/1/Mani_PhD_Thesis_with_Final_Corrections.pdf Tellamekala, Mani Kumar (2022) Towards uncertainty-aware and label-efficient machine learning of human expressive behaviour. PhD thesis, University of Nottingham. Emotion Recognition Machine Learning Uncertainty Modelling Self-supervised Learning Probabilistic Machine Learning Models Stochastic Process Regression Uncertainty-Aware Learning Label Ambiguity
spellingShingle	Emotion Recognition Machine Learning Uncertainty Modelling Self-supervised Learning Probabilistic Machine Learning Models Stochastic Process Regression Uncertainty-Aware Learning Label Ambiguity Tellamekala, Mani Kumar Towards uncertainty-aware and label-efficient machine learning of human expressive behaviour
title	Towards uncertainty-aware and label-efficient machine learning of human expressive behaviour
title_full	Towards uncertainty-aware and label-efficient machine learning of human expressive behaviour
title_fullStr	Towards uncertainty-aware and label-efficient machine learning of human expressive behaviour
title_full_unstemmed	Towards uncertainty-aware and label-efficient machine learning of human expressive behaviour
title_short	Towards uncertainty-aware and label-efficient machine learning of human expressive behaviour
title_sort	towards uncertainty-aware and label-efficient machine learning of human expressive behaviour
topic	Emotion Recognition Machine Learning Uncertainty Modelling Self-supervised Learning Probabilistic Machine Learning Models Stochastic Process Regression Uncertainty-Aware Learning Label Ambiguity
url	https://eprints.nottingham.ac.uk/71876/

Towards uncertainty-aware and label-efficient machine learning of human expressive behaviour

Similar Items