Unsupervised landmark discovery via self-training correspondence

Object parts, also known as landmarks, convey information about an object’s shape and spatial configuration in 3D space, especially for deformable objects. The goal of landmark detection is to have a model that, for a particular object instance, can estimate the locations of its parts. Research in t...

Full description

Bibliographic Details
Main Author: Mallis, Dimitrios
Format: Thesis (University of Nottingham only)
Language:English
Published: 2023
Subjects:
Online Access:https://eprints.nottingham.ac.uk/72953/
_version_ 1848800754833293312
author Mallis, Dimitrios
author_facet Mallis, Dimitrios
author_sort Mallis, Dimitrios
building Nottingham Research Data Repository
collection Online Access
description Object parts, also known as landmarks, convey information about an object’s shape and spatial configuration in 3D space, especially for deformable objects. The goal of landmark detection is to have a model that, for a particular object instance, can estimate the locations of its parts. Research in this field is mainly driven by supervised approaches, where a sufficient amount of human-annotated data is available. As annotating landmarks for all objects is impractical, this thesis focuses on learning landmark detectors without supervision. Despite good performance on limited scenarios (objects showcasing minor rigid deformation), unsupervised landmark discovery mostly remains an open problem. Existing work fails to capture semantic landmarks, i.e. points similar to the ones assigned by human annotators and may not generalise well to highly articulated objects like the human body, complicated backgrounds or large viewpoint variations. In this thesis, we propose a novel self-training framework for the discovery of unsupervised landmarks. Contrary to existing methods that build on auxiliary tasks such as image generation or equivariance, we depart from generic keypoints and train a landmark detector and descriptor to improve itself, tuning the keypoints into distinctive landmarks. We propose an iterative algorithm that alternates between producing new pseudo-labels through feature clustering and learning distinctive features for each pseudo-class through contrastive learning. Our detector can discover highly semantic landmarks, that are more flexible in terms of capturing large viewpoint changes and out-of-plane rotations (3D rotations). New state-of-the-art performance is achieved in multiple challenging datasets.
first_indexed 2025-11-14T20:56:35Z
format Thesis (University of Nottingham only)
id nottingham-72953
institution University of Nottingham Malaysia Campus
institution_category Local University
language English
last_indexed 2025-11-14T20:56:35Z
publishDate 2023
recordtype eprints
repository_type Digital Repository
spelling nottingham-729532023-07-26T04:40:18Z https://eprints.nottingham.ac.uk/72953/ Unsupervised landmark discovery via self-training correspondence Mallis, Dimitrios Object parts, also known as landmarks, convey information about an object’s shape and spatial configuration in 3D space, especially for deformable objects. The goal of landmark detection is to have a model that, for a particular object instance, can estimate the locations of its parts. Research in this field is mainly driven by supervised approaches, where a sufficient amount of human-annotated data is available. As annotating landmarks for all objects is impractical, this thesis focuses on learning landmark detectors without supervision. Despite good performance on limited scenarios (objects showcasing minor rigid deformation), unsupervised landmark discovery mostly remains an open problem. Existing work fails to capture semantic landmarks, i.e. points similar to the ones assigned by human annotators and may not generalise well to highly articulated objects like the human body, complicated backgrounds or large viewpoint variations. In this thesis, we propose a novel self-training framework for the discovery of unsupervised landmarks. Contrary to existing methods that build on auxiliary tasks such as image generation or equivariance, we depart from generic keypoints and train a landmark detector and descriptor to improve itself, tuning the keypoints into distinctive landmarks. We propose an iterative algorithm that alternates between producing new pseudo-labels through feature clustering and learning distinctive features for each pseudo-class through contrastive learning. Our detector can discover highly semantic landmarks, that are more flexible in terms of capturing large viewpoint changes and out-of-plane rotations (3D rotations). New state-of-the-art performance is achieved in multiple challenging datasets. 2023-07-26 Thesis (University of Nottingham only) NonPeerReviewed application/pdf en cc_by https://eprints.nottingham.ac.uk/72953/1/Thesis_Mallis_revised_version.pdf Mallis, Dimitrios (2023) Unsupervised landmark discovery via self-training correspondence. PhD thesis, University of Nottingham. unsupervised Landmark discovery self-training clustering correspondence keypoints
spellingShingle unsupervised Landmark discovery
self-training
clustering correspondence
keypoints
Mallis, Dimitrios
Unsupervised landmark discovery via self-training correspondence
title Unsupervised landmark discovery via self-training correspondence
title_full Unsupervised landmark discovery via self-training correspondence
title_fullStr Unsupervised landmark discovery via self-training correspondence
title_full_unstemmed Unsupervised landmark discovery via self-training correspondence
title_short Unsupervised landmark discovery via self-training correspondence
title_sort unsupervised landmark discovery via self-training correspondence
topic unsupervised Landmark discovery
self-training
clustering correspondence
keypoints
url https://eprints.nottingham.ac.uk/72953/