Some Topics in Topological Data Analysis

In recent years there has been growing interest within Statistics in topological aspects of random objects, one important direction being Topological Data Analysis (TDA) and the associated concept of Persistent Homology. This research aims to investigate both theoretical and computational aspects of...

Full description

Bibliographic Details
Main Author: Di, Yang
Format: Thesis (University of Nottingham only)
Language:English
Published: 2021
Subjects:
Online Access:https://eprints.nottingham.ac.uk/66766/
_version_ 1848800356707860480
author Di, Yang
author_facet Di, Yang
author_sort Di, Yang
building Nottingham Research Data Repository
collection Online Access
description In recent years there has been growing interest within Statistics in topological aspects of random objects, one important direction being Topological Data Analysis (TDA) and the associated concept of Persistent Homology. This research aims to investigate both theoretical and computational aspects of TDA. In the first strand of this research the aim is to generalize the central limit theorem (CLT) given by Kahle and Meckes (2013, 2015) for Betti numbers in Erdős–Rényi random graphs, to a CLT for Betti numbers in the stochastic block model. In addressing this problem, we have provided results on the spectral structure of the adjacency matrix and the normalized graph Laplacian in stochastic block models which appear to be new. The second strand of the research is to investigate numerically the relationship between the topological summaries computed under the full sample and under subsamples. Subsampling often needs to be considered because existing computational algorithms for TDA tend to break down for larger sample sizes as computational demands grow rapidly with sample size. One important finding is that subsampling which exploits existing structure in the data is likely to do much better than purely random subsampling. In this PhD thesis, numerical results are given for various types of simulated data through to real datasets.
first_indexed 2025-11-14T20:50:16Z
format Thesis (University of Nottingham only)
id nottingham-66766
institution University of Nottingham Malaysia Campus
institution_category Local University
language English
last_indexed 2025-11-14T20:50:16Z
publishDate 2021
recordtype eprints
repository_type Digital Repository
spelling nottingham-667662024-01-16T16:08:32Z https://eprints.nottingham.ac.uk/66766/ Some Topics in Topological Data Analysis Di, Yang In recent years there has been growing interest within Statistics in topological aspects of random objects, one important direction being Topological Data Analysis (TDA) and the associated concept of Persistent Homology. This research aims to investigate both theoretical and computational aspects of TDA. In the first strand of this research the aim is to generalize the central limit theorem (CLT) given by Kahle and Meckes (2013, 2015) for Betti numbers in Erdős–Rényi random graphs, to a CLT for Betti numbers in the stochastic block model. In addressing this problem, we have provided results on the spectral structure of the adjacency matrix and the normalized graph Laplacian in stochastic block models which appear to be new. The second strand of the research is to investigate numerically the relationship between the topological summaries computed under the full sample and under subsamples. Subsampling often needs to be considered because existing computational algorithms for TDA tend to break down for larger sample sizes as computational demands grow rapidly with sample size. One important finding is that subsampling which exploits existing structure in the data is likely to do much better than purely random subsampling. In this PhD thesis, numerical results are given for various types of simulated data through to real datasets. 2021-12-08 Thesis (University of Nottingham only) NonPeerReviewed application/pdf en cc_by https://eprints.nottingham.ac.uk/66766/1/Thesis%204267423%20Yang%20DI%2020210930.pdf Di, Yang (2021) Some Topics in Topological Data Analysis. PhD thesis, University of Nottingham. Topology Topological Data Analysis Persistent Homology
spellingShingle Topology
Topological Data Analysis
Persistent Homology
Di, Yang
Some Topics in Topological Data Analysis
title Some Topics in Topological Data Analysis
title_full Some Topics in Topological Data Analysis
title_fullStr Some Topics in Topological Data Analysis
title_full_unstemmed Some Topics in Topological Data Analysis
title_short Some Topics in Topological Data Analysis
title_sort some topics in topological data analysis
topic Topology
Topological Data Analysis
Persistent Homology
url https://eprints.nottingham.ac.uk/66766/