Some Topics in Topological Data Analysis
In recent years there has been growing interest within Statistics in topological aspects of random objects, one important direction being Topological Data Analysis (TDA) and the associated concept of Persistent Homology. This research aims to investigate both theoretical and computational aspects of...
| Main Author: | |
|---|---|
| Format: | Thesis (University of Nottingham only) |
| Language: | English |
| Published: |
2021
|
| Subjects: | |
| Online Access: | https://eprints.nottingham.ac.uk/66766/ |
| _version_ | 1848800356707860480 |
|---|---|
| author | Di, Yang |
| author_facet | Di, Yang |
| author_sort | Di, Yang |
| building | Nottingham Research Data Repository |
| collection | Online Access |
| description | In recent years there has been growing interest within Statistics in topological aspects of random objects, one important direction being Topological Data Analysis (TDA) and the associated concept of Persistent Homology. This research aims to investigate both theoretical and computational aspects of TDA. In the first strand of this research the aim is to generalize the central limit theorem (CLT) given by Kahle and Meckes (2013, 2015) for Betti numbers in Erdős–Rényi random graphs, to a CLT for Betti numbers in the stochastic block model. In addressing this problem, we have provided results on the spectral structure of the adjacency matrix and the normalized graph Laplacian in stochastic block models which appear to be new. The second strand of the research is to investigate numerically the relationship between the topological summaries computed under the full sample and under subsamples. Subsampling often needs to be considered because existing computational algorithms for TDA tend to break down for larger sample sizes as computational demands grow rapidly with sample size. One important finding is that subsampling which exploits existing structure in the data is likely to do much better than purely random subsampling. In this PhD thesis, numerical results are given for various types of simulated data through to real datasets. |
| first_indexed | 2025-11-14T20:50:16Z |
| format | Thesis (University of Nottingham only) |
| id | nottingham-66766 |
| institution | University of Nottingham Malaysia Campus |
| institution_category | Local University |
| language | English |
| last_indexed | 2025-11-14T20:50:16Z |
| publishDate | 2021 |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | nottingham-667662024-01-16T16:08:32Z https://eprints.nottingham.ac.uk/66766/ Some Topics in Topological Data Analysis Di, Yang In recent years there has been growing interest within Statistics in topological aspects of random objects, one important direction being Topological Data Analysis (TDA) and the associated concept of Persistent Homology. This research aims to investigate both theoretical and computational aspects of TDA. In the first strand of this research the aim is to generalize the central limit theorem (CLT) given by Kahle and Meckes (2013, 2015) for Betti numbers in Erdős–Rényi random graphs, to a CLT for Betti numbers in the stochastic block model. In addressing this problem, we have provided results on the spectral structure of the adjacency matrix and the normalized graph Laplacian in stochastic block models which appear to be new. The second strand of the research is to investigate numerically the relationship between the topological summaries computed under the full sample and under subsamples. Subsampling often needs to be considered because existing computational algorithms for TDA tend to break down for larger sample sizes as computational demands grow rapidly with sample size. One important finding is that subsampling which exploits existing structure in the data is likely to do much better than purely random subsampling. In this PhD thesis, numerical results are given for various types of simulated data through to real datasets. 2021-12-08 Thesis (University of Nottingham only) NonPeerReviewed application/pdf en cc_by https://eprints.nottingham.ac.uk/66766/1/Thesis%204267423%20Yang%20DI%2020210930.pdf Di, Yang (2021) Some Topics in Topological Data Analysis. PhD thesis, University of Nottingham. Topology Topological Data Analysis Persistent Homology |
| spellingShingle | Topology Topological Data Analysis Persistent Homology Di, Yang Some Topics in Topological Data Analysis |
| title | Some Topics in Topological Data Analysis |
| title_full | Some Topics in Topological Data Analysis |
| title_fullStr | Some Topics in Topological Data Analysis |
| title_full_unstemmed | Some Topics in Topological Data Analysis |
| title_short | Some Topics in Topological Data Analysis |
| title_sort | some topics in topological data analysis |
| topic | Topology Topological Data Analysis Persistent Homology |
| url | https://eprints.nottingham.ac.uk/66766/ |