Generating vague geographic information through data mining of passive web data

Vagueness is an inherent property of geographic data. This thesis develops a geocomputational method that demonstrates that vague information has the potential to be incorporated within GIS in straightforward manner. This method applies vagueness to the elements of place: types, names and spatial bo...

Full description

Bibliographic Details
Main Author: Brindley, Paul
Format: Thesis (University of Nottingham only)
Language:English
Published: 2016
Subjects:
Online Access:https://eprints.nottingham.ac.uk/33722/
_version_ 1848794689603371008
author Brindley, Paul
author_facet Brindley, Paul
author_sort Brindley, Paul
building Nottingham Research Data Repository
collection Online Access
description Vagueness is an inherent property of geographic data. This thesis develops a geocomputational method that demonstrates that vague information has the potential to be incorporated within GIS in straightforward manner. This method applies vagueness to the elements of place: types, names and spatial boundaries, generating vague geographic objects by extracting and filtering the differing opinions and perceptions held within web derived data. The aim of the research is threefold: (1) to investigate an approach to automatically generate vague, probabilistic geographical information concerning place by mining differing perspectives from passive web data; (2) to assure the quality of the vague information produced and test the hypothesis that its results are indistinguishable from directly surveying public opinion; and (3) to demonstrate the value of integrating vague information into geospatial applications via examples of its use. To achieve the first aim, the thesis develops methods to extract differing perspectives of place from web data - constructing (i) vague place type settlement classification and (ii) vague place names and boundaries for ‘neighbourhood’ level units. The methods developed are automated, suitable for generating output at a national scale and use a wide range of different source data to collect the differing opinions. The second aim assesses the quality of the data produced, determining if output extracted from the web was representative of that obtained from asking people directly. Statistical analysis of regression models demonstrates that data were representative of that collected through asking people directly both for vague settlement classifications and vague urban locale boundaries. Importantly, the validation data, drawn from public opinion, also supported the notion that vagueness was omnipresent within geographic information concerning place. The third aim was addressed through the use of case studies in order to demonstrate the added value of such data and subsequent integration of vague geographic objects within other socio-economic data. Critically, the incorporation of vagueness within place models not only add value to geographic data but also improve the accuracy of real-world representations within GIS.
first_indexed 2025-11-14T19:20:11Z
format Thesis (University of Nottingham only)
id nottingham-33722
institution University of Nottingham Malaysia Campus
institution_category Local University
language English
last_indexed 2025-11-14T19:20:11Z
publishDate 2016
recordtype eprints
repository_type Digital Repository
spelling nottingham-337222025-02-28T13:29:14Z https://eprints.nottingham.ac.uk/33722/ Generating vague geographic information through data mining of passive web data Brindley, Paul Vagueness is an inherent property of geographic data. This thesis develops a geocomputational method that demonstrates that vague information has the potential to be incorporated within GIS in straightforward manner. This method applies vagueness to the elements of place: types, names and spatial boundaries, generating vague geographic objects by extracting and filtering the differing opinions and perceptions held within web derived data. The aim of the research is threefold: (1) to investigate an approach to automatically generate vague, probabilistic geographical information concerning place by mining differing perspectives from passive web data; (2) to assure the quality of the vague information produced and test the hypothesis that its results are indistinguishable from directly surveying public opinion; and (3) to demonstrate the value of integrating vague information into geospatial applications via examples of its use. To achieve the first aim, the thesis develops methods to extract differing perspectives of place from web data - constructing (i) vague place type settlement classification and (ii) vague place names and boundaries for ‘neighbourhood’ level units. The methods developed are automated, suitable for generating output at a national scale and use a wide range of different source data to collect the differing opinions. The second aim assesses the quality of the data produced, determining if output extracted from the web was representative of that obtained from asking people directly. Statistical analysis of regression models demonstrates that data were representative of that collected through asking people directly both for vague settlement classifications and vague urban locale boundaries. Importantly, the validation data, drawn from public opinion, also supported the notion that vagueness was omnipresent within geographic information concerning place. The third aim was addressed through the use of case studies in order to demonstrate the added value of such data and subsequent integration of vague geographic objects within other socio-economic data. Critically, the incorporation of vagueness within place models not only add value to geographic data but also improve the accuracy of real-world representations within GIS. 2016-07-15 Thesis (University of Nottingham only) NonPeerReviewed application/pdf en arr https://eprints.nottingham.ac.uk/33722/1/brindley_submitted_thesis.pdf Brindley, Paul (2016) Generating vague geographic information through data mining of passive web data. PhD thesis, University of Nottingham. data mining geographic inforamtion systems gis passive web data vagueness
spellingShingle data mining
geographic inforamtion systems
gis
passive web data
vagueness
Brindley, Paul
Generating vague geographic information through data mining of passive web data
title Generating vague geographic information through data mining of passive web data
title_full Generating vague geographic information through data mining of passive web data
title_fullStr Generating vague geographic information through data mining of passive web data
title_full_unstemmed Generating vague geographic information through data mining of passive web data
title_short Generating vague geographic information through data mining of passive web data
title_sort generating vague geographic information through data mining of passive web data
topic data mining
geographic inforamtion systems
gis
passive web data
vagueness
url https://eprints.nottingham.ac.uk/33722/