A probabilistic model for the evaluation of module extraction algorithms in complex biological networks

This thesis presents CiGRAM, a model of complex networks ith known modular structure that is capable of generating realistic graph topology. Much of the recent focus on module detection has been geared towards developing new algorithms capable of detecting biologically significant clusters. However...

Full description

Bibliographic Details
Main Author: Gilbert, J.P.
Format: Thesis (University of Nottingham only)
Language:English
Published: 2015
Subjects:
Online Access:https://eprints.nottingham.ac.uk/30524/
_version_ 1848794001903190016
author Gilbert, J.P.
author_facet Gilbert, J.P.
author_sort Gilbert, J.P.
building Nottingham Research Data Repository
collection Online Access
description This thesis presents CiGRAM, a model of complex networks ith known modular structure that is capable of generating realistic graph topology. Much of the recent focus on module detection has been geared towards developing new algorithms capable of detecting biologically significant clusters. However, evaluating clusterings detected by different methods shows that there is little topological agreement or consensus in terms of meta-data despite most methods discovering modules with significant ontology. In this thesis an approach to modelling complex networks with ground-truth modular structure is presented. This approach is capable of generating graphs with heterogeneous degree distributions, high clustering coefficients and assortative degree correlations observed in real data but often ignored in existing benchmarks. Moreover, the model for modular structure concludes that non-modular random graphs are indistinguishable from modules. This model can be tuned to fit many empirical biological and non-biological datasets through fitting target graph summary statistics. The ground-truth structure allows the evaluation of module extraction algorithms in a domain specific context. Furthermore, it was found that degree assortativity appears to negatively impact several module extraction methods such as the popular infomap and modularity maximisation methods. Results presented disagree with other benchmark models highlighting the potential for future research into improving existing methods in ways that challenge assumptions about the detectability of modules.
first_indexed 2025-11-14T19:09:15Z
format Thesis (University of Nottingham only)
id nottingham-30524
institution University of Nottingham Malaysia Campus
institution_category Local University
language English
last_indexed 2025-11-14T19:09:15Z
publishDate 2015
recordtype eprints
repository_type Digital Repository
spelling nottingham-305242017-10-13T07:15:35Z https://eprints.nottingham.ac.uk/30524/ A probabilistic model for the evaluation of module extraction algorithms in complex biological networks Gilbert, J.P. This thesis presents CiGRAM, a model of complex networks ith known modular structure that is capable of generating realistic graph topology. Much of the recent focus on module detection has been geared towards developing new algorithms capable of detecting biologically significant clusters. However, evaluating clusterings detected by different methods shows that there is little topological agreement or consensus in terms of meta-data despite most methods discovering modules with significant ontology. In this thesis an approach to modelling complex networks with ground-truth modular structure is presented. This approach is capable of generating graphs with heterogeneous degree distributions, high clustering coefficients and assortative degree correlations observed in real data but often ignored in existing benchmarks. Moreover, the model for modular structure concludes that non-modular random graphs are indistinguishable from modules. This model can be tuned to fit many empirical biological and non-biological datasets through fitting target graph summary statistics. The ground-truth structure allows the evaluation of module extraction algorithms in a domain specific context. Furthermore, it was found that degree assortativity appears to negatively impact several module extraction methods such as the popular infomap and modularity maximisation methods. Results presented disagree with other benchmark models highlighting the potential for future research into improving existing methods in ways that challenge assumptions about the detectability of modules. 2015-12-10 Thesis (University of Nottingham only) NonPeerReviewed application/pdf en cc_by_nc_sa https://eprints.nottingham.ac.uk/30524/1/thesis.pdf Gilbert, J.P. (2015) A probabilistic model for the evaluation of module extraction algorithms in complex biological networks. PhD thesis, University of Nottingham. Complex networks community detection module detection biological networks topological models graph theory
spellingShingle Complex networks
community detection
module detection
biological networks
topological models
graph theory
Gilbert, J.P.
A probabilistic model for the evaluation of module extraction algorithms in complex biological networks
title A probabilistic model for the evaluation of module extraction algorithms in complex biological networks
title_full A probabilistic model for the evaluation of module extraction algorithms in complex biological networks
title_fullStr A probabilistic model for the evaluation of module extraction algorithms in complex biological networks
title_full_unstemmed A probabilistic model for the evaluation of module extraction algorithms in complex biological networks
title_short A probabilistic model for the evaluation of module extraction algorithms in complex biological networks
title_sort probabilistic model for the evaluation of module extraction algorithms in complex biological networks
topic Complex networks
community detection
module detection
biological networks
topological models
graph theory
url https://eprints.nottingham.ac.uk/30524/