Novel guidelines for the analysis of single nucleotide polymorphisms in disease association studies
How genetic mutations such as Single Nucleotide Polymorphisms (SNPs) affect the risk of contracting a specific disease is still an open question for numerous different medical conditions. Two problems related to SNPs analysis are (i) the selection of computational techniques to discover possible sin...
| Main Author: | |
|---|---|
| Format: | Thesis (University of Nottingham only) |
| Language: | English |
| Published: |
2011
|
| Online Access: | https://eprints.nottingham.ac.uk/11808/ |
| _version_ | 1848791364087578624 |
|---|---|
| author | Fiaschi, Linda |
| author_facet | Fiaschi, Linda |
| author_sort | Fiaschi, Linda |
| building | Nottingham Research Data Repository |
| collection | Online Access |
| description | How genetic mutations such as Single Nucleotide Polymorphisms (SNPs) affect the risk of contracting a specific disease is still an open question for numerous different medical conditions. Two problems related to SNPs analysis are (i) the selection of computational techniques to discover possible single and multiple SNP associations; and (ii) the size of the latest datasets, which may contain millions of SNPs.
In order to find associations between SNPs and diseases, two popular techniques are investigated and enhanced. Firstly, the ‘Transmission Disequilibrium Test’ for familybased analysis is considered. The fixed length of haplotypes provided by this approach represents a possible limit to the quality of the obtained results. For this reason, an adaptation is proposed to select the minimum number of SNPs that are responsible for disease predisposition. Secondly, decision tree algorithms for case-control analysis in situations of unrelated individuals are considered. The application of a single tool may lead to limited analysis of the genetic association to a specific condition. Thus, a novel consensus approach is proposed exploiting the strengths of three different algorithms, ADTree, C4.5 and Id3. Results obtained suggest the new approach achieves improved performance.
The recent explosive growth in size of current SNPs databases has highlighted limitations in current techniques. An example is ‘Linkage Disequilibrium’ which identifies redundancy in multiple SNPs. Despite the high accuracies obtained by this method, it exhibits poor scalability for large datasets, which severely impacts on its performance. Therefore, a new fast scalable tool based on ‘Linkage Disequilibrium’ is developed to reduce the size through the measurement and elimination of redundancy between SNPs included in the initial dataset. Experimental evidence validates the potentially improved performance of the new method. |
| first_indexed | 2025-11-14T18:27:20Z |
| format | Thesis (University of Nottingham only) |
| id | nottingham-11808 |
| institution | University of Nottingham Malaysia Campus |
| institution_category | Local University |
| language | English |
| last_indexed | 2025-11-14T18:27:20Z |
| publishDate | 2011 |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | nottingham-118082025-02-28T11:15:43Z https://eprints.nottingham.ac.uk/11808/ Novel guidelines for the analysis of single nucleotide polymorphisms in disease association studies Fiaschi, Linda How genetic mutations such as Single Nucleotide Polymorphisms (SNPs) affect the risk of contracting a specific disease is still an open question for numerous different medical conditions. Two problems related to SNPs analysis are (i) the selection of computational techniques to discover possible single and multiple SNP associations; and (ii) the size of the latest datasets, which may contain millions of SNPs. In order to find associations between SNPs and diseases, two popular techniques are investigated and enhanced. Firstly, the ‘Transmission Disequilibrium Test’ for familybased analysis is considered. The fixed length of haplotypes provided by this approach represents a possible limit to the quality of the obtained results. For this reason, an adaptation is proposed to select the minimum number of SNPs that are responsible for disease predisposition. Secondly, decision tree algorithms for case-control analysis in situations of unrelated individuals are considered. The application of a single tool may lead to limited analysis of the genetic association to a specific condition. Thus, a novel consensus approach is proposed exploiting the strengths of three different algorithms, ADTree, C4.5 and Id3. Results obtained suggest the new approach achieves improved performance. The recent explosive growth in size of current SNPs databases has highlighted limitations in current techniques. An example is ‘Linkage Disequilibrium’ which identifies redundancy in multiple SNPs. Despite the high accuracies obtained by this method, it exhibits poor scalability for large datasets, which severely impacts on its performance. Therefore, a new fast scalable tool based on ‘Linkage Disequilibrium’ is developed to reduce the size through the measurement and elimination of redundancy between SNPs included in the initial dataset. Experimental evidence validates the potentially improved performance of the new method. 2011-07-13 Thesis (University of Nottingham only) NonPeerReviewed application/pdf en arr https://eprints.nottingham.ac.uk/11808/1/ethesis.pdf Fiaschi, Linda (2011) Novel guidelines for the analysis of single nucleotide polymorphisms in disease association studies. PhD thesis, University of Nottingham. |
| spellingShingle | Fiaschi, Linda Novel guidelines for the analysis of single nucleotide polymorphisms in disease association studies |
| title | Novel guidelines for the analysis of single nucleotide polymorphisms in disease association studies |
| title_full | Novel guidelines for the analysis of single nucleotide polymorphisms in disease association studies |
| title_fullStr | Novel guidelines for the analysis of single nucleotide polymorphisms in disease association studies |
| title_full_unstemmed | Novel guidelines for the analysis of single nucleotide polymorphisms in disease association studies |
| title_short | Novel guidelines for the analysis of single nucleotide polymorphisms in disease association studies |
| title_sort | novel guidelines for the analysis of single nucleotide polymorphisms in disease association studies |
| url | https://eprints.nottingham.ac.uk/11808/ |