Evaluation of methods and marker systems in genomic selection of oil palm (Elaeis guineensis Jacq.)

Background Genomic selection (GS) uses genome-wide markers as an attempt to accelerate genetic gain in breeding programs of both animals and plants. This approach is particularly useful for perennial crops such as oil palm, which have long breeding cycles, and for which the optimal method for GS...

Full description

Bibliographic Details
Main Authors: Kwong, Qi Bin, Teh, Chee Keng, Ong, Ai Ling, Chew, Fook Tim, Mayes, Sean, Kulaveerasingam, Harikrishna, Tammi, Martti, Yeoh, Suat Hui, Appleton, David Ross, Harikrishna, Jennifer Ann
Format: Article
Published: BioMed Central 2017
Subjects:
Online Access:https://eprints.nottingham.ac.uk/48972/
_version_ 1848797892015292416
author Kwong, Qi Bin
Teh, Chee Keng
Ong, Ai Ling
Chew, Fook Tim
Mayes, Sean
Kulaveerasingam, Harikrishna
Tammi, Martti
Yeoh, Suat Hui
Appleton, David Ross
Harikrishna, Jennifer Ann
author_facet Kwong, Qi Bin
Teh, Chee Keng
Ong, Ai Ling
Chew, Fook Tim
Mayes, Sean
Kulaveerasingam, Harikrishna
Tammi, Martti
Yeoh, Suat Hui
Appleton, David Ross
Harikrishna, Jennifer Ann
author_sort Kwong, Qi Bin
building Nottingham Research Data Repository
collection Online Access
description Background Genomic selection (GS) uses genome-wide markers as an attempt to accelerate genetic gain in breeding programs of both animals and plants. This approach is particularly useful for perennial crops such as oil palm, which have long breeding cycles, and for which the optimal method for GS is still under debate. In this study, we evaluated the effect of different marker systems and modeling methods for implementing GS in an introgressed dura family derived from a Deli dura x Nigerian dura (Deli x Nigerian) with 112 individuals. This family is an important breeding source for developing new mother palms for superior oil yield and bunch characters. The traits of interest selected for this study were fruit-to-bunch (F/B), shell-to-fruit (S/F), kernel-to-fruit (K/F), mesocarp-to-fruit (M/F), oil per palm (O/P) and oil-to-dry mesocarp (O/DM). The marker systems evaluated were simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). RR-BLUP, Bayesian A, B, Cπ, LASSO, Ridge Regression and two machine learning methods (SVM and Random Forest) were used to evaluate GS accuracy of the traits. Results The kinship coefficient between individuals in this family ranged from 0.35 to 0.62. S/F and O/DM had the highest genomic heritability, whereas F/B and O/P had the lowest. The accuracies using 135 SSRs were low, with accuracies of the traits around 0.20. The average accuracy of machine learning methods was 0.24, as compared to 0.20 achieved by other methods. The trait with the highest mean accuracy was F/B (0.28), while the lowest were both M/F and O/P (0.18). By using whole genomic SNPs, the accuracies for all traits, especially for O/DM (0.43), S/F (0.39) and M/F (0.30) were improved. The average accuracy of machine learning methods was 0.32, compared to 0.31 achieved by other methods. Conclusion Due to high genomic resolution, the use of whole-genome SNPs improved the efficiency of GS dramatically for oil palm and is recommended for dura breeding programs. Machine learning slightly outperformed other methods, but required parameters optimization for GS implementation.
first_indexed 2025-11-14T20:11:05Z
format Article
id nottingham-48972
institution University of Nottingham Malaysia Campus
institution_category Local University
last_indexed 2025-11-14T20:11:05Z
publishDate 2017
publisher BioMed Central
recordtype eprints
repository_type Digital Repository
spelling nottingham-489722020-05-04T19:22:16Z https://eprints.nottingham.ac.uk/48972/ Evaluation of methods and marker systems in genomic selection of oil palm (Elaeis guineensis Jacq.) Kwong, Qi Bin Teh, Chee Keng Ong, Ai Ling Chew, Fook Tim Mayes, Sean Kulaveerasingam, Harikrishna Tammi, Martti Yeoh, Suat Hui Appleton, David Ross Harikrishna, Jennifer Ann Background Genomic selection (GS) uses genome-wide markers as an attempt to accelerate genetic gain in breeding programs of both animals and plants. This approach is particularly useful for perennial crops such as oil palm, which have long breeding cycles, and for which the optimal method for GS is still under debate. In this study, we evaluated the effect of different marker systems and modeling methods for implementing GS in an introgressed dura family derived from a Deli dura x Nigerian dura (Deli x Nigerian) with 112 individuals. This family is an important breeding source for developing new mother palms for superior oil yield and bunch characters. The traits of interest selected for this study were fruit-to-bunch (F/B), shell-to-fruit (S/F), kernel-to-fruit (K/F), mesocarp-to-fruit (M/F), oil per palm (O/P) and oil-to-dry mesocarp (O/DM). The marker systems evaluated were simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). RR-BLUP, Bayesian A, B, Cπ, LASSO, Ridge Regression and two machine learning methods (SVM and Random Forest) were used to evaluate GS accuracy of the traits. Results The kinship coefficient between individuals in this family ranged from 0.35 to 0.62. S/F and O/DM had the highest genomic heritability, whereas F/B and O/P had the lowest. The accuracies using 135 SSRs were low, with accuracies of the traits around 0.20. The average accuracy of machine learning methods was 0.24, as compared to 0.20 achieved by other methods. The trait with the highest mean accuracy was F/B (0.28), while the lowest were both M/F and O/P (0.18). By using whole genomic SNPs, the accuracies for all traits, especially for O/DM (0.43), S/F (0.39) and M/F (0.30) were improved. The average accuracy of machine learning methods was 0.32, compared to 0.31 achieved by other methods. Conclusion Due to high genomic resolution, the use of whole-genome SNPs improved the efficiency of GS dramatically for oil palm and is recommended for dura breeding programs. Machine learning slightly outperformed other methods, but required parameters optimization for GS implementation. BioMed Central 2017-12-11 Article PeerReviewed Kwong, Qi Bin, Teh, Chee Keng, Ong, Ai Ling, Chew, Fook Tim, Mayes, Sean, Kulaveerasingam, Harikrishna, Tammi, Martti, Yeoh, Suat Hui, Appleton, David Ross and Harikrishna, Jennifer Ann (2017) Evaluation of methods and marker systems in genomic selection of oil palm (Elaeis guineensis Jacq.). BMC Genetics, 18 (1). ISSN 1471-2156 Genomic prediction Complex traits Machine learning Predictive modeling Marker-assisted selection SSR SNP Perennial crop https://bmcgenet.biomedcentral.com/articles/10.1186/s12863-017-0576-5 doi:10.1186/s12863-017-0576-5 doi:10.1186/s12863-017-0576-5
spellingShingle Genomic prediction
Complex traits
Machine learning
Predictive modeling
Marker-assisted selection
SSR
SNP
Perennial crop
Kwong, Qi Bin
Teh, Chee Keng
Ong, Ai Ling
Chew, Fook Tim
Mayes, Sean
Kulaveerasingam, Harikrishna
Tammi, Martti
Yeoh, Suat Hui
Appleton, David Ross
Harikrishna, Jennifer Ann
Evaluation of methods and marker systems in genomic selection of oil palm (Elaeis guineensis Jacq.)
title Evaluation of methods and marker systems in genomic selection of oil palm (Elaeis guineensis Jacq.)
title_full Evaluation of methods and marker systems in genomic selection of oil palm (Elaeis guineensis Jacq.)
title_fullStr Evaluation of methods and marker systems in genomic selection of oil palm (Elaeis guineensis Jacq.)
title_full_unstemmed Evaluation of methods and marker systems in genomic selection of oil palm (Elaeis guineensis Jacq.)
title_short Evaluation of methods and marker systems in genomic selection of oil palm (Elaeis guineensis Jacq.)
title_sort evaluation of methods and marker systems in genomic selection of oil palm (elaeis guineensis jacq.)
topic Genomic prediction
Complex traits
Machine learning
Predictive modeling
Marker-assisted selection
SSR
SNP
Perennial crop
url https://eprints.nottingham.ac.uk/48972/
https://eprints.nottingham.ac.uk/48972/
https://eprints.nottingham.ac.uk/48972/