Combining semi-automated image analysis techniques with machine learning algorithms to accelerate large scale genetic studies

Background: Genetic analyses of plant root system development require large datasets of extracted architectural traits. To quantify such traits from images of root systems, researchers often have to choose between automated tools (that are prone to error and extract only a limited number of architec...

Full description

Bibliographic Details
Main Authors: Atkinson, Jonathan A., Lobet, Guillaume, Noll, Manuel, Meyer, Patrick E., Griffiths, Marcus, Wells, Darren M.
Format: Other
Published: bioRxiv 2017
Subjects:
Online Access:https://eprints.nottingham.ac.uk/44900/
_version_ 1848797024025051136
author Atkinson, Jonathan A.
Lobet, Guillaume
Noll, Manuel
Meyer, Patrick E.
Griffiths, Marcus
Wells, Darren M.
author_facet Atkinson, Jonathan A.
Lobet, Guillaume
Noll, Manuel
Meyer, Patrick E.
Griffiths, Marcus
Wells, Darren M.
author_sort Atkinson, Jonathan A.
building Nottingham Research Data Repository
collection Online Access
description Background: Genetic analyses of plant root system development require large datasets of extracted architectural traits. To quantify such traits from images of root systems, researchers often have to choose between automated tools (that are prone to error and extract only a limited number of architectural traits) or semi-automated ones (that are highly time consuming). Findings: We trained a Random Forest algorithm to infer architectural traits from automatically-extracted image descriptors. The training was performed on a subset of the dataset, then applied to its entirety. This strategy allowed us to (i) decrease the image analysis time by 73% and (ii) extract meaningful architectural traits based on image descriptors. We also show that these traits are sufficient to identify Quantitative Trait Loci that had previously been discovered using a semi-automated method. Conclusions: We have shown that combining semi-automated image analysis with machine learning algorithms has the power to increase the throughput in large scale root studies. We expect that such an approach will enable the quantification of more complex root systems for genetic studies. We also believe that our approach could be extended to other area of plant phenotyping.
first_indexed 2025-11-14T19:57:18Z
format Other
id nottingham-44900
institution University of Nottingham Malaysia Campus
institution_category Local University
last_indexed 2025-11-14T19:57:18Z
publishDate 2017
publisher bioRxiv
recordtype eprints
repository_type Digital Repository
spelling nottingham-449002020-05-04T18:50:59Z https://eprints.nottingham.ac.uk/44900/ Combining semi-automated image analysis techniques with machine learning algorithms to accelerate large scale genetic studies Atkinson, Jonathan A. Lobet, Guillaume Noll, Manuel Meyer, Patrick E. Griffiths, Marcus Wells, Darren M. Background: Genetic analyses of plant root system development require large datasets of extracted architectural traits. To quantify such traits from images of root systems, researchers often have to choose between automated tools (that are prone to error and extract only a limited number of architectural traits) or semi-automated ones (that are highly time consuming). Findings: We trained a Random Forest algorithm to infer architectural traits from automatically-extracted image descriptors. The training was performed on a subset of the dataset, then applied to its entirety. This strategy allowed us to (i) decrease the image analysis time by 73% and (ii) extract meaningful architectural traits based on image descriptors. We also show that these traits are sufficient to identify Quantitative Trait Loci that had previously been discovered using a semi-automated method. Conclusions: We have shown that combining semi-automated image analysis with machine learning algorithms has the power to increase the throughput in large scale root studies. We expect that such an approach will enable the quantification of more complex root systems for genetic studies. We also believe that our approach could be extended to other area of plant phenotyping. bioRxiv 2017-06-20 Other NonPeerReviewed Atkinson, Jonathan A., Lobet, Guillaume, Noll, Manuel, Meyer, Patrick E., Griffiths, Marcus and Wells, Darren M. (2017) Combining semi-automated image analysis techniques with machine learning algorithms to accelerate large scale genetic studies. bioRxiv, Cold Spring Harbor, N.Y., USA. Root plant phenotyping machine learning qtl analysis http://www.biorxiv.org/content/early/2017/06/20/152702 doi:10.1101/152702 doi:10.1101/152702
spellingShingle Root
plant phenotyping
machine learning
qtl analysis
Atkinson, Jonathan A.
Lobet, Guillaume
Noll, Manuel
Meyer, Patrick E.
Griffiths, Marcus
Wells, Darren M.
Combining semi-automated image analysis techniques with machine learning algorithms to accelerate large scale genetic studies
title Combining semi-automated image analysis techniques with machine learning algorithms to accelerate large scale genetic studies
title_full Combining semi-automated image analysis techniques with machine learning algorithms to accelerate large scale genetic studies
title_fullStr Combining semi-automated image analysis techniques with machine learning algorithms to accelerate large scale genetic studies
title_full_unstemmed Combining semi-automated image analysis techniques with machine learning algorithms to accelerate large scale genetic studies
title_short Combining semi-automated image analysis techniques with machine learning algorithms to accelerate large scale genetic studies
title_sort combining semi-automated image analysis techniques with machine learning algorithms to accelerate large scale genetic studies
topic Root
plant phenotyping
machine learning
qtl analysis
url https://eprints.nottingham.ac.uk/44900/
https://eprints.nottingham.ac.uk/44900/
https://eprints.nottingham.ac.uk/44900/