Crop classification by a support vector machine with intelligently selected training data for an operational application

The accuracy of supervised classification is dependent to a large extent on the training data used. The aim is often to capture a large training set to fully describe the classes spectrally, commonly with the requirements of a conventional statistical classifier in-mind. However, it is not always ne...

Full description

Bibliographic Details
Main Authors: Mathur, Ajay, Foody, Giles M.
Format: Article
Published: Taylor & Francis 2008
Online Access:https://eprints.nottingham.ac.uk/1996/
_version_ 1848790700856967168
author Mathur, Ajay
Foody, Giles M.
author_facet Mathur, Ajay
Foody, Giles M.
author_sort Mathur, Ajay
building Nottingham Research Data Repository
collection Online Access
description The accuracy of supervised classification is dependent to a large extent on the training data used. The aim is often to capture a large training set to fully describe the classes spectrally, commonly with the requirements of a conventional statistical classifier in-mind. However, it is not always necessary to provide a complete description of the classes, especially if using a support vector machine (SVM) as the classifier. A SVM seeks to fit an optimal hyperplane between the classes and uses only some of the training samples that lie at the edge of the class distributions in feature space (support vectors). This should allow the definition of the most informative training samples prior to the analysis. An approach to identify informative training samples was demonstrated for the classification of agricultural classes in south-western part of Punjab state, India. A small, intelligently selected, training data set was acquired in the field with the aid of ancillary information. This data set contained the data from training sites that were predicted before the classification to be amongst the most informative for a SVM classification. The intelligent training collection scheme yielded a classification of comparable accuracy, ~91%, to one derived using a larger training set acquired by a conventional approach. Moreover, from inspection of the training sets it was apparent that the intelligently defined training set contained a greater proportion of support vectors (0.70), useful training sites, than that acquired by the conventional approach (0.41). By focusing on the most informative training samples, the intelligent scheme required less investment in training than the conventional approach and its adoption would have reduced total financial outlay in classification production and evaluation by ~26%. Additionally, the analysis highlighted the possibility to further reduce the training set size without any significant negative impact on classification accuracy.
first_indexed 2025-11-14T18:16:47Z
format Article
id nottingham-1996
institution University of Nottingham Malaysia Campus
institution_category Local University
last_indexed 2025-11-14T18:16:47Z
publishDate 2008
publisher Taylor & Francis
recordtype eprints
repository_type Digital Repository
spelling nottingham-19962020-05-04T20:27:50Z https://eprints.nottingham.ac.uk/1996/ Crop classification by a support vector machine with intelligently selected training data for an operational application Mathur, Ajay Foody, Giles M. The accuracy of supervised classification is dependent to a large extent on the training data used. The aim is often to capture a large training set to fully describe the classes spectrally, commonly with the requirements of a conventional statistical classifier in-mind. However, it is not always necessary to provide a complete description of the classes, especially if using a support vector machine (SVM) as the classifier. A SVM seeks to fit an optimal hyperplane between the classes and uses only some of the training samples that lie at the edge of the class distributions in feature space (support vectors). This should allow the definition of the most informative training samples prior to the analysis. An approach to identify informative training samples was demonstrated for the classification of agricultural classes in south-western part of Punjab state, India. A small, intelligently selected, training data set was acquired in the field with the aid of ancillary information. This data set contained the data from training sites that were predicted before the classification to be amongst the most informative for a SVM classification. The intelligent training collection scheme yielded a classification of comparable accuracy, ~91%, to one derived using a larger training set acquired by a conventional approach. Moreover, from inspection of the training sets it was apparent that the intelligently defined training set contained a greater proportion of support vectors (0.70), useful training sites, than that acquired by the conventional approach (0.41). By focusing on the most informative training samples, the intelligent scheme required less investment in training than the conventional approach and its adoption would have reduced total financial outlay in classification production and evaluation by ~26%. Additionally, the analysis highlighted the possibility to further reduce the training set size without any significant negative impact on classification accuracy. Taylor & Francis 2008 Article PeerReviewed Mathur, Ajay and Foody, Giles M. (2008) Crop classification by a support vector machine with intelligently selected training data for an operational application. International Journal of Remote Sensing, 29 (8). pp. 2227-2240. ISSN 0143-1161 http://www.tandfonline.com/doi/full/10.1080/01431160701395203 doi:10.1080/01431160701395203 doi:10.1080/01431160701395203
spellingShingle Mathur, Ajay
Foody, Giles M.
Crop classification by a support vector machine with intelligently selected training data for an operational application
title Crop classification by a support vector machine with intelligently selected training data for an operational application
title_full Crop classification by a support vector machine with intelligently selected training data for an operational application
title_fullStr Crop classification by a support vector machine with intelligently selected training data for an operational application
title_full_unstemmed Crop classification by a support vector machine with intelligently selected training data for an operational application
title_short Crop classification by a support vector machine with intelligently selected training data for an operational application
title_sort crop classification by a support vector machine with intelligently selected training data for an operational application
url https://eprints.nottingham.ac.uk/1996/
https://eprints.nottingham.ac.uk/1996/
https://eprints.nottingham.ac.uk/1996/