Improving specific class mapping from remotely sensed data by cost-sensitive learning

In many remote-sensing projects, one is usually interested in a small number of land-cover classes present in a study area and not in all the land-cover classes that make-up the landscape. Previous studies in supervised classification of satellite images have tackled specific class mapping problem b...

Full description

Bibliographic Details
Main Authors: Silva, Joel, Bacao, Fernando, Dieng, Maguette, Foody, Giles M., Caetano, Mario
Format: Article
Published: Taylor & Francis 2017
Subjects:
Online Access:https://eprints.nottingham.ac.uk/41520/
_version_ 1848796293411897344
author Silva, Joel
Bacao, Fernando
Dieng, Maguette
Foody, Giles M.
Caetano, Mario
author_facet Silva, Joel
Bacao, Fernando
Dieng, Maguette
Foody, Giles M.
Caetano, Mario
author_sort Silva, Joel
building Nottingham Research Data Repository
collection Online Access
description In many remote-sensing projects, one is usually interested in a small number of land-cover classes present in a study area and not in all the land-cover classes that make-up the landscape. Previous studies in supervised classification of satellite images have tackled specific class mapping problem by isolating the classes of interest and combining all other classes into one large class, usually called others, and by developing a binary classifier to discriminate the class of interest from the others. Here, this approach is called focused approach. The strength of the focused approach is to decompose the original multi-class supervised classification problem into a binary classification problem, focusing the process on the discrimination of the class of interest. Previous studies have shown that this method is able to discriminate more accurately the classes of interest when compared with the standard multi-class supervised approach. However, it may be susceptible to data imbalance problems present in the training data set, since the classes of interest are often a small part of the training set. A result the classification may be biased towards the largest classes and, thus, be sub-optimal for the discrimination of the classes of interest. This study presents a way to minimize the effects of data imbalance problems in specific class mapping using cost-sensitive learning. In this approach errors committed in the minority class are treated as being costlier than errors committed in the majority class. Cost-sensitive approaches are typically implemented by weighting training data points accordingly to their importance to the analysis. By changing the weight of individual data points, it is possible to shift the weight from the larger classes to the smaller ones, balancing the data set. To illustrate the use of the cost-sensitive approach to map specific classes of interest, a series of experiments with weighted support vector machines classifier and Landsat Thematic Mapper data were conducted to discriminate two types of mangrove forest (high-mangrove and low-mangrove) in Saloum estuary, Senegal, a United Nations Educational, Scientific and Cultural Organisation World Heritage site. Results suggest an increase in overall classification accuracy with the use of cost-sensitive method (97.3%) over the standard multi-class (94.3%) and the focused approach (91.0%). In particular, cost-sensitive method yielded higher sensitivity and specificity values on the discrimination of the classes of interest when compared with the standard multi-class and focused approaches.
first_indexed 2025-11-14T19:45:41Z
format Article
id nottingham-41520
institution University of Nottingham Malaysia Campus
institution_category Local University
last_indexed 2025-11-14T19:45:41Z
publishDate 2017
publisher Taylor & Francis
recordtype eprints
repository_type Digital Repository
spelling nottingham-415202020-05-04T18:38:24Z https://eprints.nottingham.ac.uk/41520/ Improving specific class mapping from remotely sensed data by cost-sensitive learning Silva, Joel Bacao, Fernando Dieng, Maguette Foody, Giles M. Caetano, Mario In many remote-sensing projects, one is usually interested in a small number of land-cover classes present in a study area and not in all the land-cover classes that make-up the landscape. Previous studies in supervised classification of satellite images have tackled specific class mapping problem by isolating the classes of interest and combining all other classes into one large class, usually called others, and by developing a binary classifier to discriminate the class of interest from the others. Here, this approach is called focused approach. The strength of the focused approach is to decompose the original multi-class supervised classification problem into a binary classification problem, focusing the process on the discrimination of the class of interest. Previous studies have shown that this method is able to discriminate more accurately the classes of interest when compared with the standard multi-class supervised approach. However, it may be susceptible to data imbalance problems present in the training data set, since the classes of interest are often a small part of the training set. A result the classification may be biased towards the largest classes and, thus, be sub-optimal for the discrimination of the classes of interest. This study presents a way to minimize the effects of data imbalance problems in specific class mapping using cost-sensitive learning. In this approach errors committed in the minority class are treated as being costlier than errors committed in the majority class. Cost-sensitive approaches are typically implemented by weighting training data points accordingly to their importance to the analysis. By changing the weight of individual data points, it is possible to shift the weight from the larger classes to the smaller ones, balancing the data set. To illustrate the use of the cost-sensitive approach to map specific classes of interest, a series of experiments with weighted support vector machines classifier and Landsat Thematic Mapper data were conducted to discriminate two types of mangrove forest (high-mangrove and low-mangrove) in Saloum estuary, Senegal, a United Nations Educational, Scientific and Cultural Organisation World Heritage site. Results suggest an increase in overall classification accuracy with the use of cost-sensitive method (97.3%) over the standard multi-class (94.3%) and the focused approach (91.0%). In particular, cost-sensitive method yielded higher sensitivity and specificity values on the discrimination of the classes of interest when compared with the standard multi-class and focused approaches. Taylor & Francis 2017-03-21 Article PeerReviewed Silva, Joel, Bacao, Fernando, Dieng, Maguette, Foody, Giles M. and Caetano, Mario (2017) Improving specific class mapping from remotely sensed data by cost-sensitive learning. International Journal of Remote Sensing, 38 (11). pp. 3294-3316. ISSN 0143-1161 Support vector machines; land cover mapping; specific class mapping; remote sensing; Landsat; cost-sensitive learning http://www.tandfonline.com/doi/full/10.1080/01431161.2017.1292073 doi:10.1080/01431161.2017.1292073 doi:10.1080/01431161.2017.1292073
spellingShingle Support vector machines; land cover mapping; specific class mapping; remote sensing; Landsat; cost-sensitive learning
Silva, Joel
Bacao, Fernando
Dieng, Maguette
Foody, Giles M.
Caetano, Mario
Improving specific class mapping from remotely sensed data by cost-sensitive learning
title Improving specific class mapping from remotely sensed data by cost-sensitive learning
title_full Improving specific class mapping from remotely sensed data by cost-sensitive learning
title_fullStr Improving specific class mapping from remotely sensed data by cost-sensitive learning
title_full_unstemmed Improving specific class mapping from remotely sensed data by cost-sensitive learning
title_short Improving specific class mapping from remotely sensed data by cost-sensitive learning
title_sort improving specific class mapping from remotely sensed data by cost-sensitive learning
topic Support vector machines; land cover mapping; specific class mapping; remote sensing; Landsat; cost-sensitive learning
url https://eprints.nottingham.ac.uk/41520/
https://eprints.nottingham.ac.uk/41520/
https://eprints.nottingham.ac.uk/41520/