Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers

The reliable and efficient large-scale mapping of date palm trees from remotely sensed data is crucial for developing palm tree inventories, continuous monitoring, vulnerability assessments, environmental control, and long-term management. Given the increasing availability of UAV images with limited...

Full description

Bibliographic Details
Main Authors: Gibril, Mohamed Barakat A., Mohd Shafri, Helmi Zulhaidi, Al-Ruzouq, Rami, Shanableh, Abdallah, Nahas, Faten, Al Mansoori, Saeed
Format: Article
Language:English
Published: MDPI 2023
Online Access:http://psasir.upm.edu.my/id/eprint/110312/
http://psasir.upm.edu.my/id/eprint/110312/1/drones-07-00093-v2.pdf
_version_ 1848865492113031168
author Gibril, Mohamed Barakat A.
Mohd Shafri, Helmi Zulhaidi
Al-Ruzouq, Rami
Shanableh, Abdallah
Nahas, Faten
Al Mansoori, Saeed
author_facet Gibril, Mohamed Barakat A.
Mohd Shafri, Helmi Zulhaidi
Al-Ruzouq, Rami
Shanableh, Abdallah
Nahas, Faten
Al Mansoori, Saeed
author_sort Gibril, Mohamed Barakat A.
building UPM Institutional Repository
collection Online Access
description The reliable and efficient large-scale mapping of date palm trees from remotely sensed data is crucial for developing palm tree inventories, continuous monitoring, vulnerability assessments, environmental control, and long-term management. Given the increasing availability of UAV images with limited spectral information, the high intra-class variance of date palm trees, the variations in the spatial resolutions of the data, and the differences in image contexts and backgrounds, accurate mapping of date palm trees from very-high spatial resolution (VHSR) images can be challenging. This study aimed to investigate the reliability and the efficiency of various deep vision transformers in extracting date palm trees from multiscale and multisource VHSR images. Numerous vision transformers, including the Segformer, the Segmenter, the UperNet-Swin transformer, and the dense prediction transformer, with various levels of model complexity, were evaluated. The models were developed and evaluated using a set of comprehensive UAV-based and aerial images. The generalizability and the transferability of the deep vision transformers were evaluated and compared with various convolutional neural network-based (CNN) semantic segmentation models (including DeepLabV3+, PSPNet, FCN-ResNet-50, and DANet). The results of the examined deep vision transformers were generally comparable to several CNN-based models. The investigated deep vision transformers achieved satisfactory results in mapping date palm trees from the UAV images, with an mIoU ranging from 85% to 86.3% and an mF-score ranging from 91.62% to 92.44%. Among the evaluated models, the Segformer generated the highest segmentation results on the UAV-based and the multiscale testing datasets. The Segformer model, followed by the UperNet-Swin transformer, outperformed all of the evaluated CNN-based models in the multiscale testing dataset and in the additional unseen UAV testing dataset. In addition to delivering remarkable results in mapping date palm trees from versatile VHSR images, the Segformer model was among those with a small number of parameters and relatively low computing costs. Collectively, deep vision transformers could be used efficiently in developing and updating inventories of date palms and other tree species.
first_indexed 2025-11-15T14:05:34Z
format Article
id upm-110312
institution Universiti Putra Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T14:05:34Z
publishDate 2023
publisher MDPI
recordtype eprints
repository_type Digital Repository
spelling upm-1103122024-09-05T07:05:05Z http://psasir.upm.edu.my/id/eprint/110312/ Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers Gibril, Mohamed Barakat A. Mohd Shafri, Helmi Zulhaidi Al-Ruzouq, Rami Shanableh, Abdallah Nahas, Faten Al Mansoori, Saeed The reliable and efficient large-scale mapping of date palm trees from remotely sensed data is crucial for developing palm tree inventories, continuous monitoring, vulnerability assessments, environmental control, and long-term management. Given the increasing availability of UAV images with limited spectral information, the high intra-class variance of date palm trees, the variations in the spatial resolutions of the data, and the differences in image contexts and backgrounds, accurate mapping of date palm trees from very-high spatial resolution (VHSR) images can be challenging. This study aimed to investigate the reliability and the efficiency of various deep vision transformers in extracting date palm trees from multiscale and multisource VHSR images. Numerous vision transformers, including the Segformer, the Segmenter, the UperNet-Swin transformer, and the dense prediction transformer, with various levels of model complexity, were evaluated. The models were developed and evaluated using a set of comprehensive UAV-based and aerial images. The generalizability and the transferability of the deep vision transformers were evaluated and compared with various convolutional neural network-based (CNN) semantic segmentation models (including DeepLabV3+, PSPNet, FCN-ResNet-50, and DANet). The results of the examined deep vision transformers were generally comparable to several CNN-based models. The investigated deep vision transformers achieved satisfactory results in mapping date palm trees from the UAV images, with an mIoU ranging from 85% to 86.3% and an mF-score ranging from 91.62% to 92.44%. Among the evaluated models, the Segformer generated the highest segmentation results on the UAV-based and the multiscale testing datasets. The Segformer model, followed by the UperNet-Swin transformer, outperformed all of the evaluated CNN-based models in the multiscale testing dataset and in the additional unseen UAV testing dataset. In addition to delivering remarkable results in mapping date palm trees from versatile VHSR images, the Segformer model was among those with a small number of parameters and relatively low computing costs. Collectively, deep vision transformers could be used efficiently in developing and updating inventories of date palms and other tree species. MDPI 2023-01 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/110312/1/drones-07-00093-v2.pdf Gibril, Mohamed Barakat A. and Mohd Shafri, Helmi Zulhaidi and Al-Ruzouq, Rami and Shanableh, Abdallah and Nahas, Faten and Al Mansoori, Saeed (2023) Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers. Drones, 7 (2). pp. 93-25. ISSN 2504-446X https://www.mdpi.com/2504-446X/7/2/93 10.3390/drones7020093
spellingShingle Gibril, Mohamed Barakat A.
Mohd Shafri, Helmi Zulhaidi
Al-Ruzouq, Rami
Shanableh, Abdallah
Nahas, Faten
Al Mansoori, Saeed
Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers
title Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers
title_full Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers
title_fullStr Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers
title_full_unstemmed Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers
title_short Large-scale date palm tree segmentation from multiscale UAV-based and aerial images using deep vision transformers
title_sort large-scale date palm tree segmentation from multiscale uav-based and aerial images using deep vision transformers
url http://psasir.upm.edu.my/id/eprint/110312/
http://psasir.upm.edu.my/id/eprint/110312/
http://psasir.upm.edu.my/id/eprint/110312/
http://psasir.upm.edu.my/id/eprint/110312/1/drones-07-00093-v2.pdf