Prediction Of Oil Palm Yield For Smallholders Estates In Tropical Region Using Extra Trees Method

Global food security and sustainable use of natural resources heavily relies on the timely prediction of crop yields. The oil palm, being the most profitable crop for oil production worldwide, requires accurate yield predictions to maintain a balance between its global demand and supply. This thesis...

Full description

Bibliographic Details
Main Author: Khan, Nuzhat
Format: Thesis
Language:English
Published: 2023
Subjects:
Online Access:http://eprints.usm.my/60016/
http://eprints.usm.my/60016/1/NUZHAT%20KHAN%20-%20TESIS%20cut.pdf
_version_ 1848884327311474688
author Khan, Nuzhat
author_facet Khan, Nuzhat
author_sort Khan, Nuzhat
building USM Institutional Repository
collection Online Access
description Global food security and sustainable use of natural resources heavily relies on the timely prediction of crop yields. The oil palm, being the most profitable crop for oil production worldwide, requires accurate yield predictions to maintain a balance between its global demand and supply. This thesis proposes machine learning regression approach to predict oil palm fresh fruit bunches yield. The main objective of this research is to develop a machine learning model trained on actual data to predict long-term oil palm yield with high precision. The study utilizes data obtained from multiple sources including Malaysia Palm Oil Board (MPOB), Meteorological Department Malaysia (MET) and NASA. The proposed methodology is implemented on site specific data recorded from an entire state Pahang Malaysia. The data is comprised of 18 variables including historical yield, soil, and weather variables, to accurately predict future yield. The statistical analysis facilitated to assess data quality and to extract the agricultural information. The outcomes of the correlation analysis reveal the complex interdependencies of yield influencing factors. The data exploration is followed by a preprocessing pipeline to convert raw data into meaningful information. The data preprocessing pipeline includes treating outliers, normalization, features selection and data splitting into training, testing, and validation sets. Based on the prepared data, the automated model selection is used to identify the most appropriate prediction model.
first_indexed 2025-11-15T19:04:56Z
format Thesis
id usm-60016
institution Universiti Sains Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T19:04:56Z
publishDate 2023
recordtype eprints
repository_type Digital Repository
spelling usm-600162024-02-27T08:24:44Z http://eprints.usm.my/60016/ Prediction Of Oil Palm Yield For Smallholders Estates In Tropical Region Using Extra Trees Method Khan, Nuzhat TS1-2301 Manufactures Global food security and sustainable use of natural resources heavily relies on the timely prediction of crop yields. The oil palm, being the most profitable crop for oil production worldwide, requires accurate yield predictions to maintain a balance between its global demand and supply. This thesis proposes machine learning regression approach to predict oil palm fresh fruit bunches yield. The main objective of this research is to develop a machine learning model trained on actual data to predict long-term oil palm yield with high precision. The study utilizes data obtained from multiple sources including Malaysia Palm Oil Board (MPOB), Meteorological Department Malaysia (MET) and NASA. The proposed methodology is implemented on site specific data recorded from an entire state Pahang Malaysia. The data is comprised of 18 variables including historical yield, soil, and weather variables, to accurately predict future yield. The statistical analysis facilitated to assess data quality and to extract the agricultural information. The outcomes of the correlation analysis reveal the complex interdependencies of yield influencing factors. The data exploration is followed by a preprocessing pipeline to convert raw data into meaningful information. The data preprocessing pipeline includes treating outliers, normalization, features selection and data splitting into training, testing, and validation sets. Based on the prepared data, the automated model selection is used to identify the most appropriate prediction model. 2023-09 Thesis NonPeerReviewed application/pdf en http://eprints.usm.my/60016/1/NUZHAT%20KHAN%20-%20TESIS%20cut.pdf Khan, Nuzhat (2023) Prediction Of Oil Palm Yield For Smallholders Estates In Tropical Region Using Extra Trees Method. PhD thesis, Universiti Sains Malaysia.
spellingShingle TS1-2301 Manufactures
Khan, Nuzhat
Prediction Of Oil Palm Yield For Smallholders Estates In Tropical Region Using Extra Trees Method
title Prediction Of Oil Palm Yield For Smallholders Estates In Tropical Region Using Extra Trees Method
title_full Prediction Of Oil Palm Yield For Smallholders Estates In Tropical Region Using Extra Trees Method
title_fullStr Prediction Of Oil Palm Yield For Smallholders Estates In Tropical Region Using Extra Trees Method
title_full_unstemmed Prediction Of Oil Palm Yield For Smallholders Estates In Tropical Region Using Extra Trees Method
title_short Prediction Of Oil Palm Yield For Smallholders Estates In Tropical Region Using Extra Trees Method
title_sort prediction of oil palm yield for smallholders estates in tropical region using extra trees method
topic TS1-2301 Manufactures
url http://eprints.usm.my/60016/
http://eprints.usm.my/60016/1/NUZHAT%20KHAN%20-%20TESIS%20cut.pdf