Crop classification and yield prediction using robust machine learning models for agricultural sustainability

Agriculture is pivotal for the economy of a country as it is a major source of food, employment and raw materials. However, challenges such as diseases, soil degradation, and water scarcity persist. Technology adoption can address these issues, improving production and quality. Machine learning, a s...

Full description

Bibliographic Details
Main Authors: Badshah, Abid, Alkazemi, Basem Yousef, Fakhrud Din, ., Kamal Z., Zamli, Muhammad Haris, .
Format: Article
Language:English
Published: IEEE 2024
Subjects:
Online Access:https://umpir.ump.edu.my/id/eprint/44215/
_version_ 1848827315639812096
author Badshah, Abid
Alkazemi, Basem Yousef
Fakhrud Din, .
Kamal Z., Zamli
Muhammad Haris, .
author_facet Badshah, Abid
Alkazemi, Basem Yousef
Fakhrud Din, .
Kamal Z., Zamli
Muhammad Haris, .
author_sort Badshah, Abid
building UMP Institutional Repository
collection Online Access
description Agriculture is pivotal for the economy of a country as it is a major source of food, employment and raw materials. However, challenges such as diseases, soil degradation, and water scarcity persist. Technology adoption can address these issues, improving production and quality. Machine learning, a subset of Artificial Intelligence (AI), enables prediction, classification, and automation in agriculture. It optimizes irrigation, fertilization, and crop selection, aiding decision-making for food security and crop management. This study proposes two robust machine learning architectures for classification and regression based on distinct datasets. Firstly, we delve into a crop recommendation dataset obtained from Kaggle, consisting of various input attributes such as the pH of the soil, temperature, humidity, and nutrient levels. Leveraging machine learning classification techniques such as Extra Tree Classifier (ETC), Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), K-Nearest Neighbour (KNN), Gaussian Naive Bayes (GNB), and Support Vector Machine (SVM), we suggest twenty-two different crops founded on these inputs. Through the use of K-fold cross-validation, Explainable AI (XAI) and feature engineering, we identify the bestperforming model, with Random Forest coming out on top scoring an accuracy of 99.7% with precision, recall, F1 score, and confusion matrix. Secondly, we investigate wheat yield prediction data snagged from the World Bank and Food and Agriculture Organization (FAO), covering the years 1992-2013 for Pakistan. Using Multivariate Imputation by Chained Equations (MICE) to tackle data restrictions, we gauge wheat production for 2014-2024 and forecast the 2025 yield using machine learning regression models. Once again, using hyper parameter tuning with K-fold cross-validation, Support Vector Regressor (SVR) stands out as the top-performing model, achieving an accuracy of 99.9% with R2 Score. Transparency and confidence in agricultural decision-making are increased when machine learning decisions are made comprehensible using Explainable AI (XAI) approaches. Two widely used XAI approaches, namely Feature Importance and Local Interpretable Model-Agnostic Explanations (LIME) are used to interpret and explain outcomes of the proposed models. The study can increase agricultural productivity, minimize risks, enhance food security, and promote more environmentally friendly farming approaches
first_indexed 2025-11-15T03:58:46Z
format Article
id ump-44215
institution Universiti Malaysia Pahang
institution_category Local University
language English
last_indexed 2025-11-15T03:58:46Z
publishDate 2024
publisher IEEE
recordtype eprints
repository_type Digital Repository
spelling ump-442152025-10-06T03:47:51Z https://umpir.ump.edu.my/id/eprint/44215/ Crop classification and yield prediction using robust machine learning models for agricultural sustainability Badshah, Abid Alkazemi, Basem Yousef Fakhrud Din, . Kamal Z., Zamli Muhammad Haris, . QA75 Electronic computers. Computer science S Agriculture (General) Agriculture is pivotal for the economy of a country as it is a major source of food, employment and raw materials. However, challenges such as diseases, soil degradation, and water scarcity persist. Technology adoption can address these issues, improving production and quality. Machine learning, a subset of Artificial Intelligence (AI), enables prediction, classification, and automation in agriculture. It optimizes irrigation, fertilization, and crop selection, aiding decision-making for food security and crop management. This study proposes two robust machine learning architectures for classification and regression based on distinct datasets. Firstly, we delve into a crop recommendation dataset obtained from Kaggle, consisting of various input attributes such as the pH of the soil, temperature, humidity, and nutrient levels. Leveraging machine learning classification techniques such as Extra Tree Classifier (ETC), Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), K-Nearest Neighbour (KNN), Gaussian Naive Bayes (GNB), and Support Vector Machine (SVM), we suggest twenty-two different crops founded on these inputs. Through the use of K-fold cross-validation, Explainable AI (XAI) and feature engineering, we identify the bestperforming model, with Random Forest coming out on top scoring an accuracy of 99.7% with precision, recall, F1 score, and confusion matrix. Secondly, we investigate wheat yield prediction data snagged from the World Bank and Food and Agriculture Organization (FAO), covering the years 1992-2013 for Pakistan. Using Multivariate Imputation by Chained Equations (MICE) to tackle data restrictions, we gauge wheat production for 2014-2024 and forecast the 2025 yield using machine learning regression models. Once again, using hyper parameter tuning with K-fold cross-validation, Support Vector Regressor (SVR) stands out as the top-performing model, achieving an accuracy of 99.9% with R2 Score. Transparency and confidence in agricultural decision-making are increased when machine learning decisions are made comprehensible using Explainable AI (XAI) approaches. Two widely used XAI approaches, namely Feature Importance and Local Interpretable Model-Agnostic Explanations (LIME) are used to interpret and explain outcomes of the proposed models. The study can increase agricultural productivity, minimize risks, enhance food security, and promote more environmentally friendly farming approaches IEEE 2024-11-13 Article PeerReviewed pdf en cc_by_nc_nd_4 https://umpir.ump.edu.my/id/eprint/44215/1/Crop%20classification%20and%20yield%20prediction%20using%20robust%20machine.pdf Badshah, Abid and Alkazemi, Basem Yousef and Fakhrud Din, . and Kamal Z., Zamli and Muhammad Haris, . (2024) Crop classification and yield prediction using robust machine learning models for agricultural sustainability. IEEE Access, 12. pp. 162799 -162813. ISSN 2169-3536. (Published) https://doi.org/10.1109/ACCESS.2024.3486653 https://doi.org/10.1109/ACCESS.2024.3486653 https://doi.org/10.1109/ACCESS.2024.3486653
spellingShingle QA75 Electronic computers. Computer science
S Agriculture (General)
Badshah, Abid
Alkazemi, Basem Yousef
Fakhrud Din, .
Kamal Z., Zamli
Muhammad Haris, .
Crop classification and yield prediction using robust machine learning models for agricultural sustainability
title Crop classification and yield prediction using robust machine learning models for agricultural sustainability
title_full Crop classification and yield prediction using robust machine learning models for agricultural sustainability
title_fullStr Crop classification and yield prediction using robust machine learning models for agricultural sustainability
title_full_unstemmed Crop classification and yield prediction using robust machine learning models for agricultural sustainability
title_short Crop classification and yield prediction using robust machine learning models for agricultural sustainability
title_sort crop classification and yield prediction using robust machine learning models for agricultural sustainability
topic QA75 Electronic computers. Computer science
S Agriculture (General)
url https://umpir.ump.edu.my/id/eprint/44215/
https://umpir.ump.edu.my/id/eprint/44215/
https://umpir.ump.edu.my/id/eprint/44215/