A comparative study of different imputation methods for daily rainfall data in east-coast Peninsular Malaysia

Rainfall data are the most significant values in hydrology and climatology modelling. However, the datasets are prone to missing values due to various issues. This study aspires to impute the rainfall missing values by using various imputation method such as Replace by Mean, Nearest Neighbor, Random...

Full description

Bibliographic Details
Main Authors: Che Mat Nor, Siti Mariana, Shaharudin, Shazlyn Milleana, Ismail, Shuhaida, Zainuddin, Nurul Hila, Tan, Mou Leong
Format: Article
Published: Universitas Ahmad Dahlan 2020
Subjects:
Online Access:http://eprints.uthm.edu.my/6100/
_version_ 1848888714625810432
author Che Mat Nor, Siti Mariana
Shaharudin, Shazlyn Milleana
Ismail, Shuhaida
Zainuddin, Nurul Hila
Tan, Mou Leong
author_facet Che Mat Nor, Siti Mariana
Shaharudin, Shazlyn Milleana
Ismail, Shuhaida
Zainuddin, Nurul Hila
Tan, Mou Leong
author_sort Che Mat Nor, Siti Mariana
building UTHM Institutional Repository
collection Online Access
description Rainfall data are the most significant values in hydrology and climatology modelling. However, the datasets are prone to missing values due to various issues. This study aspires to impute the rainfall missing values by using various imputation method such as Replace by Mean, Nearest Neighbor, Random Forest, Non-linear Interactive Partial Least-Square (NIPALS) and Markov Chain Monte Carlo (MCMC). Daily rainfall datasets from 48 rainfall stations across east-coast Peninsular Malaysia were used in this study. The dataset were then fed into Multiple Linear Regression (MLR) model. The performance of abovementioned methods were evaluated using Root Mean Square Method (RMSE), Mean Absolute Error (MAE) and Nash-Sutcliffe Efficiency Coefficient (CE). The experimental results showed that RF coupled with MLR (RF-MLR) approach was attained as more fitting for satisfying the missing data in east-coast Peninsular Malaysia.
first_indexed 2025-11-15T20:14:40Z
format Article
id uthm-6100
institution Universiti Tun Hussein Onn Malaysia
institution_category Local University
last_indexed 2025-11-15T20:14:40Z
publishDate 2020
publisher Universitas Ahmad Dahlan
recordtype eprints
repository_type Digital Repository
spelling uthm-61002022-01-26T07:00:30Z http://eprints.uthm.edu.my/6100/ A comparative study of different imputation methods for daily rainfall data in east-coast Peninsular Malaysia Che Mat Nor, Siti Mariana Shaharudin, Shazlyn Milleana Ismail, Shuhaida Zainuddin, Nurul Hila Tan, Mou Leong QC980-999 Climatology and weather Rainfall data are the most significant values in hydrology and climatology modelling. However, the datasets are prone to missing values due to various issues. This study aspires to impute the rainfall missing values by using various imputation method such as Replace by Mean, Nearest Neighbor, Random Forest, Non-linear Interactive Partial Least-Square (NIPALS) and Markov Chain Monte Carlo (MCMC). Daily rainfall datasets from 48 rainfall stations across east-coast Peninsular Malaysia were used in this study. The dataset were then fed into Multiple Linear Regression (MLR) model. The performance of abovementioned methods were evaluated using Root Mean Square Method (RMSE), Mean Absolute Error (MAE) and Nash-Sutcliffe Efficiency Coefficient (CE). The experimental results showed that RF coupled with MLR (RF-MLR) approach was attained as more fitting for satisfying the missing data in east-coast Peninsular Malaysia. Universitas Ahmad Dahlan 2020 Article PeerReviewed Che Mat Nor, Siti Mariana and Shaharudin, Shazlyn Milleana and Ismail, Shuhaida and Zainuddin, Nurul Hila and Tan, Mou Leong (2020) A comparative study of different imputation methods for daily rainfall data in east-coast Peninsular Malaysia. Bulletin of Electrical Engineering and Informatics, 9 (2). pp. 635-643. ISSN 2089-3191 https://dx.doi.org/10.11591/eei.v9i2.2090
spellingShingle QC980-999 Climatology and weather
Che Mat Nor, Siti Mariana
Shaharudin, Shazlyn Milleana
Ismail, Shuhaida
Zainuddin, Nurul Hila
Tan, Mou Leong
A comparative study of different imputation methods for daily rainfall data in east-coast Peninsular Malaysia
title A comparative study of different imputation methods for daily rainfall data in east-coast Peninsular Malaysia
title_full A comparative study of different imputation methods for daily rainfall data in east-coast Peninsular Malaysia
title_fullStr A comparative study of different imputation methods for daily rainfall data in east-coast Peninsular Malaysia
title_full_unstemmed A comparative study of different imputation methods for daily rainfall data in east-coast Peninsular Malaysia
title_short A comparative study of different imputation methods for daily rainfall data in east-coast Peninsular Malaysia
title_sort comparative study of different imputation methods for daily rainfall data in east-coast peninsular malaysia
topic QC980-999 Climatology and weather
url http://eprints.uthm.edu.my/6100/
http://eprints.uthm.edu.my/6100/