Imputation techniques for incomplete load data based on seasonality and orientation of the missing values

In load data, the missing problem always occurs in a set of data. Since it has a seasonal pattern according to days, most of the time, the load usage for the next day is predictable. For this reason, a new model has been developed based on these characteristics. Data containing missing values bein...

Full description

Bibliographic Details
Main Authors: Nur Arina Bazilah Kamisan, Muhammad Hisyam Lee, Abdul Ghapor Hussin, Yong Zulina Zubairi
Format: Article
Language:English
Published: Penerbit Universiti Kebangsaan Malaysia 2020
Online Access:http://journalarticle.ukm.my/15409/
http://journalarticle.ukm.my/15409/1/22.pdf
_version_ 1848813793700741120
author Nur Arina Bazilah Kamisan,
Muhammad Hisyam Lee,
Abdul Ghapor Hussin,
Yong Zulina Zubairi,
author_facet Nur Arina Bazilah Kamisan,
Muhammad Hisyam Lee,
Abdul Ghapor Hussin,
Yong Zulina Zubairi,
author_sort Nur Arina Bazilah Kamisan,
building UKM Institutional Repository
collection Online Access
description In load data, the missing problem always occurs in a set of data. Since it has a seasonal pattern according to days, most of the time, the load usage for the next day is predictable. For this reason, a new model has been developed based on these characteristics. Data containing missing values being divided to its seasonality pattern and for each subdivision, the values from mean, the mean with standard deviation and third quartile are calculated before being rearrange to form a new set of values that will replace the missing values. These three values will be used as imputations for the missing values. To examine the effects of the orientation of the missing values with the choices of imputation, the missing values from the data are divided into three parts: at the front, in the middle and at the end of the data with 5%, 15%, and 25% of missing values. The results from root mean square error and mean absolute error show that the proposed techniques, particularly the mean and the third quartile value, are superior to the other complex methods when dealing with the missing values. The mean imputation is ample when the missing values is presence at the front and in the middle of the data while the third quartile value is superior when the missing values is at the end of the data.
first_indexed 2025-11-15T00:23:50Z
format Article
id oai:generic.eprints.org:15409
institution Universiti Kebangasaan Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T00:23:50Z
publishDate 2020
publisher Penerbit Universiti Kebangsaan Malaysia
recordtype eprints
repository_type Digital Repository
spelling oai:generic.eprints.org:154092020-10-23T01:33:13Z http://journalarticle.ukm.my/15409/ Imputation techniques for incomplete load data based on seasonality and orientation of the missing values Nur Arina Bazilah Kamisan, Muhammad Hisyam Lee, Abdul Ghapor Hussin, Yong Zulina Zubairi, In load data, the missing problem always occurs in a set of data. Since it has a seasonal pattern according to days, most of the time, the load usage for the next day is predictable. For this reason, a new model has been developed based on these characteristics. Data containing missing values being divided to its seasonality pattern and for each subdivision, the values from mean, the mean with standard deviation and third quartile are calculated before being rearrange to form a new set of values that will replace the missing values. These three values will be used as imputations for the missing values. To examine the effects of the orientation of the missing values with the choices of imputation, the missing values from the data are divided into three parts: at the front, in the middle and at the end of the data with 5%, 15%, and 25% of missing values. The results from root mean square error and mean absolute error show that the proposed techniques, particularly the mean and the third quartile value, are superior to the other complex methods when dealing with the missing values. The mean imputation is ample when the missing values is presence at the front and in the middle of the data while the third quartile value is superior when the missing values is at the end of the data. Penerbit Universiti Kebangsaan Malaysia 2020-05 Article PeerReviewed application/pdf en http://journalarticle.ukm.my/15409/1/22.pdf Nur Arina Bazilah Kamisan, and Muhammad Hisyam Lee, and Abdul Ghapor Hussin, and Yong Zulina Zubairi, (2020) Imputation techniques for incomplete load data based on seasonality and orientation of the missing values. Sains Malaysiana, 49 (5). pp. 1165-1174. ISSN 0126-6039 http://www.ukm.my/jsm/malay_journals/jilid49bil5_2020/KandunganJilid49Bil5_2020.html
spellingShingle Nur Arina Bazilah Kamisan,
Muhammad Hisyam Lee,
Abdul Ghapor Hussin,
Yong Zulina Zubairi,
Imputation techniques for incomplete load data based on seasonality and orientation of the missing values
title Imputation techniques for incomplete load data based on seasonality and orientation of the missing values
title_full Imputation techniques for incomplete load data based on seasonality and orientation of the missing values
title_fullStr Imputation techniques for incomplete load data based on seasonality and orientation of the missing values
title_full_unstemmed Imputation techniques for incomplete load data based on seasonality and orientation of the missing values
title_short Imputation techniques for incomplete load data based on seasonality and orientation of the missing values
title_sort imputation techniques for incomplete load data based on seasonality and orientation of the missing values
url http://journalarticle.ukm.my/15409/
http://journalarticle.ukm.my/15409/
http://journalarticle.ukm.my/15409/1/22.pdf