Development of an imputation technique - INI for software metric database with incomplete data

Software metrics are numerical data that provides a quantitative basis for the development and validation of models, and effective measurement of the software development process. Gathering software engineering data can be expensive. Such precious and costly data cannot afford to be missing. However...

Full description

Bibliographic Details
Main Authors: Wasito, Ito, Olanrewaju, Rashidah F.
Format: Book Section
Language:English
Published: Institute of Electrical and Electronics Engineers 2007
Subjects:
Online Access:http://eprints.utm.my/9634/
http://eprints.utm.my/9634/1/ItoWasito2007_DevelopmentOfAnImputationTechnique.pdf
_version_ 1848891899025293312
author Wasito, Ito
Olanrewaju, Rashidah F.
author_facet Wasito, Ito
Olanrewaju, Rashidah F.
author_sort Wasito, Ito
building UTeM Institutional Repository
collection Online Access
description Software metrics are numerical data that provides a quantitative basis for the development and validation of models, and effective measurement of the software development process. Gathering software engineering data can be expensive. Such precious and costly data cannot afford to be missing. However missing data is a common problem and software engineering database is not an exception. Though there are many algorithms to solve problem of incomplete data, unfortunately few have been developed in the field of Software Engineering. Missing data causes significant problem. With inaccurate data or missing data, it is very difficult to know how much a project will cost or worth. Missing data leads to loss of information, causes biasness in data analysis and hence results to inaccurate decision-making for project management and implementation. In this paper, an imputation technique for imputing missing data based on global-local Modified Singular Value Decomposition (MSVD) algorithm, INI was proposed. This technique was used for estimating missing data in a software engineering database (PROMISE). Its performance was evaluated and compared with two existing imputation techniques, Expectation Maximization (EM) and Mean Imputation (MI). Varying percentages of missings, (1%, 10%, 15%, and 20% 25%) were introduced in the original dataset in order to have an incomplete dataset for imputation. Simulations were carried for comparative purposes. Imputation Error (IE) was use as an evaluation criterion. Imputation Error (IE) was use as an evaluation criterion. Study results showed that, the only method that consistently outperformed other methods (EM and MI), guarantee a higher accuracy of imputed data, prompt and less bias at all level of missings is the global-local MSVD, INI. It maintained consistency at all level of missings compared to EM and MI. It was found that EM is not suitable for data with missing proportion greater than 20%. While MI lost in all count to EM and INI.
first_indexed 2025-11-15T21:05:17Z
format Book Section
id utm-9634
institution Universiti Teknologi Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T21:05:17Z
publishDate 2007
publisher Institute of Electrical and Electronics Engineers
recordtype eprints
repository_type Digital Repository
spelling utm-96342017-08-15T01:32:54Z http://eprints.utm.my/9634/ Development of an imputation technique - INI for software metric database with incomplete data Wasito, Ito Olanrewaju, Rashidah F. QA75 Electronic computers. Computer science Software metrics are numerical data that provides a quantitative basis for the development and validation of models, and effective measurement of the software development process. Gathering software engineering data can be expensive. Such precious and costly data cannot afford to be missing. However missing data is a common problem and software engineering database is not an exception. Though there are many algorithms to solve problem of incomplete data, unfortunately few have been developed in the field of Software Engineering. Missing data causes significant problem. With inaccurate data or missing data, it is very difficult to know how much a project will cost or worth. Missing data leads to loss of information, causes biasness in data analysis and hence results to inaccurate decision-making for project management and implementation. In this paper, an imputation technique for imputing missing data based on global-local Modified Singular Value Decomposition (MSVD) algorithm, INI was proposed. This technique was used for estimating missing data in a software engineering database (PROMISE). Its performance was evaluated and compared with two existing imputation techniques, Expectation Maximization (EM) and Mean Imputation (MI). Varying percentages of missings, (1%, 10%, 15%, and 20% 25%) were introduced in the original dataset in order to have an incomplete dataset for imputation. Simulations were carried for comparative purposes. Imputation Error (IE) was use as an evaluation criterion. Imputation Error (IE) was use as an evaluation criterion. Study results showed that, the only method that consistently outperformed other methods (EM and MI), guarantee a higher accuracy of imputed data, prompt and less bias at all level of missings is the global-local MSVD, INI. It maintained consistency at all level of missings compared to EM and MI. It was found that EM is not suitable for data with missing proportion greater than 20%. While MI lost in all count to EM and INI. Institute of Electrical and Electronics Engineers 2007-10 Book Section PeerReviewed application/pdf en http://eprints.utm.my/9634/1/ItoWasito2007_DevelopmentOfAnImputationTechnique.pdf Wasito, Ito and Olanrewaju, Rashidah F. (2007) Development of an imputation technique - INI for software metric database with incomplete data. In: 4th Student Conference on Research and Development SCOReD 2006. Institute of Electrical and Electronics Engineers, pp. 76-80. ISBN 978-1-4244-0526-8 http://dx.doi.org/10.1109/SCORED.2006.4339312 doi : 10.1109/SCORED.2006.4339312
spellingShingle QA75 Electronic computers. Computer science
Wasito, Ito
Olanrewaju, Rashidah F.
Development of an imputation technique - INI for software metric database with incomplete data
title Development of an imputation technique - INI for software metric database with incomplete data
title_full Development of an imputation technique - INI for software metric database with incomplete data
title_fullStr Development of an imputation technique - INI for software metric database with incomplete data
title_full_unstemmed Development of an imputation technique - INI for software metric database with incomplete data
title_short Development of an imputation technique - INI for software metric database with incomplete data
title_sort development of an imputation technique - ini for software metric database with incomplete data
topic QA75 Electronic computers. Computer science
url http://eprints.utm.my/9634/
http://eprints.utm.my/9634/
http://eprints.utm.my/9634/
http://eprints.utm.my/9634/1/ItoWasito2007_DevelopmentOfAnImputationTechnique.pdf