Evaluating A New Adaptive Group Lasso Imputation Technique For Handling Missing Values In Compositional Data

Pie chart is a widely used statistical chart to represent the proportions of various components in a certain entity. The shares of data in a pie chart, also known as compositional data, consist of non-negative values, containing only relative information. However, in many real-life domains, a sub...

Full description

Bibliographic Details
Main Author: Tian, Ying
Format: Thesis
Language:English
Published: 2024
Subjects:
Online Access:http://eprints.usm.my/62560/
http://eprints.usm.my/62560/1/TIAN%20YING%20-%20TESIS%20cut.pdf
_version_ 1848885021584130048
author Tian, Ying
author_facet Tian, Ying
author_sort Tian, Ying
building USM Institutional Repository
collection Online Access
description Pie chart is a widely used statistical chart to represent the proportions of various components in a certain entity. The shares of data in a pie chart, also known as compositional data, consist of non-negative values, containing only relative information. However, in many real-life domains, a substantial amount of missing values is often collected. The complexity of compositional data with missing values renders traditional estimation methods inadequate. In this thesis, a compositional data imputation method designed based on LASSO is proposed combining group LASSO and adaptive LASSO analysis methods. The estimation effects of highdimensional and low-dimensional compositional data with missing values are compared through simulation studies and case analyses under different missing rates, dimensions, and correlation coefficients. Considering the impact of outliers on the accuracy of estimation, both simulation and case analysis are conducted to compare the proposed algorithm against four existing methods. The experimental results demonstrate that the proposed adaptive group LASSO method produces a better imputation performance, MSE, MADE, RMSE and NRMSE increased by up to 26.6% at selected missing rates. Future work analyses the effect of imputation under continuous missing rates, MAR missing mechanism and more model evaluation criteria.
first_indexed 2025-11-15T19:15:59Z
format Thesis
id usm-62560
institution Universiti Sains Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T19:15:59Z
publishDate 2024
recordtype eprints
repository_type Digital Repository
spelling usm-625602025-06-24T07:02:17Z http://eprints.usm.my/62560/ Evaluating A New Adaptive Group Lasso Imputation Technique For Handling Missing Values In Compositional Data Tian, Ying QA1 Mathematics (General) Pie chart is a widely used statistical chart to represent the proportions of various components in a certain entity. The shares of data in a pie chart, also known as compositional data, consist of non-negative values, containing only relative information. However, in many real-life domains, a substantial amount of missing values is often collected. The complexity of compositional data with missing values renders traditional estimation methods inadequate. In this thesis, a compositional data imputation method designed based on LASSO is proposed combining group LASSO and adaptive LASSO analysis methods. The estimation effects of highdimensional and low-dimensional compositional data with missing values are compared through simulation studies and case analyses under different missing rates, dimensions, and correlation coefficients. Considering the impact of outliers on the accuracy of estimation, both simulation and case analysis are conducted to compare the proposed algorithm against four existing methods. The experimental results demonstrate that the proposed adaptive group LASSO method produces a better imputation performance, MSE, MADE, RMSE and NRMSE increased by up to 26.6% at selected missing rates. Future work analyses the effect of imputation under continuous missing rates, MAR missing mechanism and more model evaluation criteria. 2024-08 Thesis NonPeerReviewed application/pdf en http://eprints.usm.my/62560/1/TIAN%20YING%20-%20TESIS%20cut.pdf Tian, Ying (2024) Evaluating A New Adaptive Group Lasso Imputation Technique For Handling Missing Values In Compositional Data. PhD thesis, Universiti Sains Malaysia.
spellingShingle QA1 Mathematics (General)
Tian, Ying
Evaluating A New Adaptive Group Lasso Imputation Technique For Handling Missing Values In Compositional Data
title Evaluating A New Adaptive Group Lasso Imputation Technique For Handling Missing Values In Compositional Data
title_full Evaluating A New Adaptive Group Lasso Imputation Technique For Handling Missing Values In Compositional Data
title_fullStr Evaluating A New Adaptive Group Lasso Imputation Technique For Handling Missing Values In Compositional Data
title_full_unstemmed Evaluating A New Adaptive Group Lasso Imputation Technique For Handling Missing Values In Compositional Data
title_short Evaluating A New Adaptive Group Lasso Imputation Technique For Handling Missing Values In Compositional Data
title_sort evaluating a new adaptive group lasso imputation technique for handling missing values in compositional data
topic QA1 Mathematics (General)
url http://eprints.usm.my/62560/
http://eprints.usm.my/62560/1/TIAN%20YING%20-%20TESIS%20cut.pdf