Robust diagnostic and parameter estimation for multiple linear and panel data regression models

The Influential Distance (ID) is proposed to identify multiple influential observations (IOs) in linear regression. However, the method not only considered good leverage observations (GLOs) as IOs, but also takes long computational running time with high rate of swamping and masking effects. Fast Im...

Full description

Bibliographic Details
Main Author: Sani, Muhammad
Format: Thesis
Language:English
Published: 2018
Subjects:
Online Access:http://psasir.upm.edu.my/id/eprint/79201/
http://psasir.upm.edu.my/id/eprint/79201/1/IPM%202019%203%20IR.pdf
_version_ 1848858617299599360
author Sani, Muhammad
author_facet Sani, Muhammad
author_sort Sani, Muhammad
building UPM Institutional Repository
collection Online Access
description The Influential Distance (ID) is proposed to identify multiple influential observations (IOs) in linear regression. However, the method not only considered good leverage observations (GLOs) as IOs, but also takes long computational running time with high rate of swamping and masking effects. Fast Improvised Influential Distance (FIID) is proposed to overcome these shortcomings. The results indicate that FIID successfully identified and classified GLOs and IOs with less computational running time, no masking effect and smaller rate of swamping. The presence of high leverage points (HLPs) and violation of the assumption of homoscedasticity are very common in analyzing data in linear and panel data regression models. To remedy this problems weighted least squares (WLS) based on FIID weighting method for Heteroscedasticity Consistent Covariance Matrix (HCCM) estimator is developed. The results obtained from simulation study and real data sets indicate that the proposed method is superior compared to the existing methods. The presence of outlying observations in a data set causes heteroscedasticity in a homoscedastic data set and vice versa. To know the type of outliers that are responsible for these irregularities is very important so that appropriate measure will be taken. To bridge the gap in the literature, we have successfully proposed robust White test to detect heteroscedasticity and identifies the types of outliers that causes and hide heteroscedasticity termed heteroscedasticity-enhancing and heteroscedasticityreducing observations (HEO and HRO), respectively. Furthermore, we proposed appropriate remedial measures for both HEO and HRO denoted by GM-FIID and ITSRWLS, respectively. The results of the simulation study show that the proposed methods are efficient and consistent than the existing methods. The panel data estimators for both fixed and random effect models becomes bias and causes inconsistency in variance-covariance matrix when there exist heteroscedasticity of unknown form and high leverage points in a data set. To date no research has been done to address this problem. To fill-in the gap in the literature we proposed a WLS estimation technique for both fixed and random effect model based on RHCCM estimator with FIID weighting method. The MM-Centering technique is employed instead of mean centering to reduce the effect of HLPs. The results of simulation study and real data sets indicate that weighted least squares based on FIID (WLSFIID) was found to be the best method. The classical Hausman pretest is used to choose between random and fixed effect panel data models. In the presence of heteroscedastic error variances and high leverage points (HLPs) or IOs in a data set, the right model may not be correctly identified. To the best of our knowledge no research has been done to address this issue. We proposed a robust Hausman pretest denoted as RHTFIID based on FIID and Robust Heteroscedasticity Consistent Covariance Matrix (RHCCM) estimator to remedy the problem. The results of simulation and real data set indicate that the proposed method was found to perform better than the conventional Hausman pretest.
first_indexed 2025-11-15T12:16:17Z
format Thesis
id upm-79201
institution Universiti Putra Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T12:16:17Z
publishDate 2018
recordtype eprints
repository_type Digital Repository
spelling upm-792012020-06-30T03:40:07Z http://psasir.upm.edu.my/id/eprint/79201/ Robust diagnostic and parameter estimation for multiple linear and panel data regression models Sani, Muhammad The Influential Distance (ID) is proposed to identify multiple influential observations (IOs) in linear regression. However, the method not only considered good leverage observations (GLOs) as IOs, but also takes long computational running time with high rate of swamping and masking effects. Fast Improvised Influential Distance (FIID) is proposed to overcome these shortcomings. The results indicate that FIID successfully identified and classified GLOs and IOs with less computational running time, no masking effect and smaller rate of swamping. The presence of high leverage points (HLPs) and violation of the assumption of homoscedasticity are very common in analyzing data in linear and panel data regression models. To remedy this problems weighted least squares (WLS) based on FIID weighting method for Heteroscedasticity Consistent Covariance Matrix (HCCM) estimator is developed. The results obtained from simulation study and real data sets indicate that the proposed method is superior compared to the existing methods. The presence of outlying observations in a data set causes heteroscedasticity in a homoscedastic data set and vice versa. To know the type of outliers that are responsible for these irregularities is very important so that appropriate measure will be taken. To bridge the gap in the literature, we have successfully proposed robust White test to detect heteroscedasticity and identifies the types of outliers that causes and hide heteroscedasticity termed heteroscedasticity-enhancing and heteroscedasticityreducing observations (HEO and HRO), respectively. Furthermore, we proposed appropriate remedial measures for both HEO and HRO denoted by GM-FIID and ITSRWLS, respectively. The results of the simulation study show that the proposed methods are efficient and consistent than the existing methods. The panel data estimators for both fixed and random effect models becomes bias and causes inconsistency in variance-covariance matrix when there exist heteroscedasticity of unknown form and high leverage points in a data set. To date no research has been done to address this problem. To fill-in the gap in the literature we proposed a WLS estimation technique for both fixed and random effect model based on RHCCM estimator with FIID weighting method. The MM-Centering technique is employed instead of mean centering to reduce the effect of HLPs. The results of simulation study and real data sets indicate that weighted least squares based on FIID (WLSFIID) was found to be the best method. The classical Hausman pretest is used to choose between random and fixed effect panel data models. In the presence of heteroscedastic error variances and high leverage points (HLPs) or IOs in a data set, the right model may not be correctly identified. To the best of our knowledge no research has been done to address this issue. We proposed a robust Hausman pretest denoted as RHTFIID based on FIID and Robust Heteroscedasticity Consistent Covariance Matrix (RHCCM) estimator to remedy the problem. The results of simulation and real data set indicate that the proposed method was found to perform better than the conventional Hausman pretest. 2018-11 Thesis NonPeerReviewed text en http://psasir.upm.edu.my/id/eprint/79201/1/IPM%202019%203%20IR.pdf Sani, Muhammad (2018) Robust diagnostic and parameter estimation for multiple linear and panel data regression models. Doctoral thesis, Universiti Putra Malaysia. Robust statistics Regression analysis Linear models (Statistics)
spellingShingle Robust statistics
Regression analysis
Linear models (Statistics)
Sani, Muhammad
Robust diagnostic and parameter estimation for multiple linear and panel data regression models
title Robust diagnostic and parameter estimation for multiple linear and panel data regression models
title_full Robust diagnostic and parameter estimation for multiple linear and panel data regression models
title_fullStr Robust diagnostic and parameter estimation for multiple linear and panel data regression models
title_full_unstemmed Robust diagnostic and parameter estimation for multiple linear and panel data regression models
title_short Robust diagnostic and parameter estimation for multiple linear and panel data regression models
title_sort robust diagnostic and parameter estimation for multiple linear and panel data regression models
topic Robust statistics
Regression analysis
Linear models (Statistics)
url http://psasir.upm.edu.my/id/eprint/79201/
http://psasir.upm.edu.my/id/eprint/79201/1/IPM%202019%203%20IR.pdf