An efficient method of identification of influential observaions in multiple linear regression and its application to real data

Influential observations (IOs) are those observations which either alone or together with several other observations have detrimental effect on the computed values of various estimates. As such, it is very important to detect their presence. Several methods have been proposed to identify IOs that in...

Full description

Bibliographic Details
Main Authors: Midi, Habshah, Hendi, Hasan Talib, Uraibi, Hassan, Arasan, Jayanthi, Ismaeel, Shelan Saied
Format: Article
Published: Universiti Kebangsaan Malaysia 2024
Online Access:http://psasir.upm.edu.my/id/eprint/108928/
_version_ 1848865239068573696
author Midi, Habshah
Hendi, Hasan Talib
Uraibi, Hassan
Arasan, Jayanthi
Ismaeel, Shelan Saied
author_facet Midi, Habshah
Hendi, Hasan Talib
Uraibi, Hassan
Arasan, Jayanthi
Ismaeel, Shelan Saied
author_sort Midi, Habshah
building UPM Institutional Repository
collection Online Access
description Influential observations (IOs) are those observations which either alone or together with several other observations have detrimental effect on the computed values of various estimates. As such, it is very important to detect their presence. Several methods have been proposed to identify IOs that include the fast improvised influential distance (FIID). The FIID method has been shown to be more efficient than some existing methods. Nonetheless, the shortcoming of the FIID method is that, it is computationally not stable, still suffers from masking and swamping effects, time consuming issues and not using proper cut-off point. As a solution to this problem, a new robust version of influential distance method (RFIID) which is based on Reweighted Fast Consistent and High Breakdown (RFCH) estimator is proposed. The results of real data and Monte Carlo simulation study indicate that the RFIID able to correctly separate the IOs from the rest of data with the least computational running times, least swamping effect and no masking effect compared to the other methods in this study.
first_indexed 2025-11-15T14:01:32Z
format Article
id upm-108928
institution Universiti Putra Malaysia
institution_category Local University
last_indexed 2025-11-15T14:01:32Z
publishDate 2024
publisher Universiti Kebangsaan Malaysia
recordtype eprints
repository_type Digital Repository
spelling upm-1089282024-05-16T14:04:39Z http://psasir.upm.edu.my/id/eprint/108928/ An efficient method of identification of influential observaions in multiple linear regression and its application to real data Midi, Habshah Hendi, Hasan Talib Uraibi, Hassan Arasan, Jayanthi Ismaeel, Shelan Saied Influential observations (IOs) are those observations which either alone or together with several other observations have detrimental effect on the computed values of various estimates. As such, it is very important to detect their presence. Several methods have been proposed to identify IOs that include the fast improvised influential distance (FIID). The FIID method has been shown to be more efficient than some existing methods. Nonetheless, the shortcoming of the FIID method is that, it is computationally not stable, still suffers from masking and swamping effects, time consuming issues and not using proper cut-off point. As a solution to this problem, a new robust version of influential distance method (RFIID) which is based on Reweighted Fast Consistent and High Breakdown (RFCH) estimator is proposed. The results of real data and Monte Carlo simulation study indicate that the RFIID able to correctly separate the IOs from the rest of data with the least computational running times, least swamping effect and no masking effect compared to the other methods in this study. Universiti Kebangsaan Malaysia 2024-06-20 Article PeerReviewed Midi, Habshah and Hendi, Hasan Talib and Uraibi, Hassan and Arasan, Jayanthi and Ismaeel, Shelan Saied (2024) An efficient method of identification of influential observaions in multiple linear regression and its application to real data. Sains Malaysiana, 52 (12). pp. 3589-3602. ISSN 0126-6039; ESSN: 2735-0118 https://www.ukm.my/jsm/pdf_files/SM-PDF-52-12-2023/19.pdf 10.17576/jsm-2023-5212-19
spellingShingle Midi, Habshah
Hendi, Hasan Talib
Uraibi, Hassan
Arasan, Jayanthi
Ismaeel, Shelan Saied
An efficient method of identification of influential observaions in multiple linear regression and its application to real data
title An efficient method of identification of influential observaions in multiple linear regression and its application to real data
title_full An efficient method of identification of influential observaions in multiple linear regression and its application to real data
title_fullStr An efficient method of identification of influential observaions in multiple linear regression and its application to real data
title_full_unstemmed An efficient method of identification of influential observaions in multiple linear regression and its application to real data
title_short An efficient method of identification of influential observaions in multiple linear regression and its application to real data
title_sort efficient method of identification of influential observaions in multiple linear regression and its application to real data
url http://psasir.upm.edu.my/id/eprint/108928/
http://psasir.upm.edu.my/id/eprint/108928/
http://psasir.upm.edu.my/id/eprint/108928/