Detection of outliers in high-dimensional data using nu-support vector regression
Support Vector Regression (SVR) is gaining in popularity in the detection of outliers and classification problems in high-dimensional data (HDD) as this technique does not require the data to be of full rank. In real application, most of the data are of high dimensional. Classification of high-dimen...
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Taylor and Francis
2021
|
| Online Access: | http://psasir.upm.edu.my/id/eprint/96639/ http://psasir.upm.edu.my/id/eprint/96639/1/ABSTRACT.pdf |
| _version_ | 1848862413905985536 |
|---|---|
| author | Mohammed Rashid, Abdullah Midi, Habshah Dhhan, Waleed Arasan, Jayanthi |
| author_facet | Mohammed Rashid, Abdullah Midi, Habshah Dhhan, Waleed Arasan, Jayanthi |
| author_sort | Mohammed Rashid, Abdullah |
| building | UPM Institutional Repository |
| collection | Online Access |
| description | Support Vector Regression (SVR) is gaining in popularity in the detection of outliers and classification problems in high-dimensional data (HDD) as this technique does not require the data to be of full rank. In real application, most of the data are of high dimensional. Classification of high-dimensional data is needed in applied sciences, in particular, as it is important to discriminate cancerous cells from non-cancerous cells. It is also imperative that outliers are identified before constructing a model on the relationship between the dependent and independent variables to avoid misleading interpretations about the fitting of a model. The standard SVR and the μ-ε-SVR are able to detect outliers; however, they are computationally expensive. The fixed parameters support vector regression (FP-ε-SVR) was put forward to remedy this issue. However, the FP-ε-SVR using ε-SVR is not very successful in identifying outliers. In this article, we propose an alternative method to detect outliers i.e. by employing nu-SVR. The merit of our proposed method is confirmed by three real examples and the Monte Carlo simulation. The results show that our proposed nu-SVR method is very successful in identifying outliers under a variety of situations, and with less computational running time. |
| first_indexed | 2025-11-15T13:16:38Z |
| format | Article |
| id | upm-96639 |
| institution | Universiti Putra Malaysia |
| institution_category | Local University |
| language | English |
| last_indexed | 2025-11-15T13:16:38Z |
| publishDate | 2021 |
| publisher | Taylor and Francis |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | upm-966392023-01-11T07:07:42Z http://psasir.upm.edu.my/id/eprint/96639/ Detection of outliers in high-dimensional data using nu-support vector regression Mohammed Rashid, Abdullah Midi, Habshah Dhhan, Waleed Arasan, Jayanthi Support Vector Regression (SVR) is gaining in popularity in the detection of outliers and classification problems in high-dimensional data (HDD) as this technique does not require the data to be of full rank. In real application, most of the data are of high dimensional. Classification of high-dimensional data is needed in applied sciences, in particular, as it is important to discriminate cancerous cells from non-cancerous cells. It is also imperative that outliers are identified before constructing a model on the relationship between the dependent and independent variables to avoid misleading interpretations about the fitting of a model. The standard SVR and the μ-ε-SVR are able to detect outliers; however, they are computationally expensive. The fixed parameters support vector regression (FP-ε-SVR) was put forward to remedy this issue. However, the FP-ε-SVR using ε-SVR is not very successful in identifying outliers. In this article, we propose an alternative method to detect outliers i.e. by employing nu-SVR. The merit of our proposed method is confirmed by three real examples and the Monte Carlo simulation. The results show that our proposed nu-SVR method is very successful in identifying outliers under a variety of situations, and with less computational running time. Taylor and Francis 2021 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/96639/1/ABSTRACT.pdf Mohammed Rashid, Abdullah and Midi, Habshah and Dhhan, Waleed and Arasan, Jayanthi (2021) Detection of outliers in high-dimensional data using nu-support vector regression. Journal of Applied Statistics, 49 (10). pp. 1-20. ISSN 0266-4763; ESSN: 1360-0532 https://www.tandfonline.com/doi/abs/10.1080/02664763.2021.1911965?journalCode=cjas20 10.1080/02664763.2021.1911965 |
| spellingShingle | Mohammed Rashid, Abdullah Midi, Habshah Dhhan, Waleed Arasan, Jayanthi Detection of outliers in high-dimensional data using nu-support vector regression |
| title | Detection of outliers in high-dimensional data using nu-support vector regression |
| title_full | Detection of outliers in high-dimensional data using nu-support vector regression |
| title_fullStr | Detection of outliers in high-dimensional data using nu-support vector regression |
| title_full_unstemmed | Detection of outliers in high-dimensional data using nu-support vector regression |
| title_short | Detection of outliers in high-dimensional data using nu-support vector regression |
| title_sort | detection of outliers in high-dimensional data using nu-support vector regression |
| url | http://psasir.upm.edu.my/id/eprint/96639/ http://psasir.upm.edu.my/id/eprint/96639/ http://psasir.upm.edu.my/id/eprint/96639/ http://psasir.upm.edu.my/id/eprint/96639/1/ABSTRACT.pdf |