A combinatory algorithm of univariate and multivariate gene selection

Microarray technology has provided the means to monitor the expression levels of a large number of genes simultaneously. Constructing a classifier based on microarray data has emerged as an important problem for diseases such as cancer. Difficulty arises from the fact that the number of samples are...

Full description

Bibliographic Details
Main Authors: Mahmoodian, Sayed Hamid, Marhaban, Mohammad Hamiruce, Abdul Rahim, Raha, Rosli, Rozita, Saripan, M. Iqbal
Format: Article
Language:English
Published: Asian Research Publication Network 2009
Online Access:http://psasir.upm.edu.my/id/eprint/12660/
http://psasir.upm.edu.my/id/eprint/12660/1/A%20combinatory%20algorithm%20of%20univariate%20and%20multivariate%20gene%20selection.pdf
_version_ 1848841895089799168
author Mahmoodian, Sayed Hamid
Marhaban, Mohammad Hamiruce
Abdul Rahim, Raha
Rosli, Rozita
Saripan, M. Iqbal
author_facet Mahmoodian, Sayed Hamid
Marhaban, Mohammad Hamiruce
Abdul Rahim, Raha
Rosli, Rozita
Saripan, M. Iqbal
author_sort Mahmoodian, Sayed Hamid
building UPM Institutional Repository
collection Online Access
description Microarray technology has provided the means to monitor the expression levels of a large number of genes simultaneously. Constructing a classifier based on microarray data has emerged as an important problem for diseases such as cancer. Difficulty arises from the fact that the number of samples are usually less than the number of genes which may interact with one another. Selection of a small number of significant genes is fundamental to correctly analyze the samples. Gene selection is usually based on univariate or multivariate methods. Univariate methods for gene selection cannot address interactions among multiple genes, a situation which demands the multivariate methods [1], [2]. In this paper, we considered new parameters which come up from singular value decomposition and present a combination algorithm for gene selection to integrate the univariate and multivariate approaches and compare it with gene selection based on correlation coefficient with binary output classes to analyze the effect of new parameters. Repeatability of selected genes is evaluated by external 10-fold cross validation whereas SVM and PLR classifiers are used to classify two well known datasets for cancers. We calculated the misclassification error in training samples and independent samples of two datasets (breast cancer and Leukemia). The results show that the mean of misclassification error of training samples in 100 iteration are almost equal in two algorithms but our algorithm have the better ability to classify independent samples.
first_indexed 2025-11-15T07:50:30Z
format Article
id upm-12660
institution Universiti Putra Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T07:50:30Z
publishDate 2009
publisher Asian Research Publication Network
recordtype eprints
repository_type Digital Repository
spelling upm-126602017-01-03T09:54:41Z http://psasir.upm.edu.my/id/eprint/12660/ A combinatory algorithm of univariate and multivariate gene selection Mahmoodian, Sayed Hamid Marhaban, Mohammad Hamiruce Abdul Rahim, Raha Rosli, Rozita Saripan, M. Iqbal Microarray technology has provided the means to monitor the expression levels of a large number of genes simultaneously. Constructing a classifier based on microarray data has emerged as an important problem for diseases such as cancer. Difficulty arises from the fact that the number of samples are usually less than the number of genes which may interact with one another. Selection of a small number of significant genes is fundamental to correctly analyze the samples. Gene selection is usually based on univariate or multivariate methods. Univariate methods for gene selection cannot address interactions among multiple genes, a situation which demands the multivariate methods [1], [2]. In this paper, we considered new parameters which come up from singular value decomposition and present a combination algorithm for gene selection to integrate the univariate and multivariate approaches and compare it with gene selection based on correlation coefficient with binary output classes to analyze the effect of new parameters. Repeatability of selected genes is evaluated by external 10-fold cross validation whereas SVM and PLR classifiers are used to classify two well known datasets for cancers. We calculated the misclassification error in training samples and independent samples of two datasets (breast cancer and Leukemia). The results show that the mean of misclassification error of training samples in 100 iteration are almost equal in two algorithms but our algorithm have the better ability to classify independent samples. Asian Research Publication Network 2009 Article PeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/12660/1/A%20combinatory%20algorithm%20of%20univariate%20and%20multivariate%20gene%20selection.pdf Mahmoodian, Sayed Hamid and Marhaban, Mohammad Hamiruce and Abdul Rahim, Raha and Rosli, Rozita and Saripan, M. Iqbal (2009) A combinatory algorithm of univariate and multivariate gene selection. Journal of Theoretical and Applied Information Technology, 5 (2). pp. 113-118. ISSN 1992-8645; ESSN: 1817-3195 http://www.jatit.org/volumes/fifth_volume_2_2009.php
spellingShingle Mahmoodian, Sayed Hamid
Marhaban, Mohammad Hamiruce
Abdul Rahim, Raha
Rosli, Rozita
Saripan, M. Iqbal
A combinatory algorithm of univariate and multivariate gene selection
title A combinatory algorithm of univariate and multivariate gene selection
title_full A combinatory algorithm of univariate and multivariate gene selection
title_fullStr A combinatory algorithm of univariate and multivariate gene selection
title_full_unstemmed A combinatory algorithm of univariate and multivariate gene selection
title_short A combinatory algorithm of univariate and multivariate gene selection
title_sort combinatory algorithm of univariate and multivariate gene selection
url http://psasir.upm.edu.my/id/eprint/12660/
http://psasir.upm.edu.my/id/eprint/12660/
http://psasir.upm.edu.my/id/eprint/12660/1/A%20combinatory%20algorithm%20of%20univariate%20and%20multivariate%20gene%20selection.pdf