Mohammadi, A and Saraee, M 2008, Dealing with missing values in microarray data , in: 4th IEEE International Conference on Emerging Technologies, 2008. ICET 2008, 18-19 Oct. 2008, Rawalpindi, Pakistan,.
- Published Version
Restricted to Repository staff only
Download (3MB) | Request a copy
Gene expression profiling plays an important role in a broad range of areas in biology. The raw gene expression data, may contain missing values. It is an important preprocessing step to accurately estimate missing values in microarray data, because complete datasets are required in numerous expression profile analysis. Numerous methods have been developed to deal with missing values. In this paper, a new and robust method based on fuzzy clustering and gene ontology is proposed to estimate missing values in microarray data. In the proposed method, missing values are imputed with values generated from cluster centers. To determine the similar genes in clustering process, we have utilized the biological knowledge obtained from gene ontology as well as gene expression values. We have applied the proposed method on yeast cell cycle data and yeast environmental stress data, with different percentage of missing entries. We compared the estimation accuracy of our method with some other methods. The experimental results indicate that the proposed method outperforms other methods in terms of accuracy.
|Item Type:||Conference or Workshop Item (Paper)|
|Themes:||Health and Wellbeing|
|Schools:||Schools > School of Computing, Science and Engineering > Salford Innovation Research Centre (SIRC)|
|Journal or Publication Title:||proceedings of 4th IEEE International Conference on Emerging Technologies, 2008. ICET 2008|
|Depositing User:||Dr Mo Saraee|
|Date Deposited:||03 Nov 2011 15:57|
|Last Modified:||29 Oct 2015 00:11|
Actions (login required)
|Edit record (repository staff only)|