Department of Orthopaedic Surgery Publications (UMKC)The items in this collection are the scholarly output of the faculty, staff, and students of the Department of Orthopaedic Surgery..https://hdl.handle.net/10355/151412024-03-29T12:30:04Z2024-03-29T12:30:04ZGene selection for classification of microarray data based on the Bayes errorZhang, Ji-GangDeng, Hong-Wenhttps://hdl.handle.net/10355/150392019-05-16T14:38:19Z2007-10-03T00:00:00ZGene selection for classification of microarray data based on the Bayes error
Zhang, Ji-Gang; Deng, Hong-Wen
Abstract
Background
With DNA microarray data, selecting a compact subset of discriminative genes from thousands of genes is a critical step for accurate classification of phenotypes for, e.g., disease diagnosis. Several widely used gene selection methods often select top-ranked genes according to their individual discriminative power in classifying samples into distinct categories, without considering correlations among genes. A limitation of these gene selection methods is that they may result in gene sets with some redundancy and yield an unnecessary large number of candidate genes for classification analyses. Some latest studies show that incorporating gene to gene correlations into gene selection can remove redundant genes and improve classification accuracy.
Results
In this study, we propose a new method, Based Bayes error Filter (BBF), to select relevant genes and remove redundant genes in classification analyses of microarray data. The effectiveness and accuracy of this method is demonstrated through analyses of five publicly available microarray datasets. The results show that our gene selection method is capable of achieving better accuracies than previous studies, while being able to effectively select relevant genes, remove redundant genes and obtain efficient and small gene sets for sample classification purposes.
Conclusion
The proposed method can effectively identify a compact set of genes with high classification accuracy. This study also indicates that application of the Bayes error is a feasible and effective wayfor removing redundant genes in gene selection.
2007-10-03T00:00:00ZHAPSIMU: a genetic simulation platform for population-based association studiesZhang, FengLiu, JianfengChen, JieDeng, Hong-Wenhttps://hdl.handle.net/10355/150302019-05-16T14:38:28Z2008-08-05T00:00:00ZHAPSIMU: a genetic simulation platform for population-based association studies
Zhang, Feng; Liu, Jianfeng; Chen, Jie; Deng, Hong-Wen
Abstract
Background
Population structure is an important cause leading to inconsistent results in population-based association studies (PBAS) of human diseases. Various statistical methods have been proposed to reduce the negative impact of population structure on PBAS. Due to lack of structural information in real populations, it is difficult to evaluate the impact of population structure on PBAS in real populations.
Results
We developed a genetic simulation platform, HAPSIMU, based on real haplotype data from the HapMap ENCODE project. This platform can simulate heterogeneous populations with various known and controllable structures under the continuous migration model or the discrete model. Moreover, both qualitative and quantitative traits can be simulated using additive genetic model with various genetic parameters designated by users.
Conclusion
HAPSIMU provides a common genetic simulation platform to evaluate the impact of population structure on PBAS, and compare the relative performance of various population structure identification and PBAS methods.
2008-08-05T00:00:00Z