[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1570256.1570281acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
short-paper

Cancer classification using microarray and layered architecture genetic programming

Published: 08 July 2009 Publication History

Abstract

An important problem of cancer diagnosis and treatment is to distinguish tumors from malignant or benign. Classifying tumors correctly leads us to target specific therapies properly to maximizing efficiency and reducing toxicity. Through the microarray technology, it is possible that monitoring expression in cells for numerous of genes simultaneously. Therefore we are allowed to use potential information hidden in the gene expression data to build a more accurate and more reliable classification model on tumor samples. In this paper we intend to investigate a new approach for cancer classification using genetic programming and microarray gene expression profiles. The layered architecture genetic programming (LAGEP) is applied to build the classification model. Some typical cancer gene expression datasets are validated to demonstrate the classification accuracy of the proposed model.

References

[1]
T.R. Golub, D.K. Slonim, P. Tamayo, C. Huard, M. Gaasenbeek, J.P. Mesirov, H. Coller, M.L. Loh, J.R. Downing, and M.A. Caligiuri, et al., Molecular classification of cancer: class discovery and class prediction by gene expression monitoring, Science, 286(1999), 531--537
[2]
A. Dupuy and R.M. Simon, Critical review of published microarray studies for cancer outcome and guidelines on statistical analysis and reporting, Journal of National Cancer Institute, 99, 2 (2007), 147--57
[3]
J.W. Lee, J.B. Lee, M. Park and S.H. Song, An extensive comparison of recent classification tools applied to microarray data, Computational Statistics & Data Analysis, 48 (2005), 869--885
[4]
M.H. Asyali, D. Colak, O. Demirkaya and M.S. Inan, Gene expression profile classification: a review, Current Bioinformatics, 1 (2006), 55--73
[5]
Y. Lu and J. Han, Cancer classification using gene expression data, Information Systems, 28 (2003), 243--268
[6]
S. Dudoit, J. Fridlyand and P. Speed, Comparison of discrimination methods for classification of tumors using gene expression patterns, Journal of American Statistical Association, 97, 457 (2002), 77--87
[7]
L. Li, T.A. Darden, C.R. Weingberg, A.J. Levine and L.G. Pedersen, Gene assessment and sample classification for gene expression data using a genetic algorithm/k-nearest neighbor method, Combinational Chemistry and High Throughput Screening, 4, 8 (2001), 727--739
[8]
A. Statnikov, C.F. Aliferis, I. Tsamardinos, D. Hardin and S. Levy, A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis, Bioinformatics, 21, 5 (2005), 631--643
[9]
H. Zhang, C.Y. Yu, B. Singer and M. Xiong, Recursive partitioning for tumor classification with gene expression microarray data, in Proceedings of the National Academy of Sciences, 98, 12 (2001), 6730--6735
[10]
L. Breiman, J.H. Friedman, R. Olshen, and C.J. Stone, Classification and regression trees, Belmont, CA, Wadsworth, 1984
[11]
L. Breiman, Random forests, Machine Learning, 45 (2001), 5--32
[12]
A. Statnikov, L. Wang and C.F. Aliferis, A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification, BMC Informatics, 9, 319, 2008.
[13]
B. Wu, T. Abbott, D. Fishman, W. McMurray, G. Mor, K. Stone, D. Ward, K. Williams and H. Zhao, Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data, Bioinformatics, 19 (2003), 1636--1643
[14]
R. Diaz-Uriarte and S. Alvarez de Andres, Gene selection and classification of microarray data using random forest, BMC Bioinformatics, 7 (2006), 3
[15]
J. Gehrke, G. Ganti, R. Ramakrishnan, and W. Loh, Boat-optimistic decision tree construction, in Proceedings of the 1999 ACM SIGMOD, Philadephia, PA, 1999, 169--180
[16]
G. Natsoulis, L.E. Ghaoui, G.R.G. Lanckriet, A.M. Tolley, F. Leroy, S. Dunlea, B.P. Eynon, C. Pearson, S. Tugendreich, and K. Jarnagin, Classification of a large microarray data set: algorithm comparison and analysis of drug signatures, Genome Research, 15 (2005), 724--736
[17]
L. Zhu, B. Han, L. Li, S. Xu, H. Mou, and Z. Zheng, A novel two-stage cancer classification method for microarray data based on supervised manifold learning, in Proceedings on the 2nd international conference on Bioinformatics and biomedical Engineering (ICBBE), (16-18, May, 2008), 1908--1911
[18]
Y. Lee, and C. Lee, Classification of multiple cancer types by multicategory support vector machines using gene expression data, Bioinformatics, 19, 9 (2003), 1132--1139
[19]
J.J. Liu, G. Cutler, W. Li, Z. Pan, S. Peng, T. Hoey, L. Chen, and X.B. Ling, Multiclass cancer classification and biomarker discovery using GA-based algorithms, Bioinformatics, 21, 11 (2005), 2691--2697
[20]
J. Khan, J.S. Wei, M. Ringner, L.H. Saal, M. Lananyi, F. Westermann, F. erthold, M. Schwab and C.R. Antonescu, C. Peterson, and P.S. Meltzer, Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks, Nature Medicine, 7, 6 (June, 2001), 673--679
[21]
S.B. Cho and H.H. Won, Cancer classification using ensemble of neural networks with multiple significant gene subsets, Applied Intelligence, 26, 3 (2007), 243--250
[22]
X. Ruan, J. Wang, H. Li, and X. Li, A method for cancer classification using ensemble neural networks with gene expression profile, in Proceedings of the 2nd international conference on Bioinformatics and Biomedical Engineering (ICBBE), (May 16-18, 2008), 342--346,
[23]
A.C. Tan, D.Q. Naiman, L. Xu, R.L. Winshlow, and D. Geman, Simple decision rules for classifying human cancer from gene expression profiles, Bioinformatics, 21, 20 (2005), 3896--3904
[24]
Zne-Jung Lee, An integrated algorithm for gene selection and classification applied to microarray data of ovarian cancer, Artificial Intelligence in Medicine, 42 (2008), 81--93
[25]
E.G. Talbi, L. Jourdan, J. García-Nieto, and E. Alba, Comparison of population based metaheuristics for feature selection: Application to microarray data classification, in Proceedings of IEEE/ACS International Conference on Computer Systems and Applications (AICCSA 2008) (March 31-April 4, 2008), 45--52,
[26]
W.B. Langdon and B.F. Buxton, Genetic programming for mining DNA Chip data from cancer patients, Genetic Programming and Evolvable Machines, 5, 3 (2004), 251--257
[27]
L. Li, T.A. Darden, C.R. Weingberg, A.J. Levine and L.G. Pedersen, Gene assessment and sample classification for gene expression data using a genetic algorithm/k-nearest neighbor method, Combinational Chemistry and High Throughput Screening, 4, 8 (2001), 727--739
[28]
C.H. Ooi, and P. Tan, Genetic algorithms applied to multi-class prediction for the analysis of gene expression data, Bioinformatics, 19, 1 (2003), 37--44
[29]
S. Peng, Q. Xu, L.X. Bruce, X. Peng, W. Du, and L. Chen, Molecular classification of cancer types from microarray data using the combination of genetic algorithms and support vector machines, FEBS Letters, 555, 2 (2003), 358--362
[30]
J.R. Koza, Genetic Programming: On the Programming of Computers by Means of Natural Selection, Cambridge, MA: MIT Press, 1992
[31]
W. Banzhaf, P. Nordin, R.E. Keller, and F.D. Framcone, Genetic Programming: An Introduction on the Automatic Evolution of Computer Programs and Its Application, San Francisco, CA: Morgan Kaufmann, 1998
[32]
M. Brameier and W. Banzhaf, A comparison of linear genetic programming and neural networks in medical data mining, IEEE Trans. on Evolutionary Computation, 5, 1 (Feb., 2001), 17--26
[33]
F. Fernández, M. Tomassini and L. Vanneschi, An empirical study of multipopulation genetic programming, Genetic Programming and Evolvable Machines, 4 (May, 2003), 21--51
[34]
J.Y. Lin, H.R. Ke, B.C. Chien and W.P. Yang, Designing a Classifier by a Layered Multi-Population Genetic Programming Approach, Pattern Recognition, 40, 8 (Aug. 2007), 2211--2225
[35]
J.Y. Lin, H.R. Ke, B.C. Chien and W.P. Yang, Classifier Design with Feature Selection and Feature Extraction Using Layered Genetic Programming, Expert Systems with Applications, 34, 2 (2008), 1384--1393
[36]
U. Alon, N. Barkai, D.A. Notterman, and K. Gish, Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oli-gonucleotide arrays, Proc Natl Acad Sci USA, 96 (1999), 6745--6750

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
GECCO '09: Proceedings of the 11th Annual Conference Companion on Genetic and Evolutionary Computation Conference: Late Breaking Papers
July 2009
1760 pages
ISBN:9781605585055
DOI:10.1145/1570256

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 July 2009

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cancer classification
  2. gene expression data
  3. genetic programming
  4. layered architecture genetic programming

Qualifiers

  • Short-paper

Conference

GECCO09
Sponsor:
GECCO09: Genetic and Evolutionary Computation Conference
July 8 - 12, 2009
Québec, Montreal, Canada

Acceptance Rates

Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 770
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Dec 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media