Research Article | Open Access | Download PDF
Volume 4 | Issue 5 | Year 2013 | Article Id. IJCTT-V4I5P44 | DOI : https://doi.org/10.14445/22312803/IJCTT-V4I5P44
Cat Swarm based Optimization of Gene Expression Data Classification
Amit Kumar, Debahuti Mishra
Citation :
Amit Kumar, Debahuti Mishra, "Cat Swarm based Optimization of Gene Expression Data Classification," International Journal of Computer Trends and Technology (IJCTT), vol. 4, no. 5, pp. 1185-1190, 2013. Crossref, https://doi.org/10.14445/22312803/IJCTT-V4I5P44
Abstract
An Artificial Neural Network (ANN) does have the capability to provide solutions of various complex problems. The generalization ability of ANN due to the massively parallel processing capability can be utilized to learn the patterns discovered in the data set which can be represented in terms of a set of rules. This rule can be used to find the solution to a classification problem. The learning ability of the ANN is degraded due to the high dimensionality of the datasets. Hence, to minimize this risk we have used Principal Component Analysis (PCA) and Factor Analysis (FA) which provides a feature reduced dataset to the Multi Layer Perceptron (MLP), the classifier used. Again, since the weight matrices are randomly initialized, hence, in this paper we have used Cat Swarm Optimization (CSO) method to update the weight values of the weight matrix. From the experimental evaluation, it was found that using CSO with the MLP classifier provides better classification accuracy as compared to when the classifier is solely used.
Keywords
Classification, Artificial neural network, Multi-layer perceptron, Principal component analysis, Factor analysis, Cat swarm optimization.
References
[1] Shelly Gupta, Dharminder Kumar, Anand Sharma, Performance analysis of various data mining classification techniques on healthcare data, International Journal of Computer Science & Information Technology (IJCSIT 2011), Vol 3, pp .155-169.
[2] Peiman Mamani Barnaghi,Vahid Alizadeh Sahzabi,Azuraliza Abu Bakar, A Comparative Study for Various Methods of Classification, International Conference on Information and Computer Networks (ICICN 2012), vol. 27, pp. 62-66.
[3] Zuyi Wang1, Yue Wang, Jianhua Xuan, Yibin Dong, Marina Bakay, Yuanjian Feng3 Robert Clarke, Eric P. Hoffman, Optimized multilayer perceptrons for molecular classification and diagnosis using genomic data, Oxford University Press 2006, vol. 22, pp.755–761.
[4] John Paul T. Yusiong , Optimizing Artificial Neural Networks Using Cat Swarm Optimization Algorithm, I. J. Intelligent Systems and Applications, 2012; pp. 69-80.
[5] Jianwei Niu, Yiling He, Muyuan Li, Xin Zhang, Linghua Ran, Chuzhi Chao, Baoqin Zhang, A Comparative Study on Application of Data Mining Technique in Human Shape Clustering: Principal Component Analysis vs. Factor Analysis, 5th IEEE Conference on Industrial Electronics and Applications, 2010, pp 2014-2018.
[6] S. C. Chu, and P. W. Tsai, Computational intelligence based on the behaviour of cat, International journal of Innovative Computing, Information and Control, 3, vol. 1, pp.163-173, 2007.
[7] Ajay Kumar Tanwani, Jamal Afridi, M. Zubair Shafiq,Muddassar Farooq, Guidelines to Select Machine Learning Scheme for Classification of Biomedical Dataset, EvoBIO: Springer LNCS 2009, 5483, pp. 128–139.
[8] Ng Ee Ling, Yahya Abu Hasan, Classification On Microarray Data, Regional Conference on Mathematics, Statistics and Applications, 2006; pp.1-8.
[9] Guo-zheng Li , Hua-Long Bu, Mary Qu Yang, Xue-Qiang Zeng, Jack Y Yang, Selecting subsets of newly extracted features from PCA and PLS in microarray data analysis, IEEE 7th international conference on Bioinformatics and Bioengineering, 2007,pp.1-15.
[10]L. Ladha, T. Deepa, Feature selection methods and algorithms, International Journal on Computer Science and Engineering (IJCSE) 2011, vol. 3 no. 5, pp.1787-1797.
[11] Helyane Bronoski Borges, Júlio Cesar Nievola, Feature Selection as a Pre-processing Step for Classification in Gene Expression Data, Seventh International Conference on Intelligent Systems Design and Applications, 2007, pp.157-162.
[12] Onur Inan, Mustafa Serter Uzer, Nihat Yılmaz, A new hybrid feature selection method based on association rules and PCA for detection for breast cancer, International Journal of Innovative Computing, Information and Control, February 2013, vol. 9, N0. 2, pp.727-739. [13]B. Albert, A. Johnson, J. Lewis, M. Raff, K. Roberts, P.Watter, Molecular Biology of Cell, Garland Science”,2004.
[14] Luai Al Shalabi, Zyad Shaaban, Basel Kasasbeh, Data Mining: A Preprocessing Engine, Journal of Computer Science, 2006, vol. 2, No.9, pp. 735-739.
[15] Qi Shen, Zhen Mei, Bao-Xian Ye, Simultaneous genes and training samples selection by modified particle swarm optimization for gene expression data classification, Computers in Biology and Medicine, 2009, vol. 39, pp. 646—649.
[16] Pournamashi Parhi,Debahuti Mishra, Sashikala Mishra,Kailash Shaw, A novel PSO-FLANN framework for feature selection and classification of microarray data, International conference on modelling, optimization and computing (ICMOC-2012),vol.38, pp.1644-1649.
[17] Pushpalata Pujari, Guru Ghasi Das, Classification and comparative study of data mining classifiers with feature selection on biomedical dataset, Journal of Global Research in Computer Science, 2012 vol. 3, No. 5, pp. 39-45.
[18] http://archive.ics.uci.edu/ml/
[19] Rossi, A. L. D Carv, Bio-inspired parameter tuning of MLP network for gene expression data, Eighth international conference on Hybrid Intelligent systems, 2008. [20] Gilhan Kim, Yeonjoo Kim, Heuiseok Lim, Hyeoncheol Kim, An MLPbased feature subset selection for HIV-1 protease cleavage site analysis, Artificial Intelligence in Medicine, 2001, vol. 48, pp.83–89