International Journal of Computer
Trends and Technology

Research Article | Open Access | Download PDF

Volume 4 | Issue 3 | Year 2013 | Article Id. IJCTT-V4I3P144 | DOI : https://doi.org/10.14445/22312803/IJCTT-V4I3P144

Neural Techniques for Improving the Classification Accuracy of Microarray Data Set using Rough Set Feature Selection Method


Bichitrananda Patra, Sujata Dash, B. K. Tripathy

Citation :

Bichitrananda Patra, Sujata Dash, B. K. Tripathy, "Neural Techniques for Improving the Classification Accuracy of Microarray Data Set using Rough Set Feature Selection Method," International Journal of Computer Trends and Technology (IJCTT), vol. 4, no. 3, pp. 424-429, 2013. Crossref, https://doi.org/10.14445/22312803/IJCTT-V4I3P144

Abstract

Classification, a data mining task is an effective method to classify the data in the process of Knowledge Data Discovery. Classification method algorithms are widely used in medical field to classify the medical data for diagnosis. Feature Selection increases the accuracy of the Classifier because it eliminates irrelevant attributes. This paper analyzes the performance of neural network classifiers with and without feature selection in terms of accuracy and efficiency to build a model on four different datasets. This paper provides rough feature selection scheme, and evaluates the relative performance of four different neural network classification procedures such as Learning Vector Quantisation (LVQ) - LVQ1, LVQ3, optimizedlearning-rate LVQ1 (OLVQ1), and The Self-Organizing Map (SOM) incorporating those methods. Experimental results show that the LVQ3 neural classification is an appropriate classification method makes it possible to construct high performance classification models for microarray data.  

Keywords

Data Mining, Rough, Feature Selection, Learning Vector Quantisation, Self-Organizing Map, Classification. 

References

[1] Nguyen, H., Nguyen, S., Some e_cient algorithms for rough set methods, IPMU, 1451-1456, 1996
[2] Lin, T.Y and Cercone, N., Applications of Rough Sets Theory and Data Mining, Kluwer Academic Publishers, 1997.
[3] Pawlak, Z., Rough Sets: Theoretical Aspects of Reasoning About Data, Kluwer Academic Publishers, 1991.
[4] Han, J., Hu, X., and Lin T. Y., Feature Subset Selection Based on Relative Dependency Between Attributes, 4th International Conference on Rough Sets and Current Trends in Com-puting, Lecture Notes in Computer Science 3066, pp. 176-185, Springer, 2004.
[5] Modrzejewski, M., Feature Selection Using Rough Sets Theory, European Conference on Machine Learning, pp.213-226, 1993.
[6] Quafafou, M. and Boussouf, M., Generalized Rough Sets Based Feature Selection, Intelligent Data Analysis, Vol. 4, pp. 3-17, 2000.
[7] Sever, H., Raghavan, V., and Johnsten, D. T., The Status of Research on Rough Sets for Knowledge Discovery in Databases, 2nd International Conference on Nonlinear Problems in Aviation and Aerospace, Vol. 2, pp. 673-680, 1998.
[8] 
Han, J., Hu, X., and Lin T. Y., A New Computation Model for Rough Set Theory Based on Database Systems, 5th International Conference on Data Warehousing and Knowledge Dis-covery, Lecture Notes in Computer Science 2737, pp. 381 C 390, 2003.
[9] Z. Pawlak, Rough Sets, Theoretical Aspects of Reasoning About Data. Dordrecht, The Netherlands: Kluwer, 1991. 
[10] Chouchoulas and Q. Shen. Rough set-aided keyword reduction for text categorization. Applied Artificial Intelligence, Vol. 15, No. 9, pp. 843{873. 2001. 
[11] R. Jensen and Q. Shen. Semantics-Preserving Dimensionality Reduction: Rough and Fuzzy-Rough Based Approaches. IEEE Transactions on Knowledge and Data Engineering, Vol. 16, No. 12, pp. 1457{1471. 2004. 
[12] The Learning Vector Quantization Program Package Version 3.1 (April 7, 1995) Prepared by the LVQ Programming Team of the Helsinki University of Technology Laboratory of Computer and Information Science Rakentajanaukio 2 C, SF-02150 Espoo FINLAND 
[13] Self Organizing Maps, Tom Germano, March 23, 1999. 
[14] Face Recognition Using Self-Organizing Map and Principal Component Analysis, Dinesh Kumar, C.S.Rai and Shakti Kumar, 2007. 
[15] Ian H. Witten and Eibe Frank. Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, San Francisco, 2 edition, 2005. 
[16] Ding, H.C. Peng, “Minimum Redundancy Feature Selection from Microarray Gene Expression Data”, Journal of Bioinformatics and Computational Biology, Vol.3, No.2 , Apr. 2003, pp.185-205. 
[17] Statnikov, C.F. Aliferis, I. Tsamardinos, D. Hardin, S. Levy, “A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis,” Bioinformatics, Vol. 21, 2005, No. 5, pp 631–643. 
[18] T. Kohonen, Self-Oganizing Maps, Springer, Berlin, 1995.
[19] Tan, A.C., Gilbert, D., Ensemble machine learning on gene expression data for cancer classification. Applied Bioinformatics, 2003,  2, S75S83 
[20] S. Dash, B. N. Patra, B. K.Tripathy, “A Hybrid Data Mining Technique for Improving the Classification Accuracy of Microarray Data Set”, International Journal of Information Engineering and Electronic Business(IJIEEB), Vol.4, No. April 2012, Page No.43-50. 
[21] T.V. Prasad, Maitrei Kohli, “Vector Quantization of Microarray Gene Expression Data”, Proceedings of the World Congress on Engineering 2010 Vol I WCE 2010, June 30 - July 2, 2010, London, U.K