Quad-Tree Based Multiple Kernel Fuzzy C-Means Clustering for Gene Expression Data

International Journal of Computer Trends and Technology (IJCTT)          
© 2015 by IJCTT Journal
Volume-27 Number-3
Year of Publication : 2015
Authors : E. Monica Sushil Cynthia, S. Kannan


E. Monica Sushil Cynthia, S. Kannan "Quad-Tree Based Multiple Kernel Fuzzy C-Means Clustering for Gene Expression Data". International Journal of Computer Trends and Technology (IJCTT) V27(3):121-125, September 2015. ISSN:2231-2803. www.ijcttjournal.org. Published by Seventh Sense Research Group.

Abstract -
Minute variations in genes can have a major impact on how humans respond to disease, environmental factors such as bacteria, viruses, toxins, chemicals and drugs and other therapies.. Cluster analysis seeks to partition a given data set into groups based on specified features so that the data points within a group are more similar to each other than the points in different groups. The clustering algorithms have been proven useful for identifying biologically relevant groups of genes and samples. Hence in this paper we propose a new clustering algorithm for gene expression data associated to three different types of cancer and also compare with the existing approaches to prove the novel approach proposed here, has a better performance, reliability and provide more meaningful biological significance.

[1]. Duda , R. O. & Hart, P. E.(1973), Pattern classification and scene analysis, Wiley, New York.
[2]. Everitt, B.S. Cluster Analysis. 1993. Third Edition. (New York and Toronto: Halsted Press, of John Wiley & Sons Inc.).
[3]. M Telgarsky, A Vattani Hartigan's Method: k-means Clustering without Voronoi.
[4]. Mirkin CA, Letsinger RL, Mucic RC, Storhoff JJ. A DNAbased method for rationally assembling nanoparticles into macroscopic materials.
[5]. Jiang, D., Tang, C. and Zhang, A.(2004) „Cluster analysis for gene expression data: a survey‟, IEEE Transaction on Knowledge and Data Engineering, Vol. 16, No. 11, pp.1370- 1386.
[6]. Selva Kumar, S. and Hannah Inbarani, H. (2013) „Analysis of mixed C-means clustering approach for brain tumor gene expression data‟, Int, J. Data Analysis Techniques and Strategies, Vol. 5, No. 2, pp.214-228.
[7]. J. C. Dunn A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters.
[8]. Dezdek, J. C., 1981, Pattern Recognition with fuzzy objective function algorithms, Plenum press, Newyork, NY.
[9]. Tomida S, et al. (2002) Analysis of expression profile using fuzzy adaptive resonance theory. Bioinformatics 18(8):1073- 83
[10]. Bishnu PS, Bhattacherjee V (2012) Software fault prediction using quad tree-based K-means clustering algorithm. IEEE Trans Knowl Data Eng 24(6):1146–1150
[11]. H.-C. Huang, Y.-Y. Chuang, and C.-S. Chen, “Multiple kernel fuzzy clustering,” Fuzzy Systems, IEEE Transactions on, vol. 20, no. 1, pp. 120 –134, feb. 2012.
[12]. X.L. Xie and G. Beni. A validity measure for fuzzy clustering. IEEE transactions on Pattern Analysis and Machine Intelligence, 13(4):841-846, 1991.
[13]. D. L. Davies and D. W. Bouldin. A cluster separation measures IEEE Transactions and Pattern Analysis and Machine Intelligence, PAMI-1, no. 2:222-227,1979.
[14]. Sitansu Mohanty, Kaberi Das, Debahuti Mishra, Ruchi Ranjan"Cluster Validity Indices for Gene Expression Data"International Journal of Computer Trends and Technology (IJCTT),V4(5):1465-1470 May 2013.ISSN 2231-2803.
[15]. D.Vanisri, Dr.C.Loganathan An Efficient Fuzzy Possibilistic C-Means with Penalized and Compensated Constraints

Clustering, Clustering Algorithms, Gene Expression analysis, Fuzzy C-Means, Hierarchical Clustering, Gene Clustering, Gene Expression data, Quad Tree, Kernel fuzzy C-means.