International Journal of Computer
Trends and Technology

Research Article | Open Access | Download PDF

Volume 27 | Number 1 | Year 2015 | Article Id. IJCTT-V27P121 | DOI : https://doi.org/10.14445/22312803/IJCTT-V27P121

Quad-Tree Based Multiple Kernel Fuzzy C-Means Clustering for Gene Expression Data


E. Monica Sushil Cynthia, S. Kannan

Citation :

E. Monica Sushil Cynthia, S. Kannan, "Quad-Tree Based Multiple Kernel Fuzzy C-Means Clustering for Gene Expression Data," International Journal of Computer Trends and Technology (IJCTT), vol. 27, no. 1, pp. 121-125, 2015. Crossref, https://doi.org/10.14445/22312803/IJCTT-V27P121

Abstract

Minute variations in genes can have a major impact on how humans respond to disease, environmental factors such as bacteria, viruses, toxins, chemicals and drugs and other therapies.. Cluster analysis seeks to partition a given data set into groups based on specified features so that the data points within a group are more similar to each other than the points in different groups. The clustering algorithms have been proven useful for identifying biologically relevant groups of genes and samples. Hence in this paper we propose a new clustering algorithm for gene expression data associated to three different types of cancer and also compare with the existing approaches to prove the novel approach proposed here, has a better performance, reliability and provide more meaningful biological significance.

Keywords

Clustering, Clustering Algorithms, Gene Expression analysis, Fuzzy C-Means, Hierarchical Clustering, Gene Clustering, Gene Expression data, Quad Tree, Kernel fuzzy C-means.

References

[1]. Duda , R. O. & Hart, P. E.(1973), Pattern classification and scene analysis, Wiley, New York.
[2]. Everitt, B.S. Cluster Analysis. 1993. Third Edition. (New York and Toronto: Halsted Press, of John Wiley & Sons Inc.).
[3]. M Telgarsky, A Vattani Hartigan`s Method: k-means Clustering without Voronoi.
[4]. Mirkin CA, Letsinger RL, Mucic RC, Storhoff JJ. A DNAbased method for rationally assembling nanoparticles into macroscopic materials.
[5]. Jiang, D., Tang, C. and Zhang, A.(2004) „Cluster analysis for gene expression data: a survey?, IEEE Transaction on Knowledge and Data Engineering, Vol. 16, No. 11, pp.1370- 1386.
[6]. Selva Kumar, S. and Hannah Inbarani, H. (2013) „Analysis of mixed C-means clustering approach for brain tumor gene expression data?, Int, J. Data Analysis Techniques and Strategies, Vol. 5, No. 2, pp.214-228.
[7]. J. C. Dunn A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters.
[8]. Dezdek, J. C., 1981, Pattern Recognition with fuzzy objective function algorithms, Plenum press, Newyork, NY.
[9]. Tomida S, et al. (2002) Analysis of expression profile using fuzzy adaptive resonance theory. Bioinformatics 18(8):1073- 83
[10]. Bishnu PS, Bhattacherjee V (2012) Software fault prediction using quad tree-based K-means clustering algorithm. IEEE Trans Knowl Data Eng 24(6):1146–1150
[11]. H.-C. Huang, Y.-Y. Chuang, and C.-S. Chen, “Multiple kernel fuzzy clustering,” Fuzzy Systems, IEEE Transactions on, vol. 20, no. 1, pp. 120 –134, feb. 2012.
[12]. X.L. Xie and G. Beni. A validity measure for fuzzy clustering. IEEE transactions on Pattern Analysis and Machine Intelligence, 13(4):841-846, 1991.
[13]. D. L. Davies and D. W. Bouldin. A cluster separation measures IEEE Transactions and Pattern Analysis and Machine Intelligence, PAMI-1, no. 2:222-227,1979.
[14]. Sitansu Mohanty, Kaberi Das, Debahuti Mishra, Ruchi Ranjan"Cluster Validity Indices for Gene Expression Data"International Journal of Computer Trends and Technology (IJCTT),V4(5):1465-1470 May 2013.ISSN 2231-2803.
[15]. D.Vanisri, Dr.C.Loganathan An Efficient Fuzzy Possibilistic C-Means with Penalized and Compensated Constraints