Web Content Classification: A Survey

  IJCTT-book-cover
 
International Journal of Computer Trends and Technology (IJCTT)          
 
© 2014 by IJCTT Journal
Volume-10 Number-2                          
Year of Publication : 2014
Authors : Prabhjot Kaur
DOI :  10.14445/22312803/IJCTT-V10P117

MLA

Prabhjot Kaur."Web Content Classification: A Survey". International Journal of Computer Trends and Technology (IJCTT) V10(2):97-101, Apr 2014. ISSN:2231-2803. www.ijcttjournal.org. Published by Seventh Sense Research Group.

Abstract -
As the information contained within the web is increasing day by day, organizing this information could be a necessary requirement.The data mining process is to extract information from a data set and transform it into an understandable structure for further use. Classification of web page content is essential to many tasks in web information retrieval such as maintaining web directories and focused crawling.The uncontrolled type of nature of web content presents additional challenges to web page classification as compared to the traditional text classification ,but the interconnected nature of hypertext also provides features that can assist the process. In this paper the web classification is discussed in detail and its importance in field of data mining is explored.

References
[1]Esra Saraç, Selma Ay?e Özel, “Web Page Classification Using Firefly Optimization” IEEE International Symposium on Innovations in Intelligent Systems and Applications (INISTA), PP 1-5, 19-21 June 2013.
[2]Abdelhakim Herrouz, Chabane Khentout, Mahieddine Djoudi, “Overview of Web Content Mining Tools” The International Journal of Engineering And Science (IJES), Volume 2, Issue 6, 2013.
[3] Kira Radinsky, Eric Horvitz, “Mining the Web to Predict Future Events”, WSDM’13, February 4–8, 2012, Rome, Italy
[4] Monika Yadav, Mr. Pradeep Mittal, “Web Mining: An Introduction” International Journal of Advanced Research in Computer Science and Software Engineering, Volume 3, Issue 3, March 2013 .
[5] Xin-She Yang, Xingshi He, “Firefly algorithm: recent advances and applications” Int. J. Swarm Intelligence, Vol. 1, No. 1, 2013.
[6] Pikakshi Manchanda, Sonali Gupta, Komal Kumar Bhatia, “On The Automated Classification of Web Pages Using Artificial Neural Network“ IOSR Journal of Computer Engineering (IOSRJCE), Volume 4, Issue 1 (Sep-Oct. 2012), PP 20-25”.
[7] Basem O. Alijla, Lim Chee Peng, Ahamad Tajudin Khader, and Mohammed Azmi Al- Betar,” Intelligent Water Drops Algorithm for Rough Set Feature Selection”pp. 356–365, 2013. Springer-Verlag Berlin Heidelberg 2013
[8] Selma Ay?e Özel,” A Genetic Algorithm Based Optimal Feature Selection for Web Page Classification” Department of Computer Engineering, Çukurova University, 01330 Balcal?, Sar?çam, Adana, Türkiye This email address is being protected from spambots. You need JavaScript enabled to view it.
[8] Xiaoguang Qi, Brian D. Davison, “Web Page Classification: Features and Algorithms” http://www.cse.lehigh.edu/~xiq204/pubs/classification-survey/LU-CSE-07-010.pdf.
[9] Lim Wern Han and Saadat M. Alhashmi, “Joint Web-Feature (JFEAT): A Novel Web Page Classification Framework” IBIMA Publishing, Vol. 2010 (2010), Article ID 73408, 8 pages.
[10] K S Chandwani, “Clustering of Web Page Search Result using Web Content Mining Approaches” International Journal of Computer, Information Technology & Bioinformatics (IJCITB), Volume-1, Issue-2. [11] Daniele Riboni, “Feature Selection for Web Page Classification”.
[12] S. Sumathi and S.N. Sivanandam,” Introduction to Data Mining and its Applications” Studies in Computational Intelligence, Volume 29.
[13] Sunita Beniwal and Jitender Arora,” classification and Feature Selection Techniques in Data Mining”, International Journal of Engineering Research & Technology (IJERT), Vol. 1,Issue6.

Keywords
Data mining, Web page Classification, Feature Selection, Classification.