An Overview of Pre-processing Techniques in Web usage Mining

Authors : J. Soonu Aravindan, Dr. K. Vivekanandan
Abstract -
WWW (World Wide Web) is a huge repository of web pages and links. Enormous amount of web data is being generated every day. User accessed websites are recorded as a web log file, which may contain noisy & ambiguous data which may affect the results of the mining process. So there is a necessity to pre-process the web data before extracting knowledge from web log files. Web Usage Mining is the area of data mining dealing with discovery and analysis of usage patterns from web data in order to improve web based applications. This paper mainly focuses on the Major steps followed in Data Pre-Processing Stage in Web usage mining.

Web Usage Mining, Data Pre-processing, Web logs.