Dynamic Grouping of Semantically Similar User Search Histories

International Journal of Computer Trends and Technology (IJCTT)          
© - December Issue 2013 by IJCTT Journal
Volume-6 Issue-2                           
Year of Publication : 2013
Authors :A.S.Saleem Basha , V.Trilik Kumar


A.S.Saleem Basha , V.Trilik Kumar"Dynamic Grouping of Semantically Similar User Search Histories"International Journal of Computer Trends and Technology (IJCTT),V6(2):72-78 December Issue 2013 .ISSN 2231-2803.www.ijcttjournal.org. Published by Seventh Sense Research Group.

Abstract:- -Users over Internet make queries continuously for various kinds of information. Such information might be about various tasks and that is done through existing search engines. When queries are made by users continuously, over a period of time, the queries are plenty. The existing search engines organize such queries only in chronological order. However, when the quires are grouped together based on the relevancy that might be very useful to users as they can reuse queries with ease. Hwang et al. studied this problem recently and proposed mechanisms that help in grouping or organizing user search histories in useful fashion. This organization of user search histories can have various real time utilities such as result ranking, query alternations, query suggestions, sessionization and collaborative search. In this paper we implement algorithms that are used to group user search histories. We built a web based prototype that demonstrates the proof of concept. The empirical results are encouraging.


[1] A. Broder, “A taxonomy of web search,” SIGIR Forum, vol. 36, no. 2, pp. 3–10, 2002.
[2] J. Teevan, E. Adar, R. Jones, and M. A. S. Potts, “Information reretrieval: repeat queries in yahoo’s logs,” in SIGIR. New York, NY, USA: ACM, 2007, pp. 151–158.
[3] Heasoo Hwang, Hady W. Lauw, LiseGetoor and AlexandrosNtoulas, “Organizing User Search Histories”, IEEE 2012 Transactions on Knowledge and Data Engineering, Volume: 24 , Issue: 5.
[4] R. Jones and K. L. Klinkner, “Beyond the session timeout: Automatic hierarchical segmentation of search topics in query logs,” in CIKM, 2008.
[5] P. Boldi, F. Bonchi, C. Castillo, D. Donato, A. Gionis, and S. Vigna, “The query-flow graph: Model and applications,” in CIKM, 2008.
[6] D. Beeferman and A. Berger, “Agglomerative clustering of a search engine query log,” in KDD, 2000.
[7] R. Baeza-Yates and A. Tiberi, “Extracting semantic relations from query logs,” in KDD, 2007.
[8] P. Anick, “Using terminological feedback for web search refinement: A log-based study,” in SIGIR, 2003.
[9] B. J. Jansen, A. Spink, C. Blakely, and S. Koshman, “Defining a session on Web search engines: Research articles,” Journal of the American Society for Information Science and Technology, vol. 58, no. 6, pp. 862–871, 2007.
[10] L. D. Catledge and J. E. Pitkow, “Characterizing browsing strategies in the World-Wide Web,” Computer Networks and ISDN Systems, vol. 27, no. 6, pp. 1065–1073, 1995.
[11] D. He, A. Goker, and D. J. Harper, “Combining evidence for automatic Web session identification,” Information Processing and Management, vol. 38, no. 5, pp. 727–742, 2002.
[12] R. Jones and F. Diaz, “Temporal profiles of queries,” ACM Trans- actions on Information Systems, vol. 25, no. 3, p. 14, 2007.
[13] A. L. Montgomery and C. Faloutsos, “Identifying Web browsing trends and patterns,” Computer, vol. 34, no. 7, pp. 94–95, 2001.
[14] C. Silverstein, H. Marais, M. Henzinger, and M. Moricz, “Analysis of a very large Web search engine query log,” SIGIR Forum, vol. 33, no. 1, pp. 6–12, 1999.
[15] H. C. Ozmutlu and F. C¸ avdur, “Application of automatic topic identification on Excite Web search engine data logs,” Information Processing and Management, vol. 41, no. 5, pp. 1243– 1262, 2005.
[16] T. Lau and E. Horvitz, “Patterns of search: Analyzing and modeling Web query refinement,” in UM, 1999.
[17] F. Radlinski and T. Joachims, “Query chains: Learning to rank from implicit feedback,” in KDD, 2005.
[18] J.-R. Wen, J.-Y.Nie, and H.-J. Zhang, “Query clustering using user logs,” ACM Transactions in Information Systems, vol. 20, no. 1, pp. 59–81, 2002.
[19] J. Yi and F. Maghoul, “Query clustering using click-through graph,” in WWW, 2009.
[20] E. Sadikov, J. Madhavan, L. Wang, and A. Halevy, “Clustering query refinements by user intent,” in WWW, 2010.
[21] N. Craswell and M. Szummer, “Random walks on the click graph,” in SIGIR, 2007.

Keywords:-Search engine, search history, click graph, query grouping.