International Journal of Computer
Trends and Technology

Research Article | Open Access | Download PDF

Volume 4 | Issue 2 | Year 2013 | Article Id. IJCTT-V4I2P122 | DOI : https://doi.org/10.14445/22312803/IJCTT-V4I2P122

Profile Based Search Engine


Amruta Mantri, Priyanka Nawale, Trupti Pardeshi, Rajeshwary Shisode, Reena Pagare

Citation :

Amruta Mantri, Priyanka Nawale, Trupti Pardeshi, Rajeshwary Shisode, Reena Pagare, "Profile Based Search Engine," International Journal of Computer Trends and Technology (IJCTT), vol. 4, no. 2, pp. 164-168, 2013. Crossref, https://doi.org/10.14445/22312803/IJCTT-V4I2P122

Abstract

Huge amount of information available on internet makes it difficult for the user to get the exact search results according to his preferences. In this paper, we attempt to solve this problem to certain extent by extending the NUTCH open source search engine using personalized information of user. The user’s information will be extracted from the social networking sites like Facebook. The search keywords given by user will be input to the NUTCH search engine. The results returned by NUTCH search engine will be further refined using our own Profile Biasing Algorithm.

Keywords

Search engine, NUTCH, Crawling, Indexing, Profile Biasing, Profile Aggregator.

References

[1] Tom White, January 10, 2006, “Introduction to NUTCH”, http://www.java.net/pub/a/today/2006/01/10/introduc tion-to-NUTCH-2.html
[2] Sebastian Nagel, November 1, 2012,” NutchTutorial”, http://wiki.apache.org/nutch/NutchTutorial
[3] Alessio Signorini, “A Survey of Ranking Algorithms”, University of Iowa, pages 1-19, September 11, 2005.
[4] Rebecca S. Wills, “Google’s PageRank: The Math Behind the Search Engine”, North Carolina State University, Raleigh, NC 27695, 
, May 1, 2006
[5] Alessio Signorini, “A Survey of Ranking Algorithms”, University of Iowa, pages 1-19, September 11, 2005. 
[6] john-battelle , September 10, 2003, “An Open Source Search Engine”, http://searchenginewatch.com/author/1765/johnbattelle. 
[7] Rebecca S. Wills, “Google’s PageRank: The Math Behind the Search Engine”, North Carolina State University, Raleigh, NC 27695,  rmwills@ncsu.edu, May 1, 2006. 
[8] Andrzej Białecki , "Nutch as a Web mining  platform: the present and future", https://wiki.apache.org/ nutch/Presentations 
[9] Ashutosh Kumar Singh, Ravi Kumar P, ”A Comparative Study of Page Ranking Algorithms for Information Retrieval”, International Journal of Electrical and Computer Engineering 4:7 2009