An Overview of Techniques Used for Extracting Keywords from Documents

International Journal of Computer Trends and Technology (IJCTT)          
© - July Issue 2013 by IJCTT Journal
Volume-4 Issue-7                           
Year of Publication : 2013
Authors :Menaka S, Radha N


Menaka S, Radha N "An Overview of Techniques Used for Extracting Keywords from Documents"International Journal of Computer Trends and Technology (IJCTT),V4(7):2321-2325 July Issue 2013 .ISSN Published by Seventh Sense Research Group.

Abstract:- Keywords are a set of major words in a document that give high-level description of the content for readers. Keywords are useful for scanning large documents in a short time. Extracting keywords manually are very difficult and time-consuming process. Therefore, there is in need for process to extract keywords from documents automatically. Keyword extraction is a process in which a set of words are selected that gives the meaning of the whole document. This paper presents an overview of techniques used for keyword extraction.


[1] D.B. Bracewell, F. REN, S. Kuriow. 2005 “Multilingual Single Document Keyword Extraction for Information Retrieval”, in Proceedings of Natural Language Processing and Knowledge Engineering, p. 517-522.
[2] Salton G. 1989. “Automatic text processing”, published in Addison-Wesley Longman publications.
[3] Hulth A. 2003 “Improved automatic keyword extraction given more linguistic knowledge”, in Proceedings of the conference on Empirical methods in natural language processing, p.216-223.
[4] Stuart Rose, Dave Engel, Nick Cramer and Wendy Cowley. 2010 “Automatic Keyword Extraction from Individual Documents”, in Text mining: Applications and Theory, p. 3-20.
[5] Rogina I, Schaaf T.2002 “Lecture and presentation tracking in an intelligent meeting room”, in Proceedings of 4th International Conference on Multimodal Interfaces, p. 47-52.
[6] Frank E., Paynter G.W., Witten I.H., Gutwin C., & Nevill-Manning C.G. 1999 “Domain-specific keyphrase extraction”, in Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence.
[7] Suzuki Y, Fukumoto F, Sekiguchi Y. 1998 “Keyword extraction of radio news using term weighting with an encyclopedia and newspaper articles”, SIGIR, 1998.
[8] GonencErcan, IlyasCicekli. 2007 “Using Lexical Chains for Keyword Extraction”, published in Information Processing and Management, Volume 43 Issue 6, p. 1705-1714.
[9] Christian Wartena, Brusee, Slakhorst. 2010 “Keyword Extraction using Word Co-occurrence”, published in Database and Expert System Applications, p.54 - 58.

Keywords : — TF-IDF, Classification, Lexical chain, WordNet.