Movie Review Classification and Feature based Summarization of Movie Reviews

International Journal of Computer Trends and Technology (IJCTT)          
© - July Issue 2013 by IJCTT Journal
Volume-4 Issue-7                           
Year of Publication : 2013
Authors :Sabeeha Mohammed Basheer, Syed Farook


Sabeeha Mohammed Basheer, Syed Farook"Movie Review Classification and Feature based Summarization of Movie Reviews"International Journal of Computer Trends and Technology (IJCTT),V4(7):2335-2339 July Issue 2013 .ISSN Published by Seventh Sense Research Group.

Abstract:- Sentiment classification and feature based summarization are essential steps involved with the classification and summarization of movie reviews. The movie review classification is based on sentiment classification and condensed descriptions of movie reviews are generated from the feature based summarization. Experiments are conducted to identify the best machine learning based sentiment classification approach. Latent Semantic Analysis and Latent Dirichlet Allocation were compared to identify features which in turn affects the summary size. The focus of the system design is on classification accuracy and system response time.


1] David M. Blei, Andrew Y. Ng, and Michael I. Jordan. Latent dirichlet allocation. J. Mach. Learn.Res., 3: pp. 993–1022, March 2003.
[2] Sangno Lee, J. Baker, Jaeki Song, and J.C.Wetherbe. An empirical comparison of four text mining methods. In System Sciences (HICSS),2010 43rd Hawaii International Conference on, pp. 1 –10, jan. 2010.
[3] Chien-Liang Liu, Wen-Hoar Hsaio, Chia-Hoang Lee, Gen-Chi Lu, and E. Jou. Movie rating and review summarization in mobile environment. Systems, Man, and Cybernetics, Part C:Applications and Reviews, IEEE Transactions on,42(3): pp.397–407, May.
[4] M.A. Hearst, S.T. Dumais, E. Osman, J. Platt,and B. Scholkopf. Support vector machines. Intelligent Systems and their Applications, IEEE,13(4): pp. 18–28, Jul/Aug.
[5] Bo Pang, Lillian Lee, and Shivakumar Vaithyanathan.Thumbs up?:Sentiment classification using machine learning techniques.In Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10, EMNLP ’02, pp. 79–86, Stroudsburg, PA, USA, 2002. Association for Computational Linguistics.
[6] Susan T. Dumais. Latent semantic analysis. Annual Review of Information Science and Technology, 38(1):1pp. 88–230, 2004.
[7] Thomas Hofmann. Probabilistic latent semantic indexing. In Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR ’99,pp. 50–57, New York, NY, USA, 1999. ACM.
[8] Minqing Hu and Bing Liu. Mining and summarizing customer reviews. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, KDD ’04, pp. 168–177, New York, NY, USA, 2004. ACM.
[9] Mark Hall, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer, Peter Reutemann, and Ian H.Witten. The weka data mining software: an update. SIGKDD Explor. Newsl., 11(1): pp.10–18,November 2009.
[10] Andrew L. Maas and Raymond E. Daly and Peter T. Pham and Dan Huang and Andrew Y. and Christopher Potts. Learning Word Vectors for Sentiment Analysis. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies., pages 142–150, Portland, Oregon, USA, June 2011. ACL.

Keywords : — LSA, PLSA, LDA, Naive Bayes, Maximum Entropy, SVM