Analysis of Academic Performance in massive Open Online Courses (Moocs) Using Process Mining

© 2020 by IJCTT Journal
Volume-68 Issue-12
Year of Publication : 2020
Authors : Mahesh T R, Dr.B Mohan Kumar Naik
DOI :  10.14445/22312803/IJCTT-V68I12P105

How to Cite?

Mahesh T R, Dr.B Mohan Kumar Naik, "Analysis of Academic Performance in massive Open Online Courses (Moocs) Using Process Mining," International Journal of Computer Trends and Technology, vol. 68, no. 12, pp. 21-25, 2020. Crossref, 10.14445/22312803/IJCTT-V68I12P105

The purpose of this paper is to provide a survey of academic performance in massive open online courses (MOOCs) to improve students` learning experience. Due to the large volume of data in educational databases of Student`s data, e.g., weekly evaluation grades or points, demographic variables such as age, ethnicity, and sex and weekly interaction data based on event logs, e.g., video lecture interaction, submission time of assignment solution, amount of time spent weekly driven this design, Automated Student performance prediction is a very important task. This study compares the four distinct logistic regression techniques for machine learning classification, Naïve Bayes (NB), (LR), random forest (RF), and K-nearest neighbor, to track the performance of students every week and to predict their overall performance. While MOOCs provide a versatile learning platform, they are prone to early dropouts and low completion rates. This research focuses on a data-driven plan to enhance students` learning experience and dramatically reduce the dropout rate. Early forecasts based on individuals` involvement will help educators provide students who are currently struggling in the course with proper support.

Prediction, MOOCs, Machine learning, Learning analytics, Process mining, Education data mining

[1] Ahmad, F., Ismail, N.H. and Aziz, A.A. The prediction of students` academic performance using classification data mining techniques, Applied Mathematical Sciences, 9(129) (2015) 6415- 6426.
[2] Bayer, J., Bydzovská, H., Géryk, J., Obsivac, T. and Popelínský, L . , Predicting dropout from social behavior of students, Proceedings of the 5th International Conference on Educational Data Mining, (2012) 103-109.
[3] Benbassat, G. , An essay towards solving a problem in the doctrine of chances. La Reconnaissance Automatique De La Parole, Rev. Laryngol. Otol. Rhinol., 111 (1990) 4 389-392.
[4] Bergner, Y., Kerr, D. and Pritchard, D.E. Methodological challenges in the analysis of MOOC data for exploring the relationship between discussion forum views and learning outcomes, Proceedings of the 8th International Conference on Education Data Mining, (2015) 234-241.
[5] Boongoen, N.I.-O.T. Improved student dropout prediction in Thai University using an ensemble of mixed-type data clusterings, International Journal of Machine Learning and Cybernetics, 8 (2) (2017) 497-510.
[6] Breiman, L. (2001), Random forests, Machine Learning, Vol. 45 No. 1, pp. 5-32.Bydžovská, H. , A comparative analysis of techniques for predicting academic performance, Proceedings of the 9th International Conference on Educational Data Mining, (2016)306-311.
[7] Cambruzzi, W., Rigo, S.J. and Barbosa, J.L.V. Dropout prediction and reduction in distance education courses with the learning analytics multi trail approach, Journal of Universal Computer Science, 21(1), (2015) 23-47.
[8] Campbell, J.P., DeBlois, PB and Oblinger, D.G. Academic Analytics, Education Review, 42(10) (2007) 40-57.
[9] Clow, D. , An overview of learning analytics, Teaching in Higher Education, 18 (6) (2013a) 683-695.
[10] Clow, D. , MOOCs and the funnel of participation, Proceedings of the Third International Conference on Learning Analytics and Knowledge, (2013b) 185.
[11] Cortes, C. and Vapnik, V. , Support-vector networks, Machine Learning, 20(3) (1995) 273-297.
[12] Costa, E.B., Fonseca, B., Santana, M.A., de Araújo, F.F. and Rego, J. , Evaluating the effectiveness of educational data mining techniques for early prediction of students` academic failure in introductory programming courses, Computers in Human Behavior, 73, (2017) 247-256.
[13] Demšar, J. , Statistical comparisons of classifiers over multiple data sets, Journal of Machine Learning Research, 7(1) (2006) 1- 30.
[14] Dunn, K.E., and Mulvenon, S.W. , A critical review of research on formative assessments: the limited scientific evidence of the impact of formative assessments in education, Practical Assessment, Research and Evaluation, 14(7) (2009) 1-11.
[15] Ebner, M., Lienhardt, C., Rohs, M. and Meyer, I. , Microblogs in higher education – a chance to facilitate informal and processoriented learning?, Computers & Education, 55(1) (2010) 92-100.
[16] Gorunescu, F. , Data Mining: Concepts and Techniques, 12 (2011), Springer Science & Business Media.
[17] Gray, G., McGuinness, C. and Owende, P. , An application of classification models to predict learner progression in tertiary education, IEEE International Advance Computing Conference, (2014) 549-554.
[18] Hechenbichler, K. and Schliep, K. , Weighted k-nearest-neighbor techniques and ordinal classification, Discussion Paper 399, Collaborative Research Center 386.
[19] Khobragade, L.P. (2015), Students` academic failure prediction using data mining, 3 5, pp. 2321-7782.
[20] Kizilcec, R.F., Piech, C. and Schneider, E. Deconstructing disengagement: analyzing learner subpopulations in massive open online courses, Proceedings of the 3rd International Conference on Learning Analytics and Knowledge (LAK `13), (2013)170-179.
[21] Lin, J.J.J. and Reid, K.J. , Student retention modeling: an evaluation of different methods and their impact on prediction results, Research in Engineering Education Symposium, (2009)1-6.
[22] Manhães, L.M.B., da Cruz, S.M.S. and Zimbrão, G. , WAVE: an architecture for predicting dropout in undergraduate courses using EDM, Proceedings of the 29th Annual ACM Symposium on Applied Computing, (2014) 243-247.
[23] MarquezVera, C., Morales, C.R. and Soto, S.V. , Predicting school failure and dropout by using data mining techniques, IEEE Revista Iberoamericana de Tecnologias del Aprendizaje, 8(1) (2013) 7-14.
[24] Martinho, V.R.C., Nunes, C. and Minussi, C.R. , Prediction of school dropout risk group using neural network, Federated Conference on Computer Science and Information Systems, (2013) 111-114.
[25] Mashiloane, L. and Mchunu, M. , Mining for marks: a comparison of classification algorithms when predicting academic performance to identify `students at risk,`in Rajendra, P. and Kathirvalavakumar, T. (Eds), Lecture Notes in Computer Science (including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8284, Springer International Publishing, (2013). 541-552.
[26] Mayilvaganan, M. and Kalpanadevi, D. , Comparison of classification techniques for predicting students` cognitive skill in the education environment, IEEE International Conference on Computational Intelligence and Computing Research, (2014) 1-4.
[27] Mertes, S.J. and Hoover, R.E. , Predictors of the first-year retention in a community college, Community College Journal of Research and Practice, (2014) 38(7), 651-660.
[28] MinaeiBidgoli, B., Kashy, D., Kortemeyer, G. and Punch, W. (200 3), Predicting student performance: an application of data mining methods with an educational web-based system, 33rd Annual Frontiers in Education Conference, 1(T2A_13-T2A_18).
[29] Nandeshwar, A., Menzies, T. and Nelson, A. , Learning patterns of university student retention, Expert Systems with Applications, 38(12) (2011) 14984-14996.
[30] Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Louppe, G., Prettenhofer, P., Weiss, R. , Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher , M., Perrot, M. and Duchesnay, É. (2011), Scikit-learn: machine learning in python, Journal of Machine Learning Research, 12(10) 2825-2830.
[31] Pittman, K. (2008), Comparison of data mining techniques used to predict student retention, Ph.D. thesis, Nova South eastern University.
[32] Surendiran, R., Rajan, K.P. and Sathish Kumar, M., 2010. Study on the Customer targeting using Association Rule Mining. International Journal on Computer Science and Engineering, 2(7), pp.2483-2484.
[33] Sharma, M. and Mavani, M. , Accuracy comparison of predictive algorithms of data mining: application in education sector`," Advances in Computing, Communication and Control, Springer, Berlin and Heidelberg, (2011) 189-194.
[34] Simon, F.S., Robins, A., Baker, B., Box, I., Cutts, Q., Raadt, M.D., Haden, P., Hamer, J., Hamilton, M., Lister, R., Petre, M., Sutton, K., Tolhurst, D. and Tutty, J. "Predictors of success in a first programming course," Proceedings of the 8th Australian Conference on Computer Education, 52 (2006) 189-196.
[35] Van der Aalst, W., Adriansyah, A. and Van Dongen, B. , Replaying history on process models for conformance checking and performance analysis, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 2(2) (2012) 182-192.
[36] Van Der Aalst, W.M.P., Van Dongen, B.F., G? unther, C., Rozinat, A., Verbeek, H.M. W. and Weijters, A.J.M.M. , Prom: the process mining toolkit, CEUR Workshop Proceedings, 489, Ulm, September 8 (2009).
[37] Veenstra, C.P., Dey, E.L. and Herrin, G.D. , A model for freshman engineering retention, Advances in Engineering Education, 1(3) (2009) 1-23.
[38] Watson, C., Li, FWB and Godwin, J.L. Predicting performance in an introductory programming course by logging and analyzing student programming behavior, Proceedings – 2013 IEEE 13th International Conference on Advanced Learning Technologies, (2013)319-323.
[39] Quinlan, J., Ghosh, J., Yang, Q., Motoda, H., McLachlan, G.J., Ng , A., Liu, B., Yu, P.S., Zhou, Z.- H., Steinbach, M., Hand, DJ and Steinberg, D. Top 10 algorithms in data mining, Knowledge and Information Systems, 14(1) (2008) 1-37.
[40] Ye, C. and Biswas, G. , Early prediction of student dropout and performance in MOOCs using higher granularity temporal information, Journal of Learning Analytics, 1(3) (2014) 169-172.
[41] Yukselturk, E., Ozekes, S., Türel, Y.K., Education, C., Ozekes, S., Türel, Y.K. and Education, C. , Predicting dropout student: an application of data mining methods in an online education program, European Journal of Open, Distance, and E-Learning, 17(1), (2014)118-133.
[42] Zhang, Y., Oussena, S., Clark, T. and Hyensook, K. , Using data mining to improve student retention in HE: a case study, Proceedings of the 12th International Conference on Enterprise Information Systems, 1, (2010) 190-197.