Learning Data Analytics: Providing Actionable Insights to Increase College Student Success


The six-year higher-education graduation rate has been around 59% for over 15 years; less than half of college graduates finish within 4 years. This has high human, economic and societal costs. The National Research Council has identified a critical need to develop innovative approaches to improve student retention, graduation, and workforce-preparedness. The objective of this project is to develop new computational methods to analyze large and diverse types of education and learning data to help (a) discover successful academic pathways for students; (b) improve pedagogy for instructors; and (c) enhance student persistence and retention for institutions. The project outcomes are designed to help students select courses that fit their needs, capabilities, and learning styles, and are likely to lead to (faster) graduation; help instructors to better meet student needs; and give advisors and institutions the analytics needed to improve retention and persistence.

The proposed research will produce new dynamical system modeling, collaborative filtering, and multi-task learning methods. Modeling the evolution of a student's knowledge using a dynamical state-space system is a key innovation; the proposed research will develop novel collaborative system identification and collaborative Kalman filtering techniques for grade prediction. Technical innovations include supervised learning approaches for evolving datasets, such as linear and non-linear multi-task learning and collaborative multi-regression models with controlled grouping of the latent variables. These innovations will coalesce into three pilot applications: DegreePlanner for students, CourseInsights for instructors, and StudentWatch for academic advisors.

Funded by NSF Big Data Project. See Abstract for Award here


Investigators:

Ph.D. Students
Zhiyun Ren (CS)
Carrie Klein (Higher Ed)
Qian Hu (CS)
Li Zhang (CS)
Yujing Chen (CS)
Omaima Almatrafi (IST)
M.S. Students
Mackenzie Sweeney (CS)
B.S. Students
Thi Nguyen (CS)
Abigail Justen (CS)
Ameer Takeddin (CS)
Jason Ko (CS)
Michel Rouly (CS)

Products:

Tutorials:
  • Opportunities, Challenges and Methods for Higher Education Data Mining presented at SIAM International Conference in Data Mining (SDM) [Slides]
Software:
  • Next-Term Student Grade Predictor (NTSGP) [ Github ]
  • Data Mining Grader [ Github ]
Selected Publications:
  • Carrie Klein, Jaime Lester, Huzefa Rangwala, and Aditya Johri. Adoption of educational technology tools in higher education at the intersection of institutional commitment and individual trust. The Review of Higher Education, 2017. (In Press)
  • Zhiyun Ren, Xia Ning, and Huzefa Rangwala. Grade prediction with temporal course-wise influence. In Proceedings of the 2017 Educational Data Mining Conference, 2017 (In Press). Acceptance Rate: 25%.
  • Asmaa Elbadrawy, Agoritsa Polyzou, Zhiyun Ren, Mackenzie Sweeney, George Karypis, and Huzefa Rangwala. Predicting student performance using personalized analytics. Computer, 49(4):61-69, 2016
  • Mack Sweeney, Huzefa Rangwala, Jaime Lester, and Aditya Johri. Next-term student performance prediction: A recommender systems approach. Journal of Educational Data Mining, 2016
  • Omaima Almatrafi, Aditya Johri, Huzefa Rangwala, and Jaime Lester. Identifying course trajectories of high achieving engineering students through data analytics. Proceedings of the 2016 American Society for Engineering Education Conference, 2016
  • Zhiyun Ren, Huzefa Rangwala, and Aditya Johri. Predicting performance on mooc assessments using multi-regression models. Proceedings of the 2016 Educational Data Mining Conference, 2016
  • Mack Sweeney, Jaime Lester, and Huzefa Rangwala. Next-term student grade prediction. In Proceedings of the IEEE Conference on Big Data, pages 1-6. IEEE, 2015
  • Jean Michel Rouly, Huzefa Rangwala, and Aditya Johri. What are we teaching?: Automated evaluation of cs curricula content using topic modeling. In Proceedings of the eleventh annual International Conference on Computing Education Research, pages 189-197. ACM, 2015