REPOSITORY.ENU

Leveraging Data Lake Architecture for Predicting Academic Student Performance

Показать сокращенную информацию

dc.contributor.author Abdul Rahim, Shameen Aina
dc.contributor.author Sidi, Fatimah
dc.contributor.author Affendey, Lilly Suriani
dc.contributor.author Ishak, Iskandar
dc.contributor.author Yessirkep Nurlankyzy, Appak
dc.date.accessioned 2026-01-19T09:59:22Z
dc.date.available 2026-01-19T09:59:22Z
dc.date.issued 2024
dc.identifier.citation S. A. Abdul Rahim, F. Sidi, L. S. Affendey, I. Ishak, and A. Y. Nurlankyzy, “Leveraging Data Lake Architecture for Predicting Academic Student Performance”, Int. J. Adv. Sci. Eng. Inf. Technol., vol. 14, no. 6, pp. 2121–2129 ru
dc.identifier.issn 2088-5334
dc.identifier.other doi.org/10.18517/ijaseit.14.6.12408
dc.identifier.uri http://repository.enu.kz/handle/enu/29205
dc.description.abstract In today's rapidly evolving landscape of higher education, the effective management and analysis of academic data have become increasingly challenging, particularly in the context of the 3Vs of Big Data: volume, variety, and velocity. The amount of data produced by educational institutions has increased dramatically, including student records. This flood of data originates from various sources and takes several forms, such as learning management systems and student information systems. Hence, in education, data analytics and predictive modeling have become increasingly significant in acquiring insights into student performance, such as identifying at-risk students who are most likely to fail their courses. This study proposes a novel approach for predicting student academic performance, particularly identifying at-risk students, by leveraging a data lake architecture. The proposed methodology comprises the ingestion, transformation, and quality assessment of a combined data source from Universiti Putra Malaysia's Student Information System and learning management system within the data lake environment. With its parallel processing capabilities, this centralized data repository facilitates the training and evaluation of various machine learning models for prediction. In addition to forecasting the student performance, appropriate machine learning algorithms such as Support Vector Classifier, Naive Bayes, and Decision Trees are used to build prediction models by using the data lake's scalability and parallel processing capabilities. This study has laid a solid groundwork for using data architecture to improve students' performance. ru
dc.language.iso en ru
dc.publisher International Journal on Advanced Science, Engineering and Information Technology (IJASEIT) ru
dc.relation.ispartofseries Vol.14 (2024) No. 6;
dc.subject Data analytics ru
dc.subject predictive modeling ru
dc.subject student performance ru
dc.subject data lake ru
dc.subject machine learning algorithms ru
dc.title Leveraging Data Lake Architecture for Predicting Academic Student Performance ru
dc.type Article ru


Файлы в этом документе

Данный элемент включен в следующие коллекции

Показать сокращенную информацию

Поиск в DSpace


Просмотр

Моя учетная запись