REPOSITORY.ENU

Evaluating the effectiveness of machine learning methods for keyword coverage using semantic data analysis

Показать сокращенную информацию

dc.contributor.author Shaushenova, Anargul
dc.contributor.author Bayegizova, Aigulim
dc.contributor.author Baidrakhmanova, Gulnaz
dc.contributor.author Abuova, Zhanargul
dc.contributor.author Kassymova, Akmaral
dc.contributor.author Bakirova, Dana
dc.contributor.author Golenko, Yekaterina
dc.date.accessioned 2026-03-11T07:32:35Z
dc.date.available 2026-03-11T07:32:35Z
dc.date.issued 2025
dc.identifier.issn 2088-8708
dc.identifier.other DOI: 10.11591/ijece.v15i1.pp559-568
dc.identifier.uri http://repository.enu.kz/handle/enu/30097
dc.description.abstract This article presents a comprehensive comparative analysis of two advanced hybrid machine learning approaches for keyword extraction: bidirectional encoder representations from transformers (BERT) combined with autoencoder (AE) and term frequency-inverse document frequency (TF-IDF) combined with autoencoder. The research targets the task of semantic analysis in text data to evaluate the effectiveness of these methods in ensuring adequate keyword coverage across diverse text corpora. The study delves into the architecture and operational principles of each method, with a particular focus on the integration with autoencoders to enhance the semantic integrity and relevance of the extracted keywords. The experimental section provides a detailed performance analysis of both methods on various text datasets, highlighting how the structure and semantic richness of the source data influence the outcomes. The evaluation methodology includes precision, recall, and F1-score metrics. The paper discusses the advantages and disadvantages of each approach and their suitability for specific keyword extraction tasks. The findings offer valuable insights for the scientific community, aiding in the selection of the most appropriate text processing method for applications requiring deep semantic understanding and high accuracy in information extraction. ru
dc.language.iso en ru
dc.publisher International Journal of Electrical and Computer Engineering (IJECE) ru
dc.relation.ispartofseries Vol. 15, No. 1,;pp. 559~568
dc.subject Bidirectional encoder representations from transformers ru
dc.subject Hybrid methods ru
dc.subject Inverse document frequency ru
dc.subject Keyword extraction ru
dc.subject Semantic data analysis ru
dc.subject Term frequency ru
dc.title Evaluating the effectiveness of machine learning methods for keyword coverage using semantic data analysis ru
dc.type Article ru


Файлы в этом документе

Данный элемент включен в следующие коллекции

Показать сокращенную информацию

Поиск в DSpace


Просмотр

Моя учетная запись