Репозиторий Dspace

Development of a Geographical QuestionAnswering System in the Kazakh Language

Показать сокращенную информацию

dc.contributor.author Mukanova, Assel
dc.contributor.author Barlybayev, Alibek
dc.contributor.author Nazyrova, Aizhan
dc.contributor.author Kussepova, Lyazzat
dc.contributor.author Matkarimov, Bakhyt
dc.contributor.author Abdikalyk, Gulnazym
dc.date.accessioned 2024-12-10T05:04:45Z
dc.date.available 2024-12-10T05:04:45Z
dc.date.issued 2017
dc.identifier.issn 2169-3536
dc.identifier.other doi10.1109/ACCESS.2022.
dc.identifier.uri http://rep.enu.kz/handle/enu/19984
dc.description.abstract The study presents a detailed framework designed to develop a Question-Answering System (QA System) for the Kazakh language, highlighting its importance in the field of Low Resource Languages (LRL) Text Processing. This effort aims to fill the gap in resources for languages that lack substantial digital tools. Specifically, the project focuses on geographical questions about Kazakhstan, aiming to enhance accessibility and understanding of the nation's geography. The challenges associated with LRL text processing are addressed through the creation of a question-answer corpus, training a Bidirectional Encoder Representations from Transformers (BERT)-based model, and evaluating the system using Bilingual Evaluation Understudy (BLEU) metrics. The endeavor begins with the careful compilation of a corpus containing 50,000 questions, which supports the subsequent development phases and ensures the creation of a robust QA System. In the second phase, a BERT model equipped with 91,821,056 parameters is trained, enhancing the model’s ability to understand the complex linguistic nuances of the Kazakh language. The final phase involves a rigorous evaluation using BLEU metrics, where the system achieves an impressive average score of 0.9576. This score indicates a high level of agreement between the system-generated answers and the reference answers, demonstrating the system’s effectiveness at interpreting and responding to queries about Kazakh geography. This study significantly contributes to the field by providing a systematic and nuanced approach to QA System development and underscores the model’s effectiveness through thorough evaluation and comparative analysis. ru
dc.language.iso en ru
dc.publisher IEEE Access ru
dc.relation.ispartofseries VOLUME XX;
dc.subject Question Answering System ru
dc.subject Turkic languages ru
dc.subject Kazakh language ru
dc.subject Transformers ru
dc.subject BERT ru
dc.subject BLEU score ru
dc.title Development of a Geographical QuestionAnswering System in the Kazakh Language ru
dc.type Article ru


Файлы в этом документе

Данный элемент включен в следующие коллекции

Показать сокращенную информацию

Поиск в DSpace


Просмотр

Моя учетная запись