IDENTIFYING THE INFLUENCE OF TRANSFER LEARNING METHOD IN DEVELOPING AN END-TOEND AUTOMATIC SPEECH RECOGNITION SYSTEM WITH A LOW DATA LEVEL

Mamyrbayev, Orken; Alimhan, Keylan; Oralbekova, Dina; Bekarystankyzy, Akbayan; Zhumazhanov, Bagashar

DSpace Home
→
Научные статьи
→
01. Публикации в изданиях зарубежных стран
→
Business, Management and Accounting
→
View Item

dc.contributor.author	Mamyrbayev, Orken
dc.contributor.author	Alimhan, Keylan
dc.contributor.author	Oralbekova, Dina
dc.contributor.author	Bekarystankyzy, Akbayan
dc.contributor.author	Zhumazhanov, Bagashar
dc.date.accessioned	2024-09-20T12:21:12Z
dc.date.available	2024-09-20T12:21:12Z
dc.date.issued	2022
dc.identifier.citation	Mamyrbayev, O., Alimhan, K., Oralbekova, D., Bekarystankyzy, A., Zhumazhanov, B. (2022). Identifying the influence of transfer learning method in developing an end-to-end automatic speech recognition system with a low data level. Eastern-European Journal of Enterprise Technologies, 1 (9 (115)), 84–92. doi: https://doi.org/10.15587/ 1729-4061.2022.252801	ru
dc.identifier.issn	1729-3774
dc.identifier.uri	http://rep.enu.kz/handle/enu/16796
dc.description.abstract	Ensuring the best quality and performance of modern speech technologies, today, is possible based on the widespread use of machine learning methods. The idea of this project is to study and implement an end-to-end system of automatic speech recognition using machine learning methods, as well as to develop new mathematical models and algorithms for solving the problem of automatic speech recognition for agglutinative (Turkic) languages. Many research papers have shown that deep learning methods make it easier to train automatic speech recognition systems that use an end to end approach. This method can also train an automatic speech recognition system directly, that is, without manual work with raw signals. Despite the good recognition quality, this model has some drawbacks. These disadvantages are based on the need for a large amount of data for training. This is a serious problem for low-data languages, especially Turkic languages such as Kazakh and Azerbaijani. To solve this problem, various methods are needed to apply. Some methods are used for end-to-end speech recognition of languages belonging to the group of languages of the same family (agglutinative languages). Method for low-resource languages is transfer learning, and for large resources – multi-task learning. To increase efficiency and quickly solve the problem associated with a limited resource, transfer learning was used for the end-to-end model. The transfer learning method helped to fit a model trained on the Kazakh dataset to the Azerbaijani dataset. Thereby, two language corpora were trained simultaneously. Conducted experiments with two corpora show that transfer learning can reduce the symbol error rate, phoneme error rate (PER), by 14.23 % compared to baseline models (DNN+HMM, WaveNet, and CNC+LM). Therefore, the realized model with the transfer method can be used to recognize other lowresource languages.	ru
dc.description.sponsorship	This research has been funded by the Science Committee of the Ministry of Education and Science of the Republic of Kazakhstan (Grant No. AP09259309).	ru
dc.language.iso	en	ru
dc.publisher	Eastern-European Journal of Enterprise Technologies	ru
dc.relation.ispartofseries	Volume 1, Issue 9-115;Pages 84 - 92
dc.subject	Asr	ru
dc.subject	Attention	ru
dc.subject	Connectionist temporal classification	ru
dc.subject	End-toend	ru
dc.subject	Low-resource language	ru
dc.subject	Transfer learning	ru
dc.title	IDENTIFYING THE INFLUENCE OF TRANSFER LEARNING METHOD IN DEVELOPING AN END-TOEND AUTOMATIC SPEECH RECOGNITION SYSTEM WITH A LOW DATA LEVEL	ru
dc.type	Article	ru