<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">tuzsut</journal-id><journal-title-group><journal-title xml:lang="ru">Труды учебных заведений связи</journal-title><trans-title-group xml:lang="en"><trans-title>Proceedings of Telecommunication Universities</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">1813-324X</issn><issn pub-type="epub">2712-8830</issn><publisher><publisher-name>СПбГУТ</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.31854/1813-324X-2023-9-5-35-42</article-id><article-id custom-type="elpub" pub-id-type="custom">tuzsut-513</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>ЭЛЕКТРОНИКА, ФОТОНИКА, ПРИБОРОСТРОЕНИЕ И СВЯЗЬ</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>ELECTRONICS, PHOTONICS, INSTRUMENTATION AND COMMUNICATIONS</subject></subj-group></article-categories><title-group><article-title>Разработка и исследование системы автоматического распознавания цифр йеменского диалекта арабской речи с использованием нейронных сетей</article-title><trans-title-group xml:lang="en"><trans-title>Development and Research of a System for Automatic Recognition of the Digits Yemeni Dialect of Arabic Speech Using Neural Networks</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><contrib-id contrib-id-type="orcid">https://orcid.org/0009-0006-1723-2782</contrib-id><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Радан</surname><given-names>Н.Х.А.</given-names></name><name name-style="western" xml:lang="en"><surname>Radan</surname><given-names>N.H.</given-names></name></name-alternatives><bio xml:lang="ru"><p>аспирант кафедры информационных систем Тверского государственного технического университета</p></bio><email xlink:type="simple">naeem.radan@gmail.com</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><contrib-id contrib-id-type="orcid">https://orcid.org/0000-0003-1119-2610</contrib-id><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Сидоров</surname><given-names>К. В.</given-names></name><name name-style="western" xml:lang="en"><surname>Sidorov</surname><given-names>K.</given-names></name></name-alternatives><bio xml:lang="ru"><p>кандидат технических наук, доцент, доцент кафедры автоматизации технологических процессов Тверского государственного технического университета</p></bio><email xlink:type="simple">bmisidorov@mail.ru</email><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff-alternatives id="aff-1"><aff xml:lang="ru">Тверской государственный технический университет<country>Россия</country></aff><aff xml:lang="en">Tver State Technical University<country>Russian Federation</country></aff></aff-alternatives><pub-date pub-type="collection"><year>2023</year></pub-date><pub-date pub-type="epub"><day>14</day><month>11</month><year>2023</year></pub-date><volume>9</volume><issue>5</issue><fpage>35</fpage><lpage>42</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Радан Н., Сидоров К.В., 2023</copyright-statement><copyright-year>2023</copyright-year><copyright-holder xml:lang="ru">Радан Н., Сидоров К.В.</copyright-holder><copyright-holder xml:lang="en">Radan N., Sidorov K.</copyright-holder><license license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://tuzs.sut.ru/jour/article/view/513">https://tuzs.sut.ru/jour/article/view/513</self-uri><abstract><p>В статье описаны результаты исследований по разработке и тестированию системы автоматического распознавания речи (САРР) на арабских цифрах с помощью искусственных нейронных сетей. Для проведения исследований использовались звукозаписи (речевые сигналы) арабского йеменского диалекта, записанные в Республике Йемен. САРР представляет собой изолированную систему распознавания целых слов, она реализована в двух режимах: «дикторозависимая система» (дикторы при обучении и тестировании системы используются одни и те же) и «дикторонезависимая система» (дикторы, используемые для обучения системы, отличаются от тех, которые применяются для ее тестирования). В процессе распознавания речевой сигнал очищается от шумов с помощью фильтров, далее сигнал предварительно локализуется, обрабатывается и анализируется окном Хэмминга (применяется алгоритм временного выравнивания для компенсации различий в произношении). Информативные признаки извлекаются из речевого сигнала с использованием мел-частотных кепстральных коэффициентов. Разработанная САРР обеспечивает высокую точность распознавания арабских цифр йеменского диалекта – 96,2 % (для дикторозависимой системы) и 98,8 % (для дикторонезависимой системы).</p></abstract><trans-abstract xml:lang="en"><p>The article describes the results of research on the development and testing of an automatic speech recognition system (SAR) in Arabic numerals using artificial neural networks. Sound recordings (speech signals) of the Arabic Yemeni dialect recorded in the Republic of Yemen were used for the research. SAR is an isolated system of recognition of whole words, it is implemented in two modes: "speaker-dependent system" (the same speakers are used for training and testing the system) and "speaker-independent system" (the speakers used for training the system differ from those used for testing it). In the process of speech recognition, the speech signal is cleared of noise using filters, then the signal is pre-localized, processed and analyzed by the Hamming window (a time alignment algorithm is used to compensate for differences in pronunciation). Informative features are extracted from the speech signal using mel-frequency cepstral coefficients. The developed SAR provides high accuracy of the recognition of Arabic numerals of the Yemeni dialect – 96.2 % (for a speaker-dependent system) and 98.8 % (for a speaker-independent system).</p></trans-abstract><kwd-group xml:lang="ru"><kwd>нейронные сети</kwd><kwd>распознавание речи</kwd><kwd>йеменский диалект</kwd></kwd-group><kwd-group xml:lang="en"><kwd>neural networks</kwd><kwd>speech recognition</kwd><kwd>Yemeni dialect</kwd></kwd-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Al-Zabibi M. An acoustic-phonetic approach in automatic Arabic speech recognition. Loughborough University. Doctoral Thesis. 1990. URL: https://hdl.handle.net/2134/6949 (Accessed 02.10.2023)</mixed-citation><mixed-citation xml:lang="en">Al-Zabibi M. An acoustic-phonetic approach in automatic Arabic speech recognition. Loughborough University. Doctoral Thesis. 1990. URL: https://hdl.handle.net/2134/6949 [Accessed 02.10.2023]</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Alkhouli M. Alaswaat Alaghawaiyah // Daar Alfalah, Jordan. 1990 (in Arabic)</mixed-citation><mixed-citation xml:lang="en">Alkhouli M. Alaswaat Alaghawaiyah. Daar Alfalah, Jordan. 1990 (in Arabic)</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Deller J., Hansen J., Proakis J. Discrete-Time Processing of Speech Signal. 1993. DOI:10.1109/9780470544402</mixed-citation><mixed-citation xml:lang="en">Deller J., Hansen J., Proakis J. Discrete-Time Processing of Speech Signal. 1993. DOI:10.1109/9780470544402</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Elshafei M. Toward an Arabic Text-to-Speech System // The Arabian Journal for Scince and Engineering. 1991. Vol. 16. Iss. 4B. PP. 565‒583.</mixed-citation><mixed-citation xml:lang="en">Elshafei M. Toward an Arabic Text-to-Speech System. The Arabian Journal for Science and Engineering. 1991;16(4B):565‒583.</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Hagos E. Implementation of an Isolated Word Recognition System. M.Sc. Thesis. King Fahd University of Petroleum &amp; Minerals Dhahran, Saudi Arabia. 1985.</mixed-citation><mixed-citation xml:lang="en">Hagos E. Implementation of an Isolated Word Recognition System. M.Sc. Thesis. King Fahd University of Petroleum &amp; Minerals Dhahran, Saudi Arabia. 1985.</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Abdulla W.H., Abdul-Karim M.A.H. Real-time spoken Arabic digit recognizer // International Journal of Electronics. 1985. Vol. 59. Iss. 5. PP. 645–648. DOI:10.1080/00207218508920741</mixed-citation><mixed-citation xml:lang="en">Abdulla W.H., Abdul-Karim M.A.H. Real-time spoken Arabic digit recognizer. International Journal of Electronics. 1985; 59(5):645–648. DOI:10.1080/00207218508920741</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Alotaibi Y.A. Investigating spoken Arabic digits in speech recognition setting. Information Sciences. 2005. Vol. 173. Iss. 1-3. PP. 115–139. DOI:10.1016/j.ins.2004.07.008</mixed-citation><mixed-citation xml:lang="en">Alotaibi Y.A. Investigating spoken Arabic digits in speech recognition setting. Information Sciences. 2005;173(1-3):115–139. DOI:10.1016/j.ins.2004.07.008</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Alotaibi Y.A. High performance Arabic digits recognizer using neural networks // Proceedings of the International Joint Conference on Neural Networks (Portland, USA, 20‒24 July 2003). IEEE, 2003. DOI:10.1109/ijcnn.2003.1223444</mixed-citation><mixed-citation xml:lang="en">Alotaibi Y.A. High performance Arabic digits recognizer using neural networks. Proceedings of the International Joint Conference on Neural Networks, 20‒24 July 2003, Portland, USA. IEEE; 2003. DOI:10.1109/ijcnn.2003.1223444</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Alotaibi Y.A. Analyzing Arabic digit recognizer errors using spectrograms // Proceedings 7th International Conference on Signal Processing (ICSP, Beijing, China, 31 August 2004 ‒ 04 September 2004). IEEE, 2004. DOI:10.1109/icosp.2004.1452746</mixed-citation><mixed-citation xml:lang="en">Alotaibi Y.A. Analyzing Arabic digit recognizer errors using spectrograms // Proceedings 7th International Conference on Signal Processing, ICSP, 31 August 2004 ‒ 04 September 2004, Beijing, China. IEEE; 2004. DOI:10.1109/icosp.2004.1452746</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Hassine M., Boussaid L., Massaoud H. Tunisian Dialect Recognition Based on Hybrid Techniques // International Arab Journal of Information Technology. 2018. Vol. 15. No. 1. PP. 58–65.</mixed-citation><mixed-citation xml:lang="en">Hassine M., Boussaid L., Massaoud H. Tunisian Dialect Recognition Based on Hybrid Techniques. International Arab Journal of Information Technology. 2018;15(1):58–65.</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">Аль-Дайбани А.М.С. Исследование методов и разработка алгоритмов обработки сигналов для систем автоматического распознавания телефонной речи в республике Йемен. Дис. … канд. техн. наук. Владимир: Владимирский государственный университет имени Александра Григорьевича и Николая Григорьевича Столетовых, 2019. 150 с.</mixed-citation><mixed-citation xml:lang="en">Al-Daibani A.M.S. Research of methods and development of algorithms of signal processing for systems of automatic recognition of telephone speech in the Republic of Yemen. PhD Thesis. Vladimir: Vladimir State University named after Alex-ander Grigorievich and Nikolai Grigorievich Stoletov Publ.; 2019. 150 p.</mixed-citation></citation-alternatives></ref><ref id="cit12"><label>12</label><citation-alternatives><mixed-citation xml:lang="ru">Радан Н.Х. Системы автоматического распознавания арабской речи и йеменского диалекта // Научно-аналитический журнал «Вестник Санкт-Петербургского университета государственной противопожарной службы МЧС России». 2023. № 2. С. 194–212.</mixed-citation><mixed-citation xml:lang="en">Radan N. Automatic speech recognition systems for Arabic speech and Yemeni dialect. Bulletin of St. Petersburg Uni-versity of the State Fire Service of the Ministry of Emergency Situations of Russia. 2023;2:194–212.</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
