Пожалуйста, используйте этот идентификатор, чтобы цитировать или ссылаться на этот ресурс:
http://hdl.handle.net/11701/44821
Полная запись метаданных
Поле DC | Значение | Язык |
---|---|---|
dc.contributor.author | Bernikova, Olga A. | - |
dc.contributor.author | Kizhaeva, Natalia A. | - |
dc.date.accessioned | 2024-02-05T14:11:39Z | - |
dc.date.available | 2024-02-05T14:11:39Z | - |
dc.date.issued | 2023-09 | - |
dc.identifier.citation | Bernikova O. A., Kizhaeva N. A. Peculiarities of the Arabic Language Processing: Morphological Modeling. Vestnik of Saint Petersburg University. Asian and African Studies, 2023, vol. 15, issue 3, pp. 459–484. https://doi.org/10.21638/spbu13.2023.302 (In Russian) | en_GB |
dc.identifier.other | https://doi.org/10.21638/spbu13.2023.302 | - |
dc.identifier.uri | http://hdl.handle.net/11701/44821 | - |
dc.description.abstract | The paper deals with the features of morphological modeling of the Arabic language based on the definition of the specifics of its formalization. Morphological modeling is one of the key stages of automatic text analysis and includes tools for building a word form to a stem, root, definition of a part of speech, automatic construction (generation) of a given word form, etc. The objectives of the study are interdisciplinary in nature and include both the theoretical aspects of studying the features of the Arabic language, which are most relevant for its automatic processing, and the study of existing morphological analyzers and determining the specifics of their work. The practical part is based on testing the CAMeL Tools, one of the advantages of which is its comprehensive nature, which allows both preprocessing of text and solving applied problems, including sentiment analysis. The criteria for selecting examples for testing took into account the features of the Arabic language, which are difficult for its formalization (segmentation of functional words with continuous spelling, morphological and lexical homonymy, etc.). The variability of the generalized concept of “the Arabic language” is taken into account, which combines classical Arabic, Modern Standard Arabic and modern Arabic dialects. Testing tools for morphological modeling allows us to draw conclusions about the need to improve the terminological apparatus, the variability of which is noted in the description of word forms. Such kind of variation (divergence from the concepts accepted in general linguistics) potentially leads to a distortion of the results of lexico-semantic analysis. During the analysis, some gaps were noted related to the definition of part-of-speech belonging, the description of word forms, etc. The results of the study are relevant both for linguistic research and for improving the development of software applications aimed at processing the Arabic text. | en_GB |
dc.description.sponsorship | The research was carried out at the expense of the grant of the Russian Science Foundation no. 22-28-01046, https://rscf.ru/project/22-28-01046/. | en_GB |
dc.language.iso | ru | en_GB |
dc.publisher | St Petersburg State University | en_GB |
dc.relation.ispartofseries | Vestnik of St Petersburg University. Asian and African Studies;Volume 15; Issue 3 | - |
dc.subject | Arabic language | en_GB |
dc.subject | morphological modeling | en_GB |
dc.subject | analyzer | en_GB |
dc.subject | processing | en_GB |
dc.title | Peculiarities of the Arabic Language Processing: Morphological Modeling | en_GB |
dc.type | Article | en_GB |
Располагается в коллекциях: | Issue 3 |
Файлы этого ресурса:
Файл | Описание | Размер | Формат | |
---|---|---|---|---|
02.pdf | 1,26 MB | Adobe PDF | Просмотреть/Открыть |
Все ресурсы в архиве электронных ресурсов защищены авторским правом, все права сохранены.