Article,

Theoretical Basis of the Formation of Uzbek-Turkish Parallel Corpus

.
Central Asian Journal of Literature, Philosophy and Culture, 4 (12): 163-170 (December 2023)

Abstract

The article provides information on what a corpus is, the formation of linguistic corpora in world linguistics, machine translation and the work carried out in this regard, parallel corpora and types of corpora. Bilingual and multilingual texts, the formation of common corpora between Turkic languages, and the creation of parallel corpora ensure the development of languages and their improvement as a technical language. The purpose of the idiom parallel corpus, forms of creation are explained. The stages of forming the database of the parallel corpus of phrases in the Uzbek and Turkish languages are described. Processes performed in steps are explained. The stages of text processing, scanning, editing, and formatting are analyzed. In the process of automatic translation, the problems related to the translation of stable compounds and idioms are highlighted. The requirements for the creation of the linguistic support corpus of the Uzbek-Turkish parallel corpus are indicated. The role of the Uzbek-Turkish parallel corpus in the implementation of scientific research, automatic translation, the formation of national corpus and educational corpus is shown. The practical importance of the Uzbek-Turkish parallel corpus in teaching the Uzbek language as a foreign language, learning the Turkish language, and translating artistic sources in the Uzbek and Turkish languages has been shown. It is stated that the created corpus performs the function of material and linguistic support in the formation of parallel corpora, educational corpora, and national corpora. It is highlighted that the Uzbek-Turkish parallel corpus is a very necessary resource for overcoming difficulties in translation processes, parallel corpuses occupy a central place in translation studies and comparative linguistics.

Tags

Users

  • @centralasian_20

Comments and Reviews