|
This article is cited in 1 scientific paper (total in 1 paper)
Method for extracting single-word translation correspondences from parallel texts using distributional semantics models
Yu. I. Morozova, E. B. Kozerenko, M. M. Sharnin Institute of Informatics Problems, Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
Abstract:
The paper deals with problems of corpus research of linguistic units. The task of extracting translation correspondences from a parallel corpus is defined. An overview of existing approaches to this task is provided. The paper focuses on the approach to extracting translation correspondences based on distributional semantics models. The paper describes the theoretical model developed by the authors as well as its software implementation. A test parallel corpus of patent texts in French and English was compiled for the purpose of this research. The paper provides results of an experiment aimed at extracting single-word translation correspondences from the test parallel corpus.
Keywords:
extracting translation correspondences; alignment; parallel texts; parallel corpus; distributional semantics; vector space model.
Received: 27.03.2014
Citation:
Yu. I. Morozova, E. B. Kozerenko, M. M. Sharnin, “Method for extracting single-word translation correspondences from parallel texts using distributional semantics models”, Sistemy i Sredstva Inform., 24:2 (2014), 131–142
Linking options:
https://www.mathnet.ru/eng/ssi349 https://www.mathnet.ru/eng/ssi/v24/i2/p131
|
Statistics & downloads: |
Abstract page: | 389 | Full-text PDF : | 160 | References: | 71 |
|