On Monday, Antoni Oliver González, from the Universitat Oberta de Catalunya (UOC) in Barcelona, arrived at RGCL for a two week stay to form research collaborations with members of the group. On Thursday, Antoni gave the following talk to the group:
Title: Automatic detection of translation equivalents of terms in large parallel and comparable corpora
Abstract: In this talk some methodologies for finding the translation equivalents of a term in big parallel and comparable corpora will be presented. For parallel corpora we are using translation tables from Statistical Machine Translation systems (Moses). For comparable corpora we are experimenting with vecmap, a tool to create cross-lingual word embedding mappings. The experiments will be carried out using the IATE database for English for two subjects: International Relations and International organizations. The goal is to enlarge the Spanish IATE database and to create this database for Catalan.
These experiments are being performed during a short research stay and we will be only able to present preliminary results.