Bootstrapping Machine Translation for the Language Pair English – Kiswahili
Date
2008Author
De Pauw, Guy
Waiganjo, Peter
Gilles, Wagacha
De Schryver, Maurice
Metadata
Show full item recordAbstract
In recent years, research in Machine Translation has greatly benefited from the increasing availability of parallel corpora. Processing the same text in two different languages yields useful information on how words and phrases are translated from a source language into a target language. To investigate this, a parallel corpus is typically aligned by linking linguistic tokens in the source language to the corresponding units in the target language. An aligned parallel corpus therefore facilitates the automatic development of a machine translation system. In this paper, we describe data collection and annotation efforts and preliminary experiments with a parallel corpus English - Kiswahili.