IIIT Hyderabad Publications |
|||||||||
|
Translation Quality Estimation for Indian LanguagesAuthors: Nisarg Jhaveri,Manish Gupta,Vasudeva Varma Conference: The 21st Annual Conference of the European Association for Machine Translation (EAMT-2018 2018) Location ALACANT/ALICANTE, SPAIN Date: 2018-05-28 Report no: IIIT/TR/2018/111 AbstractTranslation Quality Estimation (QE) aims to estimate the quality of an automated machine translation (MT) output without any human intervention or reference translation. With the increasing use of MT systems in various cross-lingual applications, the need and applicability of QE systems is increasing. We study existing approaches and propose multiple neural network approaches for sentence-level QE, with a focus on MT outputs in Indian languages. For this, we also introduce five new datasets for four language pairs: two for English–Gujarati, and one each for English–Hindi, English–Telugu and English–Bengali, which includes one manually post-edited dataset for English-Gujarati. These Indian languages are spoken by around 689M speakers world-wide. We compare results obtained using our proposed models with multiple state-of-the-art systems including the winning system in the WMT17 shared task on QE and show that our proposed neural model which combines the discriminative power of carefully chosen features with Siamese Convolutional Neural Networks (CNNs) works best for all Indian language datasets. Full paper: pdf Centre for Search and Information Extraction Lab |
||||||||
Copyright © 2009 - IIIT Hyderabad. All Rights Reserved. |