IIIT Hyderabad Publications |
|||||||||
|
Disambiguating Tense, Aspect and Modality Markers for Correcting Machine Translation ErrorsAuthors: Anil Kumar Singh,Samar Husain,Harshit Surana,Jagadeesh Gorla,Chinnappa Guggilla,Dipti Misra Sharma Conference: In Proceedings of the Conference on Recent Advances in Natural Language Processing (RANLP). Borovets, Bulgaria. 2007 Date: 2007-10-26 Report no: IIIT/TR/2007/75 AbstractAll languages mark tense, aspect and modality (TAM) in some way, but the markers dont have a one-to-one mapping across languages. Many errors in machine translation (MT) are due to wrong translation of TAM markers. Reducing them can improve the performance of an MT system. We used about 9000 sentence pairs from an English-Hindi parallel corpus. These were manually annotated with TAM markers and their mappings. Based on this corpus, we identify the factors responsible for ambiguity in translation. We present the results for learning TAM marker translation using CRF. We achieved an improvement of 17.88% over the baseline. Full paper: pdf Centre for Language Technologies Research Centre |
||||||||
Copyright © 2009 - IIIT Hyderabad. All Rights Reserved. |