SMT - CAT Integration in a Technical Domain. Handling XML mark-up using pre and post-editing processing methods KU Leuven
The increasing use of eXtensible Markup Language (XML) is bringing additional challenges to statistical machine translation (SMT) and computer assisted translation (CAT) workflow integration in the translation industry. This paper analyzes the need to handle XML markup as a partof the translation material in a technical domain. It explores different ways of handling such markup by applying transducers in pre and post-processing steps.A series of ...