A Corpus of Machine Translation Errors Extracted from Translation Students Exercises

Abstract : In this paper, we present a freely available corpus of automatic translations accompanied with post-edited versions, annotated with labels identifying the different kinds of errors made by the MT system. These data have been extracted from translation students exercises that have been corrected by a senior professor. This corpus can be useful for training quality estimation tools and for analyzing the types of errors made MT system.
Document type :
Directions of work or proceedings
Complete list of metadatas

https://hal-univ-diderot.archives-ouvertes.fr/hal-01134899
Contributor : Natalie Kübler <>
Submitted on : Tuesday, March 24, 2015 - 2:24:32 PM
Last modification on : Thursday, June 20, 2019 - 2:14:06 PM

Identifiers

  • HAL Id : hal-01134899, version 1

Citation

Guillaume Wisniewski, Natalie Kübler, François Yvon. A Corpus of Machine Translation Errors Extracted from Translation Students Exercises. Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis. LREC 2014, May 2014, Reykjavik, Iceland. pp.26-31, 2014, Proceedings of the Ninth Language Resources and Evaluation Conference (LREC 2014), 26-31 May, Reykjavik, Iceland, ⟨http://www.lrec-conf.org/proceedings/lrec2014/index.html⟩. ⟨hal-01134899⟩

Share

Metrics

Record views

179