A Corpus of Machine Translation Errors Extracted from Translation Students Exercises

Abstract : In this paper, we present a freely available corpus of automatic translations accompanied with post-edited versions, annotated with labels identifying the different kinds of errors made by the MT system. These data have been extracted from translation students exercises that have been corrected by a senior professor. This corpus can be useful for training quality estimation tools and for analyzing the types of errors made MT system.
Type de document :
Direction d'ouvrage, Proceedings, Dossier
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis. LREC 2014, May 2014, Reykjavik, Iceland. pp.26-31, 2014, Proceedings of the Ninth Language Resources and Evaluation Conference (LREC 2014), 26-31 May, Reykjavik, Iceland, 〈http://www.lrec-conf.org/proceedings/lrec2014/index.html〉
Liste complète des métadonnées

https://hal-univ-diderot.archives-ouvertes.fr/hal-01134899
Contributeur : Natalie Kübler <>
Soumis le : mardi 24 mars 2015 - 14:24:32
Dernière modification le : jeudi 11 janvier 2018 - 06:25:47

Identifiants

  • HAL Id : hal-01134899, version 1

Collections

Citation

Guillaume Wisniewski, Natalie Kübler, François Yvon. A Corpus of Machine Translation Errors Extracted from Translation Students Exercises. Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis. LREC 2014, May 2014, Reykjavik, Iceland. pp.26-31, 2014, Proceedings of the Ninth Language Resources and Evaluation Conference (LREC 2014), 26-31 May, Reykjavik, Iceland, 〈http://www.lrec-conf.org/proceedings/lrec2014/index.html〉. 〈hal-01134899〉

Partager

Métriques

Consultations de la notice

135