Developing corpus interoperability for phonetic investigation of learner corpora

Abstract : Although automatic analysis and computer-aided annotation tools are being developed, spoken learner corpora are still smaller and less numerous than written learner corpora. This chapter gives a critical overview of some of the phonetic research questions addressed by spoken learner corpora in relation to their annotation schemes and software. Some of their annotation schemes and guidelines are presented and assessed. Corpus design and tools are discussed in relation to some two of the challenges of spoken learner corpora: comparability of data and the potential contribution to prosodic modeling. It is argued that reusability of annotated spoken data and critical statistics should be the real order of the day.
Document type :
Book sections
Complete list of metadatas

https://hal-univ-diderot.archives-ouvertes.fr/hal-01239062
Contributor : Nicolas Ballier <>
Submitted on : Monday, December 7, 2015 - 1:26:29 PM
Last modification on : Friday, January 4, 2019 - 5:33:30 PM

Identifiers

  • HAL Id : hal-01239062, version 1

Collections

Citation

Nicolas Ballier, Philippe Martin. Developing corpus interoperability for phonetic investigation of learner corpora. Ana Díaz-Negrillo; Nicolas Ballier; Paul Thompson. Automatic Treatment and Analysis of Learner Corpus Data, ⟨Benjamins⟩, pp.33-64, 2013, 9789027203663. ⟨hal-01239062⟩

Share

Metrics

Record views

120