Automatic dialect identification: a study of British English

Abstract : This contribution deals with the automatic identification of the dialects of the British Isles. Several methods based on the linguistic study of dialect-specific vowel systems are proposed and compared using the Accents of the British Isles (ABI) corpus. The first method examines differences in diphthongization for the face lexical set. Discrimination scores in a two-dialect discrimination task range from chance to ca. 98 % of correct decision depending on the pair of dialects under test. Thanks to the ACCDIST method (developed in [1,2]), the second and third ex- periments take dialectal differences in the structure of vowel systems into consideration; evaluation is performed on a 13-dialect closed set identi- fication task. Correct identification reaches up to 90 % with two subsets of the ABI corpus (/hVd/ set and read passages). All these experiments rely on a front-end automatic phonetic alignment and are therefore text-dependent. Results and possible improvements are discussed in the light of British dialectology.
Liste complète des métadonnées
Contributeur : Emmanuel Ferragne <>
Soumis le : mardi 8 décembre 2015 - 16:17:02
Dernière modification le : vendredi 31 août 2018 - 10:41:38


  • HAL Id : hal-01240069, version 1




Consultations de la notice