Back to Espeak Ng

English

docs/languages/gmw/en.md

1.52.05.5 KB
Original Source

English


The following English accents are supported by eSpeak NG and are referenced in this document:

BCP47AbbreviationAccent Name
enBrEBritish English
en-029JaECaribbean
en-GB-scotlandScEScottish English
en-GB-x-gbclanLancastrian
en-GB-x-gbcwmdWest Midlands
en-GB-x-rpRPReceived Pronunciation
en-USGenAmGeneral American

The BCP47 name is the standard language identifier for the accent, used as the espeak language name. The Abbreviation is used in the tables below for the IPA transcriptions of that accent, and the BCP47 names are used for the eSpeak NG phoneme names.

Vowels

The English language support uses a vowel system based on John Wells' Lexical Sets<sup>[<a href="#ref1">1</a>]</sup>. These were created by Wells in 1982 by comparing the Received Pronunciation British (RP) and General American (GenAm) accents in use at that time.

Short Vowels

Lexical SetenRPGenAm
KITIɪɪ
DRESSEeɛ
TRAPaææ
LOT0ɒɑ
STRUTVʌʌ
FOOTUʊʊ

Additionally, Wells defines the following lexical sets to describe vowels that are different in both RP and GenAm:

Lexical SetenRPGenAm
BATHaaɑːæ
CLOTHO2ɒɔ

Long Vowels

Lexical SetenRPGenAm
FLEECEi:i
PALMA:ɑːɑ
THOUGHTO:ɔːɔ
GOOSEu:u

Rhotic Vowels

These are vowels that are followed by an r that is not part of the next syllable when considering the root form of the word containing that vowel.

Lexical Setenen-GB-scotlandRPGenAmScE
NURSE3:VRɜːɝʌɾ
STARTA@A@ɑːɑɹɐ̟ɾ
NORTHO@O@ɔːɔɹɔɾ
FORCEo@o@ɔː
CUREU@U@ʊə̯ʊɹʉɾ
NEARi@3i@3ɪə̯ɪɹ
SQUAREe@e@eə̯ɛɹ

NOTE: /i@3/ is used for the NEAR lexical set to differentiate it from /i@/ used in words like million.

Additionally, espeak-ng has the following phonemes for different accents:

Lexical Setenen-GB-scotlandRPGenAmScE
TERM3:3:ɜːɝɛɾ
BIRD3:IRɜːɝɪɾ

Reduced Vowels

These are unstressed vowels that differ from the vowels in the main lexical sets.

Lexical SetenRPGenAm
HAPPYiɪi
COMMA@əə
LETTER3əɚ

Additionally, espeak-ng has the following phonemes for unstressed vowels.

Lexical SetenBrERPGenAmJaE
EXPLOREe#ɛɪɛɛ
ROSESI#ɪɪɪ
BLESSEDI2#ɪɪɛ
RABBITI2ɪɪɪɪ

The EXPLORE lexical set is used to support unstressed KIT vowels that have split from the KIT vowel and merged with the DRESS vowel in some accents. This includes ex- words.

The ROSES lexical set is used for words that are KIT in some accents and COMMA in others. The degree to which this occurs varies between accents and speakers.

The BLESSED lexical set is used for -ed based adjectives. These tend to preserve the KIT vowel in accents.

The RABBIT lexical set is used for unstressed KIT vowels. Some American accents have merged this with the COMMA lexical set, such that rabbit and abbot rhyme.

Diphthongs

Lexical SetenRPGenAm
FACEeIeɪ̯eɪ̯
PRICEaIaɪ̯aɪ̯
CHOICEOIɔɪ̯ɔɪ̯
GOAToUəʊ̯oʊ̯
MOUTHaUaʊ̯aʊ̯

References

  1. <a name="ref1"></a> Wikipedia. Lexical set. 2017. Creative Commons Attribution-Sharealike 3.0 Unported License (CC-BY-SA).

  2. <a name="ref2"></a> Wikipedia. IPA chart for English dialects. 2018. Creative Commons Attribution-Sharealike 3.0 Unported License (CC-BY-SA).