|
|
@@ -1,24 +1,41 @@ |
|
|
|
# English |
|
|
|
|
|
|
|
- [Short Vowels](#short-vowels) |
|
|
|
- [Long Vowels](#long-vowels) |
|
|
|
- [Rhotic Vowels](#rhotic-vowels) |
|
|
|
- [Reduced Vowels](#reduced-vowels) |
|
|
|
- [Diphthongs](#diphthongs) |
|
|
|
- [Split Vowels](#split-vowels) |
|
|
|
- [Vowels](#vowels) |
|
|
|
- [Short Vowels](#short-vowels) |
|
|
|
- [Long Vowels](#long-vowels) |
|
|
|
- [Rhotic Vowels](#rhotic-vowels) |
|
|
|
- [Reduced Vowels](#reduced-vowels) |
|
|
|
- [Diphthongs](#diphthongs) |
|
|
|
- [References](#references) |
|
|
|
|
|
|
|
---------- |
|
|
|
|
|
|
|
The following English accents are supported by eSpeak NG and are referenced in |
|
|
|
this document: |
|
|
|
|
|
|
|
| BCP47 | Abbreviation | Accent Name | |
|
|
|
|----------------|--------------|------------------------| |
|
|
|
| en | | British English | |
|
|
|
| en-029 | | Caribbean | |
|
|
|
| en-GB-scotland | ScE | Scottish English | |
|
|
|
| en-GB-x-gbclan | | Lancastrian | |
|
|
|
| en-GB-x-gbcwmd | | West Midlands | |
|
|
|
| en-GB-x-rp | RP | Received Pronunciation | |
|
|
|
| en-US | GenAm | General American | |
|
|
|
|
|
|
|
The BCP47 name is the standard language identifier for the accent, used as the |
|
|
|
espeak language name. The Abbreviation is used in the tables below for the IPA |
|
|
|
transcriptions of that accent, and the BCP47 names are used for the eSpeak NG |
|
|
|
phoneme names. |
|
|
|
|
|
|
|
## Vowels |
|
|
|
|
|
|
|
The English language support uses a vowel system based on John Wells' Lexical |
|
|
|
Sets<sup>\[<a href="#ref1">1</a>\]</sup>. These were created by Wells in 1982 |
|
|
|
by comparing the Received Pronunciation British (RP) and General American |
|
|
|
(GenAm) accents in use at that time. |
|
|
|
|
|
|
|
The `en` transcriptions listed below are the phonemes used by eSpeak NG to |
|
|
|
transcribe the different lexical sets. |
|
|
|
|
|
|
|
## Short Vowels |
|
|
|
### Short Vowels |
|
|
|
|
|
|
|
| Lexical Set | en | RP | GenAm | |
|
|
|
|-------------|-------|-------|-------| |
|
|
@@ -29,7 +46,15 @@ transcribe the different lexical sets. |
|
|
|
| STRUT | `V` | ʌ | ʌ | |
|
|
|
| FOOT | `U` | ʊ | ʊ | |
|
|
|
|
|
|
|
## Long Vowels |
|
|
|
Additionally, Wells defines the following lexical sets to describe vowels that |
|
|
|
are different in both RP and GenAm: |
|
|
|
|
|
|
|
| Lexical Set | en | RP | GenAm | |
|
|
|
|-------------|-------|-------|-------| |
|
|
|
| BATH | `aa` | ɑː | æ | |
|
|
|
| CLOTH | `O2` | ɒ | ɔ | |
|
|
|
|
|
|
|
### Long Vowels |
|
|
|
|
|
|
|
| Lexical Set | en | RP | GenAm | |
|
|
|
|-------------|-------|-------|-------| |
|
|
@@ -38,25 +63,32 @@ transcribe the different lexical sets. |
|
|
|
| THOUGHT | `O:` | ɔː | ɔ | |
|
|
|
| GOOSE | `u:` | uː | u | |
|
|
|
|
|
|
|
## Rhotic Vowels |
|
|
|
### Rhotic Vowels |
|
|
|
|
|
|
|
These are vowels that are followed by an `r` that is not part of the next syllable |
|
|
|
when considering the root form of the word containing that vowel. |
|
|
|
|
|
|
|
| Lexical Set | en | RP | GenAm | |
|
|
|
|-------------|-------|-------|-------| |
|
|
|
| NURSE | `3:` | ɜː | ɝ | |
|
|
|
| START | `A@` | ɑː | ɑɹ | |
|
|
|
| NORTH | `O@` | ɔː | ɔɹ | |
|
|
|
| FORCE | `o@` | ɔː | oɹ | |
|
|
|
| CURE | `U@` | ʊə̯ | ʊɹ | |
|
|
|
| NEAR | `i@3` | ɪə̯ | ɪɹ | |
|
|
|
| SQUARE | `e@` | eə̯ | ɛɹ | |
|
|
|
| Lexical Set | en | en-GB-scotland | RP | GenAm | ScE | |
|
|
|
|-------------|-------|----------------|-------|-------|-------| |
|
|
|
| NURSE | `3:` | `VR` | ɜː | ɝ | ʌɾ | |
|
|
|
| START | `A@` | `A@` | ɑː | ɑɹ | ɐ̟ɾ | |
|
|
|
| NORTH | `O@` | `O@` | ɔː | ɔɹ | ɔɾ | |
|
|
|
| FORCE | `o@` | `o@` | ɔː | oɹ | oɾ | |
|
|
|
| CURE | `U@` | `U@` | ʊə̯ | ʊɹ | ʉɾ | |
|
|
|
| NEAR | `i@3` | `i@3` | ɪə̯ | ɪɹ | iɾ | |
|
|
|
| SQUARE | `e@` | `e@` | eə̯ | ɛɹ | eɾ | |
|
|
|
|
|
|
|
__NOTE:__ `/i@3/` is used for the NEAR lexical set to differentiate it from |
|
|
|
`/i@/` used in words like `million`. |
|
|
|
|
|
|
|
## Reduced Vowels |
|
|
|
Additionally, espeak-ng has the following phonemes for different accents: |
|
|
|
|
|
|
|
| Lexical Set | en | en-GB-scotland | RP | GenAm | ScE | |
|
|
|
|-------------|-------|----------------|-------|-------|-------| |
|
|
|
| TERM | `3:` | `3:` | ɜː | ɝ | ɛɾ | |
|
|
|
| BIRD | `3:` | `IR` | ɜː | ɝ | ɪɾ | |
|
|
|
|
|
|
|
### Reduced Vowels |
|
|
|
|
|
|
|
These are unstressed vowels that differ from the vowels in the main lexical sets. |
|
|
|
|
|
|
@@ -85,29 +117,22 @@ The RABBIT lexical set is used for unstressed KIT vowels. Some American accents |
|
|
|
have merged this with the COMMA lexical set, such that `rabbit` and `abbot` |
|
|
|
rhyme. |
|
|
|
|
|
|
|
## Diphthongs |
|
|
|
### Diphthongs |
|
|
|
|
|
|
|
| Lexical Set | en | RP | GenAm | |
|
|
|
|-------------|-------|-------|-------| |
|
|
|
| FACE | `eI` | eɪ̯ | eɪ̯ | |
|
|
|
| PRICE | `aI` | aɪ̯ | aɪ̯ | |
|
|
|
| CHOICE | `OI` | ɔɪ̯ | ɔɪ̯ | |
|
|
|
| GOAT | `oU` | əʊ̯ | oʊ̯ | |
|
|
|
| MOUTH | `aU` | aʊ̯ | aʊ̯ | |
|
|
|
|
|
|
|
## Split Vowels |
|
|
|
|
|
|
|
These are lexical sets defined by John Wells that are merged with other lexical |
|
|
|
sets in both RP and GenAm, so have split from one of those lexical sets and |
|
|
|
merged with the other. |
|
|
|
|
|
|
|
| Lexical Set | en | RP | GenAm | |
|
|
|
|-------------|-------|-------|-------| |
|
|
|
| BATH | `aa` | ɑː | æ | |
|
|
|
| CLOTH | `O2` | ɒ | ɔ | |
|
|
|
| FACE | `eI` | eɪ̯ | eɪ̯ | |
|
|
|
| PRICE | `aI` | aɪ̯ | aɪ̯ | |
|
|
|
| CHOICE | `OI` | ɔɪ̯ | ɔɪ̯ | |
|
|
|
| GOAT | `oU` | əʊ̯ | oʊ̯ | |
|
|
|
| MOUTH | `aU` | aʊ̯ | aʊ̯ | |
|
|
|
|
|
|
|
## References |
|
|
|
|
|
|
|
1. <a name="ref1"></a> Wikipedia. |
|
|
|
[Lexical set](https://en.wikipedia.org/wiki/Lexical_set). 2017. |
|
|
|
Creative Commons Attribution-Sharealike 3.0 Unported License (CC-BY-SA). |
|
|
|
|
|
|
|
2. <a name="ref2"></a> Wikipedia. |
|
|
|
[IPA chart for English dialects](https://en.wikipedia.org/wiki/International_Phonetic_Alphabet_chart_for_English_dialects). |
|
|
|
2018. Creative Commons Attribution-Sharealike 3.0 Unported License (CC-BY-SA). |