Browse Source

Merge branch 'docs-english'

master
Reece H. Dunn 7 years ago
parent
commit
6873c45d1a
1 changed files with 64 additions and 39 deletions
  1. 64
    39
      docs/languages/gmw/en.md

+ 64
- 39
docs/languages/gmw/en.md View File

@@ -1,24 +1,41 @@
# English

- [Short Vowels](#short-vowels)
- [Long Vowels](#long-vowels)
- [Rhotic Vowels](#rhotic-vowels)
- [Reduced Vowels](#reduced-vowels)
- [Diphthongs](#diphthongs)
- [Split Vowels](#split-vowels)
- [Vowels](#vowels)
- [Short Vowels](#short-vowels)
- [Long Vowels](#long-vowels)
- [Rhotic Vowels](#rhotic-vowels)
- [Reduced Vowels](#reduced-vowels)
- [Diphthongs](#diphthongs)
- [References](#references)

----------

The following English accents are supported by eSpeak NG and are referenced in
this document:

| BCP47 | Abbreviation | Accent Name |
|----------------|--------------|------------------------|
| en | | British English |
| en-029 | | Caribbean |
| en-GB-scotland | ScE | Scottish English |
| en-GB-x-gbclan | | Lancastrian |
| en-GB-x-gbcwmd | | West Midlands |
| en-GB-x-rp | RP | Received Pronunciation |
| en-US | GenAm | General American |

The BCP47 name is the standard language identifier for the accent, used as the
espeak language name. The Abbreviation is used in the tables below for the IPA
transcriptions of that accent, and the BCP47 names are used for the eSpeak NG
phoneme names.

## Vowels

The English language support uses a vowel system based on John Wells' Lexical
Sets<sup>\[<a href="#ref1">1</a>\]</sup>. These were created by Wells in 1982
by comparing the Received Pronunciation British (RP) and General American
(GenAm) accents in use at that time.

The `en` transcriptions listed below are the phonemes used by eSpeak NG to
transcribe the different lexical sets.

## Short Vowels
### Short Vowels

| Lexical Set | en | RP | GenAm |
|-------------|-------|-------|-------|
@@ -29,7 +46,15 @@ transcribe the different lexical sets.
| STRUT | `V` | ʌ | ʌ |
| FOOT | `U` | ʊ | ʊ |

## Long Vowels
Additionally, Wells defines the following lexical sets to describe vowels that
are different in both RP and GenAm:

| Lexical Set | en | RP | GenAm |
|-------------|-------|-------|-------|
| BATH | `aa` | ɑː | æ |
| CLOTH | `O2` | ɒ | ɔ |

### Long Vowels

| Lexical Set | en | RP | GenAm |
|-------------|-------|-------|-------|
@@ -38,25 +63,32 @@ transcribe the different lexical sets.
| THOUGHT | `O:` | ɔː | ɔ |
| GOOSE | `u:` | uː | u |

## Rhotic Vowels
### Rhotic Vowels

These are vowels that are followed by an `r` that is not part of the next syllable
when considering the root form of the word containing that vowel.

| Lexical Set | en | RP | GenAm |
|-------------|-------|-------|-------|
| NURSE | `3:` | ɜː | ɝ |
| START | `A@` | ɑː | ɑɹ |
| NORTH | `O@` | ɔː | ɔɹ |
| FORCE | `o@` | ɔː | oɹ |
| CURE | `U@` | ʊə̯ | ʊɹ |
| NEAR | `i@3` | ɪə̯ | ɪɹ |
| SQUARE | `e@` | eə̯ | ɛɹ |
| Lexical Set | en | en-GB-scotland | RP | GenAm | ScE |
|-------------|-------|----------------|-------|-------|-------|
| NURSE | `3:` | `VR` | ɜː | ɝ | ʌɾ |
| START | `A@` | `A@` | ɑː | ɑɹ | ɐ̟ɾ |
| NORTH | `O@` | `O@` | ɔː | ɔɹ | ɔɾ |
| FORCE | `o@` | `o@` | ɔː | oɹ | oɾ |
| CURE | `U@` | `U@` | ʊə̯ | ʊɹ | ʉɾ |
| NEAR | `i@3` | `i@3` | ɪə̯ | ɪɹ | iɾ |
| SQUARE | `e@` | `e@` | eə̯ | ɛɹ | eɾ |

__NOTE:__ `/i@3/` is used for the NEAR lexical set to differentiate it from
`/i@/` used in words like `million`.

## Reduced Vowels
Additionally, espeak-ng has the following phonemes for different accents:

| Lexical Set | en | en-GB-scotland | RP | GenAm | ScE |
|-------------|-------|----------------|-------|-------|-------|
| TERM | `3:` | `3:` | ɜː | ɝ | ɛɾ |
| BIRD | `3:` | `IR` | ɜː | ɝ | ɪɾ |

### Reduced Vowels

These are unstressed vowels that differ from the vowels in the main lexical sets.

@@ -85,29 +117,22 @@ The RABBIT lexical set is used for unstressed KIT vowels. Some American accents
have merged this with the COMMA lexical set, such that `rabbit` and `abbot`
rhyme.

## Diphthongs
### Diphthongs

| Lexical Set | en | RP | GenAm |
|-------------|-------|-------|-------|
| FACE | `eI` | eɪ̯ | eɪ̯ |
| PRICE | `aI` | aɪ̯ | aɪ̯ |
| CHOICE | `OI` | ɔɪ̯ | ɔɪ̯ |
| GOAT | `oU` | əʊ̯ | oʊ̯ |
| MOUTH | `aU` | aʊ̯ | aʊ̯ |

## Split Vowels

These are lexical sets defined by John Wells that are merged with other lexical
sets in both RP and GenAm, so have split from one of those lexical sets and
merged with the other.

| Lexical Set | en | RP | GenAm |
|-------------|-------|-------|-------|
| BATH | `aa` | ɑː | æ |
| CLOTH | `O2` | ɒ | ɔ |
| FACE | `eI` | eɪ̯ | eɪ̯ |
| PRICE | `aI` | aɪ̯ | aɪ̯ |
| CHOICE | `OI` | ɔɪ̯ | ɔɪ̯ |
| GOAT | `oU` | əʊ̯ | oʊ̯ |
| MOUTH | `aU` | aʊ̯ | aʊ̯ |

## References

1. <a name="ref1"></a> Wikipedia.
[Lexical set](https://en.wikipedia.org/wiki/Lexical_set). 2017.
Creative Commons Attribution-Sharealike 3.0 Unported License (CC-BY-SA).

2. <a name="ref2"></a> Wikipedia.
[IPA chart for English dialects](https://en.wikipedia.org/wiki/International_Phonetic_Alphabet_chart_for_English_dialects).
2018. Creative Commons Attribution-Sharealike 3.0 Unported License (CC-BY-SA).

Loading…
Cancel
Save