Evan Kirshenbaum created an ASCII transcription of the International Phonetic Alphabet (IPA)[1], [2]. As well as using ASCII characters for specific IPA phonemes, this transcription provides a set of 3-letter feature abbreviations allowing a phoneme to be described as a sequence of features.
This document describes the IPA phonemes using the features used by Kirshenbaum. Where Kirshenbaum does not specify a feature name, the feature name from Cainteoir Text-to-Speech[5] is used. This is to provide a consistent naming scheme for the extended feature set. Where there is still no feature available, a custom 3-letter feature name is chosen.
The aim of the feature set described in this document is to specify the underlying phonetics and phonemics of the sounds being produced in a way that is consistent between languages and voices. While this feature set is modelled on the IPA, it is not meant to be able to preserve phoneme transcriptions when using a transcription as both the input and output phoneme sets. This document provides commentary on the intended usage of these features where there is ambiguity from the associated IPA usage between authors.
This document is grouped into two sections. The first section displays the IPA charts using the feature names instead of their names, showing the IPA phoneme at that position in the chart. This makes it easier to look up the features for a given IPA phoneme.
The second section lists the features and their associated name. This section does not describe what these mean. Their meaning is described in phonetics articles, books and Wikipedia. The Wikipedia IPA[4] article can be used as a starting point, as it links to topics and descriptions of the various phonemes.
The diacritics and suprasegmental feature lists also show their corresponding IPA symbol. This is to avoid duplicating the lists in the IPA chart and Feature sections.
blb |
lbd |
dnt |
alv |
pla |
rfx |
alp |
pal |
vel |
uvl |
phr |
glt |
|||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
nas |
m | ɱ | n | ɳ | ɲ | ŋ | ɴ | |||||||||||||||||
stp |
p | b | t | d | ʈ | ɖ | c | ɟ | k | ɡ | q | ɢ | ʡ | ʔ | ||||||||||
sib frc |
s | z | ʃ | ʒ | ʂ | ʐ | ɕ | ʑ | ||||||||||||||||
frc |
ɸ | β | f | v | θ | ð | ç | ʝ | x | ɣ | χ | ʁ | ħ | ʕ | h | ɦ | ||||||||
lat frc |
ɬ | ɮ | ||||||||||||||||||||||
apr |
ʋ | ɹ | ɻ | j | ɰ | |||||||||||||||||||
lat apr |
l | ɭ | ʎ | ʟ | ||||||||||||||||||||
flp |
ⱱ | ɾ | ɽ | |||||||||||||||||||||
lat flp |
ɺ | |||||||||||||||||||||||
trl |
ʙ | r | ʀ | ʜ | ʢ |
Symbols to the left have a vls
phonation, and to the right have mdv
phonation.
blb |
lbd |
dnt |
alv |
pla |
rfx |
pal |
vel |
uvl |
phr |
glt |
|
---|---|---|---|---|---|---|---|---|---|---|---|
clk |
ʘ | ǀ | ǃ | ǂ | |||||||
lat clk |
ǁ | ||||||||||
mdv imp |
ɓ | ɗ | ʄ | ɠ | ʛ | ||||||
ejc |
pʼ | tʼ | ʈʼ | cʼ | kʼ | qʼ | ʡʼ | ||||
ejc frc |
fʼ | θʼ | sʼ | ʃʼ | ʂʼ | xʼ | χʼ | ||||
lat ejc frc |
ɬʼ |
Symbol | Features |
---|---|
ʍ | vls lbv apr |
w | mdv lbv apr |
ɥ | mdv lbp apr |
ɧ | vls vzd pla frc |
fnt |
cnt |
bck |
||||
---|---|---|---|---|---|---|
hgh |
i | y | ɨ | ʉ | ɯ | u |
smh |
ɪ | ʏ | ʊ | |||
umd |
e | ø | ɘ | ɵ | ɤ | o |
mid |
ə | |||||
lmd |
ɛ | œ | ɜ | ɞ | ʌ | ɔ |
sml |
æ | ɐ | ||||
low |
a | ɶ | ɑ | ɒ |
Symbols to the left are unr
, and to the right are rnd
.
NOTE: The smh
vowels are more cnt
than the other vowels. However, this
distinction is not needed to classify these vowels, so is not included in the
above table.
Feature | Symbol | Name | Description |
---|---|---|---|
egs |
↑ | egressive | The air flow is moving outwards from the initiator. |
igs |
↓ | ingressive | The air flow is moving inwards towards the initiator. |
The ↑ and ↓ symbols are from the extended IPA[7]. They only need to be used when the air flow is different to the base IPA phoneme (e.g. using ↓ on pulmonic consonants).
Feature | Name | Description |
---|---|---|
pmc |
pulmonic | The diaphragm and lungs are used to generate the airstream. |
gtc |
glottalic | The glottis is used to generate the airstream. |
vlc |
velaric | The velum is closed and the tongue is used to generate the airstream. |
pcv |
percussive | There is no airstream used to produce this sound. |
The phonation features describe the degree to which the glottis (vocal chords) are open or closed.
Feature | Symbol | Name | Description |
---|---|---|---|
vls |
voiceless | The glottis is fully open, such that the vocal chords do not vibrate. | |
brv |
◌̤ | breathy voice | The glottis is closed slightly, to produce a whispered or murmured sound. |
slv |
◌̥ | slack voice | The glottis is opened wider than mdv , but not enough to be brv . |
mdv |
modal voice | The glottis is opened to provide the optimal vibration of the vocal chords. | |
stv |
◌̬ | stiff voice | The glottis is closed narrower than mdv , but not enough to be crv . |
crv |
◌̰ | creaky voice | The glottis is closed to produce a vocal or glottal fry. |
glc |
ʔ͡◌ | glottal closure | The glottis is fully closed. |
The IPA ◌̥ diacritic is also used to fill the vls
spaces in the IPA consonant
charts. Thus, when ◌̥ is used with a mdv
consonant that does not have an
equivalent vls
consonant, the resulting consonant is vls
, not slv
.
The place of articulation is described in terms of an active articulator and one or more passive articulators[9]. The possible places of articulation are:
Place of Articulation | Feature | Symbol | Active | Passive |
---|---|---|---|---|
bilabial | blb |
lbl |
ulp |
|
labiodental | lbd |
◌̪ | lbl |
utt |
dental | dnt |
◌̪ | apc |
utt |
alveolar | alv |
apc |
alf |
|
palato-alveolar | pla |
lmn |
alb |
|
retroflex | rfx |
sac |
hpl |
|
alveolo-palatal | alp |
dsl |
alb |
|
palatal | pal |
dsl |
hpl |
|
velar | vel |
dsl |
spl |
|
uvular | uvl |
dsl |
uvu |
|
pharyngeal | phr |
rdl |
prx |
|
glottal | glt |
lyx |
gts |
The features for these places of articulation are provided for convenience, and to make it easier to describe the IPA consonants. Internally, the active and passive articulators are used.
The ◌̪ diacritic is lbd
when used on blb
consonants, and dnt
when used on
alv
consonants.
NOTE: The IPA charts make a distinction between pharyngeal and epiglottal consonants, but Wikipedia does not. This model uses the Wikipedia descriptions.
Feature | Name | Articulator |
---|---|---|
lbl |
labial | lower lip |
lmn |
laminal | tongue blade |
apc |
apical | tongue tip |
sac |
subapical | underside of the tongue |
dsl |
dorsal | tongue body |
rdl |
radical | tongue root |
lyx |
laryngeal | larynx |
Feature | Articulator |
---|---|
ulp |
upper lip |
utt |
upper teeth |
alf |
alveolar ridge (front) |
alb |
alveolar ridge (back) |
hpl |
hard palate |
spl |
soft palate |
uvu |
uvular |
prx |
pharynx |
egs |
epiglottis |
gts |
glottis |
Feature | Kirshenbaum | Name |
---|---|---|
lbv |
lbv |
labial-velar |
lbp |
labial-palatal |
Feature | Kirshenbaum | Name |
---|---|---|
stp |
stp |
plosive (stop) |
nas |
nas |
nasal |
trl |
trl |
trill |
flp |
flp |
tap/flap |
frc |
frc |
fricative |
apr |
apr |
approximant |
clk |
clk |
click |
ejc |
ejc |
ejective |
imp |
imp |
implosive |
The following features can be combined with other manners of articulation:
Feature | Kirshenbaum | Name |
---|---|---|
lat |
lat |
lateral |
sib |
sibilant |
NOTE: sib
is not used in the IPA charts.
Feature | Kirshenbaum | Name |
---|---|---|
vwl |
vwl |
vowel |
Feature | Kirshenbaum | Name |
---|---|---|
hgh |
hgh |
close (high) |
smh |
smh |
near-close (semi-high) |
umd |
umd |
close-mid (upper-mid) |
mid |
mid |
mid |
lmd |
lmd |
open-mid (lower-mid) |
sml |
near-open (semi-low) | |
low |
low |
open (low) |
Feature | Kirshenbaum | Name |
---|---|---|
fnt |
fnt |
front |
cnt |
cnt |
center |
bck |
bck |
back |
Feature | Kirshenbaum | Name |
---|---|---|
unr |
unr |
unrounded |
rnd |
rnd |
rounded |
Feature | Kirshenbaum | IPA | Name |
---|---|---|---|
syl |
syl |
◌̩ | syllabic |
nsy |
◌̯ | non-syllabic |
Feature | Kirshenbaum | IPA | Name |
---|---|---|---|
asp |
asp |
◌ʰ | aspirated |
nrs |
◌ⁿ | nasal release | |
lrs |
◌ˡ | lateral release | |
unx |
unx |
◌̚ | no audible release (unexploded) |
NOTE: The nrs
and lrs
features are not defined in Cainteoir Text-to-Speech.
Gemination is found in several languages including Italian and Japanese. It is also present in the suprasegmental phonology between words such as “lamppost” and “evenness”.
Some linguists use the long suprasegmental for geminate consonants. The eSpeak NG convention is to use consonant length for phonation when consonant length is distinct without gemination occurring.
The way gemination is represented in eSpeak NG is to duplicate the phonemes,
with the first phoneme using the unx
feature. For example, n̚.n for a
geminated n. This describes how with the stp
and nas
consonants, the
mouth remains closed (unx
) for the first of the geminated consonants.
Feature | Kirshenbaum | IPA | Name |
---|---|---|---|
fts |
◌͈ | fortis | |
lns |
◌͉ | lenis |
The extended IPA[7] ◌͈ and ◌͉ diacritics
are used to specify lesser (lns
) and greater (fts
) oral pressure than
the unmodified voiced or voiceless phoneme. This distinction is made by
the Ewe, Tabasaran, Archi, and other languages[8].
Where fortis and lenis are used to contrast consonant durations (e.g. in the Jawoyn, Ojibwe, and Zurich German languages[8]), the length suprasegmentals are used instead.
Feature | Kirshenbaum | IPA | Name |
---|---|---|---|
apc |
◌̺ | apical | |
lmn |
◌̻ | laminal | |
lgl |
◌̼ | linguolabial |
The following IPA diacritics are only used by eSpeak NG to fill out positions in the IPA consonant and vowel charts. As such those phonemes are transcribed according to the features at that position, not using the features at the location of the base phoneme with a feature for each of the positioning diacritics.
IPA | Name |
---|---|
◌̟ | advanced |
◌̠ | retracted |
◌̈ | centralized |
◌̽ | mid-centralized |
◌̝ | raised |
◌̞ | lowered |
Feature | Kirshenbaum | IPA | Name |
---|---|---|---|
mrd |
◌̹ | more rounded | |
lrd |
◌̜ | less rounded | |
lzd |
lzd |
◌ʷ | labialized or labio-velarized |
pzd |
pzd |
◌ʲ | palatalized |
vzd |
vzd |
◌ˠ | velarized |
fzd |
fzd |
◌ˤ | pharyngealized |
atr |
◌̘ | advanced tongue root | |
rtr |
◌̙ | retracted tongue root | |
nzd |
nzd |
◌̃ | nasalized |
rzd |
rzd |
◌˞ | rhoticized |
NOTE: The IPA supports ◌̴ for velarized or pharynealized consonants. Unicode
has deprecated this combining character, while keeping the combined forms. As
such, only the combined forms are supported, using the fzd
feature. Cainteoir
Text-to-Speech uses vfz
for this combining character, but eSpeak NG does not
preserve the distinction between ◌ˤ and ◌̴.
These characters are represented as a single Unicode character. They can be used in place of the base character followed by a co-articulation combining character.
Symbol | Features |
---|---|
ɫ | mdv fzd alv lat apr |
ɚ | unr mid cnt rzd vwl |
ɝ | unr lmd cnt rzd vwl |
Feature | Kirshenbaum | IPA | Name |
---|---|---|---|
st1 |
ˈ◌ | primary stress | |
st2 |
ˌ◌ | secondary stress | |
st3 |
ˈˈ◌ | extra stress |
Feature | Kirshenbaum | IPA | Name |
---|---|---|---|
est |
◌̆ | extra short | |
hlg |
◌ˑ | half-long | |
lng |
lng |
◌ː | long |
Feature | Kirshenbaum | IPA | Name |
---|---|---|---|
sbr |
◌.◌ | syllable break | |
lnk |
◌‿◌ | linked (no break) |
Feature | Kirshenbaum | IPA | Name |
---|---|---|---|
fbr |
| | minor (foot) break | |
ibr |
‖ | major (intonation) break | |
glr |
↗ | global rise | |
glf |
↘ | global fall |
Tone | IPA | Start | Middle | End |
---|---|---|---|---|
extra high (top) | ◌˥ | ts5 |
tm5 |
te5 |
high | ◌˦ | ts4 |
tm4 |
te4 |
mid | ◌˧ | ts3 |
tm3 |
te3 |
low | ◌˨ | ts2 |
tm2 |
te2 |
extra low (bottom) | ◌˩ | ts1 |
tm1 |
te1 |
Feature | Kirshenbaum | IPA | Name |
---|---|---|---|
dst |
ꜛ◌ | downstep | |
ust |
ꜜ◌ | upstep |
Feature | Kirshenbaum | Name |
---|---|---|
ctl |
central | |
mrm |
murmured | |
orl |
oral |
NOTE: The ctl
and orl
features are specified by Evan Kirshenbaum in
Appendix A of his ASCII/IPA transcription, but are not used in the rest of
the specification.
NOTE: Evan Kirshenbaum transcribes ɦ as h<?>
({mrm,glt,frc}
), while
Wikipedia also lists this as a {mdv,glt,frc}
[6].
This is the only phoneme that Kirshenbaum uses the mrm
feature for. As such,
the mrm
feature is not used within this document.
Kirshenbaum, Evan, Representing IPA phonetics in ASCII (HTML). 1993.
Kirshenbaum, Evan, Representing IPA phonetics in ASCII (PDF). 2001.
International Phonetic Association, The International Phonetic Alphabet and the IPA Chart. 2015. Creative Commons Attribution-Sharealike 3.0 Unported License (CC-BY-SA).
Wikipedia. International Phonetic Alphabet. 2017. Creative Commons Attribution-Sharealike 3.0 Unported License (CC-BY-SA).
Dunn, R. H., Cainteoir Text-to-Speech Phoneme Features. 2013-2015.
Wikipedia. Voiced glottal fricative. 2017, Creative Commons Attribution-Sharealike 3.0 Unported License (CC-BY-SA).
Wikipedia. Extensions to the International Phonetic Alphabet. 2017, Creative Commons Attribution-Sharealike 3.0 Unported License (CC-BY-SA).
Wikipedia. Fortis and lenis. 2017, Creative Commons Attribution-Sharealike 3.0 Unported License (CC-BY-SA).
Wikipedia. Place of articulation. 2017, Creative Commons Attribution-Sharealike 3.0 Unported License (CC-BY-SA).