Browse Source

docs: describe the Kirshenbaum (ASCII-IPA) transcription scheme.

master
Reece H. Dunn 7 years ago
parent
commit
dc67d294c2
2 changed files with 216 additions and 4 deletions
  1. 20
    4
      docs/phonemes.md
  2. 196
    0
      docs/phonemes/kirshenbaum.md

+ 20
- 4
docs/phonemes.md View File

# Phonemes # Phonemes


- [Phoneme Transcription Schemes](#phoneme-transcription-schemes)
- [Consonants](#consonants) - [Consonants](#consonants)
- [Other Symbols](#other-symbols) - [Other Symbols](#other-symbols)
- [Gemination](#gemination) - [Gemination](#gemination)
languages and voices on how to specify phonemes so that the IPA and feature languages and voices on how to specify phonemes so that the IPA and feature
transcriptions are consistent. transcriptions are consistent.


## Phoneme Transcription Schemes

| BCP47 Variant | Abbreviation | Transcription Scheme | Encoding |
|---------------------------------------|--------------|------------------------------------------------------|----------|
| `foncxs` | CXS | Conlang X-SAMPA | ASCII |
| `fonipa` | IPA | International Phonetic Alphabet | Unicode |
| [`fonkirsh`](phonemes/kirshenbaum.md) | | Kirshenbaum (ASCII-IPA) | ASCII |
| `fonxsampa` | X-SAMPA | Extended Speech Assessment Methods Phonetic Alphabet | ASCII |
| `fonzsampa` | Z-SAMPA | Zompist Bulletin Board (ZBB) SAMPA | ASCII |

1. `foncxs`, `fonkirsh`, and `fonzsampa` are private use extensions defined in
the [bcp47-extensions](https://raw.githubusercontent.com/espeak-ng/bcp47-data/master/bcp47-extensions)
file.

## Consonants ## Consonants


| | `blb` | | `lbd` | | `dnt` | | `alv` | | `pla` | | `rfx` | | `alp` | | `pal` | | `vel` | | `uvl` | | `phr` | | `glt` | | | | `blb` | | `lbd` | | `dnt` | | `alv` | | `pla` | | `rfx` | | `alp` | | `pal` | | `vel` | | `uvl` | | `phr` | | `glt` | |
| `sml` | æ | | ɐ | | | | | `sml` | æ | | ɐ | | | |
| `low` | a | ɶ | | | ɑ | ɒ | | `low` | a | ɶ | | | ɑ | ɒ |


__NOTE:__ The `smh` vowels are more `cnt` than the other vowels. However, this
distinction is not needed to classify these vowels, so is not included in the
above table.

### Other Symbols ### Other Symbols


| Symbol | Features | | Symbol | Features |
| `cnt` | center | `kirshenbaum` | | `cnt` | center | `kirshenbaum` |
| `bck` | back | `kirshenbaum` | | `bck` | back | `kirshenbaum` |


__NOTE:__ The `smh` vowels are more `cnt` than the other vowels. However, this
distinction is not needed to classify these vowels so there are no features
for front-central and back-central. The `fnt` and `bck` features are used
instead.

### Rounding ### Rounding


| Feature | Name | Origin | | Feature | Name | Origin |

+ 196
- 0
docs/phonemes/kirshenbaum.md View File

# Kirshenbaum (ASCII-IPA) Transcription Scheme

- [Phoneme Transcription Schemes](#phoneme-transcription-schemes)
- [Consonants](#consonants)
- [Other Symbols](#other-symbols)
- [Vowels](#vowels)
- [Other Symbols](#other-symbols-1)
- [Diacritics](#diacritics)
- [References](#references)

----------

The Kirshenbaum (ASCII-IPA) transcription scheme was created by Evan
Kirshenbaum and others between 1991 and 1993 for use in Usenet and e-mail.
Since then, the support for Unicode has greatly improved. An ASCII encoding
of IPA is still useful as it is often easier to type.

The Kirshenbaum transcription scheme defines a set of phoneme features that are
described in eSpeak NG in the [phonemes](phonemes.md) document. The `cainteoir`
and `espeak-ng` phoneme features defined in that document are not useable in
transcriptions using the Kirshenbaum transcription scheme. They are used here
where relevant to define the Kirshenbaum phonemes in a consistent way across
the different transcription schemes.

Broad (phonemic) transcriptions are enclosed in `/` characters (`/p/`), while
narrow (phonetic) transcriptions are wrapped in `[` and `]` characters (`[p]`).

A phoneme can be transcribed as a named phoneme (`p`), or as a set of features
wrapped in `{` and `}` characters (`{vls,blb,stp}`). A feature (typically a
diacritic) of a phoneme can be added or modified by wrapping the feature in `<`
and `>` characters after the base phoneme (`p<dnt>`).

## Phoneme Transcription Schemes

| BCP47 Variant | Abbreviation | Transcription Scheme | Encoding |
|----------------------------|--------------|------------------------------------------------------|----------|
| `foncxs` | CXS | Conlang X-SAMPA | ASCII |
| [`fonipa`](../phonemes.md) | IPA | International Phonetic Alphabet | Unicode |
| `fonkirsh` | | Kirshenbaum (ASCII-IPA) | ASCII |
| `fonxsampa` | X-SAMPA | Extended Speech Assessment Methods Phonetic Alphabet | ASCII |
| `fonzsampa` | Z-SAMPA | Zompist Bulletin Board (ZBB) SAMPA | ASCII |

1. `foncxs`, `fonkirsh`, and `fonzsampa` are private use extensions defined in
the [bcp47-extensions](https://raw.githubusercontent.com/espeak-ng/bcp47-data/master/bcp47-extensions)
file.

## Consonants

| | `blb` | | `lbd` | | `dnt` | | `alv` | | `pla` | | `rfx` | | `alp` | | `pal` | | `vel` | | `uvl` | | `phr` | | `glt` | |
|-----------------------------|----------|----------|-------|----------|-----------|-----------|----------|----------|-------|-------|-------|-------|-------|-------|----------|----------|----------|----------|----------|----------|-------|----------|-------|--------|
| | `vls` | `vcd` | `vls` | `vcd` | `vls` | `vcd` | `vls` | `vcd` | `vls` | `vcd` | `vls` | `vcd` | `vls` | `vcd` | `vls` | `vcd` | `vls` | `vcd` | `vls` | `vcd` | `vls` | `vcd` | `vls` | `vcd` |
| `nas` | | `m` | | `M` | | `n[` | | `n` | | | | `n.` | | | | `n^` | | `N` | | `n"` | | | | |
| `stp` | `p` | `b` | | | `t[` | `d[` | `t` | `d` | | | `t.` | `d.` | | | `c` | `J` | `k` | `g` | `q` | `G` | | | `?` | |
| `sib`&#xA0;`afr` | | | | | | | | | | | | | | | | | | | | | | | | |
| `afr` | | | | | | | | | | | | | | | | | | | | | | | | |
| `lat`&#xA0;`afr` | | | | | | | | | | | | | | | | | | | | | | | | |
| `sib`&#xA0;`frc` | | | | | | | `s` | `z` | `S` | `Z` | `s.` | `z.` | | | | | | | | | | | | |
| `frc` | `P` | `B` | `f` | `v` | `T` | `D` | | | | | | | | | `C` | `C<vcd>` | `x` | `Q` | `X` | `g"` | `H` | `H<vcd>` | `h` | `h<?>` |
| `lat`&#xA0;`frc` | | | | | | | `s<lat>` | `z<lat>` | | | | | | | | | | | | | | | | |
| `apr` | | | | `r<lbd>` | | `r[` | | `r` | | | | `r.` | | | | `j` | | `j<vel>` | | `g"` | | | | |
| `lat`&#xA0;`apr` | | | | | | `l[` | | `l` | | | | `l.` | | | | `l^` | | `L` | | | | | | |
| `flp` | | | | | | | | `*` | | | | `*.` | | | | | | | | | | | | |
| `lat`&#xA0;`flp` | | | | | | | | `*<lat>` | | | | | | | | | | | | | | | | |
| `trl` | | `b<trl>` | | | | | | `r<trl>` | | | | | | | | | | | | `r"` | | | | |
| `clk` | `p!` | | | | `t!` | | `c!` | | | | | | | | `c!` | | `k!` | | | | | | | |
| `lat`&#xA0;`clk` | | | | | | | `l!` | | | | | | | | | | | | | | | | | |
| `imp` | | `` b` `` | | | | `` d` `` | | `` d` `` | | | | | | | | `` J` `` | | `` g` `` | | `` G` `` | | | | |
| `ejc` | `` p` `` | | | | `` t[` `` | | `` t` `` | | | | | | | | `` c` `` | | `` k` `` | | `` q` `` | | | | | |
| `ejc`&#xA0;`frc` | | | | | | | | | | | | | | | | | | | | | | | | |
| `lat`&#xA0;`ejc`&#xA0;`frc` | | | | | | | | | | | | | | | | | | | | | | | | |

1. `h` is transcribed as `{glt,apr}` in ASCII-IPA. It is transcribed as
`{glt,frc}` in this document to match the assignment used in the
Wikipedia IPA table, and the placement in other documents in eSpeak NG.
The IPA chart places them as being `{frc}` or `{apr}`, as does the main
Wikipedia page for the `[h]` phoneme.

1. The `` q` `` phoneme is incorrectly placed in the segment table (Appendix D)
of the ASCII-IPA document. According to the diacritics list (Appendix C),
the `` ` `` character denotes an `{ejc}` phoneme for a `{vls}` base
phoneme, not `{imp}` which is used for `{vcd}` base phonemes. The table
above uses the correct assignment.

1. The `L` phoneme has an alternate meaning in ASCII-IPA. This document only
describes the variant preferred by Evan Kirshenbaum.

### Other Symbols

| | `bld` | | `alv` | | `pla` | | `pal` | | `lbv` | | `vel` | |
|-----------------------------|-------|-------|-------|-------|-------|-------|-------|-------|----------|----------|----------|-------|
| | `vls` | `vcd` | `vls` | `vcd` | `vls` | `vcd` | `vls` | `vcd` | `vls` | `vcd` | `vls` | `vcd` |
| `nas` | | | | | | | | | | `n<lbv>` | | |
| `stp` | | | | | | | | | `t<lbv>` | `d<lbv>` | | |
| `afr` | | | | | | | | | | | | |
| `vzd`&#xA0;`frc` | | | | | | | | | | | | |
| `ptr`&#xA0;`apr` | | | | | | | | | | | `w<vls>` | `w` |
| `fzd`&#xA0;`lat`&#xA0;`apr` | | | | | | | | | | | | |

1. The `w` phoneme is transcribed as being `{vcd,lbv,frc}` or `{lbv,apr}`, and
`w<vls>` as `{vls,lbv,frc}` in ASCII-IPA. They are transcribed as
`{ptr,vel,apr}` in this document to match their definition as labialized
velar approximants in the IPA chart.

## Vowels

| | `fnt` | | `cnt` | | `bck` | |
|--------|-------|-------|----------|-------|-------|-------|
| | `unr` | `rnd` | `unr` | `rnd` | `unr` | `rnd` |
| `hgh` | `i` | `y` | `i"` | `u"` | `u-` | `u` |
| `smh` | `I` | `I.` | | | | `U` |
| `umd` | `e` | `Y` | `@<umd>` | | `o-` | `o` |
| `mid` | | | `@` | `@.` | | |
| `lmd` | `E` | `W` | `V"` | `O"` | `V` | `O` |
| `sml` | `&` | | | | | |
| `low` | `a` | `a.` | | | `A` | `A.` |

1. The `&` phoneme is transcribed as being `{low,fnt,unr,vwl}`, and `a` as
`{low,cnt,unr,vwl}` in ASCII-IPA. This does not match their placement in
the IPA chart according to the Unicode characters those ASCII-IPA
phonemes map to. As such, the table above reflects their IPA chart
positions.

1. The `&.` phoneme is defined in the segment table (Appendix D) of the
ASCII-IPA definition. According to the IPA charts, this is not a valid
phoneme. As such, it is not listed in the table above.

1. The `W` and `Y` phonemes have an alternate meaning in ASCII-IPA. This
document only describes the variant preferred by Evan Kirshenbaum.

### Other Symbols

| Symbol | Features |
|----------|-------------------------------|
| `R` | `unr` `mid` `cnt` `rzd` `vwl` |
| `R<umd>` | `unr` `lmd` `cnt` `rzd` `vwl` |

1. The `R<umd>` phoneme is transcribed as being `{umd,cnt,unr,rzd,vwl}` in
ASCII-IPA. However, the IPA phoneme it is representing is based on the
`{lmd}` vowel. The table above keeps the `<umd>` modifying feature to be
compatible with ASCII-IPA, but assigns it to `{lmd}` in this case to be
compatible with the IPA chart.

1. The `R` phoneme has an alternate meaning in ASCII-IPA. This document only
describes the variant preferred by Evan Kirshenbaum.

## Diacritics

| Symbol | Feature |
|--------|---------|
| `:` | `{lng}` |
| `!` | `{clk}` |
| `[` | `{dnt}` |
| `;` | `{pzd}` |
| `^` | `{pal}` |
| `<H>` | `{fzd}` |
| `<h>` | `{asp}` |
| `<o>` | `{unx}` |
| `<r>` | `{rzd}` |
| `<w>` | `{ptr}` |
| `<?>` | `{brv}` |

1. The `<o>` diacritic has an alternate meaning in ASCII-IPA. This document
only describes the variant preferred by Evan Kirshenbaum.

1. The `{ptr}` (protruded) feature is specified as `{lzd}` (labialized) in
ASCII-IPA.

1. The `{brv}` (breathy voice) feature is specified as `{mrm}` (murmured)
in ASCII-IPA.

The ASCII-IPA transcription scheme defines several diacritics that vary in
meaning depending on whether the base phoneme is a vowel or consonant.

| Symbol | Vowel | Consonant |
|--------|---------|-----------|
| `~` | `{nzd}` | `{vzd}` |
| `-` | `{unr}` | `{syl}` |
| `.` | `{rnd}` | `{rfx}` |
| `"` | `{cnt}` | `{uvl}` |

The ASCII-IPA transcription scheme defines several diacritics that vary in
meaning depending on the voicing of the base phoneme.

| Symbol | Voiceless | Voiced |
|---------|-----------|---------|
| `` ` `` | `{ejc}` | `{imp}` |

## References

1. <a name="ref1"></a> Kirshenbaum, Evan,
[Representing IPA phonetics in ASCII](https://web.archive.org/web/20160419125856/http://www.kirshenbaum.net/IPA/faq.html) (HTML). 04-01-1993.
Updated 30-04-2003.

2. <a name="ref2"></a> Kirshenbaum, Evan,
[Representing IPA phonetics in ASCII](https://web.archive.org/web/20160419125856/http://www.kirshenbaum.net/IPA/ascii-ipa.pdf) (PDF). 22-02-2001.
Updated 06-09-2011.

Loading…
Cancel
Save