Browse Source

cmn: handle latin characters as English text.

cmn now handles all latin characters as English.

The old functionality of assuming latin characters are pinyin can be
achieved with new language cmn-Latn-pinyin.
master
Juho Hiltunen 4 years ago
parent
commit
4825905835
4 changed files with 48 additions and 2 deletions
  1. 4
    0
      CHANGELOG.md
  2. 30
    1
      dictsource/cmn_rules
  3. 3
    1
      espeak-ng-data/lang/sit/cmn
  4. 11
    0
      espeak-ng-data/lang/sit/cmn-Latn-pinyin

+ 4
- 0
CHANGELOG.md View File

* Added voice variants * Added voice variants
* Renamed zh to cmn (Mandarin) * Renamed zh to cmn (Mandarin)
* Renamed zhy to yue (Cantonese) * Renamed zhy to yue (Cantonese)
* cmn (Mandarin) now assumes all latin characters all English text.
Use cmn-latn-pinyin for interpreting latin characters as pinyin.




bug fixes: bug fixes:
* Fix reading malformed SSML (Christopher Brannon) * Fix reading malformed SSML (Christopher Brannon)

+ 30
- 1
dictsource/cmn_rules View File

// This file is UTF8 encoded // This file is UTF8 encoded


// Default is to handle latin characters as pinyin
// ?1: speak latin characters as English words

.replace .replace
//replace tone mark with tone number //replace tone mark with tone number
ˉ 1 ˉ 1
// 儿 兒 erhua // 儿 兒 erhua


.group a .group a
?1 a _^_EN
a A a A
ai ai ai ai
a1i ai55 a1i ai55
y) a (DngK iA y) a (DngK iA


.group b .group b
?1 b _^_EN
b p b p
@) b (K _^_EN @) b (K _^_EN


.group c .group c
?1 c _^_EN
c tsh c tsh
ch (+ ts.h ch (+ ts.h
@) c (K _^_EN @) c (K _^_EN


.group d .group d
?1 d _^_EN
d t d t
@) d (K _^_EN @) d (K _^_EN


.group e .group e
?1 e _^_EN
e o- e o-
d) e @ d) e @
t) e @ t) e @
e4r (K @r51 e4r (K @r51


.group f .group f
?1 f _^_EN
f f f f
@) f (K _^_EN @) f (K _^_EN


.group g .group g
?1 g _^_EN
g k g k
@) g (K _^_EN @) g (K _^_EN


.group h .group h
?1 h _^_EN
h X h X
@) h (K _^_EN @) h (K _^_EN


.group i .group i
?1 i _^_EN
i i //i in ing i i //i in ing
z) i i[ z) i i[
c) i i[ c) i i[
iu iou iu iou


.group j .group j
?1 j _^_EN
j tS; j tS;
@) j (K _^_EN @) j (K _^_EN


.group k .group k
?1 k _^_EN
k kh k kh
@) k (K _^_EN @) k (K _^_EN


.group l .group l
?1 l _^_EN
l l l l
@) l (K _^_EN @) l (K _^_EN
@) l (v l @) l (v l


.group m .group m
?1 m _^_EN
m m m m
@) m (K _^_EN @) m (K _^_EN


.group n .group n
?1 n _^_EN
n n n n
ng (K N // consider (ng+vowel) as (n g+vowel) ?? ng (K N // consider (ng+vowel) as (n g+vowel) ??
_) ng (K N- // syllablic [N] _) ng (K N- // syllablic [N]


.group o .group o
?1 o _^_EN
o o o o


ou ou ou ou
y) o4u iou51 y) o4u iou51


.group p .group p
?1 p _^_EN
p ph p ph
@) p (K _^_EN @) p (K _^_EN


.group q .group q
?1 q _^_EN
q tS;h q tS;h
@) q (K _^_EN @) q (K _^_EN


.group r .group r
?1 r _^_EN
r z. r z.
r (K @r11 r (K @r11


.group s .group s
?1 s _^_EN
s s s s
sh (+ s. sh (+ s.
@) s (K _^_EN @) s (K _^_EN


.group t .group t
?1 t _^_EN
t th t th
@) t (K _^_EN @) t (K _^_EN


.group u .group u
?1 u _^_EN
u u u u
ua wA ua wA
ua (DnK ua ua (DnK ua




.group ü .group ü
?1 ü _^_EN
ü y ü y
üe yE üe yE


.group v //variant of ü .group v //variant of ü
?1 v _^_EN
v v // foreign words v v // foreign words
l) v y //ü l) v y //ü
n) v y //ü n) v y //ü
n) ve yE //üe n) ve yE //üe


.group w .group w
?1 w _^_EN
@) w (K _^_EN @) w (K _^_EN
wa wA //wa wan wang wa wA //wa wan wang
wai wai wai wai
wu wu wu wu


.group x .group x
?1 x _^_EN
x S; x S;
@) x (K _^_EN @) x (K _^_EN


.group y .group y
?1 y _^_EN
y j //before a o e i y j //before a o e i
y (u //NULL before u y (u //NULL before u
y (uK ; y (uK ;
@) y (K _^_EN @) y (K _^_EN


.group z .group z
?1 z _^_EN
z ts z ts
zh (+ ts. zh (+ ts.
@) z (K _^_EN @) z (K _^_EN
5 11 5 11


| _| | _|


+ 3
- 1
espeak-ng-data/lang/sit/cmn View File

name Chinese (Mandarin)
name Chinese (Mandarin, latin as English)
language cmn language cmn
language zh-cmn language zh-cmn
language zh language zh


dict_min 100000 dict_min 100000


dictrules 1 // interpret latin characters as English text

//for some dialects //for some dialects


//[en]: replace ng with n //[en]: replace ng with n

+ 11
- 0
espeak-ng-data/lang/sit/cmn-Latn-pinyin View File

name Chinese (Mandarin, latin as Pinyin)
language cmn-latn-pinyin
language zh-cmn
language zh

phonemes cmn
dictionary cmn
words 1
pitch 80 118

dict_min 100000

Loading…
Cancel
Save