Reece H. Dunn
69c976f859
jp: use 'a' instead of 'a_"'.
8 years ago
Reece H. Dunn
d192fd9c55
Remove the IPA phoneme table and associated und-fonipa voice.
Supporting phoneme-based voices in espeak-ng is currently complex,
and has issues that need to be fixed in the core code before
adding support for them.
8 years ago
Reece H. Dunn
822732dfbf
Revert the ipa phoneme changes.
These are causing problems with several of the voices.
Revert "ipa: n (vcd alv nas)." commit 8869c72d97
.
Revert "ipa: m (vcd blb nas)." commit 581b366bb0
.
Revert "ipa: ɳ (vcd rfx nas)." commit 7cc3a6c0c2
.
Revert "ipa: ɲ (vcd pal nas)." commit eda7bc03d6
.
Revert "ipa: ŋ (vcd vel nas)." commit efeca3b275
.
Revert "ipa: document the supported pulmonic consonants in ph_ipa as an IPA chart."
commit 675cec2fac
.
Revert "ipa: ɹ (vcd alv apr)." commit 0f41dae8e9
.
Revert "ipa: t (vls alv stp)." commit a49729cd70
.
Revert "ipa: ensure that each phoneme has an ipa annotation."
commit 3cdb952ea3
.
Revert "ipa: t̪ (vls dnt stp)." commit dd30a61186
.
Revert "ipa: d (vcd alv stp)" commit a96f748470
.
Revert "ipa: d̪ (vcd dnt stp)" commit 39f7e7b2f6
.
Revert "ipa: d͡ʒ (vcd pla sib afr)" commit 16be8f19bf
.
Revert "ipa: d͡ʑ (vcd alp sib afr)." commit 883187cfb5
.
Revert "ipa: ɟ (vcd pal stp)." commit 2e682bcdc5
.
Revert "ipa: g (vcd vel stp)." commit 679b939474
.
Revert "ipa: b (vcd blb stp)." commit fb6bd384ee
.
Revert "ipa: j (vcd pal apr)." commit 1d36966b9f
.
Revert "ipa: w (vcd ptr vel apr)." commit efade0fc71
.
Revert "ipa: β (vcd blb frc)." commit 063ee44a4f
.
Revert "ipa: v (vcd lbd frc)." commit 3374b13ad6
.
Revert "ipa: ʋ (vcd lbd apr)." commit 0a55fe6ad4
.
Revert "ipa: fix the 'Other Symbols' table."
commit 440fb6d4bb
.
Revert "ipa: ð (vcd dnt frc)." commit 6eb518ee0f
.
Revert "ipa: d (vcd alv frc)." commit 0574f488c3
.
Revert "ipa: ʒ (vcd pla frc)." commit 7312b1b2a2
.
Revert "ipa: ʐ (vcd rfx frc)." commit dc937cf1de
.
Revert "ipa: ʑ (vcd alp sib frc)." commit 8b55428ece
.
Revert "ipa: ʝ (vcd pal frc)." commit 736fa69471
.
Revert "ipa: ɣ (vcd vel frc)." commit 237af08312
.
Revert "ipa: ʁ (vcd uvl frc)." commit 09601af6f5
.
Revert "ipa: t͡ʃ (vls pla sib afr)" commit d170b0c024
.
Revert "ipa: t͡ɕ (vls alp sib afr)" commit 88bdbe9256
.
Revert "ipa: c (vls pal stp)" commit e836376922
.
Revert "ipa: q (vls uvl stp)" commit 695a9007aa
.
Revert "ipa: l (vcd alv lat apr)." commit a5e5202ca5
.
Revert "ipa: ɫ (vcd alv fzd lat apr)." commit 217eceeb34
.
Revert "ipa: ɭ (vcd rfx lat apr)." commit b7428eb443
.
Revert "ipa: ʎ (vcd pal lat apr)." commit 74c3ff5c97
.
Revert "ipa: ʟ (vcd vel lat apr)." commit 9796200f26
.
Revert "ipa: p (vls blb stp)." commit 302decc882
.
Revert "ipa: k (vls vel stp)." commit d4a2846837
.
Revert "ipa: f (vls lbd frc)." commit 175d3c0299
.
Revert "ipa: θ (vls dnt frc)." commit d2a04674e8
.
Revert "ipa: s (vls alv sib frc)." commit ce810f3cd8
.
Revert "ipa: ʃ (vls pla sib frc)." commit dce61376d8
.
Revert "ipa: ʂ (vls rfx sib frc)." commit 1135ff1013
.
Revert "ipa: zʲ (vls alv sib frc pzd)." commit d8aa055697
.
Revert "ipa: sʲ (vls alv sib frc pzd)." commit e51dc6e61a
.
Revert "ipa: ɕ (vls alp sib frc)." commit cc7127d26e
.
Revert "ipa: ɬ (vls alv lat frc)." commit e338295e19
.
Revert "ipa: ç (vls pal frc)." commit f05a32bf0f
.
Revert "ipa: x (vls vel frc)." commit c88c43eecc
.
Revert "ipa: χ (vls uvl frc)." commit 5cb8ce12bd
.
Revert "ipa: h (vls glt frc)." commit 98355a0f47
.
Revert "ipa: t͡s (vls alv sib afr)." commit dd66fe41e3
.
Revert "ipa: t͡s (vls alv sib afr)." commit 2c45f60588
.
Revert "ipa: d͡z (vcd alv sib afr)" commit 1cd9bf80b3
.
Revert "ipa: ʍ (vls ptr vel frc)" commit 923bfdb82c
.
Revert "base1: Restore the original eSpeak definitions for /l/. The ipa table versions are causing issues."
commit 299c91aca1
.
8 years ago
Alberto Pettarin
0241fb47d2
emscripten/Makefile support for both Linux, macOS
8 years ago
Reece H. Dunn
7b8fa3660d
Install the encoding.h and tokenizer.h header files.
8 years ago
Reece H. Dunn
dd90d3812d
tokenizer.c: Support general symbol tokens.
8 years ago
Reece H. Dunn
786575c6ed
tokenizer.c: Support general punctuation tokens.
8 years ago
Reece H. Dunn
0705844bf8
tokenizer.c: Move general category classification that does not override property behaviour to the end, for generic classification.
8 years ago
Reece H. Dunn
683579f403
Make the tokenizer.h API public.
8 years ago
Reece H. Dunn
9af96da469
Make the encoding.h API public.
8 years ago
Reece H. Dunn
55bfbb4754
tokenizer.c: Support ellipsis tokens.
8 years ago
Reece H. Dunn
706e780ff4
Merge remote-tracking branch 'pettarin/master'
8 years ago
Alberto Pettarin
3b4487e8a7
Updated directions to compile JS with emscripten
8 years ago
Alberto Pettarin
123309a07b
Added git ignore for emscripted in UCD tools
8 years ago
Reece H. Dunn
b847df63b5
tokenizer.c: Support semicolon tokens.
8 years ago
Alberto Pettarin
6ce74efeca
Fixed selection of default voice in JS demo
8 years ago
Reece H. Dunn
af7e8fc5a3
tokenizer.c: Support colon tokens.
8 years ago
Reece H. Dunn
7560070dcd
tokenizer.c: Support comma tokens.
8 years ago
Reece H. Dunn
c9199cfacb
tokenizer.c: Support exclamation mark tokens.
8 years ago
Reece H. Dunn
128ceaff6a
tokenizer.c: Support question mark tokens.
8 years ago
Reece H. Dunn
8f62e18324
tokenizer.c: Support full stop tokens.
8 years ago
Reece H. Dunn
0bbc9e9730
Merge remote-tracking branch 'Christianlm/master'
8 years ago
chrislm
5d8bb74169
IT: new improvements tested on april 2017
reduced length to 160 for unstressed syllables
Added some exceptions to the italian dictionaries
8 years ago
Reece H. Dunn
d50f3f2fa5
tokenizer.c: Support word tokens.
8 years ago
Reece H. Dunn
a902f451d8
tests/tokenizer.test: Support printing the tokens from a provided file, making it easy to investigate tokenizer issues.
8 years ago
Reece H. Dunn
d093513b65
tokenizer.c: Add an options parameter to the tokenizer_reset API.
8 years ago
Reece H. Dunn
c41ac642fa
tokenizer.c: Tokenise Zp codepoints as paragraphs.
8 years ago
Reece H. Dunn
f3ea6f68f3
tokenizer.c: Tokenise U+000B [VERTICAL TAB (VT)] as whitespace, not as newlines.
8 years ago
Reece H. Dunn
fc7a4e6701
tokenizer.c: Recognise U+000C [FORM FEED (FF)] as a newline codepoint.
8 years ago
Reece H. Dunn
d2d718d700
tokenizer.c: Tokenize line separator codepoints as newline tokens.
8 years ago
Reece H. Dunn
bf45e7ce36
tokenizer.c: Recognise U+0085 [NEW LINE (NEL)] as a newline codepoint.
8 years ago
Reece H. Dunn
df6ca7a22c
tokenizer.c: Support whitespace tokens.
8 years ago
Reece H. Dunn
539edac795
tokenizer.c: Create a codepoint_type helper function to classify codepoints for the tokenizer.
8 years ago
Reece H. Dunn
7b1243679f
Update the document for the new fr-CH French accent.
8 years ago
Reece H. Dunn
4bc3f15e79
Generalize the exclusion of Windows batch files.
8 years ago
claude beazley
c05e3898a4
Adding Swiss French Variant
Creating the language variant , swiss french. Primarily for counting
as Swiss French uses huitante for 80 and , like the Belgians septante
eand nonante for 70 and 90.
8 years ago
Reece H. Dunn
8f0dae6a38
tokenizer.c: Support windows newlines.
8 years ago
Reece H. Dunn
b897ff5aa8
encoding.c: Support calling peekc past the end of the buffer. This makes calling peekc easier.
8 years ago
Reece H. Dunn
3f692f498b
encoding.c: Implement a peekc API.
8 years ago
Reece H. Dunn
1c8ed9c190
tokenizer.c: Support mac newlines.
8 years ago
Reece H. Dunn
7602c9ac18
tokenizer.c: Support linux newlines.
8 years ago
Reece H. Dunn
bce44316bb
Create a basic tokenizer API using a structure that mirrors the TtsTokenizer interface in the tts-dev-studio project.
8 years ago
Reece H. Dunn
3cc53d98f4
Add ucd.h to tokenizer.c to provide the definition of the ucd_category identifier for the emscripten build.
8 years ago
Reece H. Dunn
ee61cc4358
Fix running 'make clean' when gradle is not present. Gradle is used for the Android build and is not needed when just building eSpeak NG on Linux/BSD systems.
8 years ago
Reece H. Dunn
a72199f714
Run the tests as part of the Travis build.
8 years ago
Reece H. Dunn
61d668c0cb
ucd-tools: Inverted_Terminal_Punctuation eSpeakNG extended property support; use in clause_type_from_codepoint.
8 years ago
Reece H. Dunn
5c6bc0e556
Armenian emphasis mark (U+055B) is used for interjections, so treat it as an exclamation mark.
8 years ago
Reece H. Dunn
bc13173ac4
ucd-tools: Punctuation_In_Word eSpeakNG extended property support; use in clause_type_from_codepoint.
8 years ago
Reece H. Dunn
1131d0924b
ucd-tools: Optional_Space_After eSpeakNG extended property support; use in clause_type_from_codepoint.
8 years ago
Reece H. Dunn
b932f3c493
ucd-tools: Extended_Dash eSpeakNG extended property support; use in clause_type_from_codepoint.
8 years ago