79 Commits (ef25dae11a1dba6c4b897c19d00fd81b471bbf77)

Author SHA1 Message Date
  Reece H. Dunn ea0dcc9473 Testing UTS 51 (5.0-12) ED-14c. emoji keycap sequence 8 years ago
  Reece H. Dunn 37819f0781 Testing UTS 51 (5.0-12) ED-14b. emoji combining sequence 8 years ago
  Reece H. Dunn 6f39beb154 Testing UTS 51 (5.0-12) ED-14. emoji tag sequence (tag_base = emoji character) 8 years ago
  Reece H. Dunn 421518e8cc Testing UTS 51 (5.0-12) ED-14. emoji flag sequence 8 years ago
  Reece H. Dunn 40eed16247 Testing UTS 51 (5.0-12) ED-13. emoji modifier sequence 8 years ago
  Reece H. Dunn 281cb5513e Testing UTS 51 (5.0-12) ED-9a. emoji presentation sequence 8 years ago
  Reece H. Dunn e02ab7d0c2 Testing UTS 51 (5.0-12) ED-8a. text presentation sequence 8 years ago
  Reece H. Dunn f94f7effcd Test ReadClause with a Latin-character sentence. 8 years ago
  Reece H. Dunn fbd2e679fc Don't crash if loading the default voice fails. 8 years ago
  Reece H. Dunn dced9f58fd Clean up the decoder memory in espeak_Terminate. 8 years ago
  Reece H. Dunn 63e2826753 Ensure that the old translator is not used after calling espeak_Terminate. 8 years ago
  Reece H. Dunn ae6116bf97 Fix calling espeak_Synth without first setting a voice, with tests. 8 years ago
  Reece H. Dunn 3189d194f1 Testing UTS 51 (5.0-12) ED-3. emoji character 8 years ago
  Reece H. Dunn 27a08c5fea Test ReadClause on single character emoji sequences. 8 years ago
  Reece H. Dunn 5aa47e01ea Simplify the readclause test names. 8 years ago
  Reece H. Dunn 37bae427c4 Remove the current voice tests. 8 years ago
  Reece H. Dunn 48ca2239bb Fix non-Latin character languages falling back to English when reading Latin characters. 8 years ago
  Reece H. Dunn d674529b9c tests/languages.test: Rename the test function to test_lang, to avoid a conflict with the test command. 8 years ago
  Reece H. Dunn 5030ff95cc automake: don't make the tests print 'done', have the make rule print 'PASSED' instead. 8 years ago
  Reece H. Dunn 8821c9e361 Split out the readclause tests so the tokenizer tests can use public-only APIs. 8 years ago
  Reece H. Dunn d72557aed2 Add simple tests for testing voice selection by name. 8 years ago
  Reece H. Dunn 6ec3e85007 Add language.tests for de, en and jp to test the phoneme generation. 8 years ago
  Reece H. Dunn dd90d3812d tokenizer.c: Support general symbol tokens. 8 years ago
  Reece H. Dunn 786575c6ed tokenizer.c: Support general punctuation tokens. 8 years ago
  Reece H. Dunn 683579f403 Make the tokenizer.h API public. 8 years ago
  Reece H. Dunn 9af96da469 Make the encoding.h API public. 8 years ago
  Reece H. Dunn 55bfbb4754 tokenizer.c: Support ellipsis tokens. 8 years ago
  Reece H. Dunn b847df63b5 tokenizer.c: Support semicolon tokens. 8 years ago
  Reece H. Dunn af7e8fc5a3 tokenizer.c: Support colon tokens. 8 years ago
  Reece H. Dunn 7560070dcd tokenizer.c: Support comma tokens. 8 years ago
  Reece H. Dunn c9199cfacb tokenizer.c: Support exclamation mark tokens. 8 years ago
  Reece H. Dunn 128ceaff6a tokenizer.c: Support question mark tokens. 8 years ago
  Reece H. Dunn 8f62e18324 tokenizer.c: Support full stop tokens. 8 years ago
  Reece H. Dunn d50f3f2fa5 tokenizer.c: Support word tokens. 8 years ago
  Reece H. Dunn a902f451d8 tests/tokenizer.test: Support printing the tokens from a provided file, making it easy to investigate tokenizer issues. 8 years ago
  Reece H. Dunn d093513b65 tokenizer.c: Add an options parameter to the tokenizer_reset API. 8 years ago
  Reece H. Dunn c41ac642fa tokenizer.c: Tokenise Zp codepoints as paragraphs. 8 years ago
  Reece H. Dunn f3ea6f68f3 tokenizer.c: Tokenise U+000B [VERTICAL TAB (VT)] as whitespace, not as newlines. 8 years ago
  Reece H. Dunn fc7a4e6701 tokenizer.c: Recognise U+000C [FORM FEED (FF)] as a newline codepoint. 8 years ago
  Reece H. Dunn d2d718d700 tokenizer.c: Tokenize line separator codepoints as newline tokens. 8 years ago
  Reece H. Dunn bf45e7ce36 tokenizer.c: Recognise U+0085 [NEW LINE (NEL)] as a newline codepoint. 8 years ago
  Reece H. Dunn df6ca7a22c tokenizer.c: Support whitespace tokens. 8 years ago
  Reece H. Dunn 8f0dae6a38 tokenizer.c: Support windows newlines. 8 years ago
  Reece H. Dunn b897ff5aa8 encoding.c: Support calling peekc past the end of the buffer. This makes calling peekc easier. 8 years ago
  Reece H. Dunn 3f692f498b encoding.c: Implement a peekc API. 8 years ago
  Reece H. Dunn 1c8ed9c190 tokenizer.c: Support mac newlines. 8 years ago
  Reece H. Dunn 7602c9ac18 tokenizer.c: Support linux newlines. 8 years ago
  Reece H. Dunn bce44316bb Create a basic tokenizer API using a structure that mirrors the TtsTokenizer interface in the tts-dev-studio project. 8 years ago
  Reece H. Dunn 5c6bc0e556 Armenian emphasis mark (U+055B) is used for interjections, so treat it as an exclamation mark. 8 years ago
  Reece H. Dunn 1c4ce3dcd3 tokenizer.c: create and use a clause_type_from_codepoint function, with tests. 8 years ago