Reece H. Dunn
|
3cc53d98f4
|
Add ucd.h to tokenizer.c to provide the definition of the ucd_category identifier for the emscripten build.
|
8 years ago |
Reece H. Dunn
|
61d668c0cb
|
ucd-tools: Inverted_Terminal_Punctuation eSpeakNG extended property support; use in clause_type_from_codepoint.
|
8 years ago |
Reece H. Dunn
|
5c6bc0e556
|
Armenian emphasis mark (U+055B) is used for interjections, so treat it as an exclamation mark.
|
8 years ago |
Reece H. Dunn
|
bc13173ac4
|
ucd-tools: Punctuation_In_Word eSpeakNG extended property support; use in clause_type_from_codepoint.
|
8 years ago |
Reece H. Dunn
|
1131d0924b
|
ucd-tools: Optional_Space_After eSpeakNG extended property support; use in clause_type_from_codepoint.
|
8 years ago |
Reece H. Dunn
|
b932f3c493
|
ucd-tools: Extended_Dash eSpeakNG extended property support; use in clause_type_from_codepoint.
|
8 years ago |
Reece H. Dunn
|
3100ca9d1b
|
Use ucd_properties to implement clause_type_from_codepoint for supported types.
|
8 years ago |
Reece H. Dunn
|
1c4ce3dcd3
|
tokenizer.c: create and use a clause_type_from_codepoint function, with tests.
|
8 years ago |