Ben
c98ed977f2
Adding IPA output to emscripten js library and demo
7 years ago
Reece H. Dunn
22270bd259
Fix reading hash table entries > 128.
This is related to ebfa320956169e3419234b72fee51bd596867661, but
when reading the hash chain entry length, not writing it.
If char is signed, then before this change the length would be
negative, causing problems loading the dictionary.
7 years ago
Reece H. Dunn
ebfa320956
Fix storing the line length in the hash chain.
The length is stored as the first byte in the output from
compile_line. As the data pointer is char, if char is signed then
length could be negative resulting in undefined behaviour. This
commit fixes the issue by reading and writing that byte as a
uint8_t.
This bug was caused by 2a00ca79f6
.
Previously, the entries could only be a maximum of 128 bytes, and
would not be negative on platforms with signed chars. That
commit was made to support long emoji entries, especially for
non-Latin languages where the utf-8 representations could be
longer than 128 bytes.
This change also adds some documentation to make it clearer what
is going on. NOTE: The code should really be using actual struct
objects instead of writing to opaque char buffers.
Reported by Reef Turner <[email protected] >
7 years ago
Reece H. Dunn
e7ac4b819d
hash_counts is never used, so remove it.
Reported by Reef Turner <[email protected] >
7 years ago
Reece H. Dunn
0383e3525a
Merge commit '0994206f710a4defc1eecfb78ab70ff57c58fcda'
7 years ago
Reece H. Dunn
921229259d
Use int to store the value from GetFileLength.
This fixes the clang warning:
comparison of unsigned expression < 0 is always false
[-Wtautological-compare]
Reported by Reef Turner <[email protected] >
7 years ago
Valdis Vitolins
fb64332f66
Merge branch 'master' of https://github.com/espeak-ng/espeak-ng
7 years ago
Valdis Vitolins
669b9f16c2
Improvements for Sindhi and Urdu Languages by Ejaz Shah
7 years ago
Reef Turner
0994206f71
address buffer overrun when dict listx file has no trailling newline
See ar_listx as an example.
8 years ago
Reece H. Dunn
d0169db792
af: Updates from Christo de Klerk.
7 years ago
Reece H. Dunn
46f935d9de
en: pronunciation fixes, including some Irish names/words
8 years ago
Reece H. Dunn
0b4d06dd9f
Merge commit '3278ddaa721c9a9d7de2415e64cba941e2e908f8'
8 years ago
Reece H. Dunn
3278ddaa72
Update the UCD version to 10.0.0.
8 years ago
Reece H. Dunn
8566f2ee35
Merge commit 'f26aca0aa1b9485634314d0d1710d2a0bb67776d'
8 years ago
Reece H. Dunn
d20422f192
en: various pronunciation fixes
8 years ago
Reece H. Dunn
58a0363762
Android: Fix the location of ucd-tools (it is now in src/ucd-tools).
8 years ago
Reece H. Dunn
fb97d1bd75
Merge remote-tracking branch 'shadyar/patch-4'
8 years ago
Reece H. Dunn
234d58a699
Merge remote-tracking branch 'shadyar/patch-3'
8 years ago
Valdis Vitolins
1f63acd93d
Documentation: links to sonic and pcaudiolib changed to forks of espeak-ng project
8 years ago
Reece H. Dunn
f26aca0aa1
ucd-tools 10.0.0
8 years ago
Reece H. Dunn
1cb4115eab
Add Emoji_Component support from emoji-data.txt.
8 years ago
Reece H. Dunn
18d3a0819a
Unicode Emoji 5.0
8 years ago
Reece H. Dunn
023ebace9a
Add Regional_Indicator support from PropList.txt.
8 years ago
Reece H. Dunn
4c1ff765f1
Unicode Character Data 10.0.0
8 years ago
Reece H. Dunn
bdc255835e
Provide a compatibility implementation of iswblank for C89 compilers.
8 years ago
Reece H. Dunn
223d14bec8
Move the type declaration out of the for loops to support building with a C89 compiler.
8 years ago
Reece H. Dunn
0b56f2e5bc
Use C-style comments to support using a C89 compiler.
8 years ago
Reece H. Dunn
6ff015f171
Use curl's -o to save the data files.
This will prevent writing the file if the command fails.
8 years ago
Reece H. Dunn
da3ed3aa9b
Update the CHANGELOG to include the emoji-data properties.
8 years ago
shadyar
042d1bf79d
reading Persian was debugged through editing exceptional words.
8 years ago
shadyar
06b702e94a
debugging some rules
8 years ago
Valdis Vitolins
65a1fa63e2
Turkish: issue #247 fixed pronunciation for w
8 years ago
yorkie
db79bd12eb
build, encoding: include stdint
The stdint.h is not included when building test, then including this module inside encoding.h
8 years ago
Reece H. Dunn
b72754b439
Merge remote-tracking branch 'yorkie/fix/dup-symbol'
8 years ago
yorkie
949de3f80f
build: fix duplicated symbol when linking
8 years ago
Reece H. Dunn
93b3ffa74d
Merge remote-tracking branch 'Christianlm/master'
8 years ago
Valdis Vitolins
2724442f41
Iprovements for Sindhi and Urdu languages by Ejaz Shah
8 years ago
chrislm
fae81de478
Added rules and exceptions for italian
Deleted symbols/characters in it_lists now available in it_emoji
8 years ago
Reece H. Dunn
f946c6e227
windows: fix building the bs, hr and sr languages
8 years ago
Reece H. Dunn
2a00ca79f6
Increase dict_line to support long emoji entries.
8 years ago
Reece H. Dunn
d5d980862e
Guard against overflowing the dict_line bounds when compiling dictionary files.
8 years ago
Reece H. Dunn
0bc26d93d2
windows: ignore .tmp files
8 years ago
Reece H. Dunn
4771cd4d5e
windows: fix building the Chinese (Cantonese) voice
8 years ago
Reece H. Dunn
8d1ff99847
windows: fix building the Japanese voice
8 years ago
Reece H. Dunn
ebe7ce827b
windows: the installer depends on the data (voice, language) files
8 years ago
Reece H. Dunn
b16ff0119e
windows: update the project files to Visual Studio 2017
8 years ago
Reece H. Dunn
ddee4d6386
languages.test: fix running the Japanese test
8 years ago
Reece H. Dunn
593949dec6
pa: remove the emoji support for now, as it fails to build
8 years ago
Reece H. Dunn
f8403414f2
zh: (emoji) -- CLDR 31.0.1 emoji translations
8 years ago
Reece H. Dunn
348de83de7
vi: (emoji) -- CLDR 31.0.1 emoji translations
8 years ago