Reece H. Dunn
ebfa320956
Fix storing the line length in the hash chain.
The length is stored as the first byte in the output from
compile_line. As the data pointer is char, if char is signed then
length could be negative resulting in undefined behaviour. This
commit fixes the issue by reading and writing that byte as a
uint8_t.
This bug was caused by 2a00ca79f6
.
Previously, the entries could only be a maximum of 128 bytes, and
would not be negative on platforms with signed chars. That
commit was made to support long emoji entries, especially for
non-Latin languages where the utf-8 representations could be
longer than 128 bytes.
This change also adds some documentation to make it clearer what
is going on. NOTE: The code should really be using actual struct
objects instead of writing to opaque char buffers.
Reported by Reef Turner <[email protected] >
8 years ago
Reece H. Dunn
e7ac4b819d
hash_counts is never used, so remove it.
Reported by Reef Turner <[email protected] >
8 years ago
Reece H. Dunn
0383e3525a
Merge commit '0994206f710a4defc1eecfb78ab70ff57c58fcda'
8 years ago
Reece H. Dunn
921229259d
Use int to store the value from GetFileLength.
This fixes the clang warning:
comparison of unsigned expression < 0 is always false
[-Wtautological-compare]
Reported by Reef Turner <[email protected] >
8 years ago
Valdis Vitolins
fb64332f66
Merge branch 'master' of https://github.com/espeak-ng/espeak-ng
8 years ago
Valdis Vitolins
669b9f16c2
Improvements for Sindhi and Urdu Languages by Ejaz Shah
8 years ago
Reef Turner
0994206f71
address buffer overrun when dict listx file has no trailling newline
See ar_listx as an example.
8 years ago
Reece H. Dunn
d0169db792
af: Updates from Christo de Klerk.
8 years ago
Reece H. Dunn
46f935d9de
en: pronunciation fixes, including some Irish names/words
8 years ago
Reece H. Dunn
0b4d06dd9f
Merge commit '3278ddaa721c9a9d7de2415e64cba941e2e908f8'
8 years ago
Reece H. Dunn
3278ddaa72
Update the UCD version to 10.0.0.
8 years ago
Reece H. Dunn
8566f2ee35
Merge commit 'f26aca0aa1b9485634314d0d1710d2a0bb67776d'
8 years ago
Reece H. Dunn
d20422f192
en: various pronunciation fixes
8 years ago
Reece H. Dunn
58a0363762
Android: Fix the location of ucd-tools (it is now in src/ucd-tools).
8 years ago
Reece H. Dunn
fb97d1bd75
Merge remote-tracking branch 'shadyar/patch-4'
8 years ago
Reece H. Dunn
234d58a699
Merge remote-tracking branch 'shadyar/patch-3'
8 years ago
Valdis Vitolins
1f63acd93d
Documentation: links to sonic and pcaudiolib changed to forks of espeak-ng project
8 years ago
Reece H. Dunn
f26aca0aa1
ucd-tools 10.0.0
8 years ago
Reece H. Dunn
1cb4115eab
Add Emoji_Component support from emoji-data.txt.
8 years ago
Reece H. Dunn
18d3a0819a
Unicode Emoji 5.0
8 years ago
Reece H. Dunn
023ebace9a
Add Regional_Indicator support from PropList.txt.
8 years ago
Reece H. Dunn
4c1ff765f1
Unicode Character Data 10.0.0
8 years ago
Reece H. Dunn
bdc255835e
Provide a compatibility implementation of iswblank for C89 compilers.
8 years ago
Reece H. Dunn
223d14bec8
Move the type declaration out of the for loops to support building with a C89 compiler.
8 years ago
Reece H. Dunn
0b56f2e5bc
Use C-style comments to support using a C89 compiler.
8 years ago
Reece H. Dunn
6ff015f171
Use curl's -o to save the data files.
This will prevent writing the file if the command fails.
8 years ago
Reece H. Dunn
da3ed3aa9b
Update the CHANGELOG to include the emoji-data properties.
8 years ago
shadyar
042d1bf79d
reading Persian was debugged through editing exceptional words.
8 years ago
shadyar
06b702e94a
debugging some rules
8 years ago
Valdis Vitolins
65a1fa63e2
Turkish: issue #247 fixed pronunciation for w
8 years ago
yorkie
db79bd12eb
build, encoding: include stdint
The stdint.h is not included when building test, then including this module inside encoding.h
8 years ago
Reece H. Dunn
b72754b439
Merge remote-tracking branch 'yorkie/fix/dup-symbol'
8 years ago
yorkie
949de3f80f
build: fix duplicated symbol when linking
8 years ago
Reece H. Dunn
93b3ffa74d
Merge remote-tracking branch 'Christianlm/master'
8 years ago
Valdis Vitolins
2724442f41
Iprovements for Sindhi and Urdu languages by Ejaz Shah
8 years ago
chrislm
fae81de478
Added rules and exceptions for italian
Deleted symbols/characters in it_lists now available in it_emoji
8 years ago
Reece H. Dunn
f946c6e227
windows: fix building the bs, hr and sr languages
8 years ago
Reece H. Dunn
2a00ca79f6
Increase dict_line to support long emoji entries.
8 years ago
Reece H. Dunn
d5d980862e
Guard against overflowing the dict_line bounds when compiling dictionary files.
8 years ago
Reece H. Dunn
0bc26d93d2
windows: ignore .tmp files
8 years ago
Reece H. Dunn
4771cd4d5e
windows: fix building the Chinese (Cantonese) voice
8 years ago
Reece H. Dunn
8d1ff99847
windows: fix building the Japanese voice
8 years ago
Reece H. Dunn
ebe7ce827b
windows: the installer depends on the data (voice, language) files
8 years ago
Reece H. Dunn
b16ff0119e
windows: update the project files to Visual Studio 2017
8 years ago
Reece H. Dunn
ddee4d6386
languages.test: fix running the Japanese test
8 years ago
Reece H. Dunn
593949dec6
pa: remove the emoji support for now, as it fails to build
8 years ago
Reece H. Dunn
f8403414f2
zh: (emoji) -- CLDR 31.0.1 emoji translations
8 years ago
Reece H. Dunn
348de83de7
vi: (emoji) -- CLDR 31.0.1 emoji translations
8 years ago
Reece H. Dunn
664dbf0673
ur: (emoji) -- CLDR 31.0.1 emoji translations
8 years ago
Reece H. Dunn
1014745e7e
tr: (emoji) -- CLDR 31.0.1 emoji translations
8 years ago