Reece H. Dunn
1154409393
Use PropertyValueAliases for the script mapping.
The mapping of the script labels in the UCD data to ISO 15924
script tags is now done using the sc property map in the
PropertyValueAliases data.
This has the following benefits:
1. It removes the dependency on the IANA subtag registry.
2. It ensures the scripts are correct as specified in the
UCD data files.
10 years ago
Reece H. Dunn
8a8f021a2c
ucd: support parsing PropertyValueAliases data
10 years ago
Reece H. Dunn
47af3ade04
autogen.sh: fix libtoolize detection on Mac OSX
10 years ago
Reece H. Dunn
d33c342e74
NEWS: remove an errant block of text
11 years ago
Reece H. Dunn
ea191f1371
ucd-tools 7.0.0.1
11 years ago
Reece H. Dunn
a370cc607a
src/tostring: fix up the Script name entries
11 years ago
Reece H. Dunn
fe8633b1bf
src/tostring: use an array instead of a switch for script names
11 years ago
Reece H. Dunn
8564c19582
autotools: update the default for --with-unicode-version to 7.0.0
11 years ago
Reece H. Dunn
c3a303b72f
Update the release notes.
11 years ago
Reece H. Dunn
24f6e97f05
tostring: map the new script values
11 years ago
Reece H. Dunn
9589e27f0f
tools/printdata.py: don't include CSUR data in the tests
11 years ago
Reece H. Dunn
892006d10d
Move the CSUR data to data/csur
11 years ago
Reece H. Dunn
ced06ed0f4
Do not include supplementary data in the UCD APIs.
This removes support for the CSUR (ConScript Unicode Registry) data
in the main Unicode APIs. This data should be accessed through a
different API.
11 years ago
Reece H. Dunn
8836af4942
README: update the UCD version in the docs
11 years ago
Reece H. Dunn
b757f60c63
Unicode Character Data 7.0.0
11 years ago
Reece H. Dunn
ec34ff5854
ucd.h: update script to match ISO 15924
11 years ago
Reece H. Dunn
4747999f57
tools/iana.py: read_data is not used, so remove it
11 years ago
Reece H. Dunn
2aadc3b4a3
README.md: use the HTTP location for the UCD 6.3.0 URL
11 years ago
Reece H. Dunn
a27f6c8fb8
README.md: provide detailed documentation
11 years ago
Reece H. Dunn
dfb4027d5f
README.md: tidy up the title and description
11 years ago
Reece H. Dunn
64e69c4e52
ucd.h: don't use named groups in the documentation
11 years ago
Reece H. Dunn
01c4669961
ucd.h: correct a parameter name in the documentation
11 years ago
Reece H. Dunn
014d4c3771
autotools: add a html target for generating documentation via doxygen
11 years ago
Reece H. Dunn
4691c3c6b3
Klingon: add a reference to the ConScript Unicode Registry specification
11 years ago
Reece H. Dunn
fb1b51adf1
autotools: fix the name of the --with-unicode-version configure option
11 years ago
Reece H. Dunn
e4a1167496
Unicode Character Data 6.3.0
11 years ago
Reece H. Dunn
cc2a1759bd
autotools: support specifying the Unicode/UCD version
11 years ago
Reece H. Dunn
fb0e77ff5d
autotools: create a ucd-update target to regenerate the sources from UCD data
11 years ago
Reece H. Dunn
77f9b5ece8
autotools: XDGDATADIR is not used/configured, so don't print its location at the end of a configure run
11 years ago
Reece H. Dunn
a0c601d953
ignore the autotools-generated 'compile' file
11 years ago
Reece H. Dunn
88e72aeb0a
tools/ucd.py: support printing out the data as CSV with specified columns from the command-line
12 years ago
Reece H. Dunn
be49d9cce6
ucd.h: fix a typo
12 years ago
Reece H. Dunn
1116d02aa9
Make the category/category_type/script to-string helper methods part of the public API.
12 years ago
Reece H. Dunn
65c3bc33b6
autotools: fix the dependencies on the UCD files.
12 years ago
Reece H. Dunn
4ded80bdb3
ctype: add default cases to switch statements to suppress -Wswitch warnings with clang
12 years ago
Reece H. Dunn
7e411b34e9
F8D0-D8FF: Klingon
12 years ago
Reece H. Dunn
c06f296d87
tools/scripts.py: merge some script set ranges
12 years ago
Reece H. Dunn
65f95033c8
Add support for querying the Script property.
12 years ago
Reece H. Dunn
349e225aae
Support mapping a General Category to a General Category Group.
12 years ago
Reece H. Dunn
6e15fd6d9b
Add tests for ucd::lookup_category_group.
12 years ago
Reece H. Dunn
3f9f6c0623
Add tests for ucd::isspace.
12 years ago
Reece H. Dunn
cc9abdff12
Fetch UnicodeData.txt from unicode.org if not present to make the build fully automated.
12 years ago
Reece H. Dunn
8799bc5e53
The ctype.cpp file is not generated, so remove the comment stating that it is.
12 years ago
Reece H. Dunn
2d982956a5
Store the category data in uint8_t arrays to minimize their compiled size.
12 years ago
Reece H. Dunn
2df0e6abdb
Factor out the remaining single category tables.
12 years ago
Reece H. Dunn
7f1dd9cc96
Avoid duplicating Lo only tables.
12 years ago
Reece H. Dunn
ea09eb5c45
Add tests for querying UCD properties; fix discovered issues.
12 years ago
Reece H. Dunn
9c3a87dbeb
Add toupper, tolower and totitle case-conversion APIs.
12 years ago
Reece H. Dunn
e3e85d33f2
Rename Ci to Ii and move it to an I/Invalid category group as it is not part of the UCD C/Other category group.
12 years ago
Reece H. Dunn
bc6a5c23cc
Remove the Zc class as it is not part of the UCD; special case Cc-based whitespace instead.
12 years ago