Reece H. Dunn
454038dbfa
Create a C-based API in addition to the C++-based API in <ucd/ucd.h>.
9 years ago
Reece H. Dunn
bddb25bdc7
Unicode Character Data 8.0.0
10 years ago
Reece H. Dunn
1154409393
Use PropertyValueAliases for the script mapping.
The mapping of the script labels in the UCD data to ISO 15924
script tags is now done using the sc property map in the
PropertyValueAliases data.
This has the following benefits:
1. It removes the dependency on the IANA subtag registry.
2. It ensures the scripts are correct as specified in the
UCD data files.
10 years ago
Reece H. Dunn
a370cc607a
src/tostring: fix up the Script name entries
11 years ago
Reece H. Dunn
fe8633b1bf
src/tostring: use an array instead of a switch for script names
11 years ago
Reece H. Dunn
24f6e97f05
tostring: map the new script values
11 years ago
Reece H. Dunn
ced06ed0f4
Do not include supplementary data in the UCD APIs.
This removes support for the CSUR (ConScript Unicode Registry) data
in the main Unicode APIs. This data should be accessed through a
different API.
11 years ago
Reece H. Dunn
b757f60c63
Unicode Character Data 7.0.0
11 years ago
Reece H. Dunn
ec34ff5854
ucd.h: update script to match ISO 15924
11 years ago
Reece H. Dunn
64e69c4e52
ucd.h: don't use named groups in the documentation
12 years ago
Reece H. Dunn
01c4669961
ucd.h: correct a parameter name in the documentation
12 years ago
Reece H. Dunn
e4a1167496
Unicode Character Data 6.3.0
12 years ago
Reece H. Dunn
be49d9cce6
ucd.h: fix a typo
12 years ago
Reece H. Dunn
1116d02aa9
Make the category/category_type/script to-string helper methods part of the public API.
12 years ago
Reece H. Dunn
4ded80bdb3
ctype: add default cases to switch statements to suppress -Wswitch warnings with clang
12 years ago
Reece H. Dunn
7e411b34e9
F8D0-D8FF: Klingon
12 years ago
Reece H. Dunn
c06f296d87
tools/scripts.py: merge some script set ranges
12 years ago
Reece H. Dunn
65f95033c8
Add support for querying the Script property.
12 years ago
Reece H. Dunn
349e225aae
Support mapping a General Category to a General Category Group.
12 years ago
Reece H. Dunn
cc9abdff12
Fetch UnicodeData.txt from unicode.org if not present to make the build fully automated.
12 years ago
Reece H. Dunn
8799bc5e53
The ctype.cpp file is not generated, so remove the comment stating that it is.
12 years ago
Reece H. Dunn
2d982956a5
Store the category data in uint8_t arrays to minimize their compiled size.
12 years ago
Reece H. Dunn
2df0e6abdb
Factor out the remaining single category tables.
12 years ago
Reece H. Dunn
7f1dd9cc96
Avoid duplicating Lo only tables.
12 years ago
Reece H. Dunn
ea09eb5c45
Add tests for querying UCD properties; fix discovered issues.
12 years ago
Reece H. Dunn
9c3a87dbeb
Add toupper, tolower and totitle case-conversion APIs.
12 years ago
Reece H. Dunn
e3e85d33f2
Rename Ci to Ii and move it to an I/Invalid category group as it is not part of the UCD C/Other category group.
12 years ago
Reece H. Dunn
bc6a5c23cc
Remove the Zc class as it is not part of the UCD; special case Cc-based whitespace instead.
12 years ago
Reece H. Dunn
ff7a5e0209
Add support for looking up the general category group for a codepoint.
12 years ago
Reece H. Dunn
2282735a4c
Add ctype-style APIs that are compatible with wctype.h.
12 years ago
Reece H. Dunn
a416b4090c
Display the Unicode Character Database version in the generated file.
12 years ago
Reece H. Dunn
7f4f3ee7d1
Add the generated file src/categories.cpp so you don't need the Unicode Character Database to build the project.
12 years ago
Reece H. Dunn
12bafa6b4d
tools/categories.py: generate category lookup tables for the full unicode range.
12 years ago
Reece H. Dunn
2813950acc
Infrastructure for building libucd.a.
12 years ago