Reece H. Dunn
6553494c5c
Remove noBreak from ucd_properties -- it is a DispositionType, not a PropList type.
8 years ago
Reece H. Dunn
9dc44bf0d4
Add Noncharacter_Code_Point support from PropList.txt.
8 years ago
Reece H. Dunn
a184262d9b
Add Other_Uppercase support from PropList.txt.
8 years ago
Reece H. Dunn
6a2a87e358
Add Other_Lowercase support from PropList.txt.
8 years ago
Reece H. Dunn
735a661232
Add Extender support from PropList.txt.
8 years ago
Reece H. Dunn
d067838684
Add Diacritic support from PropList.txt.
8 years ago
Reece H. Dunn
3ff6d8ae0f
Add Ideographic support from PropList.txt.
8 years ago
Reece H. Dunn
aa32a420af
Add Other_Alphabetic support from PropList.txt.
8 years ago
Reece H. Dunn
835dfc83a1
Add ASCII_Hex_Digit support from PropList.txt.
8 years ago
Reece H. Dunn
a69b99a81b
Add Hex_Digit support from PropList.txt.
8 years ago
Reece H. Dunn
844895d315
Add Other_Math support from PropList.txt.
8 years ago
Reece H. Dunn
4226457fb5
Add Terminal_Punctuation support from PropList.txt.
8 years ago
Reece H. Dunn
a9aabc6242
Add tests for the PropList API.
8 years ago
Reece H. Dunn
b716b95feb
mkencodingtable: support mapping ISO 8859 encoding tables
8 years ago
Reece H. Dunn
6346ede604
isalnum: use the Alphabetic derived core property to check that the character is in the alpha class.
8 years ago
Reece H. Dunn
cbddf092c9
isalpha: use the Alphabetic derived core property to check that the character is in the alpha class.
8 years ago
Reece H. Dunn
b38659582f
islower: include the Other_Lowercase characters.
8 years ago
Reece H. Dunn
9d1469b956
printdata: use the Uppercase property from DerivedCoreProperties.
8 years ago
Reece H. Dunn
1fc20f39f5
isupper: include the Other_Uppercase characters.
8 years ago
Reece H. Dunn
d956501863
printdata: load all the properties from the PropList data.
8 years ago
Reece H. Dunn
ac082c9400
Add tests for the remaining is* APIs.
8 years ago
Reece H. Dunn
eb08dc422d
Update the copyright dates.
8 years ago
Reece H. Dunn
c9f2940373
isblank: don't include <noBreak> characters, and add tests for this API.
8 years ago
Reece H. Dunn
f109bb918f
isspace: don't include <noBreak> characters.
8 years ago
Reece H. Dunn
5f9dc111cf
Add tests for the isdigit and isxdigit ctype APIs.
8 years ago
Reece H. Dunn
bd71fed013
ctype: return true in isupper/islower if there is a simple case mapping present
8 years ago
Reece H. Dunn
e77b7c7b49
printdata: create an isspace helper function
8 years ago
Reece H. Dunn
ceda811b12
printdata: add the properties to the primary data map
8 years ago
Reece H. Dunn
a2193799e4
printdata: use get to return a default value if the map key is not present
8 years ago
Reece H. Dunn
cd9cc8e6e2
Unicode Character Data 9.0.0
8 years ago
Reece H. Dunn
a83ce9ee8e
Python 3 compatibility fixes.
8 years ago
Reece H. Dunn
7201a1a150
Convert scripts.cpp from C++ to C.
9 years ago
Reece H. Dunn
707998940d
Convert categories.cpp from C++ to C.
9 years ago
Reece H. Dunn
0afcb3f89f
Convert case.cpp from C++ to C.
9 years ago
Reece H. Dunn
454038dbfa
Create a C-based API in addition to the C++-based API in <ucd/ucd.h>.
9 years ago
Reece H. Dunn
bcf8be59b3
Support enabling the CSUR data.
10 years ago
Reece H. Dunn
28baabf72a
Remove the IANA subtag registry parser
This is not needed now that PropertyValueAliases is used for script
mapping.
10 years ago
Reece H. Dunn
1154409393
Use PropertyValueAliases for the script mapping.
The mapping of the script labels in the UCD data to ISO 15924
script tags is now done using the sc property map in the
PropertyValueAliases data.
This has the following benefits:
1. It removes the dependency on the IANA subtag registry.
2. It ensures the scripts are correct as specified in the
UCD data files.
10 years ago
Reece H. Dunn
8a8f021a2c
ucd: support parsing PropertyValueAliases data
10 years ago
Reece H. Dunn
9589e27f0f
tools/printdata.py: don't include CSUR data in the tests
11 years ago
Reece H. Dunn
ced06ed0f4
Do not include supplementary data in the UCD APIs.
This removes support for the CSUR (ConScript Unicode Registry) data
in the main Unicode APIs. This data should be accessed through a
different API.
11 years ago
Reece H. Dunn
b757f60c63
Unicode Character Data 7.0.0
11 years ago
Reece H. Dunn
4747999f57
tools/iana.py: read_data is not used, so remove it
11 years ago
Reece H. Dunn
88e72aeb0a
tools/ucd.py: support printing out the data as CSV with specified columns from the command-line
12 years ago
Reece H. Dunn
7e411b34e9
F8D0-D8FF: Klingon
12 years ago
Reece H. Dunn
c06f296d87
tools/scripts.py: merge some script set ranges
12 years ago
Reece H. Dunn
65f95033c8
Add support for querying the Script property.
12 years ago
Reece H. Dunn
349e225aae
Support mapping a General Category to a General Category Group.
12 years ago
Reece H. Dunn
6e15fd6d9b
Add tests for ucd::lookup_category_group.
12 years ago
Reece H. Dunn
3f9f6c0623
Add tests for ucd::isspace.
12 years ago