eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Reece H. Dunn bcf8be59b3 Support enabling the CSUR data. 10 years ago
data/csur CSUR: Use the Zinh script code for symbols 10 years ago
docs NEWS: remove an errant block of text 11 years ago
src Use PropertyValueAliases for the script mapping. 10 years ago
tests Make the category/category_type/script to-string helper methods part of the public API. 12 years ago
tools Support enabling the CSUR data. 10 years ago
.gitignore Use PropertyValueAliases for the script mapping. 10 years ago
AUTHORS Parse the UCD data files. 12 years ago
COPYING Parse the UCD data files. 12 years ago
Makefile.am Support enabling the CSUR data. 10 years ago
README.md Support enabling the CSUR data. 10 years ago
autogen.sh autogen.sh: fix libtoolize detection on Mac OSX 11 years ago
configure.ac ucd-tools 7.0.0.1 11 years ago

README.md

Unicode Character Data Tools


The Unicode Character Data (UCD) Tools is a library for working with the Unicode Character Data from unicode.org.

It provides a compact replacement for various wide-character C APIs. These can be used in Android applications, as the Android C library does not have full wide-character support.

In addition to this it provides APIs for:

The following data sets are used for the data tables:

ConScript Unicode Registry

If enabled, the following data from the ConScript Unicode Registry (CSUR) is added:

Code Range Script
F8D0-F8FF Klingon

This data is located in the data/csur directory.

Build Dependencies

In order to build ucd-tools, you need:

  1. a functional autotools system (make, autoconf, automake and libtool);
  2. a functional c++ compiler.

To build the documentation, you need:

  1. the doxygen program to build the api documentation;
  2. the dot program from the graphviz library to generate graphs in the api documentation.

Debian

Core Dependencies:

Dependency Install
autotools sudo apt-get install make autoconf automake libtool
c++ compiler sudo apt-get install gcc g++

Documentation Dependencies:

Dependency Install
doxygen sudo apt-get install doxygen
graphviz sudo apt-get install graphviz

Building

UCD Tools supports the standard GNU autotools build system. The source code does not contain the generated configure files, so to build it you need to run:

./autogen.sh
./configure --prefix=/usr
make

The tests can be run by using:

make check

The program can be installed using:

sudo make install

The documentation can be built using:

make html

Updating the UCD Data

To re-generate the source files from the UCD data when a new version of unicode is released, you need to run:

./configure --prefix=/usr --with-unicode-version=VERSION
make ucd-update

where VERSION is the Unicode version (e.g. 6.3.0).

Additionally, you can use the UCD_FLAGS option to control how the data is generated. The following flags are supported:

Flag Description
--with-csur Add ConScript Unicode Registry data.

Bugs

Report bugs to the ucd-tools issues page on GitHub.

License Information

UCD Tools is released under the GPL version 3 or later license.