Add Pashto language support based on Urdu language files
This commit adds support for the Pashto language (ps) to espeak-ng. The implementation is based on Urdu language files and includes ps_rules, ps_list, ps_emoji, ps_extra files and updated Makefile.am to include Pashto in the dictionary targets and build rules.
fix: add Pashto data file.
refactor: use enhance rules for stress.
fix: add missing configs.
Add Pashto phoneme support and improve voice files
fix: add Pashto phonemes test.
fix: restore original phonemes.
fix: remove renduandant ps_dict from Make.am file.
fix: use correct phonemes with ipa and stress rules.
feat: translate all en_emoji to Pashto.
fix: add Pashto dict entry in Makefile.am
feat: enhance ps_rules with example pairs and words.
The implications of this are large for using eSpeak-ng as a phonemizer. It means that it's now possible to get accurate mappings between the phonemized word and the original word (up till now it was pretty much impossible). This is huge for many applications that use it as a side-tool.
When e.g. using mbrola voices, the sample rate may change. When the
application plays audio itself, it needs to know it, and
espeak_ng_GetSampleRate does not report that.
Fix the replacement character in the input string (#2024)
The correction uses substitute character (0x001A) as a workaround. This
control character is the "older brother" of the replacement character
OxFFFD.
The workaround is necessary because the replacement character (0xFFFD)
is used to flag to indicate an error or EOF. The substitute character
(0x001A) was included as an alternative flag to indicate that the input
character itself is a replacement character (0xFFFD).