6bdb2ee6cd
								
							 
						 
						
							
									ar: in MBROLA qaf is not considered 'thick' consonant 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								8a65eaaf46
								
							 
						 
						
							
									ar: add missing rules for lengthened thick consonants 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								1246abc311
								
							 
						 
						
							
									ar: sort phonemes by Latin rules 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								7e05cf4adb
								
							 
						 
						
							
									ar: decrease volume of mb-ar2 voice to avoid saturation 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								c037223bdb
								
							 
						 
						
							
									ar: make proclitics unstressed 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								26b067a68b
								
							 
						 
						
							
									ar: move default stress rule from code to config file 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								f86540308c
								
							 
						 
						
							
									ar: define Z phoneme explicitly 
							 
							
							
otherwise it can't be distinguished from D 
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								5c7dd911f1
								
							 
						 
						
							
									ar: move decision logic of dark vowels to ar_rules file 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								08ef8f9c8f
								
							 
						 
						
							
									ar: create separate dark vowels to be used after thick consonants 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								ed108d3b3b
								
							 
						 
						
							
									ar: create letter group for non-thick consonants 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								9d901e48be
								
							 
						 
						
							
									shn: support numbers for 100, 1000, and 10000 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								61672f5e24
								
							 
						 
						
							
									Use defines for the different number breaking systems to improve readability. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								ed5a378b9d
								
							 
						 
						
							
									docs: document the tr_languages langopts 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								273b8e2a1f
								
							 
						 
						
							
									en: support numbers upto a hundred nonillion 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								0b4896fd7c
								
							 
						 
						
							
									en: add tests for cardinal and ordinal numbers 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								f387fd804a
								
							 
						 
						
							
									Merge remote-tracking branch 'valdisvi/shan' 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								9638750579
								
							 
						 
						
							
									Always flush stdout when reading stdin line by line. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								fb06f35f3b
								
							 
						 
						
							
									Update the Unicode Data Files license. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								088de546e9
								
							 
						 
						
							
									Use the generic BSD-2-Clause license text in COPYING.BSD2. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								1c60fb7f62
								
							 
						 
						
							
									Don't use STRESSPOSN_1L for thousands_sep in the Slovak/Czech language setup. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								c6ac526847
								
							 
						 
						
							
									When printing phonemes, don't add a space at the start of a sentence or clause. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								65186c07df
								
							 
						 
						
							
									Preserve the sourceix property of a deleted phonSWITCH phoneme. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								cf6d14783c
								
							 
						 
						
							
									Preserve the sourceix property of a deleted phoneme for replaced phonemes. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								8e13f7147c
								
							 
						 
						
							
									Add constants for use with PHONEME_LIST.newword. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								d0e806b600
								
							 
						 
						
							
									ur: improvements for Urdu by Ejaz Shah 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								910f4c2a72
								
							 
						 
						
							
									Add ISO 15924 script codes to the remaining language pronunciation tests. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								475bfdcb66
								
							 
						 
						
							
									Use the ISO 15924 4-letter script names consistently in the tests. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								4a7118dba1
								
							 
						 
						
							
									Fix issue #530 Broken replacement from Cyrrilic to Latin for Lingua Franca Nova 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								5439b89db8
								
							 
						 
						
							
									Issue #521 — add spelling tests for more langugages 
							 
							
							
Sample sentences for languages are taken from:
- Afrikaans https://www.omniglot.com/writing/afrikaans.htm 
- Albanian https://www.omniglot.com/writing/albanian.htm 
- Amharic https://www.bbc.com/amharic 
- Ancient Greek: http://titus.uni-frankfurt.de/unicode/samples/grbeisp.htm 
- Aragonese https://www.omniglot.com/writing/aragonese.php 
- Armenian: https://elinux.org/UTF8_Sampler 
- Assamese https://www.omniglot.com/writing/assamese.htm 
- Azerbaijani https://www.omniglot.com/writing/azeri.htm 
- Basque https://www.omniglot.com/writing/basque.htm 
- Bengali https://www.bbc.com/bengali/news 
- Dutch https://www.omniglot.com/writing/dutch.htm 
- Greenlandic: https://www.omniglot.com/writing/greenlandic.htm 
- Guarani: https://www.omniglot.com/writing/guarani.htm 
- Gujarati: http://mylanguages.org/gujarati_reading.php 
- Haitian Creole: https://www.omniglot.com/writing/haitiancreole.htm 
- Interlingua: https://www.omniglot.com/writing/interlingua.htm 
- Kannada: https://www.omniglot.com/language/phrases/kannada.php 
- Kyrgyz: https://ru.wikipedia.org/wiki/%D0%9A%D0%B8%D1%80%D0%B3%D0%B8%D0%B7%D1%81%D0%BA%D0%B0%D1%8F_%D0%BF%D0%B8%D1%81%D1%8C%D0%BC%D0%B5%D0%BD%D0%BD%D0%BE%D1%81%D1%82%D1%8C 
- Konkani (Devanagari) https://r12a.github.io/scripts/devanagari/ 
- Kurdish https://www.omniglot.com/writing/kurdish.htm 
- Lingua Franca Nova https://www.omniglot.com/writing/lfn.htm 
- Lobjan: https://www.omniglot.com/writing/lojban.htm 
- Malay https://www.omniglot.com/writing/malay.htm 
- Maltese https://www.omniglot.com/writing/maltese.htm 
- Marathi https://www.bbc.com/marathi 
- Māori https://www.omniglot.com/writing/maori.htm 
- Nahuatl https://www.gutenberg.org/files/12219/12219-h/12219-h.htm 
- Oriya https://www.omniglot.com/writing/oriya.htm 
- Oromo https://www.omniglot.com/writing/oromo.htm 
- Papiamento https://www.omniglot.com/writing/papiamento.php 
- Punjabi https://pa.wikipedia.org/wiki/%E0%A8%AD%E0%A8%BE%E0%A8%B0%E0%A8%A4_%E0%A8%A6%E0%A8%BE_%E0%A8%B0%E0%A8%BE%E0%A8%B8%E0%A8%BC%E0%A8%9F%E0%A8%B0%E0%A8%AA%E0%A8%A4%E0%A9%80 
- Setswana https://www.omniglot.com/writing/tswana.php 
- Sindhi https://en.wikipedia.org/wiki/Sindhi_language 
- Sinhala https://www.bbc.com/sinhala 
- Tamil http://kermitproject.org/utf8.html 
- Tatar https://www.omniglot.com/writing/tatar.htm 
- Telugu http://kermitproject.org/utf8.html 
- Vietnamese https://www.omniglot.com/writing/vietnamese.htm  
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								d3f2a753f3
								
							 
						 
						
							
									Fix issue #527 — spelling differs for Russian with or without extended dictionary 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								86bbc257b0
								
							 
						 
						
							
									Support matching any length strings in the replacement rules. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								98e9122dfc
								
							 
						 
						
							
									FindReplacementChars: Pass in the source buffer (next characters) instead of next_in. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								4fbcda9c2a
								
							 
						 
						
							
									FindReplacementChars: Use an nc (next character) variable. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								cacc212d4b
								
							 
						 
						
							
									FindReplacementChars: Rename uc to fc. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								a9d4bdd7f7
								
							 
						 
						
							
									Make ignore_next into ignore_next_n to support ignoring multiple next characters. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								3518fbf3ff
								
							 
						 
						
							
									mk: Support additional romanizations (ISO 9, BGN/PCGN, Cadastre, and MJMS/SSO). 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								0b64b04baa
								
							 
						 
						
							
									mk: Remove the Latin script groups -- these are handled by replacement characters. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								27454f56f4
								
							 
						 
						
							
									mk: Don't map đ and ć to Serbian ђ and ћ (use Macedonian ѓ and ќ instead). 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								db3ae0eaec
								
							 
						 
						
							
									mk: Reformat the Latin to Cyrillic romanization support table. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								252f5772ae
								
							 
						 
						
							
									Simplify printing the replace message. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								6a7e31e24e
								
							 
						 
						
							
									Merge remote-tracking branch 'Christianlm/master' 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								55c64036e0
								
							 
						 
						
							
									Use UTF-8 strings in replace rules, instead of a packed UTF-16 pair. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								0e91fcbc04
								
							 
						 
						
							
									Don't use pw when reading the replacement data. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								424f705525
								
							 
						 
						
							
									Revert the new (broken) replacement rule logic. 
							 
							
							
The replacement tests for bs, hr, and sr are no longer marked as
broken as they work using the old code. The mk tests keep the
broken annotation, as they don't work in the old code either.
This reverts commit 801a8d197c64d5701e5e3b51ebf6171fd235d2c09f0667de86 
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								5303b6b570
								
							 
						 
						
							
									IT: addedsome rules for pronominal verbs and for suffix *filia* 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								bae92dab38
								
							 
						 
						
							
									ja: Add tests for replacing Katakana (Kana) with Hiragana (Hira). 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								9660df7743
								
							 
						 
						
							
									mk: Add tests for replacing Latin with Cyrillic. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								bfb624824e
								
							 
						 
						
							
									Move the additional English replacement rule test to language-replace.test. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								672c07b3a9
								
							 
						 
						
							
									Reorganize the language pronunciation tests. 
							 
							
							
							
						 
						7 years ago  
				
					
						
							
								 
						
							
								93e23a47c8
								
							 
						 
						
							
									issue #521: add spelling tests for all languages 
							 
							
							
Tests include pangrams from http://clagnut.com/blog/2380/ .
Based on a patch by Valdis Vitolins <[email protected] >. 
							
						 
						7 years ago