eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

ta_rules 1.6KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177
  1. // This file is UTF8 encoded
  2. // Spelling to phoneme rules for Tamil
  3. // B means a combining vowel sign, or a virama
  4. .replace
  5. ௦ 0 // Tamil numbers
  6. ௧ 1
  7. ௨ 2
  8. ௩ 3
  9. ௪ 4
  10. ௫ 5
  11. ௬ 6
  12. ௭ 7
  13. ௮ 8
  14. ௯ 9
  15. .group 0xe0ae // characters which start with UTF-8 bytes: [e0 ae]
  16. ஂ // anusvara
  17. ஃ // visarga
  18. அ a
  19. ஆ a:
  20. இ i
  21. ஈ i:
  22. உ u
  23. ஊ u:
  24. எ e
  25. ஏ e:
  26. ஐ E:
  27. ஒ o
  28. ஓ o:
  29. ஔ aU
  30. // consonants
  31. க ga // inter-vocalic, unless there is virama before or after
  32. க (B g
  33. க (் g
  34. ்) க ga
  35. ்) க (B g
  36. _) க ka
  37. _) க (B k
  38. க்க k:a
  39. க்க (B k:
  40. ங Na
  41. ங (B N
  42. ச dZa
  43. ச (B dZ
  44. _) ச sa
  45. _) ச (B s
  46. ச்ச tS:a
  47. ச்ச (B tS:
  48. ஜ dZa
  49. ஜ (B dZ
  50. ஞ n^a
  51. ஞ (B n^
  52. ட d.a
  53. ட (B d.
  54. _) ட t.a
  55. _) ட (B t.
  56. ட்ட t.a
  57. ட்ட (B t.
  58. ண n.a
  59. ண (B n.
  60. த da
  61. த (B d
  62. _) த ta
  63. _) த (B t
  64. த்த t:a
  65. த்த (B t:
  66. ந na
  67. ந (B n
  68. ன na
  69. ன (B n
  70. ப ba
  71. ப (B b
  72. _) ப pa
  73. _) ப (B p
  74. ப்ப p:a
  75. ப்ப (B p:
  76. ம ma
  77. ம (B m
  78. ய ja
  79. ய (B j
  80. ர ra
  81. ர (B r
  82. ற Ra
  83. ற (B R
  84. ற் (ற t // RR -> tR
  85. ல la
  86. ல (B l
  87. ள l/2a // this should be [l.] not [l/2]
  88. ள (B l/2
  89. ழ z.a
  90. ழ (B z.
  91. வ va
  92. வ (B v
  93. ஶ Sa
  94. ஶ (B S
  95. ஷ s.a
  96. ஷ (B s.
  97. ஸ sa
  98. ஸ (B s
  99. ஹ ha
  100. ஹ (B h
  101. // combining vowels
  102. ா a:
  103. ி i
  104. .group 0xe0af // characters which start with UTF-8 bytes: [e0 af]
  105. ீ i:
  106. ு u
  107. ூ u:
  108. ெ e
  109. ே e:
  110. ை E:
  111. ொ o
  112. ோ o:
  113. ௌ aU
  114. ் // virama
  115. ௗ : // aU length mark
  116. .group
  117. $ dola