SSML consists of XML-like tags, for example: Did you mean the <emphasis level="strong"><prosody pitch="75">green</prosody></emphasis> beans?
The following markup tags and attributes are recognised:
x-slow
, slow
, medium
, fast
, x-fast
or a percentage such as 125%
)silent
, x-soft
, soft
, medium
, loud
, x-loud
, +1dB
or -1dB
)default
, x-low
, low
, medium
, high
, x-high
)none
, reduced
, moderate
, strong
or x-strong
)eSpeak can speak HTML text directly, or text containing both SSML and HTML markup.
Any unrecognised tags are ignored.
The following tags cause a sentence break:
br
dd
li
img
td
The following tags cause a paragraph break:
h1
h2
h3
h4
hr
Text between the following tags is ignored:
script
style