@@ -32,7 +32,6 @@ EXTRA_DIST += ChangeLog | |||
all-local: \ | |||
espeak-data/phontab \ | |||
docs/speak_lib.h \ | |||
dictionaries \ | |||
mbrola | |||
@@ -67,9 +66,6 @@ docs: docs/index.html \ | |||
README.html \ | |||
src/espeak-ng.1 | |||
docs/speak_lib.h: src/include/espeak-ng/speak_lib.h | |||
cp $< $@ | |||
##### build targets: | |||
libespeak_ng_includedir = $(includedir)/espeak-ng |
@@ -1,69 +0,0 @@ | |||
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd"> | |||
<html> | |||
<head> | |||
<title></title> | |||
<meta name="GENERATOR" content="Quanta Plus"> | |||
<meta http-equiv="Content-Type" content="text/html; charset=utf-8"> | |||
</head> | |||
<body> | |||
<A href="docindex.html">Back</A> | |||
<hr> | |||
<h2>ANALYSIS</h2> | |||
<hr> | |||
(Further notes are needed) | |||
<p> | |||
Recordings of spoken words and phrases can be analysed to try and make eSpeak match a language more closely. | |||
Unlike most other (larger and better quality) synthesizers, eSpeak's data is not produced directly from recorded sounds. To use an analogy, it's like a drawing or sketch compared with a photograph. Or vector graphics compared with a bitmap image. It's smaller, less accurate, with less subtlety, but it can sometimes show some aspects of the picture more clearly than a more accurate image. | |||
<h4>Recording Sounds</h4> | |||
Recordings should be made while speaking slowly, clearly, and firmly and loudly (but not shouting). Speak about half a metre from the microphone. Try to avoid background noise and hum interference from electrical power cables. | |||
<h4>Praat</h4> | |||
I use a modified version of the praat program (<a href="www.praat.org">www.praat.org</a>) to view and analyse both sound recordings and output from eSpeak. The modification adds a new function (<code>Spectrum->To_eSpeak</code>) which analysis a voiced sound and produces a file which can be loaded into espeakedit. Details of the modification are in the <code>"praat-mod"</code> directory in the espeakedit package. | |||
The analysis contains a sequence of frames, one per cycle at the speech's fundamental frequency. Each frame is a short time spectrum, together with praat's estimation of the f1 to f5 formant frequencies at the time of that cycle. | |||
I also use Praat's <code>New->Record_mono_sound</code> function to make sound recordings. | |||
<h3>Vowels and Diphthongs</h3> | |||
<h4>Analysing a Recording</h4> | |||
Make a recording, with a male voice, and trim it in Praat to keep just the required vowel sound. Then use the new <code>Spectrum->To_eSpeak</code> modification (this was named <code>To_Spectrogram2</code> in earlier versions) to analyse the sound. It produces a file named <code>"spectrum.dat"</code>. | |||
Load the <code>"spectrum.dat"</code> file into espeakedit. Espeakedit has two Open functions, <code>File->Open</code> and <code>File->Open2</code>. They are the same, except that they remember different paths. I generally use <code>File->Open2</code> for reading the <code>"spectrum.dat"</code> file. | |||
The data is displayed in espeakedit as a sequence of spectrum frames (see <a href="editor.html">editor.html</a>). | |||
<h4>Tone Quality</h4> | |||
It can be difficult to match the tonal quality of a new vowel to be compatible with existing vowel files. This is determined by the relative heights and widths of the formant peaks. These vary depending on how the recording was made, the microphone, and the strength and tone of the voice. Also the positions of the higher peaks (F3 upwards) can vary depending on the characteristics of the speaker's voice. Formant peaks correspond to resonances within the mouth and throat, and they depend on its size and shape. With a female voice, all the formants (F1 upwards) are generally shifted to higher frequencies. | |||
For these reasons, it's best to use a male voice, and to use its analysed spectra only as guidance. Rather than construct formant-peaks entirely to match the analysed data, instead copy keyframes from a similar existing vowel. Then make small adjustments to match the position of the F1, F2, F3 formant peaks and hopefully produce the required vowel sound. | |||
<h4>Using an Existing Vowel File</h4> | |||
Choose a similar vowel file from <code>phsource/vowel</code> and open it into espeakedit. It may be useful to use <code>phsource/vowel/vowelchart</code> as a map to show how vowel files compare with each other. You can select a keyframe from the vowel file and use CTRL-C and CTRL-V to copy the green formant peaks onto a frame of the new spectrum sequence. Then adjust the peaks to match the new frame. Press F1 to hear the sound of the formant peaks in the selected frame. | |||
The F0 peak is provided in order to adjust the correct balance of low frequencies, below the F1 peak. If the sound is too muffled, or conversely, too "thin", try adjusting the amplitude or position of the F0 peak. | |||
<h4>Length and Amplitude</h4> | |||
Use an existing vowel file as a guide for how to set the amplitude and length of the keyframes. At the right of each keyframe, its length is shown in mS and under that is its relative (RMS) amplitude. | |||
The second keyframe should be marked with a red marker (use CTRL-M to toggle this). This divides the vowel into the front-part (with one frame), and the rest. | |||
Use F2 to play the sound of the new vowel sequence. It will also produce a WAV file (the default name is speech.wav) which you can read into praat to see whether it has a sensible shape. | |||
<h4>Using the New Vowel</h4> | |||
Make a new directory (eg. vwl_xx) in phsource for your new vowels. Save the spectrum sequence with a name which you have chosen for it. | |||
You can then edit the phoneme file for your language (eg. phsource/ph_xxx), and change a phoneme to refer to your new vowel file. Then do <code>Data->Compile_Phoneme_Data</code> from espeakedit's menubar to re-compile the phoneme data. | |||
</body> | |||
</html> |
@@ -1,67 +0,0 @@ | |||
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd"> | |||
<html> | |||
<head> | |||
<title>eSpeak Speech Synthesizer</title> | |||
<meta name="GENERATOR" content="Quanta Plus"> | |||
<meta http-equiv="Content-Type" content="text/html; charset=utf-8"> | |||
</head> | |||
<body> | |||
<table border="1" cellpadding="10" background="images/sand-light.jpg" width="100%"> | |||
<tbody> | |||
<tr> | |||
<td width="15%"> | |||
<a href="http://sourceforge.net"><img src="http://sflogo.sourceforge.net/sflogo.php?group_id=159649&type=2" width="125" height="37" border="0" alt="SourceForge.net Logo" /></a> | |||
</td> | |||
<td> | |||
<div align="center"><h1>eSpeak - Documents</h1></div> | |||
</td> | |||
</tr> | |||
<tr> | |||
<td valign="top"> | |||
<font size="+1"><strong> | |||
<A href="index.html">Home</A> | |||
<p> | |||
<A href="commands.html">Usage</A> | |||
<p> | |||
<A href="languages.html">Languages</A> | |||
</strong></font> | |||
</td> | |||
<td> | |||
<h3><A href="voices.html">Voice Files</A></h3> | |||
Voice files specify a language and other characteristics of a voice. | |||
<h3><A href="mbrola.html">Mbrola Voices</A></h3> | |||
eSpeak can be used as a front-end for Mbrola diphone voices. | |||
<h3><A href="dictionary.html">Pronunciation Dictionary</A></h3> | |||
<ul> | |||
<li>How to add pronunciation corrections. | |||
<li>How to build up pronunciation rules for a new language. | |||
</ul><p> | |||
<h3><A href="add_language.html">Adding a Language</A></h3> | |||
How to add or improve a language. | |||
<h3><A href="phonemes.html">Phonemes</A></h3> | |||
The list of phoneme mnemonics for English, for use in the Pronunciation Dictionary. | |||
<h3><A href="phontab.html">Phoneme Tables</A></h3> | |||
The tables of the phonemes used by each language, with their properties and sound production. | |||
<h3><A href="intonation.html">Intonation</A></h3> | |||
Different intonation "tunes" may be defined for different languages for clauses which end in full-stop, comma, question-mark, and exclamation-mark. | |||
<h3><A href="speak_lib.h">eSpeak Library API</A></h3> | |||
API definition and header file for a shared library version of eSpeak. | |||
<h3><A href="ssml.html">Markup tags</A></h3> | |||
SSML (Speech Synthesis Markup Language) and HTML tags recognized by eSpeak. | |||
<h3><A href="editor.html">The espeakedit program</A></h3> | |||
GUI software to edit vowel files and to compile the phoneme data for use by eSpeak.<br> | |||
<ul> | |||
<li><a href="editor_if.html">espeakedit program GUI details</a> | |||
<li><a href="analyse.html">Analysing sound recordings</a> | |||
<li><a href="makephonemes.html">Adjusting phoneme data</a> (to be written) | |||
</ul> | |||
</td> | |||
</tr> | |||
</tbody> | |||
</table> | |||
</body> | |||
</html> |
@@ -1,75 +0,0 @@ | |||
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd"> | |||
<html> | |||
<head> | |||
<title>espeakedit</title> | |||
<meta name="GENERATOR" content="Quanta Plus"> | |||
<meta http-equiv="Content-Type" content="text/html; charset=utf-8"> | |||
</head> | |||
<body> | |||
<A href="docindex.html">Back</A> | |||
<hr> | |||
<h2>ESPEAKEDIT PROGRAM</h2> | |||
<hr> | |||
The <strong>espeakedit</strong> program is used to prepare phoneme data for the eSpeak speech synthesizer.<p> | |||
It has two main functions: | |||
<ul> | |||
<li>Prepare keyframe files for individual vowels and voiced consonants. These each contain a sequence of keyframes which define how formant peaks (peaks in the frequency spectrum) vary during the sound.<p> | |||
<li>Process the master <strong>phonemes</strong> file which, by including the phoneme files for the various languages, defines all their phonemes and references the keyframe files and the sound sample files which they use. <strong>espeakedit</strong> processes these and compiles them into the <strong>phondata</strong>, <strong>phonindex</strong>, and <strong>phontab</strong> files in the <strong>espeak-data</strong> directory which are used by the eSpeak speech synthesizer. | |||
</ul> | |||
<hr> | |||
<h3>Installation</h3> | |||
<strong>espeakedit</strong> needs the following packages:<br> | |||
(The package names mentioned here are those from the Ubuntu "Dapper" Linux distribution). | |||
<ul> | |||
<li><strong>sox</strong> (a universal sound sample translator) | |||
<li><strong>libwxgtk2.6-0</strong> (wxWidgets Cross-platform C++ GUI toolkit) | |||
<li><strong>portaudio0</strong> (Portaudio V18, portable audio I/O) | |||
</ul> | |||
In addition, a modified version of <strong>praat</strong> (<a href="www.praat.org">www.praat.org</a>) is used to view and analyse WAV sound files. | |||
This needs the package <strong>libmotif3</strong> to run and <strong>libmotif-dev</strong> to compile. | |||
<hr> | |||
<h3>Quick Guide</h3> | |||
This will quickly illustrate the main features. Details of the interface and key commands are given in <a href="editor_if.html">editor_if.html</a><p> | |||
For more detailed information on analysing sound recordings and preparing phoneme definitions and keyframe data see <a href="analyse.html">analyse.html</a> (to be written). | |||
<h4>Compiling Phoneme Data</h4> | |||
<ol> | |||
<li>Run the <strong>espeakedit</strong> program.<p> | |||
<li>Select <b>Data->Compile phoneme data</b> from the menu bar. Dialog boxes will ask you to locate the directory (<b>phsource</b>) which contains the master phonemes file, and the directory (<b>dictsource,</b>) which contains the dictionary files (en_rules, en_list, etc). Once specified, espeakedit will remember their locations, although they can be changed later from <b>Options->Paths</b>.<p> | |||
<li>A message in the status line at the bottom of the espeakedit window will indicate whether there are any errors in the phoneme data, and how many language's dictionary files have been compiled. The compiled data is placed into the <b>espeak-data</b> directory, ready for use by the speak program. If errors are found in the phoneme data, they are listed in a file <b>error_log</b> in the <b>phsource</b> directory.</li> | |||
<p> | |||
NOTE: espeakedit can be used from the command line to compile the phoneme data, with the command: <b> espeakedit --compile</b> | |||
<li>Select <b>Tools->Make vowels chart->From compiled phoneme data</b>. This will look for the vowels in the compiled phoneme data of each language and produce a vowel chart (.png file) in <b>phsource/vowelcharts</b>. These charts plot the vowels' F1 (formant 1) frequency against their F2 frequency, which corresponds approximately to their open/close and front/back positions. The colour in the circle for each vowel indicates its F3 frequency, red indicates a low F3, through yellow and green to blue and violet for a high F3. In the case of a diphthong, a line is drawn from the circle to the position of the end of the vowel. | |||
</ol> | |||
<h4>Keyframe Sequences</h4> | |||
<ol> | |||
<li>Select <b>File->Open</b> from the menu bar and select a vowel file, <b>phsource/vowel/a</b>. This will open a tab in the espeakedit window which contains a sequence of 4 keyframes. Each keyframe shows a black graph, which is the outline of an original analysed spectrum from a sound recording, and also a green line, which shows the formant peaks which have been added (using the black graph as a guide) and which produce the sound.<p> | |||
<li>Click in the "a" tab window and then press the <b>F2</b> key. This will produce and play the sound of the keyframe sequence. The first time you do this, you'll get a save dialog asking where you want the WAV file to be saved. Once you give a location all future sounds will be stored in that same location, although it can be changed from <b>Options->Paths</b>.<p> | |||
<li>Click on the second of the four frames, the one with the red square. Press <b>F1</b>. That plays the sound of just that frame.<p> | |||
<li>Press the <b>1</b> (number one) key. That selects formant F1 and a red triangle appears under the F1 formant peak to indicate that it's selected. Also an = sign appears next to formant 1 in the formants list in the left panel of the window.<p> | |||
<li>Press the left-arrow key a couple of times to move the F1 peak to the left. The red triangle and its associated green formant peak moves lower frequency. Its numeric value in the formants list in the left panel decreases.<p> | |||
<li>Press the <b>F1</b> key again. The frame will give a slightly different vowel sound. As you move the F1 peak slightly up and down and then press <b>F1</b> again, the sound changes. Similarly if you press the <b>2</b> key to select the F2 formant, then moving that will also change the sound. If you move the F1 peak down to about 700 Hz (and reduce its height a bit with the down-arrow key) and move F2 up to 1400 Hz, then you'll hear a "er" schwa [@] sound instead of the original [a].<p> | |||
<li>Select <b>File->Open</b> and choose <b>phsource/vowel/aI</b>. This opens a new tab labelled "aI" which contains more frames. This is the [aI] diphthong and if you click in the tab window and press <b>F2</b> you'll hear the English word "eye". If you click on each frame in turn and press <b>F1</b> then you can hear each of the keyframes in turn. They sound different, starting with an [A] sound (as in "palm"), going through something like [@] in "her" and ending with something like [I] in "kit" (or perhaps a French é). Together they make the diphthong [aI]. | |||
</ol> | |||
<h4>Text and Prosody Windows</h4> | |||
<ol> | |||
<li>Click on the <b>Text</b> tab in the left panel. Two text windows appear in the panel with buttons <b>Translate</b> and <b>Speak</b> below them.<p> | |||
<li>Type some text into the top window and click the <b>Translate</b> button. The phonetic translation will appear in the lower window.<p> | |||
<li>Click the <b>Speak</b> button. The text will be spoken and a <b>Prosody</b> tab will open in the main window.<p> | |||
<li>Click on a vowel phoneme which is displayed in the Prosody tab. A red line appears under it to indicate that it has been selected.<p> | |||
<li>Use the <b>up-arrow</b> or <b>down-arrow</b> key to move the vowel's blue pitch contour up or down. Then click the <b>Speak</b> button again to hear the effect of the altered pitch. If the adjacent phoneme also has a pitch contour then you may hear a discontinuity in the sound if it no longer matches with the one which you have moved.<p> | |||
<li>Hold down the <b>Ctrl</b> key while using the <b>up-arrow</b> or <b>down-arrow</b> keys. The gradient of the pitch contour will change.<p> | |||
<li>Click with the right mouse button over a phoneme. A menu allows you to select a different pitch envelope shape. Details of the currently selected phoneme appear in the Status line at the bottom of the window. The <b>Stress</b> number gives the stress level of the phoneme (see voices.html for a list).<p> | |||
<li>Click the <b>Translate</b> button. This re-translates the text and restores the original pitches.<p> | |||
<li>Click on a vowel phoneme in the Prosody window and use the <b><</b> and <b>></b> keys to shorten or lengthen it.<p> | |||
</ol> | |||
The Prosody window can be used to experiment with different phoneme lengths and different intonation.<p> | |||
<hr> | |||
</body> | |||
</html> | |||
@@ -1,143 +0,0 @@ | |||
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd"> | |||
<html> | |||
<head> | |||
<title>Editor - Spectrum</title> | |||
<meta name="GENERATOR" content="Quanta Plus"> | |||
<meta http-equiv="Content-Type" content="text/html; charset=utf-8"> | |||
</head> | |||
<body> | |||
<A href="docindex.html">Back</A> | |||
<hr> | |||
<h2>USER INTERFACE - FORMANT EDITOR</h2> | |||
<hr> | |||
<h3>Frame Sequence Display</h3> | |||
The eSpeak editor can display a number of frame-sequencies in tabbed windows. Each frame can contain a short-time frequency spectrum, covering the period of one cycle at the sound's pitch. Frames can also show: | |||
<ul> | |||
<LI>Blue vertical lines showing the estimated position of the f1 to f5 formants (if the sequence was produced by praat analysis). These should correspond with the peaks in the spectrum, but may not do so exactly<p> | |||
<li>Numbers at the right side of the frame showing the position from the start of the sequence in mS, and the pitch of the sound.<p> | |||
<li>Up to 9 formant peaks (numbered 0 to 9) added by the user, usually to match the peaks in the spectrum, in order to produce the required sound. These are shown in green, can be moved by keyboard presses as described below, and may merge if they are close together. If a frame has formant peaks then it is a Keyframe and is shown with a pale yellow background.<p> | |||
<li>If formant peaks are present, a relative amplitude (r.m.s.) value is shown at the right side of the frame. | |||
<li> | |||
</ul> | |||
<h3>Text Tab</h3> | |||
Enter text in the top left text window. Click the <b>Translate</b> button to see the phonetic transcription in the text window below. Then click the <b>Speak</b> button to speak the text and show the results in the <b>Prosody</b> tab, if that is open. | |||
<p> | |||
If changes are made in the <b>Prosody</b> tab, then clicking <b>Speak</b> will speak the modified prosody while <b>Translate</b> will revert to the default prosody settings for the text. | |||
<p> | |||
To enter phonetic symbols (Kirschenbaum encoding) in the top left text window, enclose them within [[ ]]. | |||
<h3>Spect Tab</h3> | |||
The "Spect" tab in the left panel of the eSpeak editor shows information about the currently selected frame and sequence. | |||
<ul> | |||
<li>The <strong>Formants</strong> section displays the Frequency, Height, and Width of each formant peak (peaks 0 to 8). Peaks 6, 7, 8 don't have a variable width.<p> | |||
<li><strong>% amp - Frame</strong> can be used to adjust the amplitiude of the frame. If you change this value then the rms amplitude value at the right side of the frame will change. The formant peaks don't change, just the overall amplitude of the frame.<p> | |||
<li><strong>mS</strong> shows the time in mS until the next keyframe (or end of sequence if there is none). The spin control initially shows the same value, but this can be changed in order to increase or decrease the effctive length of a keyframe.<p> | |||
<li><strong>% amp - Sequence</strong> /ul> adjusts the amplitude of the whole sequence. Changing this values changes the rms amplitudes of all the keyframes in the sequence.<p> | |||
<li><strong>% mS - Sequence</strong> /ul> shows the total length of the sequence.<p> | |||
<li><strong>Graph</strong><br> | |||
Yellow vertical lines show the position of keyframes within the sequence.<br> | |||
Black bars on these show the frequencies of formant peaks which have been set at these keyframes.<br> | |||
Thick red lines, if present, show the formants, as detected in the original analysis.<br> | |||
Thin black line, if present, shows the pitch profile measured in the original analysis. | |||
</ul> | |||
</li> | |||
</ul> | |||
<h3>Key Commands</h3> | |||
<ul> | |||
<li><strong>Selection</strong>.<p> | |||
The selected frame(s) are shown with a red border. The selected formant peak is also indicated by an equals ("=") sign next to its number in the "Spect" panel to the right of the window.<p> | |||
The selected formant peak is shown with a red triangle under the peak.<p> | |||
Keyframes are shown with a pale yellow background. A keyframe is any frame with any formant peaks which are not zero height. If all formant peaks become zero height, the frame is no longer a keyframe. If you increase a peak's height the frame becomes a keyframe. | |||
<dl> | |||
<dt><strong>Numbers 0 to 8</strong> | |||
<dd>Select formant peak number 0 to 8. | |||
<dt><strong>Page Up/Down</strong> | |||
<dd>Move to next/previous frame | |||
</dl> | |||
<li><strong>Formant movement</strong>. With the following keys, holding down <b>Shift</b> causes slower movement. | |||
<dl> | |||
<dt>Left | |||
<dd>Moves the selected formant peak to higher frequency. | |||
<dt>Right | |||
<dd>Moves the selected formant peak to lower frequency. | |||
<dt>Up | |||
<dd>Increases height of the selected formant peak. | |||
<dt>Down | |||
<dd>Decreases height of the selected formant peak. | |||
<dt><strong><</strong> | |||
<dd>Narrows the selected formant peak. | |||
<dt><strong>></strong> | |||
<dd>Widens the selected formant peak. | |||
<dt><strong>CTRL <</strong> | |||
<dd>Narrows the selected formant peak. | |||
<dt><strong>CTRL ></strong> | |||
<dd>Widens the selected formant peak. | |||
<dt><b>/</b> | |||
<dd>Makes the selected formant peak symmetrical. | |||
</dl> | |||
<li><strong>Frame Cut and Paste</strong> | |||
<dl> | |||
<dt><b>CTRL A</b> | |||
<dd>Select all frames in the sequence. | |||
<dt><b>CTRL C</b> | |||
<dd>Copy selected frames to (internal) clipboard. | |||
<dt><b>CTRL V</b> | |||
<dd>Paste frames from the clipboard to overwrite the contents of the selected frame and the frames which follow it. Only the formant peaks information is pasted. | |||
<dt><b>CTRL SHIFT V</b> | |||
<dd>Paste frames from the clippoard to insert them above the selected frame. | |||
<dt><b>CTRL X</b> | |||
<dd>Delete the selected frames. | |||
</dl> | |||
<li><strong>Frame editing</strong> | |||
<dl> | |||
<dt><b>CTRL D</b> | |||
<dd>Copy the formant peaks down to the selected frame from the next keyframe above. | |||
<dt><b>CTRL SHIFT D</b> | |||
<dd>Copy the formant peaks up to the selected frame from the next key-frame below. | |||
<dt><b>CTRL Z</b> | |||
<dd>Set all formant peaks in the selected frame to zero height. It is no longer a key-frame. | |||
<dt><b>CTRL I</b> | |||
<dd>Set the formant peaks in the selected frame as an interpolation between the next keyframes above and below it. A dialog box allows you to enter a percentage. 50% gives values half-way between the two adjacent key-frames, 0% gives values equal to the one above, and 100% equal to the one below. | |||
</dl> | |||
<li><strong>Display and Sound</strong> | |||
<dl> | |||
<dt><b>CTRL Q</b> | |||
<dd>Shows interpolated formant peaks on non-keyframes. These frames don't become keyframes until any of the peaks are edited to increase their height. | |||
<dt><b>CTRL SHIFT Q</b> | |||
<dd>Removes the interpolated formant peaks display. | |||
<dt><b>CTRL G</b> | |||
<dd>Toggle grid on and off. | |||
<dt><b>F1</b> | |||
<dd>Play sound made from the one selected keyframe. | |||
<dt><b>F2</b> | |||
<dd>Play sound made from all the keyframes in the sequence. | |||
</ul> | |||
<p> | |||
<hr> | |||
<h2>USER INTERFACE - PROSODY EDITOR</h2> | |||
<hr> | |||
<ul><LI> | |||
<dl> | |||
<dt><b>Left</b> | |||
<dd>Move to previous phoneme. | |||
<dt><b>Right</b> | |||
<dd>Move to next phoneme. | |||
<dt><b>Up</b> | |||
<dd>Increase pitch. | |||
<dt><b>Down</b> | |||
<dd>Decrease pitch. | |||
<dt><b>Ctrl Up</b> | |||
<dd>Increase pitch range. | |||
<dt><b>Ctrl Down</b> | |||
<dd>Decrease pitch range. | |||
<dt><b>></b> | |||
<dd>Increase length. | |||
<dt><b><</b> | |||
<dd>Decrease length. | |||
</dd> | |||
</dl> | |||
</LI> | |||
</ul> | |||
</body> | |||
</html> |
@@ -3,7 +3,14 @@ | |||
- [Features](#features) | |||
- [History](#history) | |||
- [Languages](languages.html) | |||
- [Documents](docindex.html) | |||
- [Adding a Language](add_language.html) | |||
- [Pronunciation Dictionary](dictionary.html) | |||
- [Voice Files](voices.html) | |||
- [MBROLA Voices](mbrola.html) | |||
- [Phonemes](phonemes.html) | |||
- [Phoneme Tables](phontab.html) | |||
- [Intonation](intonation.html) | |||
- [Markup Tags](ssml.html) | |||
- [License](../COPYING) | |||
---------- |