eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

index.md 2.9KB

1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950515253545556575859606162636465666768697071
  1. # eSpeak NG: Speech Synthesizer
  2. - [Features](#features)
  3. - [History](#history)
  4. - [Adding a Language](add_language.md)
  5. - [Text to Phoneme Translation](dictionary.md)
  6. - [Voice Files](voices.md)
  7. - [MBROLA Voices](mbrola.md)
  8. - [Phoneme Tables](phontab.md)
  9. - [Intonation](intonation.md)
  10. - [Markup Tags](markup.md)
  11. - [License](../COPYING)
  12. ----------
  13. eSpeak NG is a compact open source software speech synthesizer for English and
  14. other languages, for Linux and Windows.
  15. eSpeak NG uses a "formant synthesis" method. This allows many languages to be
  16. provided in a small size. The speech is clear, and can be used at high speeds,
  17. but is not as natural or smooth as larger synthesizers which are based on human
  18. speech recordings.
  19. eSpeak NG is available as:
  20. * A command line program (Linux and Windows) to speak text from a file or
  21. from stdin.
  22. * A shared library version for use by other programs. (On Windows this is
  23. a DLL).
  24. * A SAPI5 version for Windows, so it can be used with screen-readers and
  25. other programs that support the Windows SAPI5 interface.
  26. * eSpeak NG has been ported to other platforms, including Solaris and Mac
  27. OSX.
  28. ## Features
  29. * Includes different Voices, whose characteristics can be altered.
  30. * Can produce speech output as a WAV file.
  31. * SSML (Speech Synthesis Markup Language) is supported (not complete),
  32. and also HTML.
  33. * Compact size. The program and its data, including many languages,
  34. totals about 1.4 Mbytes.
  35. * Can be used as a front-end to [MBROLA diphone voices](mbrola.md).
  36. eSpeak NG converts text to phonemes with pitch and length information.
  37. * Can translate text into phoneme codes, so it could be adapted as a
  38. front end for another speech synthesis engine.
  39. * Potential for other languages. Several are included in varying stages
  40. of progress. Help from native speakers for these or other languages is
  41. welcome.
  42. * Written in C.
  43. The eSpeak speech synthesizer supports over 70 languages, however in many cases
  44. these are initial drafts and need more work to improve them. Assistance from
  45. native speakers is welcome for these, or other new languages. Please contact me
  46. if you want to help.
  47. ## History
  48. The program was originally known as __speak__ and originally written
  49. for Acorn/RISC\_OS computers starting in 1995 by Jonathan Duddington. This was
  50. enhanced and re-written in 2007 as __eSpeak__, including a relaxation of the
  51. original memory and processing power constraints, and with support for additional
  52. languages.
  53. In 2010, Reece H. Dunn started maintaining a version of eSpeak on GitHub that
  54. was designed to make it easier to build eSpeak on POSIX systems, porting the
  55. build system to autotools in 2012. In late 2015, this project was officially
  56. forked to a new __eSpeak NG__ project. The new eSpeak NG project is a significant
  57. departure from the eSpeak project, with the intention of cleaning up the
  58. existing codebase, adding new features and adding and improving to the
  59. supported languages.