ManaTTS is the largest open Persian speech dataset with 86+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

README.md 3.2KB

123456789101112131415161718192021222324252627282930313233343536
  1. # Third-Party Licenses
  2. This directory contains the licenses for the third-party tools and libraries used in this project. Below is a list of the tools along with their licenses.
  3. ## Tools and Licenses
  4. | Tool Name | Usage | Repository Page | License |
  5. |----------------|------------------------------------------------------------------------|--------------------------------------------------------------------------------------|------------|
  6. | Spleeter | Source separation (remove background music) | [GitHub](https://github.com/deezer/spleeter) | MIT |
  7. | Parsi.io | Number extraction & number to text conversion | [GitHub](https://github.com/language-ml/parsi.io) | Apache-2.0 |
  8. | Hazm | Text normalization | [GitHub](https://github.com/roshan-research/hazm) | MIT |
  9. | Pydub | Silence detection/removal | [GitHub](https://github.com/jiaaro/pydub) | MIT |
  10. | Perpos | Part of speech tagging for sentence tokenization | [GitHub](https://github.com/mhbashari/perpos) | MIT |
  11. | Vosk | Forced alignment | [GitHub](https://github.com/alphacep/vosk) | Apache-2.0 |
  12. | Whisper-fa | Forced alignment | [HuggingFace](https://huggingface.co/speechbrain/asr-whisper-large-v2-commonvoice-fa) | Apache-2.0 |
  13. | Wav2vec2-v3 | Forced alignment | [HuggingFace](https://huggingface.co/m3hrdadfi/wav2vec2-large-xlsr-persian-v3) | - |
  14. | Wav2vec2-fa | Forced alignment | [GitHub](https://github.com/Hamtech-ai/wav2vec2-fa) | Apache-3.0 |
  15. | Hezar | Forced alignment | [GitHub](https://github.com/hezarai/hezar) | Apache-2.0 |
  16. | JiWER | CER calculation | [GitHub](https://github.com/jitsi/jiwer) | Apache-2.0 |
  17. ## License Files
  18. This directory also contains the actual license files for each tool:
  19. - [Spleeter License (MIT)](spleeter.txt)
  20. - [Parsi.io License (Apache-2.0)](parsi_io.txt)
  21. - [Hazm License (MIT)](hazm.txt)
  22. - [Pydub License (MIT)](pydub.txt)
  23. - [Perpos License (MIT)](perpos.txt)
  24. - [Vosk License (Apache-2.0)](vosk.txt)
  25. - [Whisper-fa License (Apache-2.0)](whisper_fa.txt)
  26. - [Wav2vec2-fa License (Apache-3.0)](wav2vec2_fa.txt)
  27. - [Hezar License (Apache-2.0)](hezar.txt)
  28. - [JiWER License (Apache-2.0)](jiwer.txt)
  29. Please refer to these files for the full text of each license.