A dataset of informal Persian audio and text chunks, along with a fully open processing pipeline, suitable for ASR and TTS tasks. Created from crawled content on virgool.io.

MahtaFetrat 5d07f4ec0f add third party licenses		1 year ago
..
README.md	add third party licenses	1 year ago
hazm.txt	add third party licenses	1 year ago
hezar.txt	add third party licenses	1 year ago
jiwer.txt	add third party licenses	1 year ago
parsi_io.txt	add third party licenses	1 year ago
pydub.txt	add third party licenses	1 year ago
spleeter.txt	add third party licenses	1 year ago
vosk.txt	add third party licenses	1 year ago
wav2vec_fa.txt	add third party licenses	1 year ago
whisper_fa.txt	add third party licenses	1 year ago

Third-Party Licenses

This directory contains the licenses for the third-party tools and libraries used in this project. Below is a list of the tools along with their licenses.

Tools and Licenses

Tool Name	Usage	Repository Page	License
Parsi.io	Number extraction & number to text conversion	GitHub	Apache-2.0
Hazm	Text normalization	GitHub	MIT
Pydub	Silence detection/removal	GitHub	MIT
Perpos	Part of speech tagging for sentence tokenization	GitHub	MIT
Vosk	Forced alignment	GitHub	Apache-2.0
Whisper-fa	Forced alignment	HuggingFace	Apache-2.0
Wav2vec2-v3	Forced alignment	HuggingFace	-
Wav2vec2-fa	Forced alignment	GitHub	Apache-3.0
Hezar	Forced alignment	GitHub	Apache-2.0
JiWER	CER calculation	GitHub	Apache-2.0

License Files

This directory also contains the actual license files for each tool:

Please refer to these files for the full text of each license.

README.md

Third-Party Licenses

Tools and Licenses

License Files