A dataset of informal Persian audio and text chunks, along with a fully open processing pipeline, suitable for ASR and TTS tasks. Created from crawled content on virgool.io.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Mahta Fetrat 90a406d545
Add the dataset processing notebook
1 year ago
LICENSE Initial commit 1 year ago
README.md Initial commit 1 year ago
VirgoolInformal_Dataset_Processing.ipynb Add the dataset processing notebook 1 year ago

README.md

VirgoolInformal-Dataset