A dataset of informal Persian audio and text chunks, along with a fully open processing pipeline, suitable for ASR and TTS tasks. Created from crawled content on virgool.io.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Mahta Fetrat 90a406d545
Add the dataset processing notebook
10 months ago
LICENSE Initial commit 10 months ago
README.md Initial commit 10 months ago
VirgoolInformal_Dataset_Processing.ipynb Add the dataset processing notebook 10 months ago

README.md

VirgoolInformal-Dataset