|
|
@@ -43,11 +43,13 @@ You can view and run [the processing notebook in Google Colab](https://colab.res |
|
|
|
|
|
|
|
The dataset can be used to train Persian ASR and TTS models, specifically tailored for informal Persian speech. Additionally, it can be utilized to evaluate ASR models in terms of Character Error Rate (CER). For example, see [this repository](https://github.com/MahtaFetrat/Open-Persian-ASRs) for an ASR evaluation setup. |
|
|
|
|
|
|
|
## Links |
|
|
|
## Dataset |
|
|
|
The dataset can be accessed through the following links: |
|
|
|
|
|
|
|
- [Raw Data](link to be updated) |
|
|
|
- [Processed Data](link to be updated) |
|
|
|
- [Processing Notebook in Colab](https://colab.research.google.com/drive/1AjvrRisJYdqvNdSDKdSWfxge6S29mavm?usp=sharing) |
|
|
|
|
|
|
|
You can also see a smaller randomly selected subset of the processed data in [this directory](link to be updated). |
|
|
|
|
|
|
|
## License |
|
|
|
|