Browse Source

Add link to the crawling script on colab

main
Mahta Fetrat 8 months ago
parent
commit
65b3279621
No account linked to committer's email address
1 changed files with 1 additions and 1 deletions
  1. 1
    1
      README.md

+ 1
- 1
README.md View File

@@ -6,7 +6,7 @@ ManaTTS is the largest publicly accessible single-speaker Persian corpus, compri
The ManaTTS dataset can be downloaded from [this link](link to be updated).

## Raw Data Crawling
The raw data for this dataset was crawled from the Nasl-e-Mana magazine website. The crawling script used for this purpose is also provided in this repository.
The raw data for this dataset was crawled from the Nasl-e-Mana magazine website. The crawling script used for this purpose is also provided in this repository and on Google Colab in [this link](https://colab.research.google.com/drive/1_E5KYAwuCr9B8k6EPYjVErsx-7rrr8Vl?usp=sharing).

## Processing Pipeline
The following figure illustrates the overall processing pipeline used to create the ManaTTS dataset, including the steps for preproces

Loading…
Cancel
Save