HomoRich: The first large-scale Persian homograph dataset for G2P conversion, featuring 528K annotated sentences with balanced pronunciation variants and dual phoneme representations.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Mahta Fetrat a5c775863c
Add files via upload
2 weeks ago
assets Add files via upload 2 weeks ago
data Add files via upload 2 weeks ago
scripts Add files via upload 2 weeks ago
LICENSE Initial commit 2 weeks ago
README.md Initial commit 2 weeks ago

README.md

HomoRich-G2P-Persian