Updated 7 months ago
Updated 7 months ago
Updated 1 year ago
Updated 11 months ago
Updated 10 months ago
Updated 7 months ago
Updated 7 months ago
Updated 6 months ago
Updated 5 months ago
Updated 4 months ago
Updated 3 months ago
ManaTTS is the largest open Persian speech dataset with 86+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.
Updated 3 days ago
Python package for detecting informal Persian text using regular expressions and rule-based methods
Updated 3 days ago
A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject
Updated 3 days ago
Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G2P tasks without additional training, featuring Sentence-Bench and Kaamel-Dict.
Updated 3 days ago
A dataset of informal Persian audio and text chunks, along with a fully open processing pipeline, suitable for ASR and TTS tasks. Created from crawled content on virgool.io.
Updated 3 days ago
Melu project implemented by l2l and using MetaSGD instead of MAML
Updated 3 years ago
Official implementation of the Fake News Revealer paper
Updated 1 year ago
This is Haji's MSc thesis codes. Use it with ultimate caution.
Updated 3 years ago