Organ-aware 3D lesion segmentation dataset and pipeline for abdominal CT analysis (ACM Multimedia 2025 candidate)
Updated 1 hour ago
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Updated 1 week ago
HomoRich: The first large-scale Persian homograph dataset for G2P conversion, featuring 528K annotated sentences with balanced pronunciation variants and dual phoneme representations.
Updated 1 week ago
A Persian grapheme-to-phoneme (G2P) model designed for homograph disambiguation, fine-tuned using the HomoRich dataset to improve pronunciation accuracy.
Updated 1 week ago
Updated 1 week ago
Benchmarking notebooks for various Persian G2P models, comparing their performance on the SentenceBench dataset, including Homo-GE2PE and Homo-T5.
Updated 1 week ago
DeepTraCDR: Prediction Cancer Drug Response using multimodal deep learning with Transformers
Updated 3 weeks ago
Updated 3 months ago
Updated 3 months ago
A dataset of informal Persian audio and text chunks, along with a fully open processing pipeline, suitable for ASR and TTS tasks. Created from crawled content on virgool.io.
Updated 3 months ago
Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G2P tasks without additional training, featuring Sentence-Bench and Kaamel-Dict.
Updated 3 months ago
A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject
Updated 3 months ago
Python package for detecting informal Persian text using regular expressions and rule-based methods
Updated 3 months ago
ManaTTS is the largest open Persian speech dataset with 86+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.
Updated 3 months ago
Updated 6 months ago
Updated 8 months ago
Updated 8 months ago
Updated 9 months ago
Updated 10 months ago
Updated 11 months ago