Persian/Farsi text to speech(TTS) training using coqui tts
-
Updated
Feb 15, 2025 - Jupyter Notebook
Persian/Farsi text to speech(TTS) training using coqui tts
CLIPfa: Connecting Farsi Text and Images
A comprehensive dataset for determining gender based on Persian names, enriched with English representations.
A collection of Farsi (Persian) datasets
An Image Dataset of Printed Farsi Text for OCR Research
The first intelligent Persian reverse dictionary
Persian Datasets including: Wikipedia, Twitter, Hamshahri, Hellokish, NSURL'19, Peyma, Text_mining.ir
This repository hosts BonyadAI, a Persian question answering AI Model. We developed an initial web crawler and scraper to gather the dataset. The second phase involved building a machine learning model based on word embeddings and NLP techniques. This AI model operates end-to-end, receiving user voice input and providing responses in Persian voice.
Official github repository, Persis: A persian font recognition pipeline using convolutional neural networks.
In this repository, the wavLM model is used for quality and poor quality data for speaker verification task, and the PyCM library is used for evaluation.
A Custom ANN Model for Classifying Farsi News Topics With User Interface.
Python library for analyzing Persian texts. With the ability to analyze customer opinions and their offer status, analyzing the seven emotions in Persian sentences at the moment.
Persian to Finglish dataset with all the sentences voice for TTS dataset used to train tacotron2
Welcome to the Persian Last Names Dataset, a comprehensive collection of over 100,000 Persian surnames accompanied by their respective frequencies. This dataset is curated from a substantial real-world sample of more than 10 million records, ensuring reliable and representative data for various applications.
Simple Script To Crawl Data From Persian News Agencies Including Fars, Mehr.
The first dataset for Farsi fact extraction and verification
This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.
A comprehensive repository of classical Persian poetry, curated from Ganjoor.net, designed for Natural Language Processing (NLP), machine learning applications, and literary research.
🎤 Generate high-quality, offline text-to-speech audio with Coqui-TTS for enhanced accessibility and creativity in your projects.
Add a description, image, and links to the farsi-datasets topic page so that developers can more easily learn about it.
To associate your repository with the farsi-datasets topic, visit your repo's landing page and select "manage topics."