Skip to content

Latest commit

 

History

History
20 lines (15 loc) · 604 Bytes

File metadata and controls

20 lines (15 loc) · 604 Bytes

Arabic-Text-Statistics

This script generates some basic statistics such as letters counts from an Arabic text. It will perform some preprocessing which are:

  1. Deleteing Tashkeel (الحركات):
  2. Delete Tatweel (مثال: العـــــــــــربية >> العربية).
  3. Delete symbols (ex: ! ? ,).

Getting Started

This script is written using Python 3.

Prerequisites

The main package that is used here is PyArabic.

All prerequisites can be installed from requirements.txt file.

After enetring the project directiry, run this command:

pip install -r requirements.txt