Interpretable Dialect Classifier

This is the official repository for Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers . We provide the code for training and evaluating the dialect classifiers, as well as the code for extracting and evaluating the lexical features.

Requirements

pip install -r requirements.txt

Data

The data used in this work are the FRMT, LSDC, ITDI, and Europarl v8 datasets. The processed data and data processing scripts are placed in the data directory.

Training

Both training code for LOO and selfExplain are placed in the model directory.

Evaluation

The evaluation code and data including plasusibility, sufficiency, and human evaluation are placed in the evaluation directory.

Citation

If you use our tool, we'd appreciate if you cite the following paper:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interpretable Dialect Classifier

Requirements

Data

Training

Evaluation

Citation

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Interpretable Dialect Classifier

Requirements

Data

Training

Evaluation

Citation