Skip to content

Latest commit

 

History

History
31 lines (23 loc) · 952 Bytes

File metadata and controls

31 lines (23 loc) · 952 Bytes

⚠️ Note

This project was originally developed in 2019. Some libraries, APIs, and functions used in the code may now be deprecated or updated.

It is recommended to use this repository primarily to understand the overall workflow, feature engineering techniques, and modeling approach.

For practical use, please update the dependencies and modify deprecated functions to align with the latest versions of the libraries.

🧠 NLP Techniques Used

The following NLP techniques are used for feature extraction:

  • Bag of Words (BoW)
  • TF-IDF
  • Word Embeddings
    • Word2Vec
    • Doc2Vec

📌 Each of the below classification algorithms has been trained and evaluated on all of the above NLP techniques.


🤖 Classification Algorithms

  • Logistic Regression
  • Support Vector Machine (SVM)
  • Random Forest
  • Extreme Gradient Boosting (XGBoost)
  • Light Gradient Boosting (LightGBM)