sms-spam-classification-bert

High-precision SMS Spam detection system using BERT. Achieved 98.0% accuracy through fine-tuning with PyTorch and Transformers.

📚 About Data

More than 5500 real text labeled examples. Dataset created by Tiago A. Almeida and José M. Gómez Hidalgo available here.

🎯 Key Features

Data Visualization: Several charts show relevant data features.
Transformer-based: Leverages bert-base-cased for deep contextual understanding.
Optimized Training: Implements a Linear Learning Rate Scheduler with a peak LR of 2e-5, decaying to zero to ensure stable convergence and prevent overfitting.
High Performance: Reached 98.0% accuracy on the test set.
Ready for Production: Model weights are hosted on Hugging Face for easy integration.

📊 Results

The model shows exceptional performance across all metrics:

Metric	Score
Accuracy	0.98
Precision (Spam)	0.99
Recall (Spam)	0.92
F1-Score (Spam)	0.93

🚀 Model Hosting

Due to GitHub's file size limitations, the trained model weights are hosted on the Hugging Face Hub.

You can access the model here: [https://huggingface.co/mrcsgh/bert-sms-spam-classifier]

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
notebooks		notebooks
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sms-spam-classification-bert

📚 About Data

🎯 Key Features

📊 Results

🚀 Model Hosting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

sms-spam-classification-bert

📚 About Data

🎯 Key Features

📊 Results

🚀 Model Hosting

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages