ML Project: Fingerprint Spoofing Detection

Project Overview

This project focuses on a binary classification problem: fingerprint spoofing detection. The goal is to identify genuine versus counterfeit fingerprint images using a dataset with labeled samples. The dataset contains features extracted from fingerprint images and includes both genuine (True, label 1) and fake (False, label 0) samples.

Dataset

The dataset used in this project consists of 6-dimensional feature vectors. The training data is stored in Project/trainData.txt, a CSV file where each row represents a sample with the first 6 values being the features and the last value representing the class label (1 or 0). The samples are not ordered.

Project Tasks

1. Exploratory Data Analysis

Load the dataset.
Plot histograms and pair-wise scatter plots of the features.
Analyze the distribution and overlap of classes for different feature pairs.

2. PCA and LDA Application

Apply PCA to reduce dimensionality and analyze the variance captured by each principal component.
Apply LDA to find the best linear separation between the two classes.
Evaluate the performance of classifiers using PCA and LDA.

3. Logdensity Estimation

Fit uni-variate Gaussian models to the features.
Plot the distribution densities on top of normalized histograms.
Analyze the goodness of fit for Gaussian models.

4. Training and Validation

Split the dataset into training and validation sets.
Train and evaluate different models: MVG, tied Gaussian, and Naive Bayes Gaussian.
Compute log-likelihood ratios and evaluate classifier performance.

5. DCF and minDCF Evaluation

Analyze classifier performance under different prior probabilities and cost scenarios.
Compute the actual and minimum Decision Cost Function (DCF) for various applications.

6. Binary Logistic Regression

Train logistic regression models with different regularization parameters.
Evaluate the impact of regularization on classification performance.
Analyze the effects of data preprocessing techniques like centering and PCA.

7. Support Vector Machines

Train support vector machine models with different kernel functions and parameters.
Evaluate the impact of kernel functions and parameters on classification performance.
Visualize the decision boundaries of support vector machine models.
Analyze the effects of data preprocessing techniques like centering and polynomial feature expansion.

8. Gaussian Mixture Models

Train Gaussian mixture models with different covariance types and number of components.
Evaluate the impact of covariance types and number of components on classification performance.
Visualize the decision boundaries of Gaussian mixture models.
Save the best classifier based on combined score.
Evaluate the performance of the best classifier using pieff vs DCFs plots.

9. Model Evaluation and Fusion

Load the best classifiers for logistic regression, support vector machines, and Gaussian mixture models.
Evaluate the performance of the best classifiers using pieff vs DCFs plots.
Perform k-fold calibration on the validation scores of the best classifiers.
Evaluate the performance of the calibrated scores using pieff vs DCFs plots.
Fuse the scores of logistic regression, support vector machines, and Gaussian mixture models.
Evaluate the performance of the fused scores using pieff vs DCFs plots.
Apply the best classifier to the evaluation dataset.
Evaluate the performance of the best classifier on the evaluation dataset using pieff vs DCFs plots.
Visualize the decision boundaries of Gaussian mixture models with different covariance types and number of components.

Project Structure

ML-Project/

│

├── Project/

│ ├── trainData.txt # Training dataset

│ ├── project.py # Main project code

│ ├── models/ # Saved models

│ ├── plots/ # Saved plots

│

├── README.md # Project README file

├── Report.tex # LaTeX report

Requirements

Python 3.x
NumPy
SciPy
Matplotlib
LaTeX (for compiling the report)

Usage

Clone the repository:

git clone https://github.com/yourusername/ML-Project.git
cd ML-Project

Run the project code:
```
python Project/project_code.py
```
Compile the LaTeX report:
```
pdflatex report.tex
```

Contributions

Contributions are welcome! Please fork the repository and create a pull request with your changes.

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
Project		Project
Project.pdf		Project.pdf
README.md		README.md
Report.pdf		Report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML Project: Fingerprint Spoofing Detection

Project Overview

Dataset

Project Tasks

1. Exploratory Data Analysis

2. PCA and LDA Application

3. Logdensity Estimation

4. Training and Validation

5. DCF and minDCF Evaluation

6. Binary Logistic Regression

7. Support Vector Machines

8. Gaussian Mixture Models

9. Model Evaluation and Fusion

Project Structure

Requirements

Usage

Contributions

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ML Project: Fingerprint Spoofing Detection

Project Overview

Dataset

Project Tasks

1. Exploratory Data Analysis

2. PCA and LDA Application

3. Logdensity Estimation

4. Training and Validation

5. DCF and minDCF Evaluation

6. Binary Logistic Regression

7. Support Vector Machines

8. Gaussian Mixture Models

9. Model Evaluation and Fusion

Project Structure

Requirements

Usage

Contributions

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages