Machine Learning Notebooks

This repository contains a collection of Jupyter notebooks for machine learning (aprendizaje automático) algorithms and techniques, with examples and practical implementations. These notebooks are exercises from the course Artificial Intelligence 2024 at the UIB (University de les Illes Balears). The course is taught by Miquel Miró Nicolau, Gabriel Moyà Alcover, Dr. Javier Varona Gómez. From the research group XAI (Explainable Artificial Intelligence).

4_SVM.ipynb and 4_SVM_Pràctica.ipynb: Support Vector Machine implementation with both linear and non-linear kernels, visualization of decision boundaries, and hyperparameter tuning using cross-validation. Includes practical exercises comparing SVM with other classification models (Perceptron, Logistic Regression).

5. Data Cleaning

5_Neteja_de_dades_i_DT.ipynb: This notebook covers data cleaning techniques, such as: handling missing values, categorical data encoding, feature scaling, and noise reduction.

Assignment

ML_assignment.ipynb: Final course assignment applying multiple machine learning algorithms to the forest cover type dataset. The notebook includes data preprocessing (resampling for class balance and PCA for dimensionality reduction), hyperparameter optimization for various models (Perceptron, Logistic Regression, SVM, Decision Tree, Random Forest), and comprehensive model evaluation using confusion matrices and classification metrics.

Key Techniques Covered

Classification Methods

Perceptron: Simple neural network implementation for linear classification problems
Logistic Regression: Probabilistic classification for binary and multi-class problems
Support Vector Machines: Classification with both linear and non-linear kernels for optimal decision boundaries
Decision Trees: Tree-based classification with conditional branching
Random Forest: Ensemble method combining multiple decision trees for improved performance

Cross-Validation

K-Fold cross-validation is implemented in multiple notebooks to evaluate model performance. The technique divides the dataset into k subsets and uses each subset for testing while training on the remaining data.

Classification Metrics

Accuracy scoring
Confusion matrices
Classification reports
Precision, recall, and F1-score
ROC curves and AUC analysis

Parameter Optimization

Grid search for hyperparameter tuning
Custom product dictionary for parameter combinations
Cross-validation based optimization
Regularization parameter selection

Data Preprocessing

Handling missing values
Categorical data encoding
Feature scaling and normalization
Dimensionality reduction using PCA
Class balancing and resampling techniques
Noise reduction methods

Visualization Techniques

Decision boundary visualization
Feature correlation heatmaps
Model performance comparison plots
Learning curves
Hyperparameter effect visualization

Languages

The notebooks contain comments and explanations in Catalan.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
ML_Assignment_Report		ML_Assignment_Report
data		data
imgs		imgs
1_ML_i_Perceptró.ipynb		1_ML_i_Perceptró.ipynb
2_Regr_Pràctica.ipynb		2_Regr_Pràctica.ipynb
2_Regressió_i correlació.ipynb		2_Regressió_i correlació.ipynb
3_RegrLog_Pràctica.ipynb		3_RegrLog_Pràctica.ipynb
3_Regressió_Logística_i_K-Fold.ipynb		3_Regressió_Logística_i_K-Fold.ipynb
4_SVM.ipynb		4_SVM.ipynb
4_SVM_Pràctica.ipynb		4_SVM_Pràctica.ipynb
5_Neteja_de_dades_i_DT.ipynb		5_Neteja_de_dades_i_DT.ipynb
ML_assignment.ipynb		ML_assignment.ipynb
ML_storage.json		ML_storage.json
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning Notebooks

Contents

1. Introduction to Machine Learning

2. Regression

3. Logistic Regression

4. Support Vector Machines

5. Data Cleaning

Assignment

Key Techniques Covered

Classification Methods

Cross-Validation

Classification Metrics

Parameter Optimization

Data Preprocessing

Visualization Techniques

Languages

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Notebooks

Contents

1. Introduction to Machine Learning

2. Regression

3. Logistic Regression

4. Support Vector Machines

5. Data Cleaning

Assignment

Key Techniques Covered

Classification Methods

Cross-Validation

Classification Metrics

Parameter Optimization

Data Preprocessing

Visualization Techniques

Languages

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages