Machine Learning Recommendation Machine

A hybrid music recommendation engine that suggests songs by combining two ML techniques: collaborative filtering (learning from listening patterns) and content-based filtering (analyzing what makes songs sound similar).

Collaborative filtering uses SVD matrix factorization trained with stochastic gradient descent on Last.fm listening data to discover hidden patterns in user preferences. Content-based filtering uses Spotify audio features (energy, tempo, danceability, etc.) with K-Nearest Neighbors to find songs that sound similar to what a user already likes.

Requirements

Python >= 3.11
uv package manager
Spotify API credentials (client ID and secret from Spotify Developer Dashboard)

Setup

# Create virtual environment
python3.12 -m venv .venv

# Activate virtual environment
source .venv/bin/activate

# Install dependencies
uv sync

Add your Spotify API credentials to .env Look at .env.example to see example keys

Commands

1. Parse raw Last.fm data

python scripts/parse_csv_user_song.py

python scripts/parse_csv_unique_song.py

These produce:

user_song_interaction.csv
unique_song_interaction.csv

2. Match audio features

python scripts/match_audio_features.py

Matches our Last.fm songs against a Kaggle dataset (data/raw/song_audio_features.csv) containing pre-collected Spotify audio features (energy, tempo, danceability, etc.). Matches by artist + track name.

As of Feb 2026, Spotify deprecated its song features endpoint

Produces: audio_features.csv

3. Train collaborative filtering model (SVD)

python model/collaborative_filtering.py

Trains SVD matrix factorization with stochastic gradient descent on user-song interactions. Splits data 75/12.5/12.5 train/val/test and prints mean squared error each epoch. Saves the trained model to data/models/svd_model.npz for use by the hybrid recommender.

4. Run content-based filtering model (KNN)

python model/content_based_filtering.py

Loads audio features, fits KNN with cosine similarity, and generates 10 recommendations per user based on their listening history.

5. Run the app (hybrid recommender)

python app/app.py

Launches a Gradio web UI at http://127.0.0.1:7860 (can also be deployed publically). Search for songs on Spotify, select up to 5, and get recommendations using a two-stage hybrid pipeline:

KNN candidate generation — finds ~100 sonically similar songs from the dataset
SVD ranking — re-ranks candidates using collaborative filtering item biases (learned song popularity)

6. Run evaluation and generate plots

python model/evaluation.py

Evaluates all three models (SVD, KNN, Hybrid) using ranking metrics: Precision@k, Recall@k, and NDCG@k. Generates plots to plots/:

SVD training curves (train/val MSE per epoch)
Audio feature distributions and correlation heatmap
t-SNE embedding of the song feature space
Model comparison bar chart

A Jupyter notebook with the same analysis is also available at notebooks/evaluation.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
app		app
model		model
notebooks		notebooks
plots		plots
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning Recommendation Machine

Requirements

Setup

Commands

1. Parse raw Last.fm data

2. Match audio features

3. Train collaborative filtering model (SVD)

4. Run content-based filtering model (KNN)

5. Run the app (hybrid recommender)

6. Run evaluation and generate plots

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Recommendation Machine

Requirements

Setup

Commands

1. Parse raw Last.fm data

2. Match audio features

3. Train collaborative filtering model (SVD)

4. Run content-based filtering model (KNN)

5. Run the app (hybrid recommender)

6. Run evaluation and generate plots

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages