📈 Multiple Linear Regression

Predict startup profitability using Multiple Linear Regression with Backward Elimination — implemented in both Python and R.

What It Does

Models how R&D Spend, Administration, Marketing Spend, and State influence a startup's Profit using the 50 Startups dataset.

Loads and preprocesses data (one-hot encodes the categorical State column)
Splits into 80/20 train/test sets
Fits a Multiple Linear Regression model and reports R² / RMSE
Performs Backward Elimination (statsmodels OLS) to find the optimal predictor subset

Dataset

50_Startups.csv — 50 records with columns:

Column	Description
R&D Spend	Research & development expenditure
Administration	Administrative costs
Marketing Spend	Marketing expenditure
State	New York, California, or Florida
Profit	Target variable

🛠 Tech Stack

	Language	Libraries
🐍	Python 3.10+	`numpy` · `pandas` · `matplotlib` · `scikit-learn` · `statsmodels`
📊	R	`caTools`

Getting Started

Python

pip install numpy pandas matplotlib scikit-learn statsmodels
python multiple_linear_regression.py

R

install.packages("caTools")   # first time only
source("multiple_linear_regression.R")

Both scripts expect 50_Startups.csv in the same directory.

⚠️ Known Issues

No cross-validation or hyperparameter tuning (simple demonstration project).
The R script encodes State as numeric factor levels (1, 2, 3), which may imply ordinality — acceptable for lm() with factor types but worth noting.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
50_Startups.csv		50_Startups.csv
LICENSE		LICENSE
README.md		README.md
multiple_linear_regression.R		multiple_linear_regression.R
multiple_linear_regression.py		multiple_linear_regression.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📈 Multiple Linear Regression

What It Does

Dataset

🛠 Tech Stack

Getting Started

Python

R

⚠️ Known Issues

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📈 Multiple Linear Regression

What It Does

Dataset

🛠 Tech Stack

Getting Started

Python

R

⚠️ Known Issues

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages