Insurance Premium Prediction

📊 Machine Learning Regression Project – Kaggle Submission

This project builds a regression model to predict insurance premium amounts using a real-world dataset of 2 million entries. The goal was to optimize model performance while maintaining interpretability and memory efficiency under resource constraints.

🧠 Project Highlights

Trained a Random Forest model on 2M records with custom preprocessing and feature engineering
Applied tailored imputation strategies for missing values and encoded high-cardinality categorical features
Achieved ~1.15 RMSE on Kaggle's private leaderboard
Visualized distribution patterns and evaluated prediction errors to understand model limitations

📈 Tools & Libraries

Python, Pandas, NumPy, Scikit-learn, Matplotlib, Seaborn
Jupyter Notebook (exploratory analysis and training)

📊 Key Metrics

📉 RMSE: ~1.15 (Kaggle)
📉 MAE: ~637
📈 Records Used: 2,000,000+

📂 Dataset

Due to file size constraints, the dataset is not included in this repository.

Please download it directly from the Kaggle competition page:
🔗 Playground Series - Season 4, Episode 12

Place the downloaded files (e.g., train.csv, test.csv) in the same directory as the notebook before running.

🚀 How to Run

Clone the repo
Download the dataset and place it in the same directory as the notebook
Make sure you have Python 3 installed and the following libraries:

pandas
numpy
scikit-learn
matplotlib
seaborn

Launch the notebook:

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Insurance_Premium_Prediction_Regression_Model.ipynb		Insurance_Premium_Prediction_Regression_Model.ipynb
Insurence_Premium_Prediction_Presentation.pptx		Insurence_Premium_Prediction_Presentation.pptx
Prediction_Presentation.pdf		Prediction_Presentation.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Insurance Premium Prediction

🧠 Project Highlights

📈 Tools & Libraries

📊 Key Metrics

📂 Dataset

🚀 How to Run

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Insurance Premium Prediction

🧠 Project Highlights

📈 Tools & Libraries

📊 Key Metrics

📂 Dataset

🚀 How to Run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages