Insurance Charge Prediction using Linear Regression

This project focuses on predicting medical insurance charges using a Linear Regression machine learning model. The model estimates insurance costs based on personal and demographic attributes.

Project Overview

Medical insurance costs depend on multiple factors such as age, BMI, smoking habits, and region. The objective of this project is to build a regression model that accurately predicts insurance charges using these features.

This is a supervised learning regression problem where the target variable (charges) is continuous.

Dataset Information

The dataset used in this project is insurance.csv, which contains the following features:

age : Age of the policyholder
sex : Gender (male/female)
bmi : Body Mass Index
children : Number of dependents
smoker : Smoking status (yes/no)
region : Residential region (northeast, northwest, southeast, southwest)
charges : Medical insurance cost (Target Variable)

Technologies Used

Python
Pandas
NumPy
Scikit-learn
Matplotlib
Seaborn
Jupyter Notebook

Project Workflow

Data Loading
Data Exploration and Analysis
Data Preprocessing
- Encoding categorical variables
- Feature selection
Train-test split
Model training using Linear Regression
Model evaluation
Prediction of insurance charges

Model Evaluation Metrics

The model performance was evaluated using:

R² Score
Mean Absolute Error (MAE)
Mean Squared Error (MSE)

These metrics measure how accurately the model predicts insurance charges.

How to Run the Project

Clone the repository:

git clone https://github.com/Divyansh1802/Insurance-Charge-Predictor.git
Navigate to the project folder:

cd Insurance-Charge-Predictor
Install required dependencies:

pip install -r requirements.txt
Open the Jupyter Notebook and run all cells.

Example Prediction

Input:

Age: 30
Sex: Male
BMI: 25.3
Children: 1
Smoker: No
Region: Southeast

Output:

Predicted Insurance Charges: (Model Generated Value)

Future Improvements

Apply advanced regression models (Random Forest, Gradient Boosting, XGBoost)
Perform hyperparameter tuning
Deploy the model using Flask or Streamlit
Build a user-friendly interface for real-time predictions

Author

Divyansh Upadhyay

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Predict_Insurance_Charge.ipynb		Predict_Insurance_Charge.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Insurance Charge Prediction using Linear Regression

Project Overview

Dataset Information

Technologies Used

Project Workflow

Model Evaluation Metrics

How to Run the Project

Example Prediction

Future Improvements

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Insurance Charge Prediction using Linear Regression

Project Overview

Dataset Information

Technologies Used

Project Workflow

Model Evaluation Metrics

How to Run the Project

Example Prediction

Future Improvements

Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages