Amazon

Online shoppers often rely on ratings, reviews, and discounts to make decisions — but how reliable are these signals? In this project, I analyse an Amazon products dataset to explore:

How review volume affects rating reliability
Whether heavy discounts imply poor product quality
The relationship between price, discount, and customer satisfaction
Category-level pricing and rating behaviour

The project emphasises business-driven EDA, not prediction.

🎯 Objectives

Perform structured exploratory data analysis using Python
Identify counterintuitive patterns in customer behaviour
Visualise relationships between price, discount, rating, and review volume
Translate insights into business-relevant conclusions

🧾 Dataset Information

Source: Kaggle

Format: .xlsx

Each row represents a product review/listing

Key columns used:

product_name category actual_price discounted_price discount_percentage rating rating_count

🛠️ Tools & Libraries Used

Python Pandas (data manipulation & aggregation) Matplotlib & Seaborn (visualisation)

📊 Key Analyses & Visualizations

🔹 1. Rating Reliability vs Review Volume 🔹 2. Ratings of Highly Discounted Products (>50%) 🔹 3. Price, Rating & Discount Interaction 🔹 4. Category-Level Insights 🔹 5. Review Behaviour Analysis

🧠 Key Insights

1. ⭐ Ratings with low review counts are highly volatile

2. 📦 Customer trust increases with review volume

3. 💸 High discounts are often promotional, not quality-driven

4. 🏷️ Higher price does not always translate to higher ratings

📁 Project Structure

Amazon/ │ ├── data/ │ └── amazon_data.xlsx │ ├── notebooks/ │ └── Amazon.ipynb │ ├── visuals/ │ └── Amazon Visuals.ipynb │ ├── README.md

📊 EDA is crucial to assess data reliability, not just averages

📁 Project Structure

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Day-1. Amazon		Day-1. Amazon
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Amazon

🎯 Objectives

🧾 Dataset Information

Source: Kaggle

Format: .xlsx

Each row represents a product review/listing

Key columns used:

🛠️ Tools & Libraries Used

📊 Key Analyses & Visualizations

🧠 Key Insights

1. ⭐ Ratings with low review counts are highly volatile

2. 📦 Customer trust increases with review volume

3. 💸 High discounts are often promotional, not quality-driven

4. 🏷️ Higher price does not always translate to higher ratings

📁 Project Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Amazon

🎯 Objectives

🧾 Dataset Information

Source: Kaggle

Format: .xlsx

Each row represents a product review/listing

Key columns used:

🛠️ Tools & Libraries Used

📊 Key Analyses & Visualizations

🧠 Key Insights

1. ⭐ Ratings with low review counts are highly volatile

2. 📦 Customer trust increases with review volume

3. 💸 High discounts are often promotional, not quality-driven

4. 🏷️ Higher price does not always translate to higher ratings

📁 Project Structure

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages