Skip to content

Mlungis/HR_Analytics_Dashboard_Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸš€ HR Analytics Dashboard Project

From Messy Data β†’ Clean Dataset β†’ Interactive Power BI Dashboard


πŸ“Œ Overview

This project demonstrates a complete end-to-end data analytics workflow, transforming raw, unstructured employee data into a clean, analysis-ready dataset and building an interactive HR dashboard.

It showcases skills in:

  • Data cleaning & preprocessing
  • Data transformation using Python
  • Data modeling & visualization
  • Business insight generation

🎯 Project Objective

To simulate a real-world business scenario where messy HR data is cleaned and transformed into meaningful insights that support decision-making.


πŸ“‚ Dataset

  • Source: messy_employee_data.xlsx
  • Sheet Used: Employee_Data_RAW

The dataset initially contained:

  • Inconsistent column names
  • Missing values
  • Duplicate records
  • Incorrect data formats
  • Invalid entries (emails, phone numbers, etc.)

🧹 Data Cleaning Process

1. Column Standardization

  • Removed extra spaces
  • Converted all column names to lowercase
  • Replaced spaces with underscores
  • Renamed inconsistent columns

2. Removing Invalid Data

  • Dropped completely empty rows
  • Removed rows containing placeholder values (---)
  • Removed duplicate employee records

3. Data Type & Format Cleaning

πŸ‘€ Employee ID

  • Removed prefixes (EMP-)
  • Converted to consistent format

πŸ§‘ Names

  • Trimmed spaces
  • Standardized capitalization

πŸ“§ Email

  • Converted to lowercase
  • Removed invalid entries

πŸ“ž Phone Numbers

  • Extracted digits only
  • Standardized format: XXX-XXX-XXXX

4. Data Normalization

πŸ’° Salary

  • Removed symbols ($, ,)
  • Converted to numeric format

πŸ“… Hire Date

  • Converted to standard date format (YYYY-MM-DD)

🏒 Department

  • Standardized naming (title case)

🌍 Location

  • Cleaned city names (e.g., "nyc" β†’ "New York")
  • Standardized country names (e.g., "us", "usa" β†’ "USA")

5. Data Validation Rules

πŸ“Š Performance Score

  • Ensured values between 0–100

🎁 Bonus Percentage

  • Handled missing and invalid values
  • Defaulted to 0 where necessary

πŸŽ‚ Age

  • Valid range enforced (18–75)

⚧ Gender

  • Standardized categories:

    • Male
    • Female
    • Non-Binary
    • Other
    • Not Specified

πŸ“Œ Status

  • Cleaned into:

    • Active
    • Inactive
    • On Leave
    • Terminated
    • Unknown

πŸ’Ύ Output

  • Cleaned dataset saved as: cleaned_employee_data.xlsx

πŸ“Š Dashboard Features

The cleaned dataset was used to build an interactive HR Analytics Dashboard with the following insights:

πŸ” Key Metrics (KPIs)

  • Total Employees
  • Average Salary
  • Average Performance Score
  • Average Bonus Percentage

πŸ“ˆ Visual Insights

  • Hiring trends over time
  • Department-wise employee distribution
  • Salary vs performance analysis
  • Top job roles by employee count
  • Gender distribution
  • Geographic employee distribution

πŸŽ›οΈ Interactivity

  • Filters (Department, Gender, Country, Status)
  • Drill-down capabilities
  • Dynamic visuals responding to user input

🧠 Key Insights Derived

  • Identification of top-performing departments
  • Salary distribution patterns across roles
  • Workforce composition by region and gender
  • Relationship between salary and performance
  • Hiring trends over time

πŸ› οΈ Technologies Used

  • Python

    • pandas
    • re (regex)
  • Excel

  • Power BI


βš™οΈ How to Run the Project

1. Install Dependencies

pip install pandas openpyxl

2. Run the Script

python your_script_name.py

3. Output

  • Clean dataset will be generated automatically

πŸ’‘ Why This Project Matters

This project demonstrates the ability to:

  • Handle real-world messy datasets
  • Apply data cleaning best practices
  • Transform raw data into business insights
  • Build professional dashboards

πŸš€ Future Improvements

  • Add predictive analytics (employee attrition)
  • Integrate real-time data sources
  • Enhance dashboard storytelling
  • Deploy dashboard to cloud platforms

πŸ“¬ Contact

If you're a recruiter or collaborator interested in this project, feel free to connect!


⭐ If you found this project valuable, consider giving it a star!

Dashboard

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages