GitHub - Abe-E/INFO-3401-Final-Project: The purpose of this project was to investigate whether historical redlining policies influence health outcomes and access.

Project Overview

We investigated whether historical redlining in the city of Denver predicted current neighborhood equity scores. To clarify our results, we utilized CDC data to determine whether redlining predicted specific diseases based on zipcodes.

Method for CDC Disease Analysis

This project investigates the impact of historical housing policy on modern community health in Denver. Utilizing SQL and Excel, I integrated two datasets: historical HOLC "Redlining" equity maps and 2023 CDC places morbidity data. Data for Denver redlining was downloaded as a geojson and was converted to a CSV file via a python script. To extract the data from the large CDC file, data was cleaned within a query within the CDC website, and joined in SQL and exported to Excel. To analyze this data, I performed Z-score standardization on clinical measures to account for population variance across 78 neighborhoods. I then ran a linear regression model, with the historical equity grade as the independent variable to predict the prevalence of modern chronic conditions. This analysis attempted to quantify how early 20th century redlining remains a statistically significant predictor in 21st century health conditions. A pivot table and graph was made to visualize the results.

Findings

1. Clinical Outcome: Cancer (Non-Skin) or Melanoma

Statistical Metric	Value	Statistical Significance
Multiple R	0.530	Moderate-to-Strong Correlation
R-Squared ($R^2$)	0.281	28.1% of variance explained
P-Value ($p$)	0.002	High Significance

2. Clinical Outcome: Routine Doctor Visits

Statistical Metric	Value	Statistical Significance
Multiple R	0.449	Moderate Correlation
R-Squared ($R^2$)	0.201	20.1% of variance explained
P-Value ($p$)	0.013	Significant

3. Clinical Outcome: Diagnosed Diabetes

Statistical Metric	Value	Statistical Significance
Multiple R	0.175	Weak Correlation
R-Squared ($R^2$)	0.031	3.1% of variance explained
P-Value ($p$)	0.355	Not Significant (Trend Only)

Statistics Summary

Clinical Measure	Grade A (Best)	Grade B	Grade C	Grade D (Redlined)
Diagnosed Diabetes	5.4%	5.8%	7.3%	10.1%
Routine Doctor Visits	66.1%	73.2%	68.1%	60.9%
Cancer (non-skin)	5.1%	5.8%	6.8%	3.5%*

Discussion

Our findings indicate that historical redlining functions as a predictor for certain outcomes. For example, Cancer was shown to be highly significant, and that nearly 30% of the difference in cancer rates across Denver neighborhoods is linked to 1930s housing grades. Additionally, routine doctor visits explained 20.1% of the variance in doctor visits. This shows that historically redlined neighborhoods have the lowest percentage of routine visits, suggesting that policies from ~90 years ago are still preventing people from getting preventative care today.

Limitations

There are various limitations due to the nature of the data and method of research. Firstly, geographical boundaries of redlining in Denver from the 1930's does perfectly align with the current neighborhood zip codes. As a result, some redlining area's are in multiple neighborhoods, possibly skewing the data. Additionally, CDC data is self-report data and may not be representative of neighborhoods, and prone to rater and recall bias. Another limitation in this study was the way in which data was standardized.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
CDC_Redline.sql		CDC_Redline.sql
CDC_Redline.xlsx		CDC_Redline.xlsx
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages