Study Limitations
- Small sample size (18 regions) limiting statistical power
- Lack of data on other contributing factors (poverty, geography, etc.)
- Region-level analysis restricts granularity of insights
- No access to 2024 data for more current analysis
Due to the limited metrics available in our current dataset, our sample size of 18 regions was not "sufficiently large" for the data to be treated as normally distributed. Thus, we failed to reject the null hypothesis set in this study: that there is little to no correlation between healthcare facility density and mortality rate in a region.
Be that as it may, we believe our exploration and analysis of the dataset remain relevant to the problems we set out to investigate. The scatter plot shows that, even though the correlation between the two variables was statistically insignificant, there may still be a positive relationship: increasing the number of healthcare facilities in densely populated regions appears to coincide with higher mortality rates in those same regions.
💡
One possible interpretation is that an increase in healthcare facilities does not directly raise mortality rates, but instead leads to more accurate and complete recording of deaths. In other words, improved healthcare infrastructure might enhance logistical capabilities, such as data collection and record-keeping, rather than affect mortality rates directly.
What conclusions or assumptions can be made from this? We believe that due to the lack of data on other possible factors influencing mortality rates—such as poverty levels, unemployment, geographical conditions, and more—we were unable to establish meaningful connections to explain this relationship.