Disaster and Food Insecurity (DataScience Foundations Project)
Link to my GitHub Repository Github Code.
Introduction
Hunger is a global issue affecting millions of people. Food insecurity, defined by the FAO, is the lack of consistent access to adequate food supplies. Climate change, leading to increased natural disasters, can exacerbate food insecurity. This research aims to analyze the impact of disasters on food security, specifically the percentage of malnourishment, to anticipate and prepare for the consequences of disasters on affected populations.
Dataset
The research relies on two main datasets:
- EMDAT Dataset: Contains information on natural disasters worldwide, recorded by the EM-DAT Centre from 1900 to the present day.
- FAOSTAT Dataset: Provided by the Food and Agriculture Organization, it includes global hunger metadata and indicators such as prevalence of undernourishment.
Modeling
- Data Preprocessing: The FAOSTAT dataset was transformed and merged with the EM-DAT dataset. Null values, especially in the disaster magnitude column, were imputed using MICE imputation, and the data was normalized using MinMaxScaler.
- Analytic Approach: The research involved training various models, including CNN, random forest, linear regression, and multiple linear regression, to predict the impact of disasters on food insecurity.
- Results: The models were evaluated using mean squared error, with CNN and random forest showing the best fit. Further analysis on specific disaster types (flood, storm, earthquake) revealed flood and storm to have a more significant impact on food insecurity.
Conclusion
The research concludes that flood and storm disasters have a more substantial impact on food insecurity compared to earthquakes. The best-fitting models were found to be Random Forest and CNN, with mean squared errors of 0.00109 and 0.00430 for flood and storm, respectively.
Repository Contents
- Datasets:
FOASTAT_data.csv: Food insecurity datasetemdat.csv: Disasters datasetfinal_normalized_data.csv: Final dataset with normalized values of magnitude
- Code:
final_project.ipynb: Colab notebook containing the analysis, modeling, and visualizations
- Visualizations:
visualizations.twb: Tableau file with visualizations
- References:
references.txt: Document containing all references and data sources
- Presentation:
Data_5100_Analysis_Project_Presentation.pdf: Presentation slides summarizing the research findings
Limitations
The research acknowledges limitations, including the influence of variables like war, COVID-19, and poverty on malnutrition. Additionally, recovery from food insecurity post-disaster depends on a country’s resources.
Resources
Useful links and resources for further understanding and context are provided in the references section.