Data Projects

Manhattan College Business Analytics Competition 2023

1. Official poster: 2. Background story: 3. Data source: 4. Research links: https://education.nationalgeographic.org/resource/paradox-undernourishment Food Safety status in African countries https://agrilinks.org/post/advancing-food-safety-africa-opportunities-and-action-areas 5. The development of Food Safety in Sub-Saharan Africa: Based on the data found and researches, we concluded that Food Safety issue in Sub-Saharan Africa was determined by 2 main factors: Food Quantity and Food […]

Food Safety status in African countries

https://agrilinks.org/post/advancing-food-safety-africa-opportunities-and-action-areas

5. The development of Food Safety in Sub-Saharan Africa:

Based on the data found and researches, we concluded that Food Safety issue in Sub-Saharan Africa was determined by 2 main factors: Food Quantity and Food Quality. As we developed deeper analysis on each factor, we found that there was no sufficient data to support Food Quality. Therefore, we decided to primarily focus on Food Quantity.

6. Indicator for Food Quantity & Other variables:

We chose The Prevalence Number of Undernourished People in 20 years (2001 – 2020) as an indicator for Food Quantity (Y)
We chose randomly a certain number of factors as variable X that we believed to have an impact on Food Quantity (Y)

7. Data cleaning:

We used R Studio to merge data tables from Excel files and convert rows to columns. Then we removed any variables that had more than 10% of missing values. That helped us to narrow down the number of variable X that was finally used for the analysis.

Manhattan College Business Analytics Competition 2023

1. Official poster:

2. Background story:

3. Data source:

4. Research links:

5. The development of Food Safety in Sub-Saharan Africa:

6. Indicator for Food Quantity & Other variables:

7. Data cleaning:

8. My major tasks: Gather data, Clean data, Develop K-means clustering analysis on the regional level.

Analysis on Bicycle Sales for SimplyBusiness

EU SUPERSTORE ANALYSIS

Using R for Linear Model

Google BigQuery – Case 1a & 1b

Google BigQuery – Case 2a & 2b