Since the COVID tracking project has discontinued, there is no single federal database with all covid data. However, the Center for Disease Control (CDC) COVID Data Tracker
- Since the COVID tracking project has discontinued, there is no single federal database with all covid data. However, the Center for Disease Control (CDC) COVID Data Tracker contains links out to covid data for states and counties.
- Use the links provided by the CDC COVID data tracker to find the covid data for your state: https://data.cdc.gov/Case-Surveillance/United-States-COVID-19-Cases-and-Deaths-by-State-o/9mfq-cb36/data
- Sort the data by state and then by submission date. Copy all of the data for your state to a new sheet.
Create a new column AR in the database labeled % positive. The formula to calculate % positive should be = AD1/AP1. Format the cell for percentage. Copy the cell down the column for all dates of your state. - Using the data, develop a linear regression time series analysis for deaths (column D), % tested, and % positive. Answer the following questions:
- What is the null and alternative hypothesis for each variable?
- What was the r squared for each variable? What does this mean?
- What was the p-value of each test? What does this mean?
- For those tests which were significant, use the model to predict the value of the variable seven days after the end of the workshop.
- Write a short report (1 to 2 pages for each variable) that includes the results of your analysis. Present the results and discuss the implications of your findings. Include whatever graphs or statistical output you may have generated in answering these questions along with a short explanation of your analysis. What conclusions concerning COVID19 may you draw from your analysis?