Skip to content

Latest commit

 

History

History
23 lines (16 loc) · 2 KB

File metadata and controls

23 lines (16 loc) · 2 KB

Assignment for Data Processing in Python

In this assignment, we will ask you to elaborate on the code we have started developing in our challenges. The assignment consists of two parts:

COVID-19 Spread Modelling

  • Plot Rt graphs for 5-6 different countries on one plot for comparison, or using several plots side-by-side
  • See how the number of deaths and recoveries correlate with number of infected cases.
  • Find out how long a typical disease lasts by visually correlating infection rate and deaths rate and looking for some anomalies. You may need to look at different countries to find that out.
  • Calculate the fatality rate and how it changes over time. You may want to take into account the length of the disease in days to shift one time series before doing calculations

COVID-19 Papers Analysis

  • Build co-occurrence matrix of different medications, and see which medications often occur together (i.e. mentioned in one abstract). You can modify the code for building co-occurrence matrix for medications and diagnoses.
  • Visualize this matrix using heatmap.
  • As a stretch goal, visualize the co-occurrence of medications using chord diagram. This library may help you draw a chord diagram.
  • As another stretch goal, extract dosages of different medications (such as 400mg in take 400mg of chloroquine daily) using regular expressions, and build dataframe that shows different dosages for different medications. Note: consider numeric values that are in close textual vicinity of the medicine name.

Rubric

Exemplary Adequate Needs Improvement
All tasks are complete, graphically illustrated and explained, including at least one of two stretch goals More than 5 tasks are complete, no stretch goals are attempted, or the results are not clear Less than 5 (but more than 3) tasks are complete, visualizations do not help to demonstrate the point