Choose the right data for survival analysis
Let's say you're working on estimating the rate at which a new strain of virus spreads in a population. It's always good practice to explore the data you have available before deciding whether to use a statistical model. Look out for any model assumptions and requirements for the data.
Three datasets have been loaded for you, called
virus_record_avirus_record_bvirus_record_c
Inspect each DataFrame in the console. Which of the DataFrames would be suitable for survival analysis?
pandas and numpy are loaded as pd and np, respectively.
This exercise is part of the course
Survival Analysis in Python
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
Start Exercise