Choose the right data for survival analysis
Let's say you're working on estimating the rate at which a new strain of virus spreads in a population. It's always good practice to explore the data you have available before deciding whether to use a statistical model. Look out for any model assumptions and requirements for the data.
Three datasets have been loaded for you, called
virus_record_a
virus_record_b
virus_record_c
Inspect each DataFrame in the console. Which of the DataFrames would be suitable for survival analysis?
pandas
and numpy
are loaded as pd
and np
, respectively.
This exercise is part of the course
Survival Analysis in Python
Hands-on interactive exercise
Turn theory into action with one of our interactive exercises
