Get startedGet started for free

Choose the right data for survival analysis

Let's say you're working on estimating the rate at which a new strain of virus spreads in a population. It's always good practice to explore the data you have available before deciding whether to use a statistical model. Look out for any model assumptions and requirements for the data.

Three datasets have been loaded for you, called

  • virus_record_a
  • virus_record_b
  • virus_record_c

Inspect each DataFrame in the console. Which of the DataFrames would be suitable for survival analysis?

pandas and numpy are loaded as pd and np, respectively.

This exercise is part of the course

Survival Analysis in Python

View Course

Hands-on interactive exercise

Turn theory into action with one of our interactive exercises

Start Exercise