Importing Stata files
Herein, you'll gain expertise in
importing Stata files as DataFrames using the
pd.read_stata() function from pandas.
The file 'disarea.dta' is already
in your working directory. The data consist of disease extent for several diseases in various countries (more information can be found here).
This exercise is part of the course
Importing Data in Python
Exercise instructions
- Use
pd.read_stata()to load the file'disarea.dta'into the DataFramedf. - Print the head of the DataFrame
df. - Visualize your results by plotting a histogram of the column
disa10. We’ve already provided this code for you, so just run it!
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Import pandas
import pandas as pd
# Load Stata file into a pandas DataFrame: df
# Print the head of the DataFrame df
# Plot histogram of one column of the DataFrame
pd.DataFrame.hist(df[['disa10']])
plt.xlabel('Extent of disease')
plt.ylabel('Number of coutries')
plt.show()