Importing Stata files
Here, you'll gain expertise in
importing Stata files as DataFrames using the
pd.read_stata()
function from pandas
.
The last exercise's file, 'disarea.dta'
, is still
in your working directory.
This is a part of the course
“Introduction to Importing Data in Python”
Exercise instructions
- Use
pd.read_stata()
to load the file'disarea.dta'
into the DataFramedf
. - Print the head of the DataFrame
df
. - Visualize your results by plotting a histogram of the column
disa10
. We’ve already provided this code for you, so just run it!
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Import pandas
import pandas as pd
# Load Stata file into a pandas DataFrame: df
# Print the head of the DataFrame df
# Plot histogram of one column of the DataFrame
pd.DataFrame.hist(df[['disa10']])
plt.xlabel('Extent of disease')
plt.ylabel('Number of countries')
plt.show()
This exercise is part of the course
Introduction to Importing Data in Python
Learn to import data into Python from various sources, such as Excel, SQL, SAS and right from the web.
You've learned how to import flat files, but there are many other file types you will potentially have to work with as a data scientist. In this chapter, you'll learn how to import data into Python from a wide array of important file types. These include pickled files, Excel spreadsheets, SAS and Stata files, HDF5 files, a file type for storing large quantities of numerical data, and MATLAB files.
Exercise 1: Introduction to other file typesExercise 2: Not so flat any moreExercise 3: Loading a pickled fileExercise 4: Listing sheets in Excel filesExercise 5: Importing sheets from Excel filesExercise 6: Customizing your spreadsheet importExercise 7: Importing SAS/Stata files using pandasExercise 8: How to import SAS7BDATExercise 9: Importing SAS filesExercise 10: Using read_stata to import Stata filesExercise 11: Importing Stata filesExercise 12: Importing HDF5 filesExercise 13: Using File to import HDF5 filesExercise 14: Using h5py to import HDF5 filesExercise 15: Extracting data from your HDF5 fileExercise 16: Importing MATLAB filesExercise 17: Loading .mat filesExercise 18: The structure of .mat in PythonWhat is DataCamp?
Learn the data skills you need online at your own pace—from non-coding essentials to data science and machine learning.