Get Started

Importing Stata files

Here, you'll gain expertise in importing Stata files as DataFrames using the pd.read_stata() function from pandas. The last exercise's file, 'disarea.dta', is still in your working directory.

This is a part of the course

“Introduction to Importing Data in Python”

View Course

Exercise instructions

  • Use pd.read_stata() to load the file 'disarea.dta' into the DataFrame df.
  • Print the head of the DataFrame df.
  • Visualize your results by plotting a histogram of the column disa10. We’ve already provided this code for you, so just run it!

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Import pandas
import pandas as pd

# Load Stata file into a pandas DataFrame: df


# Print the head of the DataFrame df


# Plot histogram of one column of the DataFrame
pd.DataFrame.hist(df[['disa10']])
plt.xlabel('Extent of disease')
plt.ylabel('Number of countries')
plt.show()

This exercise is part of the course

Introduction to Importing Data in Python

BeginnerSkill Level
4.7+
88 reviews

Learn to import data into Python from various sources, such as Excel, SQL, SAS and right from the web.

You've learned how to import flat files, but there are many other file types you will potentially have to work with as a data scientist. In this chapter, you'll learn how to import data into Python from a wide array of important file types. These include pickled files, Excel spreadsheets, SAS and Stata files, HDF5 files, a file type for storing large quantities of numerical data, and MATLAB files.

Exercise 1: Introduction to other file typesExercise 2: Not so flat any moreExercise 3: Loading a pickled fileExercise 4: Listing sheets in Excel filesExercise 5: Importing sheets from Excel filesExercise 6: Customizing your spreadsheet importExercise 7: Importing SAS/Stata files using pandasExercise 8: How to import SAS7BDATExercise 9: Importing SAS filesExercise 10: Using read_stata to import Stata filesExercise 11: Importing Stata files
Exercise 12: Importing HDF5 filesExercise 13: Using File to import HDF5 filesExercise 14: Using h5py to import HDF5 filesExercise 15: Extracting data from your HDF5 fileExercise 16: Importing MATLAB filesExercise 17: Loading .mat filesExercise 18: The structure of .mat in Python

What is DataCamp?

Learn the data skills you need online at your own pace—from non-coding essentials to data science and machine learning.

Start Learning for Free