Exercise

Creating a bag from saved text

This time your colleague has saved the reviews to some text files. There are multiple files and multiple reviews in each file. Each review is on a separate line of the text file.

You want to load these into Dask lazily so you can use parallel processing to analyze them more quickly.

dask.bag has been imported for you as db.

Instructions

100 XP
  • Use the read_text() function to load in all of the .txt files inside the directory data/tripadvisor_hotel_reviews.
  • Count the number of reviews in the bag.
  • Use the bag's .compute() method to print the answer.