Get startedGet started for free

Quick review of Google reviews

You decide to create a comparison.cloud() of Google's positive and negative reviews for comparison to Amazon. This will give you a quick understanding of top terms without having to spend as much time as you did, examining the Amazon reviews in the previous exercises.

We've provided you with a corpus all_goog_corpus, which has 500 positive and 500 negative reviews for Google. Here, you'll clean the corpus and create a comparison cloud comparing the common words in both pro and con reviews.

This exercise is part of the course

Text Mining with Bag-of-Words in R

View Course

Exercise instructions

The all_goog_corpus object consisting of Google pro and con reviews, is loaded in your workspace.

  • Create all_goog_corp by cleaning all_goog_corpus with the predefined tm_clean() function.
  • Create all_tdm by converting all_goog_corp to a term-document matrix.
  • Create all_m by converting all_tdm to a matrix.
  • Construct a comparison.cloud() from all_m. Set max.words to 100. The colors argument is specified for you.

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Create all_goog_corp
___ <- ___

# Create all_tdm
___ <- ___

# Create all_m
___ <- ___

# Build a comparison cloud
___(___, 
    max.words = ___, 
    colors = c("#F44336", "#2196f3"))
Edit and Run Code