Get startedGet started for free

Visualize common words

Now that you have a corpus filled with words used in both the chardonnay and coffee tweets files, you can clean the corpus, convert it into a TermDocumentMatrix, and then a matrix to prepare it for a commonality.cloud().

The commonality.cloud() function accepts this matrix object, plus additional arguments like max.words and colors to further customize the plot.

commonality.cloud(tdm_matrix, max.words = 100, colors = "springgreen")

This exercise is part of the course

Text Mining with Bag-of-Words in R

View Course

Exercise instructions

  • Create all_clean by applying the predefined clean_corpus() function to all_corpus.
  • Create all_tdm, a TermDocumentMatrix from all_clean.
  • Create all_m by converting all_tdm to a matrix object.
  • Create a commonality.cloud() from all_m with max.words = 100 and colors = "steelblue1".

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Clean the corpus
___ <- ___(___)

# Create all_tdm
___ <- ___(___)

# Create all_m
___ <- ___(___)

# Print a commonality cloud
___(___, ___, ___)
Edit and Run Code