Visualize common words
Now that you have a corpus filled with words used in both the chardonnay and coffee tweets files, you can clean the corpus, convert it into a TermDocumentMatrix
, and then a matrix to prepare it for a commonality.cloud()
.
The commonality.cloud()
function accepts this matrix object, plus additional arguments like max.words
and colors
to further customize the plot.
commonality.cloud(tdm_matrix, max.words = 100, colors = "springgreen")
This exercise is part of the course
Text Mining with Bag-of-Words in R
Exercise instructions
- Create
all_clean
by applying the predefinedclean_corpus()
function toall_corpus
. - Create
all_tdm
, aTermDocumentMatrix
fromall_clean
. - Create
all_m
by convertingall_tdm
to a matrix object. - Create a
commonality.cloud()
fromall_m
withmax.words = 100
andcolors = "steelblue1"
.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Clean the corpus
___ <- ___(___)
# Create all_tdm
___ <- ___(___)
# Create all_m
___ <- ___(___)
# Print a commonality cloud
___(___, ___, ___)