The wonderful wizard of NRC
Last but not least, you get to work with the NRC lexicon which labels words across multiple emotional states. Remember Plutchik's wheel of emotion? The NRC lexicon tags words according to Plutchik's 8 emotions plus positive/negative.
In this exercise there is a new operator, %in%
, which matches a vector to another. In the code below %in%
will return FALSE
, FALSE
, TRUE
. This is because within some_vec
, 1
and 2
are not found within some_other_vector
but 3
is found and returns TRUE
. The %in%
is useful to find matches.
some_vec <- c(1, 2, 3)
some_other_vector <- c(3, "a", "b")
some_vec %in% some_other_vector
Another new operator is !
. For logical conditions, adding !
will inverse the result. In the above example, the FALSE
, FALSE
, TRUE
will become TRUE
, TRUE
, FALSE
. Using it in concert with %in%
will inverse the response and is good for removing items that are matched.
!some_vec %in% some_other_vector
We've created oz
which is the tidy version of The Wizard of Oz along with nrc
containing the "NRC" lexicon with renamed columns.
This exercise is part of the course
Sentiment Analysis in R
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
oz_plutchik <- oz %>%
# Join to nrc lexicon by term = word
inner_join(___, by = ___("___" = "___")) %>%
# Only consider Plutchik sentiments
___(!___ %in% c("___", "___")) %>%
# Group by sentiment
___(___) %>%
# Get total count by sentiment
___(total_count = ___(___))