CommencerCommencer gratuitement

Cage match, part 2! Negative reviews

In both organizations, people mentioned "culture" and "smart people", so there are some similar positive aspects between the two companies. However, with the pyramid plot, you can start to infer degrees of positive features of the work environments.

You now decide to turn your attention to negative reviews and make the same visual. This time you already have the common_words data frame in your workspace. However, the common bigrams in this exercise come from negative employee reviews.

Cet exercice fait partie du cours

Text Mining with Bag-of-Words in R

Afficher le cours

Instructions

  • Using slice_max() on common_words, obtain the top 5 bigrams referring to the diff column. The results of the new object will print to your console.
  • Create a pyramid.plot(). Pass in top5_df$AmazonNeg, top5_df$GoogleNeg, and labels = top5_df$terms. For better labeling, set
    • gap to 12.
    • top.labels to c("Amzn", "Neg Words", "Goog")

The main and unit arguments are set for you.

Exercice interactif pratique

Essayez cet exercice en complétant cet exemple de code.

# Extract top 5 common bigrams
(top5_df <- ___ %>% ___(___, n = ___))

# Create a pyramid plot
___(
    # Amazon on the left
    top5_df$___,
    # Google on the right
    top5_df$___,
    # Use terms for labels
    labels = top5_df$___,
    # Set the gap to 12
    ___ = ___,
    # Set top.labels to "Amzn", "Neg Words" & "Goog"
    ___ = ___,
    main = "Words in Common", 
    unit = NULL
)
Modifier et exécuter le code