Get startedGet started for free

Cleaning with qdap

The qdap package offers other text cleaning functions. Each is useful in its own way and is particularly powerful when combined with the others.

  • bracketX(): Remove all text within brackets (e.g. "It's (so) cool" becomes "It's cool")
  • replace_number(): Replace numbers with their word equivalents (e.g. "2" becomes "two")
  • replace_abbreviation(): Replace abbreviations with their full text equivalents (e.g. "Sr" becomes "Senior")
  • replace_contraction(): Convert contractions back to their base words (e.g. "shouldn't" becomes "should not")
  • replace_symbol() Replace common symbols with their word equivalents (e.g. "$" becomes "dollar")

This exercise is part of the course

Text Mining with Bag-of-Words in R

View Course

Exercise instructions

Apply the following functions to the text object from the previous exercise:

  • bracketX()
  • replace_number()
  • replace_abbreviation()
  • replace_contraction()
  • replace_symbol()

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

## text is still loaded in your workspace

# Remove text within brackets
___

# Replace numbers with words
___

# Replace abbreviations
___

# Replace contractions
___

# Replace symbols with words
___
Edit and Run Code