ComenzarEmpieza gratis

Cleaning with qdap

The qdap package offers other text cleaning functions. Each is useful in its own way and is particularly powerful when combined with the others.

  • bracketX(): Remove all text within brackets (e.g. "It's (so) cool" becomes "It's cool")
  • replace_number(): Replace numbers with their word equivalents (e.g. "2" becomes "two")
  • replace_abbreviation(): Replace abbreviations with their full text equivalents (e.g. "Sr" becomes "Senior")
  • replace_contraction(): Convert contractions back to their base words (e.g. "shouldn't" becomes "should not")
  • replace_symbol() Replace common symbols with their word equivalents (e.g. "$" becomes "dollar")

Este ejercicio forma parte del curso

Text Mining with Bag-of-Words in R

Ver curso

Instrucciones del ejercicio

Apply the following functions to the text object from the previous exercise:

  • bracketX()
  • replace_number()
  • replace_abbreviation()
  • replace_contraction()
  • replace_symbol()

Ejercicio interactivo práctico

Prueba este ejercicio completando el código de muestra.

## text is still loaded in your workspace

# Remove text within brackets
___

# Replace numbers with words
___

# Replace abbreviations
___

# Replace contractions
___

# Replace symbols with words
___
Editar y ejecutar código