Scrape an element based on its text
As shown in the video, the text()
function also allows you to select elements (and their parents) based on their text. Here's an ordered list:
<h3>The rules of programming</h3>
<ol>
<li>Have <em>fun</em>.</li>
<li><strong>Don't</strong> repeat yourself.</li>
<li>Think <em>twice</em> when naming variables.</li>
</ol>
In this exercise, your goal is to extract the li
element where "twice" is emphasized.
You might think that, here, it would be much easier to apply a CSS selector like li:last-child
, but wait until you finish this exercise…
programming_html
contains the above HTML document.
This exercise is part of the course
Web Scraping in R
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Select all li elements
programming_html %>%
html_elements(___ = ___)