Houston, we got a 404!
As you've seen in the video, a fundamental part of the HTTP system are status codes: They tell you if everything is okay or if there is a problem with your request.
It is good practice to always check the status code of a response before you start working with the downloaded page. For this, you can use the status_code() function from the httr() package. It takes as an argument a response object that results from a request method.
Now let's assume you're trying to scrape the same page as before, but somehow you got the URL wrong (Varigott instead of Varigotti).
Deze oefening maakt deel uit van de cursus
Web Scraping in R
Oefeninstructies
- Read out the status code of the response object from the GET request.
Praktische interactieve oefening
Probeer deze oefening eens door deze voorbeeldcode in te vullen.
response <- GET('https://en.wikipedia.org/wiki/Varigott')
# Print status code of inexistent page
___