Get startedGet started for free

Joining questions and answers

1. Joining questions and answers

So far in this chapter you've been joining a dataset of Stack Overflow questions to their associated tags, which lets you understand what packages and keywords appear alongside R questions.

2. The answers table

Next, you'll look at the answers that are associated with each question. Stack Overflow questions are answered by other users on the site, and each could have zero or multiple answers, which means this is a "one-to-many" relationship.

3. The question ID

The answers table has an id, creation date, and score, just like the questions table, but it also has a question ID, which links to the questions table. This means we could join them based on those columns. There's a lot you can discover about this data, depending on the type of join you use and the ways you aggregate afterwards.

4. The joining verbs

Now that you're well practiced in joining tables, you'll be using the exercises to further adjust this join, such as adding suffixes to each column. You'll also use other joins and dplyr verbs to explore this question and answer data.

5. Let's practice!

Let's practice!