Joining and filtering duplicates
Another data.table
has been loaded into your R session, framingham
, containing a set of 35 genes and their associations with heart disease from an open access study by Joehanes R. et al. published in 2013. Your goal is to compare the results of this study to the results from each of your studies. The cardio_2
and heart_2
data.table
s filtered for missing values, but still containing multiple probes for each gene, have been loaded into your R session. The data.table
of reproducible associations you created in the last exercise, reproducible
, has also been loaded into your R session.
This exercise is part of the course
Joining Data with data.table in R
Exercise instructions
- Using the
data.table
syntax, right joinframingham
toheart_2
, taking the first probe for each gene inheart_2
. - Anti-join
framingham
toreproducible
to see whether you found any genes that have not previously been seen to have an association with heart disease.
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
# Right join taking the first match
___
# Anti-join
___