Exercise

Removing blacklisted regions

Identifying and removing peaks in blacklisted regions is an important step in preparing the data for further analysis. For this exercise, we use the blacklist included in the ChIPQC package. This is also available from ENCODE directly.

For the purpose of this exercise, peak calls are available in peaks, coverage data is in cover, and blacklisted regions are in blacklist.hg19. The findOverlaps() function will be useful here. You've encountered the concept of overlapping regions in the introductory Bioconductor course and we will revisit it later in this chapter.

It may take a moment to load all required data and R packages for this exercise. Please be patient.

Instructions

100 XP
  • Find all overlaps between peaks and blacklisted regions.
  • Plot read coverage, peak calls, and blacklisted regions using Gviz.
  • Remove all blacklisted peaks.