Annotating peaks
In the previous exercises, you created a table of merged peak calls from all samples (available here as peaks_merged
) and a table with gene annotations (available as human_genes
). Now it is time to combine information from both tables using the annoPeaks()
function provided by the ChIPpeakAnno package. For each peak that is sufficiently close to the start site of a gene (as determined by the bindingRegion
argument), this function will return detailed information on the gene in question in a data frame. This includes a column, insideFeature
, indicating where the peak is located relative to the gene.
Este exercício faz parte do curso
ChIP-seq with Bioconductor in R
Instruções do exercício
- Annotate peaks with the closest gene.
- Determine the number of peaks that were annotated with genes.
- Create a summary table indicating where peaks were located relative to genes.
Exercício interativo prático
Experimente este exercício completando este código de exemplo.
# Annotate peaks with closest gene
peak_anno <- ___(peaks_merged, human_genes, bindingType="startSite", bindingRegion=c(-5000,5000))
# How many peaks were found close to genes?
length(___)
# Where are peaks located relative to genes?
table(peak_anno$___)