Removing duplicates
It is always a good practice to check that your sequence reads don't contain too many duplicates.
# Sample with duplicates of class: ShortReadQ
dfqsample
# Get the reads from dfqsample
mydReads <- sread(dfqsample)
# Counting duplicates
table(srduplicated(mydReads))
How would you go about removing duplicated reads in a file? Pay attention to what the condition should be in this filter.
Diese Übung ist Teil des Kurses
Introduction to Bioconductor in R
Interaktive Übung
In dieser interaktiven Übung kannst du die Theorie in die Praxis umsetzen.
