Letters of the genetic code
The basic building blocks of RNA are four molecules described by a single letter each: adenine (A), cytosine (C), guanine (G), and uracil (U). The information carried by an RNA strand can be represented as a long sequence of these four letters. To read this code, one has to divide this chain into sequences of three letters each (e.g. GCU, ACG, …). These three letter sequences are known as codons. The concept is illustrated in the image below.
Your goal for this exercise is to create a data frame with all possible three letter sequences (codons) from a vector with the four letters representing the RNA building blocks.
This exercise is part of the course
Reshaping Data with tidyr
Hands-on interactive exercise
Have a go at this exercise by completing this sample code.
letters <- c("A", "C", "G", "U")
# Create a tibble with all possible 3 way combinations
codon_df <- ___
codon_df