CommencerCommencer gratuitement

Letters of the genetic code

The basic building blocks of RNA are four molecules described by a single letter each: adenine (A), cytosine (C), guanine (G), and uracil (U). The information carried by an RNA strand can be represented as a long sequence of these four letters. To read this code, one has to divide this chain into sequences of three letters each (e.g. GCU, ACG, …). These three letter sequences are known as codons. The concept is illustrated in the image below.

RNA code

Your goal for this exercise is to create a data frame with all possible three letter sequences (codons) from a vector with the four letters representing the RNA building blocks.

Cet exercice fait partie du cours

Reshaping Data with tidyr

Afficher le cours

Exercice interactif pratique

Essayez cet exercice en complétant cet exemple de code.

letters <- c("A", "C", "G", "U")

# Create a tibble with all possible 3 way combinations
codon_df <- ___

codon_df
Modifier et exécuter le code