-
Notifications
You must be signed in to change notification settings - Fork 0
Tutorial 1: BEAUti
Walter Xie (18 November, 2022)
This tutorial introduces the BEAST 2 CodonSubstModels package for codon substitution models through a simple tutorial. The tutorial involves understanding of models describing evolution at the codon level allowing the estimation of measures of the selective forces acting on proteins (Kosiol et. al. 2000).
You will need the following software at your disposal:
-
BEAST, BEAUti, TreeAnnotator and other utility programs. This tutorial is written for BEAST v2.7.x, which is available for download from beast2.org.
-
Tracer - this program is used to explore the output of BEAST (and other Bayesian MCMC programs). It graphically and quantitively summarizes the distributions of continuous parameters and provides diagnostic information. At the time of writing, the current version is v1.7.
-
FigTree - this is an application for displaying and printing molecular phylogenies, in particular those obtained using BEAST. At the time of writing, the current version is v1.4.3.
You also need a basic knowledge of using BEAST and BEAUti. The tutorials are available from beast2.org/tutorials or Taming the BEAST website.
The CodonSubstModels
package can be installed from BEAUti's package manager (File => Manage Packages
), using the Install/Upgrade
button. Please remember to restart the BEAUti after you install the CodonSubstModels package.
The CodonSubstModels
template has to be used (File => Template => CodonSubstModels
) before you are going to setup an analysis from the BEAUti. Make sure the title of BEAUti on the top is changed to "CodonSubstModels".
Load the example NEXUS file codon.nex (click to download) from File => Codon Alignment
.
After you choose the file, there will be a pop-up menu to ask the genetic code. This dataset contains nucleotide sequences of 2 humans, 2 chimpanzees and 2 gorillas, so we need to choose Vertebrate Mitochondrial
.
All codes are defined by the standard genetic code tables from GenBank. Please make sure you are selecting the correct genetic code, otherwise some sequences may incorrectly translate into stop codons (*). As a consequence, it causes the exception in BEAST.
If the data is valid, you will see the partition appeared in the panel. This example dataset has 6 taxa and 333 triplets. The data type is codon
.
Now choose the Site Model
panel to specify codon substitution model for your analysis.
At the moment, only M0 is available, which is also known as GY94, first published by
Goldman & Yang 1994,
and late designated to be one of the M-series (M0-M13) proposed by
Yang et al. 2000.
We keep both omega
and kappa
the same as the default setting.
The type of equilibrium codon frequencies πj consists of
"equal", "F1X4", "F3X4", and "F60/F61"
(Yang 2006).
We choose F60/F61
which is also the default value for the codon frequencies model.
The rest of models are using the same models in the standard BEAST analysis including the clock model. In this analysis we are going to leave the selection at the default value of a strict molecular clock
.
Then, we simply change the tree prior to Coalescent Constant Population
, and keep the other priors as the default.
In this exercise, setting 500,000 for Chain Length
and 1,000 for Log Every
will be good enough to provide reasonable ESS. You can also rename the log to m0.log
and tree log to m0.trees
.
Save the xml in the end.
Next -> BEAST