In this video we'll look at the evolution and growth of types. The types can be anything - it can be genotypes in the population or molecules in a test tube. At first, let's look at the growth of a single type. So, x represents how many of this type we have - how many molecules, what's the concentration or anything. w is the growth rate of this type, and if we go from one time step to the next, x prime is... the concentration of the number of molecules in the next time step. If we want to look t time steps into the future, xt, we simply take w to the power of t, so xt is equal to w to the power of t times x zero. Let's look what happens when we do that. So I'm taking w to be 1.1 and I start with x0 equals 1. We see the growth of this single type. This type increases from 1 to 10 and further. And now, let's see what happens when we have a growth rate lower than one. So, let's take a growth rate of 0.9. Everything else is the same. Now we see that the type - of course - decreases in frequency. It starts at 1 and then exponentially decreases towards 0. This decrease below one is called the "extinction threshold." When a type - when a molecule - is below the extinction threshold, it simply will disappear. Most molecules that we have around us have this property, and only molecules that really replicate themselves as high-fidelity can go above this extinction threshold. The extinction threshold is separate from the error catastrophe. The error catastrophe is what we'll talk about here. The error catastrophe doesn't happen when a type goes extinct, but instead, when it cannot be maintained versus other times. We can represent more than one type as a vector in a population, so let's look at two types. x is now a vector. We have 1 of type 1; 1 of type 2. And, when x is a vector, we can simply represent the gross as a diagonal matrix. W is the growth matrix and here I'm taking 1.1 and 0.95. We go from one time step to the next simply by taking x prime equal to W, the matrix times x, matrix multiplication. If we multiply W times x, you see we get 1.1 and 0.95. So, type 1 grew by 10 percent; type 2 decreased by 5 percent. If we want to apply W twice, we apply W times W times x0, and you see that we put W on the left. So, we add more and more Ws on the left, and this is because I chose to take x to be a column vector. If I had taken x as a row vector, I could have put x on the left and then we would have added Ws on the right. We can take a look at x at time t by simply taking the matrix power of W to the power of t times x0. Let's do that. So, we start again with one one. We take the W matrix that we had before, and here you see the growth of the two types. Type 1 really increases exponentially in frequency, type 2 decreases because its growth rate is lower than 1 and goes to 0. So, type one took over the population. You might think that now this describes any population. If we add mutation, then we would just have type 1 as the wild type, as the type that is present. And, all other types just are represented as small mutants. It turns out that this is not exactly a right description of a species in most cases. In most cases, a species will not simply be represented by one type that is the optimal and all other types that appear at really low rates. Let's plot this plot that we had here, and log as a log plot. Now we can see that type 1 increases exponentially, which in a log plot is a straight line, and type 2 decreases exponentially. Let's add mutations. So now, I add a mutation rate of 10 percent. The... mutations are simply represented by stochastic matrix M, where - in this case - type 1 maintains itself with a probability of 1 minus mu and turns to type 2 with probability mu. Type 2 doesn't mutate - it stays as type 2 and doesn't mutate to type 1 at first. So, we take this M, we start again from the same x0 as we had before. And now, let's see what happens when we mutate. So, since type 1 mutates to type 2, now you see there is less of type 1 and more of type 2. Now we want to represent the dynamics with both mutation and growth. We can simply do this by multiplying the matrices W times M. It's also possible to multiply M times W in a different order, and this simply says which is done first - gross or mutation. Since gross follows mutation, follows gross, follows mutation - it doesn't really matter in which order you do it. I simply chose W times M because the algebra is easier. If we take W times M the matrix and diagonalize it, it's easier to look at the matrix power. So, if W times M is equal to V times D times V minus one, where D is a diagonal matrix and V is the eigenvector matrix of all the eigenvectors of the matrix W times M, it's easy to calculate W times M to the power of T because it's simply V times D to the t times V to the minus 1. D to the t is dominated by the largest eigenvalue just like we saw before - that type 1 grew at a higher rate than all the other types - than type 2. When D is diagonal, it will be dominated by the largest positive eigenvalue. Let's look at this. So, now I'm taking a mutation rate of one percent and the same M matrix as we had before. And, let's look at the growth. So now, with mutation, you see that type 1 increases exponentially just like it did before. Type 2 at first decreases, just like it did... also like it did before, but then it starts increasing - this is a bit weird. Let's continue the graph a bit further. So, instead of doing just 50 steps, let's do 150 steps. What you can see is type 2 decreases first, but then starts to also increase exponentially at the same rate as type 1. Let's look at the eigenvector that corresponds to this growth. So, you see that the matrix W times M has two eigenvectors and with two corresponding eigenvalues. One eigenvector is zero one and the other one is an eigenvector where both type 1 and type 2 are present, which has a growth rate of 1.089. So it has a growth rate of higher than 1, whereas the other one has a growth rate of lower than 1. Therefore, this is the dominant eigenvector. And, this distribution where both type 1 and type 2 are present is called a "quasispecies." So, we see that instead of just having one type in a population with very few... mutants, both types are present and grow exponentially. Let's change the mutation rate. Instead of having a mutation rate of 1 percent, let's change it to 8 percent. Now you see both types grow exponentially, but type 2 - the green type - is now present at a higher frequency than type 1. So, the quasispecies is now mainly composed of type 2 with very few type 1. Let's see what it looks like in the eigenvalue. Again, you see there's two eigenvectors with two corresponding eigenvalues - the largest eigenvalue is this 1.01. And, it has both type 1 and type 2 where type 2 is present at a higher frequency. If we increase the mutation rate even further - let's go to... 0.2... Now, you see.. that both types don't increase - they decrease. Type 1 decreases exponentially at a faster rate and type two decreases exponentially probably with a rate of 0.95. If we now look at the eigenvalues, you see again there's two eigenvalues and two eigenvectors. The larger... eigenvalue this time is the 0.95. It's corresponding eigenvector is this one that has only type 2 present and not type 1. The other eigenvector has a negative component of type 2, so it can't actually exist. This crossing where type 1 is lost and type 2 is the only one that is present in the population at a high enough error rate is what we call the "error catastrophe." Let's look at this a bit further. So now, I will want to plot instead of going manually through different values of S. Let's just go through a large sequence of S and plot what happens with them. So, here's what we get. On the x-axis... actually, I'm on modified mu, so the x-axis represents mu - the mutation rate. And, the y-axis... is the largest eigenvalue. So this is the growth rate of the quasispecies - of the types - that grow to the highest of the various rates. So you see when S is very low we have a growth rate very close to 1.1, and the type that has both type 1 and type 2 increases exponentially. This eigenvector, as mu increases, goes lower and lower, until at this point it actually crosses and becomes lower than the other eigenvalue - the eigenvalue with a growth rate of 0.95. At this point, type 1 will disappear from the population and only type 2 will be present. So, this transition is what we call the "error catastrophe" - the mutation rate is too high. In the mutation matrix that I had before, I only let type 1 mutate. So, type 1 mutated into type 2 but type 2 did not mutate back to type 1. This is where the error catastrophe is strongest. If we allowed back mutation - so, here I'm now working with a matrix where type 2 can mutate to type 1 with a low rate of mu divided by a hundred. Let's see what happens then. I plot... oops! I plot exactly the same... I plot exactly the same as I did before, and now you see that the two eigenvalues don't cross. This eigenvalue decreases and then becomes almost 0.95, and this eigenvalue stays at 0.95 and then decreases almost linearly, but they never cross. The reason is that type 1 is still present in the population - it's simply... generated by mutations from type 2. So... there is not a full crossing, and therefore, it's not a real phase transition. Let's derive this error catastrophe when it happens. So, we have now M without the back mutation one minus mu mu, and W with a growth rate of w for type 1. We multiply the two matrices and this is the result of the multiplication. So, we simply take 1 minus 2 mu times W, mu times 0 and so on. This is the result. Now, in order to calculate an eigenvector, we simply multiply this by a vector, 1 alpha, and we want to calculate when multiplying this vector times this matrix will give us a vector that has exactly the same ratios. This will be an eigenvector of this - of the matrix W times M. In order... for it to stay at the same ratio, the ratio between this and this has to be like the ratio between 1 and alpha. Therefore, mu plus alpha divided by 1 minus mu times w has to be equal to alpha. We can simply do a bit of algebra and then we get that alpha has to be equal to mu divided by w times 1 minus mu minus 1. So, in order for alpha to be positive in order for there to be an eigenvalue where both types can be represented, or both type 1 and type 2 are present, this needs to be positive. For this to be positive, the denominator needs to be positive, which means that w times 1 minus mu has to be bigger than 1. Another way to write it is to write w as 1 plus s where s is the growth rate beyond 1. And then, we need to have 1 plus s times 1 minus mu is bigger than 1, which means that for small s and small mu, s has to be approximately bigger than mu. So, the error threshold happens when the growth rate is bigger than the mutation rate or when the mutation rate is low enough. Let's switch now to a representation of the error catastrophe as it was originally introduced by Manfred Eigen and Peter Schuster where they looked at... instead of just looking at two types, they looked at the genome of length L, which means that they had four to the L types. And, I assume for simplicity that there is one optimal sequence: A-A-G-C and so on. But, I would like to be able to talk about just a binary sequence instead of four bases. So, let's look at the binary genome in which 1 means that the sequence is identical to the optimum and zero means that it's different from the optimum. Let's look at the per site mutation rate instead of a global rate. So, nu is now the mutation rate per site. And, just for simplicity, I assume it's the same to go from optimum to non-optimum and back. So, now we look just at ones and zeros, and... we assume that there's just one mutation per time step. So, nu is low enough in L versus L so that there's only one... mutation per time step. We'll call i the number of zeros, and we'll simply look at the types as the number of zeros that... we have in the sequence. So, i represents number of zeros; L minus i the number of ones. Now we can calculate the probability to go from i zeros to i plus 1. In order for that to happen, a mutation needs to happen, so it's L times nu. And then, the mutation needs to hit a 1 and mutate it into a 0. So, it's L minus i times L... times L nu, which gives us simply L minus i times nu, and similar for decreasing one... the number of zeros. Let's see what it does. So, we have a nu of 1 percent and a length of L, L equals 10. And now, we need to build the mutation matrix. We assume that we can only mutate one up or one down, and I simply take the expression that we just calculated. This gives us the following matrix. You can see that the matrix is different from 0 only along the diagonal and one step away because we only allow one mutation. Now we can also build the... growth rate matrix. It's diagonal and only has one entry that's different from 1. So, I take a growth rate of 1.3, which is for type 1. Now, we can look at the maximal eigenvalue - the eigenvector with the maximum eigenvalue for this matrix W times M, and we see that type 1 is represented at the highest frequency and other types also represented. So, this would be the quasispecies in this case. The population would be represented by the optimal type - types with one mutation, two mutations and so on. What happens now when we increase the mutation rate? Sorry, when we decrease the selection pressure. Let's go now to... a benefit of 0.08, so a growth rate of 1.08. You can see that now the optimal type is represented with quite a low frequency and all the other types are represented quite highly. In order to get a better overview of what happens, let's look at various ss. So, we have plotted four different ss. At sequence 0.3 - like we had before - you see that type 1 has the highest representation, one mutation away a bit lower and so on. s equal to 0.2 - it spreads out a bit more. s equal to 0.1 - it's very far spread. At 0.09, you see that type 1 is already at a fairly low frequency. And, the highest type is the type that you would get mostly if you did just a random coin flip with five ones and five zeros. And then, as s goes to 0.07, you see that type 1 is almost totally non-present, and only this type is represented. We can also represent it as a graph. In a graph, here on the x-axis, is s. s goes from 0 to 0.8, and the y-axis is the frequency of the various types. Black represents the optimal type - this is the type with one mutation, two mutations, three mutations and so on. You see that when the benefit is 0.8 - so growth rate is 1.8 - type 1 is represented at the highest frequency. As s goes down, the representation of type 1 decreases and of the other types increases, until we reach this threshold where type 1 almost disappears from the population and the highest representative type is the type that has five mutations. This is simply at this point... we simply look at coin flips. So, then the selection... doesn't affect the population anymore. Let's calculate where this transition happens. So, we said that the chance to go from i to i plus 1 is L minus i times nu. Now, we want to ask - can we maintain the zero type? So, when i is equal to 0. We want to maintain zero type versus going from 0 to 1. Therefore, we look at just a transition rate from 0 to 1. That rate is L times nu. An imitation will increase the number of... zeros. So, what is now the error catastrophe - the global mutation rate - not precise, but for the whole genome now to go from 0 to 1, is L times nu. And now, we will therefore have the same error threshold as before. s needs to be bigger than mu, which is L times nu. So, the error threshold will happen when s is bigger than L times the per site mutation rate. This is a very nice expression. Eigen and Schuster used it to calculate an error threshold for how good can a molecule replicate. So, if a molecule has a replication of... with a fidelity of one percent, it cannot code for this fidelity of one percent with a molecule that's larger than 100 bases. If it has one per mille, it can't code it with a molecule larger than a thousand bases. This shows that in order to... you need to cross the threshold of fidelity with a short enough molecule. This expression can also represent what it means to be neutral versus non-neutral. How big does the difference have to be between one type and another for evolution to be able to maintain this difference? In order to look at this, let's locate... the population that has not just a difference between the best type and all the others, but instead there's a... a stairs between one type and the next. So, the highest type has zero mutations, then the next highest growth rate is one mutation, two mutations and so on, with some kind of... randomly chosen growth rates. The difference - this threshold between type 0 and type 1 or between type 1 and type 2 - I've called Delta s, the change in growth rate. And now, we need to be able to maintain this difference versus the mutation. If we look at the change in growth rate from i to i plus 1 if that is Delta s i, we want to ask - when can this be maintained versus the mutation? And, what needs to happen is that Delta s i needs to be bigger than L minus i, the chance to go from i to i minus 1... from i to i plus 1 times nu. So, we can take a random example. Here I simply choose random ss with random... steps, so the x-axis in black represents the Delta s i, the step from... the change in growth rate from i to i plus 1. And, the red line represents L minus i times nu. This is the effect of the mutations. In order to be able to maintain a certain number of mutations, the black line has to be above the red line. So, this means that this population can maintain this point versus the effect of mutation, whereas it cannot maintain these points. So, the difference between this and this, as far as this population is concerned, is neutral. This point can be... can be taken...can be maintained, but this point again cannot be maintained. So, we can see that the mutation rate defines what is seen as neutral versus non-neutral. And, the distribution in a population has to be above a narrow threshold, which can be crossed - as you see here - several times. Thank you.