Lead partner: ALife Group, Dublin City
University
Dynamics of Informazyme Systems
Assumptions
In this section we present a
theoretical analysis of some generic dynamics which are possible
in a limited class of informazyme systems. The analysis is based
on the following assumptions:
- A fixed size flow reactor containing a population of
informazyme molecules. The reactor is buffered to maintain ready
availability of free monomers. Dilution outflow is matched to
the reaction rate to ensure a fixed maximum number of molecules
(informazymes).
- In this first instance, the only enzymatic reaction
considered is replication of a (recognised) template. Thus, an
informazyme can act as a replicase for all, and only, those
molecules which it can bind to, as determined by its
“active” or “folded” conformation.
- We assume identity (as opposed to complementary)
replication. This is analogous to, say, DNA (double-stranded)
rather than RNA (single-stranded) replication.
- We assume the reactor is well-stirred, i.e., every molecule
has an equal rate of collision with every other molecule; and
that, in any specific collision, each molecule has an equal rate
of participating as replicase or template.
- Recognition is considered to be all-or-nothing. Once
recognition takes place, replication rate is taken to be the
same for any informazyme acting on any recognised template
(regardless even of template length).
- Replication is error-prone—i.e., the produced molecule
will not always match the template exactly in sequence. However,
in general, we shall first analyse the case where such mutation
is neglected; and then attempt to consider the effect of
mutation as a perturbation of this “underlying”,
mutation-free, dynamic.
These assumptions all represent significant simplifications
of any practical replicator chemistry; so that the dynamic
possibilities of real replicator systems would presumably be
wider than discussed here. Nonetheless, as will be demonstrated,
this simplified replicator framework is sufficient to support
the core phenomena of heredity and evolution in protocell
populations; and has the key advantages of being tractable for
useful theoretical analysis, and for simulation over
evolutionary timescales with practical computational resources.
Note in particular that this approach deliberately preserves
what has already been mentioned as a key evolutionary constraint
of synthetic proto-cells, as compared to modern living cells;
namely that they lack a translational sub-system that can allow
de-coupling of the space of genetic (informational) variation
from the space of phenotypic (enzymatic) variation. The
immediate challenge is to demonstrate that some meaningful
protocell level evolution can be demonstrated despite this
significantly impoverished hereditary mechanism. A longer term
goal (beyond the scope of the immediate investigation reported
here) is to explore precisely the evolutionary emergence of
translation.
General Dynamic Equation
Let

denote the rate with
which species

,
acting as replicase, will replicate species

. Given our assumption of
all-or-nothing recognition, and equal replication rates for all
cases where recognition takes place, we can say, without loss of
generality, that every

(“replication co-efficient”) will have a
normalised value of either

(no recognition) or

(recognition).
Under a continuous
approximation, we will denote the concentration of any species

in the reactor by
![x_i \in [0,1]](wp2-dcu-img/wp2-dcu7x.png)
(with

). Applying mass-action
kinetics, the replication rate of each species will be:
The
total replication rate for the reactor will be:
Under the condition of fixed molecular
population, this must equal the total dilution outflow rate, and
the dilution rate for each individual species will be:
Neglecting mutation (in the first instance) the dynamics
of the reactor is then governed by the following system of
ordinary differential equations (ODE) in each

:
Under the conditions specified, we are therefore dealing
with various cases of a catalytic reaction network, in the sense
of
Stadler
et al. (
1993). To actually examine
the concrete dynamics of any given reactor, we must specify the
total number of species,

, and the corresponding

matrix of
replication co-efficients

. Thus, even with the strong simplification of

, for any given

there are

distinct possible
systems. Exhaustive analysis rapidly becomes infeasible. We
shall limit our detailed analysis to considering all possible
cases for

.
This provides a repertoire of “core” behaviours. We
shall augment this with more qualitative discussions of how
these core behaviours may be generalised or combined for systems
with

and/or perturbed by replication error (molecular level
mutation).
Terminology
Although we are discussing
molecular level evolution, it will be convenient to use the
following, ecologically based, terminology. Consider two
distinct molecular species

where

:
- If
we
say this is a self-replicase; otherwise it is self-inert.
- If both
and
we will say that
and
are mutualists relative to
each other.
- Where
but
we
will say that, relative to each other,
is a host and
a parasite.
- A mutualist, host or parasite may be said to be facultative
if it is also a self-replicase; otherwise it is obligate.
- If
we may also say that
is inert to replication by
(and vice versa); if both
and
we say
they are mutually inert.
“Self-”systems
For
“self”-systems we have

; that is, there is just
one species with concentration

, one co-efficient,

, and two
possible systems:
: The
species is self-inert. Given that this is the only species
present, then there are no reactions, and the system as a whole
is completely inert (
,
).
: The
species is a self-replicase. Given that this is the only species
present, then although there is constant turnover at the maximum
rate (
), in the
absence of mutation this just produces identical replacement
molecules so we still have
. (The concept of mutation necessarily
requires
, so it is not meaningful in systems restricted
to
.)
Pairwise Systems
For pairwise systems we have

; that is, there are two
species with concentrations

and

; and their
interactions are represented by the matrix:
As

, there are exactly

possible such
pairwise interaction matrices. However, some of these are
equivalent under a relabelling of

and

; as a result there are
just 10 dynamically distinct classes of pairwise system. All of
these will be considered in turn. In each case, we shall
initially neglect mutation, so that the reactor operates under
the constraint

, and the (approximate) dynamics is fully
characterised by the single differential equation:
We shall also be interested in the overall replication
rate, given by:
For each distinct class we shall
present the relevant pairwise interaction matrix or pair of
equivalent matrices, the resulting simplified expression for

,
optionally a graph of

, and a brief discussion of the resulting dynamic
behaviour including example trajectories as
appropriate.
Finally we shall return to re-consider the
general case (

) as potentially approximated by a simple
superposition of pairwise systems (

), all instantiated in
the same reactor at the same time.
Class 0
Pairwise interaction(s):
Differential equation:
As there are no
reactions at all, the reactor is completely inert. There is no
turnover, and the rate of change of both species concentrations
is, trivially, 0. In other words, the reactor remains exactly as
it is initialised. In fact this particular result holds for
arbitrary numbers of species (assuming they are all individually
and mutually inert, i.e. all

), and regardless of
whether mutation is in effect (since there is no replication
happening in the first place). This is a degenerate case of
limited practical interest; however, we shall see that there are
circumstances in which this generic situation (of a completely
inert molecular population) is a possible fixed point of the
long term (evolutionary) reactor dynamics.
Class 1
Pairwise interaction(s):
Differential equation:
The two molecular
species are (facultative) mutualists. By symmetry alone, the
rate of change of both species’ concentrations is again 0,
regardless of current state. However, in contrast to class 0,
there is continuing turnover of the reactor contents at the
maximum possible rate (

); accordingly, this would predict, under stochastic
conditions, that there would be slow random drift in the
relative concentrations, with eventual fixation of one or the
other. However, because the intrinsic dynamic here is
essentially neutral, then mutation, even at a low level, would
significantly modify this behaviour, effectively introducing
negative frequency dependent selection. In the simplest case,
with just two possible species, each having an equal rate of
mutation to the other, this would actively stabilise the state

against
drift.
There would still be stochastic fluctuation
around this state, with the level of fluctuation inversely
related to the mutation rate (and, of course, the population
size). With a somewhat larger number of mutationally related
species, still all being pairwise facultative mutualists, this
result may generalise to a stable equilibrium distribution
across all species in the set (though with continuing
replication/turnover at the maximum rate). Such a family of
species would be somewhat analogous to the concept of a
quasi-species in (externally catalysed) replicator systems
(
Eigen
et al.,
1989). If all species have equal
inbound and outbound mutational pathways and rates, within this
set, then this simplifies to a uniform distribution (i.e., all
species having concentration

, where

is the number of species).
This analysis is subject to the reactor size being large enough,
relative to the number of distinct species in the set, that the
equilibrium number of molecules of each species is still
significant—i.e., so that all species can be
simultaneously represented. In particular, we will require all

where

is the fixed
total number of molecules in the reactor. With an approximately
uniform distribution over

species, this means we require

. With a
combinatorially large family of mutant species, as is typical of
modular replicators, this may not, in fact, be the case. In
summary then, this form of interaction, under mutational
conditions, tends in itself to lead to diversification into a
potentially extensive family of mutualists (a quasi-species). If
this mutationally accessible family of common mutualists is
small then there may be stable co-existence; otherwise, because
(by hypothesis) there is no “master” sequence with
some significant selective advantage, there may be continuing
ongoing drift of the quasi-species distribution (denoted say, by
the instantaneous consensus sequence) with only a varying subset
of the family being represented in the reactor at any given
time.
Class 2
Pairwise interaction(s):
Differential equation:
One molecule is a self-replicase and the other is completely
inert. Accordingly, neglecting stochastic effects at very low
absolute numbers of molecules, such a self-replicase will
essentially deterministically invade and displace the inert
species, even if starting from arbitrarily low initial
concentration. This formally follows from the fact, noted above
and in figure
1, that

(where

is taken
as labelling the self-replicase). The growth of the
self-replicase follows a classic “S-shaped” (though
not technically logistic) curve, with a sharp, positive
feedback, “switching” dynamic, as shown in
figure
2. Note that, as the
self-replicase takes over the reactor, the total replication
rate (reactor flow rate,

) also rises to the maximum possible (

).
In this case, even with relatively high mutation rates (but
assuming that all mutants are still individually and mutually
inert), the core dynamic of growth of the self-replicase to the
maximum sustainable concentration is very robust; the only
modification being that this maximum concentration is then
somewhat less than

due to the ongoing mutational load. To analyse this, let

still denote
the concentration of the self-replicase species. If we let the
per-molecule mutation rate be denoted by

, then the dynamics is
represented by:
Setting

, and neglecting the case of

, there is a single
fixed point at:
For

we have

and for

we have

;
accordingly, this is a stable fixed point, and the
self-replicase stably maintains itself at the concentration

. This will hold
as long as the ODE approximation is valid (i.e., even for large

, provided the
absolute number of molecules of

is sufficient to mask
stochastic fluctuations).
Class 3
Pairwise interaction(s):
Differential equation:
Neither species is a self-replicase, but one can act as a
replicase for the other; so the latter is an obligate parasite
of the former (which is classified as an “obligate
host” in our terminology). The functional replicase
(

) is then
effectively a finite initial resource which is irreversibly
“used up” and decays to zero concentration, given
the initial presence of even an arbitrarily small concentration
of the parasite. The overall replication rate may initially rise
(as more “template” molecules,

become available for
replication), but then as the replicase concentration declines,
the replication rate necessarily peaks and then enters a
monotonic decline. As shown in figure
4, over a moderately extended period of decay,
the reactor asymptotically approaches a state containing only
species

, and
thus ultimately becomes completely inert (effectively as already
discussed above under the case of class 0 interactions).
Class 4
Pairwise interaction(s):
Differential equation:
One species
(

) is a
self-replicase, and the other is not; which, so far, is similar
to class 2. However, in the class 4 case, the self-replicase
will also act as a replicase for the other molecule, so there is
a host-parasite relationship as in class 3. In this case we say
the self-replicase is a facultative host for the obligate
parasite. The situation is not as good for the host as in class
2, as some of its replicase activity is “wasted” on
the parasite; nor yet as bad as class 3, because at least some
of its replicase activity is still spent on replicating itself.
In the ODE approximation these effects exactly balance out, and,
as shown above,
![\dot {x}_1 = 0 \; \forall {} x_1 \in [0, 1]](wp2-dcu-img/wp2-dcu113x.png)
. Superficially,
this is formally similar to class 0 and class 1, but the
practical dynamic is again quite different from both of those
previous cases. Although

at all concentrations, the total
replication rate now varies directly with

:
so if

drifts, then total
replication rate will also “drift” in sympathy. It
is under the assumption of non-zero mutation rate that this case
becomes most distinct. The behaviour will be somewhat similar to
that of class 1, in that the “master” sequence will
tend to diversify, through mutation, across the available
family, or quasi-species, of class 4 mutants; with some
equilibrium distribution over this set (if the mutational
pathways and rates are uniform over the set, the equilibrium
concentration of all species will be equal at

). But unlike the case
with class 1, this diversification now means that the overall
replication rate also declines; with uniform distribution in the
concentrations, this would ultimately be to an equilibrium rate
of only

. If,
as discussed for class 1, this family of mutants is actually
combinatorially large, then we may have

(or indeed
greater), with the “equilibrium” concentration
representing less than a single molecule of functional
replicase. In other words, under mutation, this class is more
likely to share the decline toward complete inertness described
for class 3. This decline may be relatively very slow (if the
the mutation rate is low); but would still be essentially
monotonic.
Class 5
Pairwise interaction(s):
Differential equation:
One species is a self-replicase and also a parasite of the other
species; the latter is self-inert, and thus acts as an obligate
host. As with class 2, the self-replicase has a positive rate of
growth at all concentrations

(see figure
5) and will
essentially deterministically invade and displace the second
species, even if starting from arbitrarily low initial
concentration. The difference here is that, even if the
self-replicase itself is initially in low concentration, it will
still achieve an exponential replication rate (because it
receives replication support from its obligate host, which is,
by assumption here, in high concentration). Thus the
displacement can be initiated more rapidly than for class 2 (see
figure
6); in this specific
example, from the same starting concentration of 0.05, the
takeover is complete by time

for class 5,
compared to

for class 2. Formally, the growth law for the
class 5 case is strictly logistic.
The overall rate of replication for this case is simply

, as it was
in class 3; but as

now grows rapidly to take over the reactor, the overall
replication rate similarly rapidly grows to the maximum possible
and there is then continuing turnover at this rate. As with
class 2, even with relatively high mutation rates (but assuming
that all mutants are still obligate hosts of the
self-replicase), the core dynamic of growth of the
self-replicase to the maximum sustainable concentration is very
robust. The differential equation taking account of this
(restricted) form of mutation is:
where

is
the per-molecule mutation rate (to class 5 mutants). Again, this
has a stable fixed point at

. Indeed, this
result would hold for any mix of mutants that are all in class 2
or class 5 relative to the “original” self-replicase
(and are otherwise inert).
Class 6
Pairwise interaction(s):
Differential equation:
Both species are self-replicases; but one (

) is a (facultative)
parasite of the other (

), which acts as a (facultative) host. As with class
3, we have

, so

will
inevitably decline. However, in this case, instead of the
reactor asymptotically approaching a completely inert state, it
is simply taken over by the other self-replicase species,

, which will
then continue to replicate at the maximum rate. This is a
properly “selective”, quasi-deterministic,
displacement of one self-replicase by another, because, at all
relative concentrations, the latter achieves a higher
replication rate. The displacement will take place even if

initially has
essentially arbitrarily low concentration. Figure
8 shows an example trajectory for

. While there is
an initial period of slow growth, the parasite does inevitably
achieve a “critical” concentration (say

at

in this
example) after which the displacement is then completed very
rapidly.
Note that although we describe this as properly
“selective”, or “Darwinian”,
displacement, there is no intrinsic fitness difference between
the two species here. That is, if examined in isolation from
each other, both species show exactly the same dynamics. This is
by deliberate assumption in the current analysis. It is only
when incubated together that the asymmetric
“host-parasite” interaction between them means that
one is consistently favoured over the other, and can
successfully invade from rarity. This case gives rise to a very
distinctive, evolutionary dynamic at the molecular level. In
essence, if there is continuing generation of mutants which are
class 6 relative to a currently dominant self-replicase, we can
predict the possibility of a potentially indefinitely extended
series of displacement events. At least, this will be so,
provided the rate at which these mutations arise is not too
rapid, so that there is time for each displacement event to
complete before another one is initiated. The details of this
evolutionary dynamic will be determined by the pattern of
mutational connectivity. As will be discussed subsequently, this
case is of key significance to the investigation of protocell
evolution (at least within the simplified model being considered
here). This is because this situation allows a reactor to
“switch” from being dominated by one informazyme
species to being dominated by a different informazyme species.
More generally, when the informazymes are contained within a
protocell instead of a static reactor, the dominant informazyme
species will be both heritable (at the level of protocell
reproduction), and potentially coupled to some significant
protocell level trait. A Class 6 “molecular
evolution” displacement may “propagate
upwards” in organisational terms, to manifest as a single
“mutation event” at the protocell level, which can
then be the target for protocell level selection. Indeed, this
is the only case, under the specific protocell modelling
framework analysed here, where the molecular level dynamics can
reliably give rise to such a protocell level mutational
event.
Class 7
Pairwise interaction(s):
Differential equation:
One species is a self-replicase; the other is an obligate
(self-inert) parasite of it. However, unlike class 4, the
parasite also functions as a host for the self-replicase. The
means that the (parasitic) replication service provided by the
self-replicase to the self-inert species is offset by the
(parasitic) replication service provided by the self-inert
species back to the self-replicase. As with class 2 and class 5,
the self-replicase will essentially deterministically invade and
displace the other species, even if starting from arbitrarily
low initial concentration. The duration of the transition is
essentially intermediate between classes 2 and 5, as shown in
figure
10 (compare with
figures
2 and
6, all based on the same example initial state
of

).
Also as with classes 2 and 5, this behaviour generalises to
relatively high mutation rates (assuming that the mutants are,
similarly, both obligate parasites and obligate hosts of the
self-replicase). Neglecting back-mutations (from the mutants
back to species

) the differential equation becomes:
with a stable fixed point at

. Thus,
the self-replicase will still reliably grow to dominate the
reactor, but must carry a significantly larger ongoing
mutational load than in the cases of class 2 or class 5. This
may be qualitatively understood as arising from the fact that
the mutants benefit from a degree of amplification by the
self-replicase, which is not the case for the other two classes.
We can reasonably infer that, in the case of a mix of mutants of
class 2, 5 and 7, this growth and dominance of the
self-replicase will still be observed, with a steady state
concentration somewhere between

and

.
Class 8
Pairwise interaction(s):
Differential equation:
Neither species is a self-replicase, but each can act as a
replicase (obligate host) for the other (obligate parasite).
Setting

, and neglecting the case of

, there is a single
fixed point at:
Of course, we also
therefore have

in this state (which would follow from the
symmetry of the situation in any case). As noted above, for

we
have

and for

we have

. Accordingly this is a stable fixed point.
The reactor will relax back into it, even under essentially
arbitrarily large perturbation. Figure
12 shows an example trajectory, initialised
from

. While not particularly significant
to the investigation here, this case is equivalent to the
canonical situation of single stranded RNA replication where
each complementary strand effectively serves as a catalyst for
the production of the other. More generally, this case is
formally equivalent to a two-component hypercycle (
Eigen and
Schuster,
1977). It can clearly be
generalised to a hypercycle containing an arbitrary number of
components (say

); and in that case, the stable fixed point will be with
equal concentrations (

) across all components (assuming, as usual, that the
absolute number of molecules is still sufficient to mask
statistical fluctuation, i.e.,

). However, this
situation changes significantly once mutation is introduced. In
this case, a hypercycle is always vulnerable to collapse. There
are several scenarios here; but, in general, any mutant species
which is able to exploit any one of the components of the
hypercycle as a replicase, without passing this support on to
the next component of the hypercycle, is a potential trigger for
such degeneration.
Class 9
Pairwise interaction(s):
Differential equation:
Both molecular species are independent self-replicases. In a
certain sense this is exactly complementary to class 8. The
expression for

is precisely the negation of that in class 8.
Accordingly, the dynamics has exactly the same fixed points: the
two states of

,

where one or the other species is no longer present, and the
state

.
However, the latter state is now unstable rather than stable.
The effect is that any perturbation means the system rapidly
collapses into a state where one species displaces the other.
Figure
14 shows an example
trajectory initialised from

.
In effect, there is positive frequency dependent selection, so
that whichever species first achieves a higher concentration
than the other benefits from a positive feedback effect which
further amplifies its concentration until it takes over the
complete reactor. The effect is well known, and is termed the
survival of the common (
Szathmáry and
Maynard Smith,
1997). It is characteristic
of any system of replicators undergoing hyperbolic rather than
exponential growth.
This result generalises directly to the

species case. There is an
unstable fixed point with

; and

stable fixed points with
a single

and all other concentrations equal to

. That is, whichever
species chances to achieve a greater concentration than the
others will benefit from the positive feedback and rapidly
displace all other species. In the case of continuing generation
of class 9 mutants (at rate V per replication of the dominant
species) there is still no possibility for any of these to
invade the established dominant self-replicase. Instead, as with
classes 2, 5 and 7, there will be an equilibrium mutant
concentration. The equilibrium analysis is more complex is this
case, and will not be considered in detail here. However, we can
say that the load will be somewhat greater than class 2 or 5 (as
the mutants can, in principle, achieve some limited degree of
amplification through self-replication) but less than class 7
(as the degree of amplification will be much smaller than would
be the case if supported by the dominant species); i.e., the
equilibrium mutant concentration will be between a minimum of

and a maximum of

(but
generally much closer to the minimum).
Protocell Inheritance Mechanism
We now
discuss more carefully the operation of a molecular population
of informazymes as the “informational subsystem”
(inheritance mechanism) of a protocell, in the light of the
theoretical analysis above. Consider a protocell containing a
population of informazyme molecules which is
“self-sustaining”. By that we mean that the
molecules in some set of informazyme species are replicated
sufficiently consistently that the relative concentrations of
these species remain essentially constant, as the protocell
grows. Under independent assortment of molecules to offspring,
both daughter cells will then inherit populations with this same
pattern of informazyme species concentrations; and thus the
potential for indefinitely extended inheritance through a
protocell lineage is established. The simplest possibility for
this is an informazyme population dominated by a single
self-replicase species. Assuming the initial existence of such
an established self-replicase species, this will generate a
spectrum of molecular-level mutants, on an ongoing basis. We now
summarise the potential impact of each pairwise class of
possible mutant species which may arise:
- Class 0, 3, 8: Not applicable (by definition, these classes
arise only when neither species is a self-replicase).
- Class 2, 5, 7, 9: Such mutants are actively selected against
relative to the established dominant. Continuing mutation will
mean an equilibrium load of such mutants, at a concentration no
higher than
(where
is the total rate of generating mutants of these classes,
per replication of the dominant species). Prima facie then,
these have no negative effect on the operation of the
informazyme inheritance mechanism; and arguably have the
positive benefit of continuously sampling the space of molecular
mutants. However, it should also be noted that in focussing on
the pairwise interactions between these mutants and the dominant
species, we are neglecting the possible impact of their
interactions among themselves. This is a significant limitation
of the analysis attempted here.
- Class 6: Such a mutant will be actively selected for
relative to the established dominant species, and will displace
it, becoming a new dominant in turn. Indeed, the key
significance of the “survival of the common” (class
9) dynamic is that, of all the dynamic behaviours analysed here,
the only one that allows one established self-replicase to be
reliably displaced by another is the class 6 facultative
parasite. Provided the expected time between such mutations
(given some average protocell size) is small compared to the
protocell gestation time, this provides an effective molecular
mechanism for protocell level mutation.
- Class 1: These are selectively neutral at the molecular
level. Accordingly, if such mutations are possible, it is
expected that the originally dominant species will progressively
diversify across a family of mutually class 1 species which can
stably co-exist. Prima facie, all of these might be considered
as members of a single “quasi-species” which is
still (collectively) self-replicating at a high rate, and is
therefore still capable of being effectively transmitted to
offspring protocells. However, this conclusion assumes that the
number of distinct species constituting the quasi-species is not
too large (otherwise there may not be enough molecules for each
to be represented), and that they are all class 1 relative to
each other (as well as to the original “master”
species). There is no particular reason for these assumptions to
hold. Accordingly, in the face of even modest rates of class 1
mutation, the informazyme population may degenerate into a
relatively chaotic mix of many dissimilar species, each present
in small absolute numbers, and with impaired overall reaction
rate. Such a population would no longer serve to transmit
coherent “information” to offspring protocells,
i.e., it could not function as an effective inheritance
mechanism at the protocell level.
- Class 4: This is similar to class 1, in that again these are
selectively neutral at the molecular level, and the originally
dominant species will progressively diversify across these.
However, a key difference from class 1 is that in this case the
overall replication rate will unconditionally decrease as the
concentration of class 4 mutants increases.
In summary then, given that we wish to formulate a
protocell model which can support evolution of computation, this
theoretical analysis suggests that we should devise an
informazyme sub-system that:
- Supports self-replicase activity.
- Supports emergence of class 6 molecular mutants.
- May permit class 2, 5, 7, 9 molecular mutants.
- Either prevents class 1 and 4 molecular mutants or ensures
that there is some effective mechanism for controlling their
impact.