| In Silico Biology 4, 0023 (2004); ©2004, Bioinformation Systems e.V. |
1Faculty of Science, Yamaguchi
University, Japan
Email:
matsuno@sci.yamaguchi-u.ac.jp
2Human Genome Center,
Institute of Medical Science,
University of Tokyo, Japan
* corresponding author
Edited by E. Wingender; received January 16, 2004; revised and accepted March 12, 2004; published April 07, 2004
In many research projects on modeling and analyzing biological pathways, the Petri net has been recognized as a promising method for representing biological pathways. From the pioneering works by Reddy et al., 1993, and Hofestädt, 1994, that model metabolic pathways by traditional Petri net, several enhanced Petri nets such as colored Petri net, stochastic Petri net, and hybrid Petri net have been used for modeling biological phenomena. Recently, Matsuno et al., 2003b, introduced the hybrid functional Petri net (HFPN) in order to give a more intuitive and natural modeling method for biological pathways than these existing Petri nets. Although the paper demonstrates the effectiveness of HFPN with two examples of gene regulation mechanism for circadian rhythms and apoptosis signaling pathway, there has been no detailed explanation about the method of HFPN construction for these examples. The purpose of this paper is to describe method to construct biological pathways with the HFPN step-by-step. The method is demonstrated by the well-known glycolytic pathway controlled by the lac operon gene regulatory mechanism.
Key words: hybrid functional Petri net, lac operon, biological pathway, simulation, Genomic Object Net
A Petri net [Reisig, 1985] is method to describe and model concurrent systems. It has been mainly used so far to model artificial systems such as manufacturing systems [Proth, 1997] and communication protocols [Wheeler, 1999]. The first attempt to use Petri nets for modeling biological pathways was made by Reddy et al., 1993, giving a method to represent metabolic pathways. Hofestädt expanded this method to model metabolic networks [Hofestädt, 1994]. Subsequently, several enhanced Petri nets have been used to model biological phenomena. Genrich et al., 2001, modeled metabolic pathways with a colored Petri net by assigning enzymatic reaction speeds to the transitions, and simulated a chain of these reactions quantitatively. Voss et al., 2003, used the colored Petri net in a different way, accomplishing a qualitative analysis of steady state in metabolic pathways [Voss et al., 2003]. The stochastic Petri net has been applied to model a variety of biological pathways; the ColE1 plasmid replication [Goss and Peccoud, 1998], the response of the σ32 transcription factor to a heat shock [Srivastava et al., 2001], and the interaction kinetics of a viral invasion [Srivastava et al., 2002]. On the other hand, we have shown that the gene regulatory network of λl phage can be more naturally modeled as a hybrid system of "discrete" and "continuous" dynamics [Matsuno et al., 2000] by employing a hybrid Petri net (HPN) architecture [Alla and David, 1998; Drath, 1998]. It has also been observed [Ghosh and Tomlin, 2001] that biological pathways can be handled as hybrid systems. For example, protein concentration dynamics, which behave continuously, being coupled with discrete switches. Another example is protein production that is switched on or off depending on the expression of other genes, i. e. the presence or absence of other proteins in sufficient concentrations.
Recently, by extending the notion of HPN, Matsuno et al., 2003b, introduced the hybrid functional Petri net (HFPN) in order to give a more intuitive and natural modeling method for biological pathways than the existing Petri nets. Although the paper demonstrates the effectiveness of an HFPN with two examples, gene regulation mechanism for circadian rhythms and apoptosis signaling pathway, it only gives the constructed HFPN models for these examples. The purpose of this paper is to demonstrate how biological pathways can be created with an HFPN describing step-by-step the process to construct a model of the lac operon gene regulatory mechanism and glycolytic pathway. The constructed HFPN model is verified by simulations of five mutants of lac operon on the Genomic Object Net (GON) software package (http://www.GenomicObject.Net/), [Matsuno et al., 2003b]. This software was developed based on the notion of the HFPN together with a GUI specially designed for biological pathway modeling.
In this chapter, first we give a brief definition of the hybrid functional Petri net and give a summary for the modeling method of biological pathways with the hybrid Petri net.
Hybrid functional Petri net: an extended hybrid Petri net for modeling biological reactions
Petri net is a network consisting of place, transition, arc, and token. A place can hold tokens as its content. A transition has arcs coming from places and arcs going out from the transition to some places. A transition with these arcs defines a firing rule in terms of the contents of the places where the arcs are attached.
Hybrid Petri net (HPN) [Alla and David, 1998] has two kinds of places discrete place and continuous place and two kinds of transitions discrete transition and continuous transition. A discrete place and a discrete transition are the same notions as used in the traditional discrete Petri net [Reddy et al., 1993]. A continuous place can hold a nonnegative real number as its content. A continuous transition fires continuously at the speed of a parameter assigned at the continuous transition. The graphical notations of a discrete transition, a discrete place, a continuous transition, and a continuous place are shown in Fig. 1, together with three types of arcs. A specific value is assigned to each arc as a weight. When a normal arc is attached to a discrete/continuous transition, w tokens are transferred through the normal arc, in either of normal arcs coming from places or going out to places.An inhibitory arc with weight w enables the transition to fire only if the content of the place at the source of the arc is less than or equal to w. For example, an inhibitory arc can be used to represent repressive activity in gene regulation. A test arc does not consume any content of the place at the source of the arc by firing. For example, a test arc can be used to represent enzyme activity, since the enzyme itself is not consumed.
Hybrid dynamic net (HDN) [Drath, 1998] has a similar structure to the HPN, using the same kinds of places and transitions as the HPN. The main difference between HPN and HDN is the firing rule of continuous transition. As we can know from the description about HPN above, for a continuous transition of HPN, the different amounts of tokens can be flowed through the two types of arcs, coming from/going out the continuous transition. In contrast, the definition of HDN does not allow to transfer different amount through these two types of arcs. However, HDN has the following firing feature of continuous transition which HPN does not have; "the speed of continuous transition of HDN can be given as a function of values in the places".
From the above discussion, we can know that each of HPN and HDN has its own feature for the firing mechanism of continuous transition. As a matter of fact, both of these features of HPN and HDN are essentially required for modeling common biological reactions. (See the example of Fig. 6 in which four monomers compose one tetramer.)
This motivated us to propose hybrid functional Petri net (HFPN) [Matsuno et al., 2003b] which includes both of these features of HPN and HDN. Moreover, HFPN has the third feature for arcs, that is, a function of values of the places can be assigned to any arc. This feature was originated from the idea in the paper [Hofestädt and Thelen, 1998] which was introduced in order to realize the calculation of dynamic biological catalytic process on Petri net based biological pathway modeling. In fact, these three features are realized in HFPN by introducing a new type transition called a functional continuous transition which allows us to assign any functions to arcs and transitions for controlling the speed/condition of consumption/production/firing. An example to use the functional continuous transition is given in the beginning of section "HFPN modeling of the lac operon gene regulatory mechanism and glycolytic pathway".
Usage of discrete and continuous elements
Biological pathways essentially consist of discrete parts such as a genetic switch control and continuous parts such as a metabolic reaction. These discrete and continuous parts can be represented by discrete elements (discrete place and discrete transition) and continuous elements (continuous place and continuous transition) of HFPN.
For example, a control system turning on or off the gene expression with operator site can be represented by discrete elements. That is, if the discrete place has a token, the protein necessary for activating the operator site has bound to the operator, that means the gene expression turn on. In addition, using the delay concept of the discrete transition, wean easily describe the transcription which happens after a certain time.
On the other hand, biological phenomenon such as transcription, translation, and enzymic and metabolic reactions have been treated as events whose conditions change continuously. By modeling the transcription and the translation with continuous elements in the following way, expression levels of mRNA and proteins can be simulated. Continuous places are used for storing concentrations of mRNA and protein. Reaction speeds of transcription and translation are assigned at parameters of continuous transitions. In addition, common formulas for biological reactions such as Michaelis-Menten's equation can be modeled almost directly with assigning concentrations of substrate and product to continuous places and the formula for the reaction of them to the continuous transition between these two places.
This section demonstrates how we can model the lac operon gene regulatory mechanism and glycolytic pathway in E. coli with an HFPN. Biological facts used for constructing this model are described in the biological literature [Alberts et al., 1994; Lewin, 1997; Watson et al., 1987]. The HFPN modeling will start with a "transcription control switch" (Fig. 3), then it will be enhanced gradually by adding "positive regulation" (Fig. 4), "negative regulation" (Fig. 5), and "hydrolyzing lactose to glucose and galactose" (Fig. 7). This step-by-step explanation based on the well-known biological facts helps readers to understand how HFPNs are created according to the biological knowledge.
All the parameters in the transitions of the HFPN model are summarized in Tables 1 and 2. In this example, we shall show a rough guideline for usage of continuous and discrete transitions.In accordance with the guideline below, readers can understand the modeling manner of biological pathway with HFPN described in the subsections from "Transcription control switch" to "Hydrolyzing lactose to glucose and galactose". For the detail explanation about firing rules, refer to the paper [Matsuno et al., 2003b].
| Table 1: | Transitions in Figures 7 and 9. |
| From | To | |||||||
| Name | Type | Delay/Speed | Variable | Weight | Type | Variable | Weight | Comment |
| T1 | C | m4/10 | m4 | 0 | N | -- | -- | degradation rate of CAP |
| T2 | C | m14/10 | m14 | 0 | N | -- | -- | degradation rate of mRNA repressor |
| T3 | C | m15/10 | m15 | 0 | N | -- | -- | degradation rate of repressor |
| T4 | C | m17/10 | m17 | 0 | N | -- | -- | degradation rate of repressor binding to DNA |
| T5 | C | m18/10 | m18 | 0 | N | -- | -- | degradation rate of repressor not binding to DNA |
| T6 | C | m7/10 | m7 | 0 | N | -- | -- | degradation rate of repressor binding to operator region |
| T7 | C | m19/10 | m19 | 0 | N | -- | -- | degradation rate of lacZ mRNA |
| T8 | C | m20/10000 | m20 | 0 | N | -- | -- | degradation rate of LacZ |
| T9 | C | m23/10 | m23 | 0 | N | -- | -- | degradation rate of lacY mRNA |
| T10 | C | m24/10 | m24 | 0 | N | -- | -- | degradation rate of LacY |
| T11 | C | m27/10 | m27 | 0 | N | -- | -- | degradation rate of lacA mRNA |
| T12 | C | m28/10000 | m28 | 0 | N | -- | -- | degradation rate of LacA |
| T13 | C | m29/10000 | m29 | 0 | N | -- | -- | degradation rate of lactose outside of cell of lactose outside of cell |
| T14 | C | m9/10000 | m9 | 0 | N | -- | -- | degradation rate of lactose |
| T15 | C | m8/2 | m8 | 0 | N | -- | -- | degradation rate of arolactose |
| T16 | C | m30/10000 | m30 | 0 | N | -- | -- | degradation rate of galactose |
| T17 | C | m17/10000 | m17 | 0 | N | -- | -- | degradation rate of glucose |
| T18 | C | m10/10000 | m10 | 0 | N | -- | -- | degradation rate of complex |
| T19 | C | m5/10000 | m5 | 0 | N | -- | -- | degradation rate of cAMP |
| T20 | C | m11/10000 | m11 | 0 | N | -- | -- | degradation rate of AMP |
| T21 | C | m12/10 | m12 | 0 | N | -- | -- | degradation rate of ADP |
| T42 | D | 1 | m2 | 1 | N | -- | -- | CAP releasing rate |
| T43 | D | 1 | m3 | 1 | N | -- | -- | repressor releasing rate |
| T45 | C | 1 | -- | -- | N | m4 | -- | CAP production rate |
| T46 | D | 1 | -- | -- | -- | m13 | 1 | activation of repressor gene |
| T57 | D | 1.082 | m13 | 1 | N | m14 | 1 | transcription rate of repressor |
| T58 | C | m14 | m14 | 0 | N | m15 | -- | translation rate of repressor mRNA |
| T59 | C | -- | m15 | -- | C | m16 | -- | conformation rate of repressor |
| T60 | C | 96 × m16/100 | m16 | 0 | N | m7 | -- | repressor binding rate to operator |
| T61 | C | 399 × m16/10000 | m16 | 0 | N | m17 | -- | repressor binding rate to the DNA other than repressor site |
| T62 | C | m16/1000 | m16 | 0 | N | m18 | -- | rate of repressor which does not bind any DNA |
| T63 | D | 1 | m4 | 1 | T | m2 | 1 | binding rate of CAP to the CAP site |
| m5 | 100 | T | ||||||
| T64 | D | 1 | m7 | 1 | T | m3 | 1 | binding rate of repressor to the operon |
| m8 | 4 | I | ||||||
| T65 | D | 1 | m2 | 1 | T | m1 | 1 | logical operation of the places "CAP site" and "operator" |
| m3 | 1 | I | ||||||
| T66 | D | 3.075 | m1 | 1 | T | m19 | 1 | transription rate of lacZ |
| m21 | 1 | |||||||
| T67 | C | m19 | m19 | 1 | T | m20 | -- | translation rate of lacZ |
| T68 | D | 0.051 | m21 | 1 | N | m22 | 1 | moving rate of RNA polymerase |
| T69 | D | 1.254 | m22 | 1 | N | m23 | 1 | transription rate of lacY |
| m25 | 1 | |||||||
| T70 | C | m23/2 | m23 | 1 | T | m24 | -- | translation rate of lacY |
| T71 | D | 0.065 | m25 | 1 | N | m26 | 1 | moving rate of RNA polymerase |
| T72 | D | 0.682 | m26 | 1 | N | m27 | 1 | transription rate of lacA |
| T73 | C | m27/5 | m27 | 1 | T | m28 | -- | translation rate of lacA |
| T74 | C | m29 | 0 | N | m9 | -- | transforming rate of into a cell | |
| m24 | 2.5 | T | ||||||
| T75 | C | m9 | 0 | N | m30 | -- | decomposing rate of lactose to galactose and glucose | |
| m20 | 5 | T | m6 | -- | ||||
| T76 | C | m9/5 | m9 | 1 | T | m8 | -- | producing rate of allolactose from lactose inside of a cell |
| T77 | D | 0.5 | m3 | 1 | N | m10 | 1 | conformation rate of repressor and allolactose |
| m8 | 1 | N | ||||||
| m16 | 1 | N | ||||||
| T79 | C | m5/10 | m5 | 0 | N | m11 | -- | reaction rate: cAMP to AMP |
| T80 | C | m11/10 | m11 | 0 | N | m5 | -- | reaction rate: AMP to cAMP |
| m6 | 5 | I | ||||||
| T81 | C | m11/10 | m11 | 0 | N | m12 | -- | reaction rate: AMP to ADP |
| T82 | C | m12/10 | m12 | 0 | N | m11 | -- | reaction rate: ADP to AMP |
| T94 | C | m29/10 | m29 | 0 | T | m8 | -- | producing rate of allolactose from lactose outside of a cell |
| All transitions in this figure are listed in the "Name" column. The symbol D or C in "Type" column represents the type of transition, discrete transition or continuous transition, respectively. In the "Delay/Speed" column, the firing speed of continuous transition or the delay time of discrete transition is described according to the type of transition. The column "From", which represents incoming arc(s) to a transition, is divided into three sub-columns, "variable" (variable names of the places attached to the incoming arcs), "weight" (weight of the incoming arcs), and "type" (N, T, and I represent normal, test, and inhibitory arcs, respectively). The column "To", which represents outgoing arcs from a transition, is divided into two sub-columns, "variable" (variable names of the places attached to the outgoing arcs) and "weight" (weight of the outgoing arcs). |
| Table 2: | Places having non-zero initial values in Figure 7. |
| Name | Variable | Initial value | Comment |
| CAP | m4 | 5 | concentration of CAP |
| cAMP | m5 | 100 | concentration of cAMP |
| glucose | m6 | 50 | concentration of glucose |
| AMP | m11 | 200 | concentration of AMP |
| ADP | m12 | 200 | concentration of ADP |
| LacZ | m20 | 5 | concentration of LacX |
| LacY | m24 | 2.5 | concentration of LacY |
| LacA | m28 | 1 | concentration of LacA |
| lactose outside of a cell | m29 | 50 | lactose outside of a cell |
Continuous transition
The firing speed of a continuous transition is described by a simple arithmetic formula such as mX / a, mX × mY, (mX × mY) / (mX + a × mY), etc., where mX, mY are variables representing the contents of the input places going into the transition, and a is a constant parameter to be tuned manually. With this speed, the contents of input places will be consumed and simultaneously each output place will receive the corresponding amount through the arc. For example, there is a description about the reaction of the protein LacZ, which says that the degradation speed of LacZ is much slower than its production rate. Based on this description (biological knowledge), we used the formula m20/10000 as the degradation speed (transition T8) and m19 as the production speed of LacZ. Basically, all transition speeds are given in the same way. That is, in our modeling method, a speed of biological reaction is defined relatively without a unit.
To each continuous place representing the concentration of some substance, a continuous transition is attached to model the degradation of the substance. The degradation rate is set to be mX/10 for high speed degradation (e. g. T1) and mX/10000 for low speed degradation (e. g. T8). Following the definition, no weight is assigned to the arc going out from a continuous transition.
The default weight of the arc going into a continuous transition
is set to be zero. If necessary, an appropriate weight is assigned to the arc
according to the underlying biological knowledge. (See the explanation in the
subsection "Hydrolyzing lactose to glucose and galactose" for the transitions T67,
T70, T73).
Discrete transition
The default delay time of a discrete transition is set to be one and the default weight of an arc from/to a discrete transition is one. If possible, the delay time and the weight are appropriately chosen according to the underlying biological knowledge. (See the explanation in the subsection "Hydrolyzing lactose to glucose and galactose" for the transitions T66 — T73).
Functional continuous transition
A functional continuous transition is used for T59, which describes a reaction converting four monomers to one tetramer (Fig.6 (a)). Recall that HPN can not have functions as parameters of the transitions. We can not model this type of reaction with HPN, since, a reaction speed of this type is always assigned to a transition as a function of values of the places as shown in the above. In contrast, it is possible to model this reaction with the HDN as shown in Fig.6 (b). However, as is shown in Fig.6 (b), the constructed HDN model is not natural or intuitive. Recall that a functional continuous transition of HFPN allows us to assign any functions to arcs and transitions for controlling the speed/condition of consumption, production, or firing. From Fig. 6 (c), we can see that this reaction can be modeled naturally and intuitively with a functional continuous transition.
At an arc of HFPN, two kinds of parameters "threshold" and "speed" are assigned. The continuous transition attached at the head of the arc can not fire unless the content of the place attached at the tail of the arc exceeds the "threshold" of the arc. On the other hand, "speed" is a value or a function, defining the amount flowing through the arc.
Initial values of places
The places which have initial values greater than zero are listed in Table 3.
| Table 3: | Mapping between the numbers in Fig. 8 and the metabolites in reactions |
| Index | Enzyme / Reaction | Index | Enzyme / Reaction | |
| 1 | hexokinase | 2 | phosphoglucose isomerase | |
| 3 | phosphofructokinase | 4 | aldolase | |
| 5 | triosephosphate isomerase | 6 | glyceraldehyde-3-phosphate dehydrogenase | |
| 7 | phosphoglycerate kinase | 8 | phosphoglycerate mutase | |
| 9 | enolase | 10 | pyruvate kinase | |
| 11 | lactate dehydrogenase |
Transcription control switch
We will sketch how places, transitions and arcs are determined in the process of modeling.
Fig. 2 shows the structure of the lac operon.
In the absence of lactose, a repressor is bound to the operator. The repressor prevents RNA polymerase from starting RNA synthesis at the promoter. On the other hand, in the presence of lactose and the absence of glucose, the catabolite activator protein (CAP) is bound to the CAP site. Since the CAP helps RNA polymerase to bind to the promoter, the transcription of the lac operon can begin. This regulation mechanism can be expressed by the HFPN of Fig. 3, consisting of only discrete elements. The place "promoter" (m1) represents the status of the transcription of the lac operon. If this place contains token(s), the lac operon is being transcribed, whereas if this place contains no tokens, transcription of the operon does not begin. The rates of releasing CAP and repressor from the DNA are assigned to the transitions T42 and T43 as the delay times, respectively. The production rates of CAP and repressor are assigned to the transitions T63 and T64, respectively, as the delay times. Each time the transition T65 fires, the place "promoter" receives one token. This transition fires when both of the following conditions hold: (1) The place "CAP site" (m2) contains tokens (this is the case in which the protein CAP is bound to the CAP site). (2) The place "operator" (m3) has no token (this is the case in which the repressor is not bound to the operator).
|
Figure 3: The HFPN representation of the control mechanism of the lac operon transcription switch. Only discrete elements are used for representing the switching mechanism. |
Positive regulation
The DNA binding of CAP is regulated by glucose to ensure that the transcription of the lac operon begins only in the absence of glucose. In fact, glucose starvation induces an increase in the intracellular levels of cyclic AMP (cAMP). Transcription of the lac operon is activated with the help of CAP, to which cAMP binds. When glucose is plentiful, cAMP levels drop; cAMP therefore dissociates from CAP, which reverts to an inactive form that can no longer bind DNA. This regulatory mechanism by CAP and cAMP is called positive regulation of the lac operon. Fig. 4 shows an HFPN model representing positive regulation of the lac operon.
|
Figure 4: Positive regulation mechanism: This figure contains the HFPN model of Fig. 3. |
Continuous places are used for representing the concentrations of the substances CAP, cAMP, AMP, ADP, and glucose. Tokens in the places "CAP" (m4) and "cAMP" (m5) should not be consumed by the firing of the transition T63, since both CAP and cAMP are not lost when forming a complex. Accordingly, two test arcs are used from the places "CAP" and "cAMP" to the transition T63. The weight of the arc from the place cAMP to the transition T63 is set to 100, while the weight of the arc from the place "CAP" is 1, which was determined by manual tuning and referring to the simulation results. After the concentrations both of CAP and cAMP exceed the thresholds which are given to these two arcs as weights, the transition T63 can fire, transferring a token from the transition T63 to the place "CAP site".
In general, reactions among cAMP, AMP, and ADP are reversed. The transition T80 (the transition T82) between the places "cAMP" (m5) and "AMP" (m11) (the places "AMP" and "ADP" (m12)) represents the reversible reaction together with the transition T79 (the transition T81). To the places "cAMP", "AMP", and "ADP", 1, 200, and 200 are assigned as initial values, respectively.
Recall that when glucose is plentiful, cAMP levels drop. This phenomenon is represented by the inhibitory arc from the place "glucose" (m6) to the transition T80. When the concentration in the place "glucose" exceeds the threshold given at this inhibitory arc, the transition T80 stops firing.
In this model, since we suppose that CAP is produced continuously, its production mechanism is modeled with the place "CAP" and the transitions T45 and T1. The initial amount of the place "CAP" is set to 5, since CAP is produced by a production mechanism independent of the mechanism being described here.
Finally, the transitions T1, T17, T19, T20, and T21 represent the natural degradation of the corresponding substances. Since these transitions represent only degradation and no production, no arcs are going out from these transitions.
Negative regulation
In the presence of lactose, a small sugar molecule called allolactose is formed in a cell. Allolactose binds to the repressor protein, and when it reaches a sufficiently high concentration, transcription is turned on by decreasing the affinity of the repressor protein for the operator site. The repressor protein is the product of the lacI gene, which is located upstream of the lac operon. Actually, after forming a tetramer, the repressor protein can bind to the operator site.
By adding this negative regulation mechanism to Fig. 4, Fig. 5 is obtained. Since it is known from the literature [Lewin, 1997] that repressor should be produced sufficiently prior to the production of other substances, parameters relating to this negative regulation are set to faster values than other parameters. In our model, a discrete place is used for representing the promoter site of a gene. When the discrete place "repressor promoter"(m13) receives a token, the transcription of the lacI gene begins. We can determine the transcription frequency by the delay rate of the transition T46. Transcription and translation mechanisms are modeled by the places "mRNA repressor" (m14) and "repressor" (m15) and the transitions T57, T2, T58, and T3.
|
Figure 5: Negative regulation mechanism: This figure contains the HFPN model of Fig. 4. |
The reaction forming a tetramer from monomers (Fig. 6 (a)) can be represented by the HDN as is shown in Fig. 6 (b). (HPN can not model this type of reaction, since, any reaction speed should be realized by assigning a
function of values of the places to the continuous transition.) By comparing this representation and the representation of Fig. 6 (c), we recognize that an HFPN allows us to represent such reaction naturally and intuitively. Tetramer formation is represented in the HFPN by places "repressor" and "4 repressor" (m16)
and three transitions T3, T59, and T60. The function
is assigned for the
input (output) arcs to (from) the transition T59 as a flow speed. Note that the speed of the input arc is four times faster than the speed of the output arc.
For the repressor forming tetramer, we determined that about 96% of them bind to the operator
site, about 3.99% of them bind to the other DNA sites, and about 0.01% of them
do not bind to DNA. These percentages are determined from the description in
the literature [Lewin, 1997]. The places "operator bind" (m7), "DNA bind" (m17), and "not bind" (m18) represent the
amount of these repressors. According to these binding rates, the firing speeds
of the transitions T60, T61, and T62 were given by
,
, and
respectively.
We separate the concentration of lactose to two places "lactose outside of a cell" (m29) and "lactose" (m9) for the convenience of describing the function of the lacY gene in the next subsection. Concentration of the allolactose is represented by the place "allolactose" (m8) whose accumulation rates are given at the transitions T94 and T76. It is known [Hofestädt, 1994] that allolactose is produced from the lactose existing outside of a cell as well as the lactose inside a cell. Since it is natural to consider that a production speed of allolactose is faster than the passing rate of allolactose through the cell membrane, the speed of transition T76 (m9 / 5) is set to be faster than the speed of transition T94 (m29 / 10).
The negative regulation in Fig. 5 was realized in the following way:
The place "operator" receives tokens if
Four molecules of allolactose need to bind with one molecule of tetramer repressor. Accordingly the function of input arc from the place "allolactose" to the transition T78 should be four times faster than that of input arc from the place "4 repressor". In summary, we gave formulas,
The transition T78 gives the complex forming rate of the allolactose and the tetramer of repressor proteins. The place "complex" (m10) represents the concentration of the complex. Allolactose can also release the tetramer of repressor proteins from the operator site by forming a complex with it. The transition T77 and the arcs from/to the transition realize this mechanism. Discrete transitions are used for the transitions T77, since only a discrete transition is available for the arc from the discrete place (a continuous amount can not be removed from a discrete place). In order to realize a smooth removal of allolactose, a small delay time (0.5) is assigned to the transition T77.
The transitions T4, T5, T
6, T13, T14, T15 and T18 represent the natural degradation
of the corresponding substances.
Hydrolyzing Lactose to Glucose and Galactose
The lac operon transcription and translation mechanisms are described in Fig. 7 together with the effects of two products of the genes lacZ and lacY on hydrolyzing lactose to glucose and galactose. The effect of the gene lacA is not included in this figure, since the lacA gene does not play a role in either the lac operon gene regulatory mechanism or the glycolytic pathway.
The places "mRNA lacZ" (m19), "mRNA lacY" (m23), and "mRNA lacA" (m27) represent the concentrations of mRNAs transcribed from the genes lacZ, lacY, and lacA, respectively, and the transcription rates are given at the discrete transitions T66, T69, and T72 as the delay times.
The places "LacZ" (m20),
"LacY" (m24), and "LacA" (m28) represent the concentrations of
proteins translated from the lacZ, lacY, and lacA mRNAs, and the translation rates are given at the continuous
transitions T67, T70, and T73. As is shown in Table 3, we set the initial values of proteins
LacZ, LacY, and LacA to 5, 2.5, 1, respectively. These values are chosen
according to the production ratios of LacZ, LacY, and LacA proteins [Lewin, 1997].
Actually, the formulas m19,
, and
are assigned to the transitions T67, T70, and T73 as the speeds, according to the fact that the proteins of lacZ, lacY, and lacA are
produced in the ratio
.
Degradation rates of mRNAs (proteins) are assigned to the transitions T7, T9, and T11 (T8, T10, and T12).
In this model, to represent the lac operon DNA, only discrete elements, discrete transitions T66, T68, T69, T71, and T72, and discrete places X1 (m21), X2 (m22), X3 (m25), and X4 (m26), are used. The discrete places X1 (X3) represents the Boolean status of the transcription of lacZ gene (lacY gene). That is, each time transcription of lacZ (lacY) is finished, the place X1 (X3) receives a token. At the discrete transition T68 (T71), the delay time 0.051 (0.065), which is required for the RNA polymerase moving from the end of the lacZ gene to the beginning of the lacY gene (the end of the lacY gene to the beginning of the lacA gene), is assigned. The delay times 3.075 and 1.254 are assigned to the transitions T66 and T69, respectively, according to the fact that the length of the lacZ gene (lacY gene) is 3075bp (1254bp). The delay time 0.682 at the transition T72 represents the length of the lacA gene (the length of the lacA gene is 682bp). The lengths of the genes are obtained from the website http://genolist.pasteur.fr/Colibri/genome.cgi. Note that we can know the transcription status of the gene lacY (the gene lacA) by observing whether the discrete places X2 (X4) contains tokens or not.
Recall that the product of the gene lacZ is an enzyme which hydrolyzes lactose to glucose and galactose. This reaction is modeled by using the places "lactose", "galactose" (m30), and "glucose", and the transitions T75, T16, and T17. In our model, 20 is assigned to each of the places "lactose" and "lactose outside of a cell" as an initial value. A test arc is used from the place "lacZ" to the transition T75, since the enzyme is not consumed.
We consider that the production rates of glucose and lactose
depend on both the concentration of lactose and the concentration of the
product of the lacZ gene. The formula
representing the
speed of the transition T74 reflects
this idea.
We mentioned that the gene lacY encodes the permease
that brings lactose into the cell. In Fig. 7, this function is realized with the places "LacY", "lactose outside of a cell", and "lactose", and the
transitions T74, T13, and T14. Since the product of lacY gene is an enzyme, test arc is used from the place "LacY" to the transition
T74, and the speed of this transition is given by the formula
similarly above.
The weight 2.5 (5) of the arc from the place LacY (m24) (LacZ (m20)) to the transition T74 (T75) corresponds to the basal concentration of LacY (LacZ) presented in Table 3.
|
Figure 7: HFPN modeling of lac operon gene regulatory mechanism. This figure contains the HFPN model in Fig. 5. |
Glycolytic pathway
In the glycolytic pathway, glucose is converted into two molecules of pyruvate. The enzymatic reactions shown in Fig. 8 and Table 3 are involved in this pathway.
More details about this pathway are explained in the following modeling process. Fig. 9 shows the HFPN model of the glycolytic pathway.
|
Figure 9: HFPN model of glycolytic pathway. Michaelis-Menten’s equation is applied to the reactions in the main pathway from glucose to pyruvic acid. Test arcs are used to represent enzyme reactions. |
Main pathway from glucose to pyruvate acid
First we create continuous places corresponding to glucose (m6), intermediates (m31 – m39), and pyruvate (m40). Default continuous places are
introduced at this step though these places are meant to represent the
concentrations of the corresponding substrates. Then, by following the pathway in Fig. 8, we put continuous transitions (T83 – T92) together with normal arcs between two consecutive places in the pathway. These transitions and arcs
shall represent the reactions, but default transitions and arcs are initially
introduced without any parameter tuning. By considering the natural degradation
of substrates, we put continuous transitions T17 for glucose and T23 – T31 for intermediates with normal arcs.
By taking into account the fact that natural degradation is very slow in
glycolysis, the firing speed of these transitions is given by the formula mX / 10000, where X = 17, 23, 24,
. . . , 31.
Production of ATP from ADP and NADH from phosphoric acid
Next we consider ADP, ATP, Pi (phosphoric acid) and NADH. In the
pathway shown in Fig. 8, two ADP molecules and two Pi’s are invested to produce
four ATP molecules and two NADH molecules. Continuous places “ADP” (m12), “ATP” (m51), "Pi" (m52, initial
value=200), and "NADH" (m53) are
created to represent ADP, ATP, Pi and NADH. We attach the continuous
transitions T21 and T22 representing the natural
degradation of ADP and ATP. Their firing speeds are set to be very slow (T21 : m21 / 10000, T22 : m22 / 10000), similarly as
for the intermediates above. For the process of ATP
ADP in the reaction (1), the normal arc from the place "ATP"
to the transition T83 and the normal arc from the transition T83 to the
place "ADP" are introduced. In the same way, normal arcs connected to T85 are introduced for the process of ATP
ADP in the reaction (3). Similarly, to represent reactions (7) and (10), transitions T89 and T92 are used, respectively. Reaction (6) (Pi
NADH) is represented by the transition T88. Since we consider ATP and ADP to degrade slowly, the formulas
m22/10000 and m21/10000 are assigned to the transitions T22 and T21 as degradation speed, respectively.
Including enzyme reactions
Places having variables m41, m42,..., m50 represent enzyme concentrations. Initial values of these variables are set to 5. The production rates of these enzymes are assigned to the transitions T47,T48,..., T56 whose speeds are set to 1. The transitions T32, T33,..., T41 have the firing speeds mX / 10 (X = 41, 42, ..., 50), which represent the speed of natural degradation of these enzymes.
Test arcs are used for enzyme reactions since an enzyme itself is
not consumed in the reaction. To each of these test arcs, the value 3 is chosen
as a weight of the arc.
Reaction speeds in the main pathway
To represent the reaction speeds in the main pathway, we adopt the Michaelis-Menten equation:

where [S] is the substance concentration, Vmax is the maximum reaction
speed, and Km is a Michaelis
constant. In our model, we let Vmax = 1 and Km = 1. The two independent
places "Pv" and "Pk" in Fig. 9 represent these variables Vmax and Km. The values of Vmax and Km can easily be manipulated by changing the contents of the places
Pv and Pk, respectively. The Michaelis-Menten equation is used for representing
each of the firing speeds of the transitions T83, T84,..., and T92. For example, for the transition T84, the formula
is used.
ADP molecules and two Pi’s are invested to produce four ATP molecules and two NADH molecules.
In the previous section, we demonstrated how a biological pathway is modeled with HFPN through an example of the lac operon gene regulatory mechanism and glycolytic pathway. This section verifies the HFPN model by analyzing simulation results obtained from GON.
GON is the software based on the notion of the HFPN and has a GUI specially designed for biological pathway modeling. GON was developed in Java so that it can run on any platforms. The modeled biological pathways is saved as an original XML format. One of the interesting function of GON is that any place or transition of HFPN can be replaced to an image having biological meaning. Fig. 10 is a screen shot of GON after applying this function several times. In this screenshot, for example, the discrete places "CAP site", "operator", and "promoter" in Fig.3 were replaced to the corresponding images.
Simulation Results of five mutants of the lac operon
The five mutants taken into consideration in this simulation are listed below together with the realization methods of the mutants in the HFPN model of Fig. 8.
Behavior of the wild type
In both of Figures 11 and 12, the concentration behaviors of lactose (outside of a cell), lactose, glucose, LacZ (β -galactosidase), and LacY (β -galactoside permease) are shown. From the beginning, glucose is degraded, since it is consumed in the glycolytic pathway. At time point 55, the glucose was consumed, the transcription of lac operon begins, producing the LacZ protein (time = 60), and successively, LacY protein begins to be produced. By comparing the concentration behavior of lactose and lactose (outside of a cell), we can know that the LacY protein works well. At time point 65 the concentration of LacZ exceeds 10, decomposition of lactose to glucose and galactose starts, increasing the concentration of glucose again. Just after the glucose is once again completely consumed, the transcription of the lac operon is stopped, keeping the concentration of LacZ and LacY proteins at some levels (from the assumption that the degradation speed of these proteins is very slow) [Alberts et al., 1994; Lewin, 1997].
|
Figure 12: Simulation results of lacZ - and lacY - mutants. We can see that the glucose is not produced in the lacZ - mutant and the lactose can not enter a cell in the lacY - mutant. |
Behavior of the lac repressor mutant
Fig. 11 shows the simulation results of the mutants lacI -, lacI s, and lacI -d obtained from GON.
In the lacI - and lacI -d mutants, LacZ
protein and LacY protein are produced, while these proteins are not produced in
the lacI s mutant.
Furthermore, in the lacI - and lacI -d mutants, the
concentrations of LacZ and LacY proteins keep growing (except during the period
of glucose re-production), even after the decomposition of lactose has ended.
Note that these simulation results support the biological experimental results
[Alberts et al., 1994; Lewin, 1997].
Behavior of the lac operon mutant
Fig. 12 shows the simulation results of the mutants lacZ - and lacY - obtained from GON.
From this figure, we can observe that, in the lacZ - mutant, glucose is never produced again once it is completely
consumed. On the other hand, in the lacY - mutant, the
concentration of lactose (inside the cell) never grows, since lactose can not
pass a cell membrane in this mutant. Note that these observations from the
simulation results support the experimental results [Alberts et al., 1994; Lewin, 1997].
In this paper, we demonstrated how to construct HFPNs in modeling biological pathways with the example of the lac operon gene regulatory mechanism and glycolytic pathway. This example was simulated by GON and the results of five mutants of lac operon gene were shown, which correspond well to the facts described in the literature.> The purpose of this paper was to describe method to construct biological pathways with the HFPN. Therefore we introduced the well-known glycolytic pathway controlled by the lac operon gene regulatory mechanism.
Although this paper deals with a known biological phenomenon, with GON, we succeeded in discovering one unknown biological phenomenon in multicellular systems [Matsuno et al., 2003a]. We analyzed the mechanism of Notch-dependent boundary formation in the Drosophila large intestine, by experimental manipulation of Delta expression and computational modeling and simulation by GON. Boundary formation representing the situation in the normal large intestine was shown in the simulation. By manipulating Delta expression in the large intestine, a few types of disorder in boundary cell differentiation were observed, and similar abnormal patterns were generated by the simulation. Simulation results suggest that parameter values representing the strength of cell-autonomous suppression of Notch signaling by Delta are essential for generating two different modes of patterning: lateral inhibition and boundary formation, which could explain how a common gene regulatory network results in two different patterning modes in vivo.
We should emphasize that, essentially, any type of differential equations can be modeled with HFPN. This means that GON has the potential to simulate biological pathway models for other biosimulation tools such as E-Cell and Gepasi. More concretely, if the users have a skill of programming, they can develop original functions as the Java class files, which can be called from the transitions or the arcs of HFPN. This means that, any function of E-Cell or Gepasi can be included as Java class file in GON.
With GON, we have succeeded in modeling many kinds of biological pathways [http://www.GenomicObject.Net/]. However, at the same time, we recognized that the current notion of HFPN is still insufficient to model more sophisticated biological pathways, including more complex information such as localization, cell interaction, etc.
This is one of the reasons to motivate us to create a "hybrid functional Petri net with extension (HFPNe)" by enhancing the concept of HFPN [Nagasaki et al., 2003]. The HFPNe allows more "types" for places (integer, real, boolean, string, vector) with which complex information can be handled. In other words, the definition of these types allow us to treat other existing Petri nets as subsets of the HFPNe. For example, a traditional Petri net (with only discrete elements) is treated as the HFPNe using only "integer type". Furthermore, HFPNe can define a hybrid system of continuous and discrete events together with a hierarchization of objects for an intuitive creation of complex objects. Genomic Object Net is developed based on the notion of an HFPNe. The biological pathway models constructed with the HFPNe will be reported in the near future.
This work was partially supported by the Grand-in-Aid for Scientific Research on Priority Areas "Genome Information Science" from the Ministry of Education, Culture, Sports, Science and Technology in Japan.