- Research
- Open Access
- Published:

# A simplified method for power-law modelling of metabolic pathways from time-course data and steady-state flux profiles

*Theoretical Biology and Medical Modelling***volume 3**, Article number: 24 (2006)

## Abstract

### Background

In order to improve understanding of metabolic systems there have been attempts to construct S-system models from time courses. Conventionally, non-linear curve-fitting algorithms have been used for modelling, because of the non-linear properties of parameter estimation from time series. However, the huge iterative calculations required have hindered the development of large-scale metabolic pathway models. To solve this problem we propose a novel method involving power-law modelling of metabolic pathways from the Jacobian of the targeted system and the steady-state flux profiles by linearization of S-systems.

### Results

The results of two case studies modelling a straight and a branched pathway, respectively, showed that our method reduced the number of unknown parameters needing to be estimated. The time-courses simulated by conventional kinetic models and those described by our method behaved similarly under a wide range of perturbations of metabolite concentrations.

### Conclusion

The proposed method reduces calculation complexity and facilitates the construction of large-scale S-system models of metabolic pathways, realizing a practical application of reverse engineering of dynamic simulation models from the Jacobian of the targeted system and steady-state flux profiles.

## Background

Systematic modelling has emerged as a powerful tool for understanding the mathematical properties of metabolic systems. The rapid development of metabolic measurement techniques has driven advances in modelling, especially using data on the effects of perturbations of metabolite concentrations, which contain valuable information about metabolic pathway structure and regulation [1]. A power-law approximation for representing enzyme-catalyzed reactions, known as Biochemical Systems Theory, is an effective approach for understanding metabolic systems [2, 3]. Generalized Mass Action (GMA) and S-systems [4, 5], which are often used as power-law modelling approaches, have wide representational spaces that permit adequate expression of enzyme kinetics [6] in spite of their simple fixed forms. Moreover since S-system forms have a smaller number of parameters than GMA forms, the S-system is an appropriate modelling framework. The derivation of an S-system model from given experimental data is a powerful tool not only for understanding non-linear properties but also for determining the regulatory structure of the system [7, 8].

S-system modelling from time-course data is often difficult due to its non-linear properties. Non-linear-fitting algorithms, such as genetic algorithms or artificial neural networks, have been used to resolve this problem [9–14]. Although these methods can be applied to metabolic pathways, massive computing power is required in the case of targeted models involving a number of closely-connected, underspecified parameters [13]. Moreover, the wider the range of targeted metabolic pathways, the more likely is the occurrence of local minima due to the expansion of the parameter search space. The network-structures-segmentation method can reduce the total parameter search range in genetic network modelling [15] but it is difficult to apply this to metabolic pathways because of the close relationships between reversible reactions and because of allosteric regulation. Diaz-Sierra and Fairén have proposed an approach, based on the steady-state assumption, that allows the construction of S-system models from a Jacobian matrix of the system [16]. Since the Jacobian constrains the search range of underspecified parameters at the optimization stage, these authors' method allows efficient parameter estimation. However, the problem of an excess number of parameters requiring estimation remains unsolved.

We present an approach to power-law modelling of metabolic pathways from the Jacobian of the targeted system and steady-state flux profiles with linearization of the S-system. This reduces the number of underspecified parameters. Two numerical experiments show that the S-system model generated by this method describes similar dynamic behaviour to that indicated by conventional kinetic models.

## Methods

### Retrieving the Jacobian from time-course data

As a first step, the Jacobian must be obtained from metabolic time-course data. In this section, we summarize the method of Sorribas *et al* [17], which we use in this work.

In biochemical systems, the Jacobian can be defined as:

where **J** is the Jacobian matrix, and δ**X** represents a small perturbation and contains the concentration X_{i} as its elements. The elements of the Jacobian can be obtained from perturbed time-courses using linear least-squares fitting [17, 18]. This method is based on the fact that transients yield linear responses to small perturbations under steady-state conditions [5]. The mathematical basis for this is that the linear representation constitutes the first-order term of a Taylor series expansion, which is sufficiently accurate in this situation.

### Determination of the kinetic orders of the S-system

The S-system is a power-law representation constructed of two terms: the production rate and the degradation rate:

where α_{i} and β_{i} are rate constants, g_{ij} and h_{ij} are kinetic orders, and x_{i} represents the concentration of a compound.

In steady state conditions, it can be expressed simply as:

*Flux* = *V*^{+} = *V* ^{-} (3)

where Flux is the sum of the steady-state fluxes into x_{i}, and V^{+} and V^{-} are production and consumption terms, respectively.

In steady state conditions, the production term and the consumption term in Eq.(2) can therefore be represented as:

where x_{j,0} represents the steady-state concentration of x_{j}.

Consequently α_{i} and β_{i} yield:

Defining X_{i} as:

= *y*_{
i
}(*x*_{1}, ..., *x*_{
n
}) (*i* = 1, ..., *n*) (7)

the Jacobian of Eq. (2) can be represented as:

In steady state conditions, Eq. (8) can be simplified by substitution of Eq. (5) with α_{i} and Eq. (6) with β_{i}, giving:

and thus:

Once Jacobian *J*_{ij}, x_{j,0}, and Flux are given, Eq. (10) is a constraint for the determination of the kinetic orders g_{ij} and h_{ij}. Savageau has described the linearization of S-system as an "F-factor" for stability analysis [19]. We use this representation to estimate parameter values.

In most cases g_{ij} and/or h_{ij} are available from the structure of the metabolic pathway; however, in the absence of known kinetic orders, parameter estimates are needed to determine them. In such cases Eq. (10) is adopted as limiting the parameter search range.

## Results

### Case study 1: a linear biochemical pathway

In the first case study, we applied this approach to the published biochemical model of yeast galactose metabolism shown in Figure 1[20]. This model consists of five metabolites, and four enzyme reactions mainly described by Michaelis-Menten equations. The kinetic equations, systems parameters, and initial conditions are listed in Appendix 1.

In this case, all parameters of the S-system model were determined, without the need for estimation. The biochemical model could be converted into the following S-system form.

Since X_{1} is an independent variable, it was omitted from the listed rate equations.

The following constraints were provided from the pathway structure: h_{22} = g_{32}, h_{23} = g_{33}, h_{33} = h_{43} = g_{53}, h_{34} = h_{44} = g_{54}, h_{54} = g_{44}, h_{55} = g_{45}, β_{2} = α_{3}, β_{3} = β_{4} = α_{5}, and α_{4} = β_{5}. The steady-state concentrations were: X_{1,0} = 0.50 mM (fixed value), X_{2,0} = 0.146 mM, X_{3,0} = 0.007 mM, X_{4,0} = 0.817 mM, and X_{5,0} = 0.243 mM. The steady-state fluxes were: V_{1} = V_{2} = V_{3} = V_{4} = 0.081 mM/min. The Jacobian was:

Our method was able to generate the S-system model of the linear pathway from the steady-state metabolite concentrations, the steady-state fluxes, and its Jacobian. For example, g_{21} and g_{32} were determined by:

Because h_{22} is equal to g_{32},

In this modelling process, all the parameters were determined by simple calculations of this kind. The resulting S-system model was:

### Case study 2: A branched biochemical pathway

The branched biochemical pathway shown in Figure 2 was tested in the second example. This pathway has the typical features of a biochemical pathway: branching, feedback regulation, and product inhibition (see Appendix 2). X_{3} is the inhibitor of both V_{3} and V_{4}.

Most of the parameters were obtained by our method. However one parameter could not be determined by calculation and a parameter estimation method was used. The model could be converted into the following S-system form:

The following constraint was provided by the pathway structure: h_{11} = g_{21}. Steady-state concentrations were: X_{1,0} = 0.067, X_{2,0} = 0.049, X_{3,0} = 0.081, X_{4,0} = 0.041, and steady-state fluxes were: V_{1} = V_{2} = 0.1, V_{3} = V_{5} = 0.043, and V_{4} = V_{6} = 0.057. The Jacobian was:

More than half of the parameters were obtained by simple calculations using the steady-state concentrations, the steady-state fluxes, and the Jacobian. The following four parameters could not be determined: α_{3}, g_{33}, β_{3}, and h_{33}. For example, h_{11} is a parameter which could be determined by:

Because g_{33} is a parameter that could not be determined by a calculation, an estimate provided by some constraint was needed. As α_{3}, h_{33}, and β_{3} could be treated as dependent parameters, once g_{33} was obtained all four parameters were determined:

Moreover, the search range of g_{33} could be limited by the definition that h_{33} has a positive value since X_{3} is the substrate of the reaction V_{3}. The search range of g_{33} was set from 0 to -0.95. When the number of underspecified parameters was one whose search range was limited, a linear-fitting algorithm could be used to fit the time-course data in the original kinetic model. In this modelling, g_{33} was determined by a linear optimization method. Consequently, the following S-system model was generated:

The list of parameters used in the above calculation is shown in Table 1.

### Comparing transient dynamic responses after perturbation

To evaluate the versatility of this method, the transient dynamics of the S-system model in response to various perturbations were compared with those of the original Michaelis-Menten model, by calculating the mean relative errors (MRE):

where n is the number of metabolites in the model, m, the number of sampling points in the time course, x'_{i}(t) represents the time course calculated from the created S-system model, and x_{i}(t), the time course calculated from the original Michaelis-Menten model. In this experiment there were 10 sampling points, the interval between sampling points was 0.5, and X_{2} in case study 1 and X_{1} in case study 2 were the targets of perturbations ranging from 0% to 200%. The time-courses of metabolites in response to perturbations of 100% are shown in Figure 3. In both examples, similar dynamic behaviour was observed in the S-system model and the reference model as a response to the perturbation.

The changes of MREs in response to the perturbation range are shown in Figure 4. The initial MREs with no perturbation were within 1%. Although a slight increase in MREs was observed in both case studies, they remained within 4% at a perturbation range of 200%.

## Discussion

Power-law modelling from time-course data of metabolite concentrations often requires parameter estimates that depend on the size of the target metabolic network. Especially when developing a large-scale metabolic model, this requires problem-specific simplifications. Our method can reduce the number of underspecified parameters by using steady-state flux profiles and the Jacobian of the targeted system derived from time courses of metabolites, and is thus suitable for large-scale power-law modelling. To validate our methods we used two existing biochemical pathway models, the straight and branched pathway models, described by dynamic equations. In the S-system, modelling of a linear metabolic pathway with 12 parameters (case study 1), our method determines all parameters accurately, whereas the method developed by Diaz-Sierra and Fairén [16] leaves at least four parameters underspecified that include error correction parameters. In the case of the branched metabolic pathway with 16 parameters (case study 2), our method determines all but one parameter, whereas the method of Diaz-Sierra and Fairén has 12 underspecified parameters that include error correction parameters. The case studies demonstrate that our method of developing S-system models from time series can reduce the number of underspecified parameters more efficiently than the previously reported method [16]. Furthermore, the perturbation response experiments show that the models created by our method can reproduce dynamics similar to the reference models, since the MRE was around 3% when the perturbation range was 200% (Fig. 4). S-system models generated by our method can provide accurate simulations within a wide range of the steady-state point. This limitation does not prevent the modelling and analysis of metabolic pathways, as it does with many power-law metabolic models [4, 5].

Robustness against experimental noise is an important requirement for the practical application of modelling methods. Since the Jacobian is rather sensitive to the noise in time series [18], a robust corrective response to the numerical errors contained in a Jacobian is important in developing a model involving its use. In the method of Diaz-Sierra and Fairén [16], error correction parameters are incorporated into the S-system model to reduce the effect of experimental noise. However, this approach may lead to an increase in the number of underspecified parameters. As an alternative approach to reducing the experimental noise in time-course data, the estimation of appropriate slopes using a non-linear neural network model is effective [10], and can provide an error-controlled time course that enables the Jacobian to be obtained with high accuracy. Methods for obtaining accurate estimates of the specific effects of general types of perturbation have been discussed [21] and might enable more precise analysis of data such as time-scale metabolite concentrations obtained under perturbations.

To assess the robustness of our method against experimental noise, we measured the calculation error when numerical errors were manually inserted into the Jacobian or steady state fluxes. Table 2 summarizes the MREs of the simulated time trajectories of the S-system and the original Michaelis-Menten model. It is evident that the Jacobian is quite sensitive to numerical error because of the direct effect on a kinetic order of a Jacobian. Furthermore, the steady-state flux profile data may include experimental noise. However, the models created by our method are relatively robust to errors in the Jacobian and steady-state fluxes, indicating that these methods will be useful in practice.

Sorribas and Cascante proposed a method for identifying regulatory patterns using a given set of logarithmic gain measurements [7]. In their paper, they suggested that one strategy for selecting possible patterns is to perform perturbation experiments and to measure the corresponding dynamic response. For practical application of this approach, it is crucial to develop appropriate ways of performing the required experiments; however, measuring the logarithmic gain resulting from various steady-state fluxes is not practical due to a lack of exhaustive measurement method. Our approach can be used to develop reasonably accurate models of metabolic pathways by using a single set of appropriate steady-state flux profiles. Although steady-state flux profile data remains difficult to be measured directly and comprehensively, several methods of measuring steady-state flux profiles by using isotopes have been developed [22, 23]. Our method assumes that the Jacobian obtained is accurate. Therefore it is important to obtain time-course data reflecting transient dynamics after a suitably small perturbation in which the Jacobian behaves in a linear manner [17].

Comprehensive metabolome data will undoubtedly accumulate as a consequence of advances in metabolic measurement techniques [24–26]. In our laboratory we have developed a high-throughput technique using capillary-electrophoresis mass spectrometry that provides effective time-course data involving a few hundred ionic metabolites [27, 28]. Our method promises to provide high-throughput modelling of large-scale metabolic pathways by exploiting the accumulating metabolome and steady-state flux profile data along with the anticipated developments in metabolome measurement techniques.

## Conclusion

Our method provides stable and high-throughput S-system modelling of metabolic pathways because it drastically reduces underspecified parameters by employing the steady-state flux profile and Jacobian retrieved from time-course data. S-system models generated by this method can provide accurate simulations within a wide range around the steady-state point. In combination with the metabolome measurement techniques it should permit high-throughput modelling of large-scale metabolic pathways.

## Appendices

### Appendix 1: Biochemical model of yeast galactose metabolism

The model of the yeast galactose utilization pathway was constructed by Atauri *et al*. [20], whose reaction map is presented in Figure 1.

The rate equations of the model are:

where the rate expressions are:

The parameters used are available in reference [20]. The calculated steady-state conditions were: X_{1}; 0.5 mM (fixed), X_{2}; 0.146 mM, X_{3}; 0.00703 mM, X_{4}; 0.817 mM, X_{5}; 0.243 mM, and the flux through the pathway; 0.0081 mM·s^{-1}.

### Appendix 2: Branched biochemical pathway

The rate equations of the branched model the reaction scheme of which is shown in Figure 2 are:

= *V*_{1} - *V*_{2}

= *V*_{2} - *V*_{3} - *V*_{4}

= *V*_{3} - *V*_{5}

= *V*_{4} - *V*_{6}

where the rate expressions are:

*V* = 0.1

## References

- 1.
Voit EO, Marino S, Lall R: Challenges for the identification of biological systems from in vivo time series data. In Silico Biol. 2005, 5 (2): 83-92.

- 2.
Savageau MA: Biochemical systems analysis. I. Some mathematical properties of the rate law for the component enzymatic reactions. J Theor Biol. 1969, 25 (3): 365-369.

- 3.
Savageau MA: Biochemical systems analysis. II. The steady-state solutions for an n-pool system using a power-law approximation. J Theor Biol. 1969, 25 (3): 370-379.

- 4.
Voit EO, Ferreira AEN: Computational analysis of biochemical systems : a practical guide for biochemists and molecular biologists. 2000, Cambridge ; New York , Cambridge University Press, xii, 531 p., [8] p. of plates-

- 5.
Torres NV, Voit EO: Pathway analysis and optimization in metabolic engineering. 2002, New York , Cambridge University Press, xiv, 305 p.-

- 6.
Savageau MA: Mathematics of organizationally complex systems. Biomed Biochim Acta. 1985, 44 (6): 839-844.

- 7.
Sorribas A, Cascante M: Structure identifiability in metabolic pathways: parameter estimation in models based on the power-law formalism. Biochem J. 1994, 298 ( Pt 2): 303-311.

- 8.
Voit EO: Models-of-data and models-of-processes in the post-genomic era. Math Biosci. 2002, 180: 263-274. 10.1016/S0025-5564(02)00115-3.

- 9.
Veflingstad SR, Almeida J, Voit EO: Priming nonlinear searches for pathway identification. Theor Biol Med Model. 2004, 1 (1): 8-10.1186/1742-4682-1-8.

- 10.
Voit EO, Almeida J: Decoupling dynamical systems for pathway identification from metabolic profiles. Bioinformatics. 2004, 20 (11): 1670-1681. 10.1093/bioinformatics/bth140.

- 11.
Sakamoto E, Iba H: Inferring a system of differential equations for a gene regulatory network by using genetic programming. 2001, 720-726.

- 12.
Maki Y, Ueda T, Okamoto M, Uematsu N, Inamura Y, Eguchi Y: Inference of genetic network using the expression profile time course data of mouse P19 cells. 2002, 13: 382-383.

- 13.
Kikuchi S, Tominaga D, Arita M, Takahashi K, Tomita M: Dynamic modeling of genetic networks using genetic algorithm and S-system. Bioinformatics. 2003, 19 (5): 643-650. 10.1093/bioinformatics/btg027.

- 14.
Kimura S, Ide K, Kashihara A, Kano M, Hatakeyama M, Masui R, Nakagawa N, Yokoyama S, Kuramitsu S, Konagaya A: Inference of S-system models of genetic networks using a cooperative coevolutionary algorithm. Bioinformatics. 2005, 21 (7): 1154-1163. 10.1093/bioinformatics/bti071.

- 15.
Maki Y, Tominaga D, Okamoto M, Watanabe S, Eguchi Y: Development of a system for the inference of large scale genetic networks. 2001, 446-458.

- 16.
Diaz-Sierra R, Fairen V: Simplified method for the computation of parameters of power-law rate equations from time-series. Math Biosci. 2001, 171 (1): 1-19. 10.1016/S0025-5564(01)00051-7.

- 17.
Sorribas A, Lozano JB, Fairen V: Deriving chemical and biochemical model networks from experimental measurements. Recent Res Devel in Physical Chem. 1998, 2: 553-573.

- 18.
Diaz-Sierra R, Lozano JB, Fairen V: Deduction of chemical mechanisms from the linear response around steady state. J Phys Chem A. 1999, 103 (3): 337-343. 10.1021/jp983041e.

- 19.
Savageau MA: Biochemical systems analysis: A study of function and design in molecular biology . 1976, Addison-Wesley Pub. Co.

- 20.
de Atauri P, Orrell D, Ramsey S, Bolouri H: Is the regulation of galactose 1-phosphate tuned against gene expression noise?. Biochem J. 2005, 387 (Pt 1): 77-84.

- 21.
Schmidt H, Cho KH, Jacobsen EW: Identification of small scale biochemical networks based on general type system perturbations. Febs J. 2005, 272 (9): 2141-2151. 10.1111/j.1742-4658.2005.04605.x.

- 22.
Zhao J, Shimizu K: Metabolic flux analysis of Escherichia coli K12 grown on 13C-labeled acetate and glucose using GC-MS and powerful flux calculation method. J Biotechnol. 2003, 101 (2): 101-117. 10.1016/S0168-1656(02)00316-4.

- 23.
Fischer E, Sauer U: Metabolic flux profiling of Escherichia coli mutants in central carbon metabolism using GC-MS. Eur J Biochem. 2003, 270 (5): 880-891. 10.1046/j.1432-1033.2003.03448.x.

- 24.
Fiehn O, Kopka J, Dormann P, Altmann T, Trethewey RN, Willmitzer L: Metabolite profiling for plant functional genomics. Nat Biotechnol. 2000, 18 (11): 1157-1161. 10.1038/81137.

- 25.
Ostergaard S, Olsson L, Nielsen J: In vivo dynamics of galactose metabolism in Saccharomyces cerevisiae: metabolic fluxes and metabolite levels. Biotechnol Bioeng. 2001, 73 (5): 412-425. 10.1002/bit.1075.

- 26.
Neves AR, Ventura R, Mansour N, Shearman C, Gasson MJ, Maycock C, Ramos A, Santos H: Is the glycolytic flux in Lactococcus lactis primarily controlled by the redox charge? Kinetics of NAD(+) and NADH pools determined in vivo by 13C NMR. J Biol Chem. 2002, 277 (31): 28088-28098. 10.1074/jbc.M202573200.

- 27.
Soga T, Ueno Y, Naraoka H, Matsuda K, Tomita M, Nishioka T: Pressure-assisted capillary electrophoresis electrospray ionization mass spectrometry for analysis of multivalent anions. Anal Chem. 2002, 74 (24): 6224-6229. 10.1021/ac0202684.

- 28.
Sugimoto M, Kikuchi S, Arita M, Soga T, Nishioka T, Tomita M: Large-scale prediction of cationic metabolite identity and migration time in capillary electrophoresis mass spectrometry using artificial neural networks. Anal Chem. 2005, 77 (1): 78-84. 10.1021/ac048950g.

## Acknowledgements

We thank Kazunari Kaizu and Katsuyuki Yugi for insightful discussions and providing technical advice. This work was supported in part by a grant from Leading Project for Biosimulation, Keio University, The Ministry of Education, Culture, Sports, Science and Technology (MEXT); a grant from CREST, JST; a grant from New Energy and Industrial Technology Development and Organization (NEDO) of the Ministry of Economy, Trade and Industry of Japan (Development of a Technological Infrastructure for Industrial Bioprocess Project); and a grant-in-aid from the Ministry of Education, Culture, Sports, Science and Technology for the 21st Century Centre of Excellence (COE) Program (Understanding and Control of Life's Function via Systems Biology).

## Author information

## Additional information

### Competing interests

The author(s) declare that they have no competing interests.

### Authors' contributions

Kitayama contributed to the development of the modelling method. Kinoshita supported the development of the mathematical theory of this method and wrote this manuscript. Sugimoto designed two experiments for method verification. Nakayama provided the basic ideas and directed the project, and Tomita was the project leader.

Tomoya Kitayama, Ayako Kinoshita contributed equally to this work.

## Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

## Rights and permissions

## About this article

#### Received

#### Accepted

#### Published

#### DOI

### Keywords

- Experimental Noise
- Mean Relative Error
- Generalize Mass Action
- Logarithmic Gain
- Perturbation Range