Mathematical explanation of the predictive power of the X-level approach reaction noise estimator method

Konkoli, Zoran

doi:10.1186/1742-4682-9-12

Research
Open access
Published: 13 April 2012

Mathematical explanation of the predictive power of the X-level approach reaction noise estimator method

Zoran Konkoli¹

Theoretical Biology and Medical Modelling volume 9, Article number: 12 (2012) Cite this article

2589 Accesses
1 Citations
Metrics details

Abstract

The X-level Approach Reaction Noise Estimator (XARNES) method has been developed previously to study reaction noise in well mixed reaction volumes. The method is a typical moment closure method and it works by closing the infinite hierarchy of equations that describe moments of the particle number distribution function. This is done by using correlation forms which describe correlation effects in a strict mathematical way. The variable X is used to specify which correlation effects (forms) are included in the description. Previously, it was argued, in a rather informal way, that the method should work well in situations where the particle number distribution function is Poisson-like. Numerical tests confirmed this. It was shown that the predictive power of the method increases, i.e. the agreement between the theory and simulations improves, if X is increased. In here, these features of the method are explained by using rigorous mathematical reasoning. Three derivative matching theoremsare proven which show that the observed numerical behavior is generic to the method.

Introduction

Noise is an integral part of the workings of the living cell biochemistry [1]. There are many types of noise and this work focuses on the intrinsic noise. If reactant copy numbers are low they can fluctuate widely [2]. These fluctuations can severely influence the dynamics of the cell and need to be carefully controlled [3].

Describing intrinsic noise has attracted a lot of effort. A range of theoretical methods have been developed to study intrinsic noise. However, an accurate characterization of the reaction noise is not easy. A direct solution of the chemical master equation for the system is often not possible since the number of configurations can be exponentially large. Numerical simulation methods can be used to avoid this problem, and are often implemented by using the Gillespie algorithm [4]. However, to obtain accurate prediction for moments of the particle number distribution function, e.g. the variance, sampling with a relatively large number of runs (simulations) is needed. This becomes impractical if the number of particle types is very large. A range of methods have been suggested to complement or replace these techniques. The focus of this work is on moment closure techniques [5–15].

The main idea behind moment closure approaches is to construct the equation system that can describe various moments of the particle number distribution function. In such a way there is no need to directly solve the master equation or perform a largenumber of computer simulations. The problem is that the equation system that describes these moments is, in principle, infinite. The main issue is to cut (or close) the hierarchy. This is done in two ways. First, one can try to make some specific assumptions about the reacting system which can be used to express higher order moments in terms of lower order ones. This procedure defines the moment closure function for the problem. Second possibility is to take the distribution function centered approach and assume that the particle number distribution function has a well-defined form, parameterized by a finite set of parameters. Variational calculus can be used to obtain the parameters [7, 16]. In both cases the complicated many body dynamics is reduced to a set of ordinary differential equations that govern quantities of interest.

Two moment closure methods that were suggested previously will be of a particular interest. The PARNES method [14, 15] is based on the assumption that pair effects dominate the dynamics. The method has been generalized into the XARNES method [17] so that higher order correlation effects can be included in the description. In fact, various choices for X result in a series of methods, e.g. X = P (pair effects), T (triplets), Q (quadruples), etc. Thus the PARNES method is the special case of the XARNES method with X = P. Both methods are based on the rather generic formalism of correlation forms which is used in statistical physics to model spatially extended many body effects. Each correlation form describes a particular correlation effect (single, pair, triple, and so forth). Correlation forms are used to perform a cluster like expansion of relevant quantities of interest. In the previous studies, the original correlation form formalism [18, 19] used to model spatially extended diffusion controlled reactions has been adapted for describing well mixed reaction volumes.

A moment closure method works well in the intended domain of application, but it might easily fail if used elsewhere. Thus for a moment closure method to be useful it is necessary to, if possible, specify precisely in which situations the method is expected to work well. In [17] it was argued that the XARNES method should describe well systems with a Poisson-like particle number distribution function. This was confirmed by numerical studies. In here, it will be shown rigorously that such behavior is generic to the method. Also, there are some ambiguities while adapting the correlation form formalism to the well mixed situation, and the procedure is not unique. The problem is that a large number of spatial degrees offreedom need to be projected onto a much small number of variables, and there are many ways how this can be done. In here, it will be shown that the choice made in the previous studies is in some sense optimal.

To show all of the above the derivative matching procedure introduced in [9–11] will be used. In particular, the procedure from the technical report [9] willbe closely followed. This report was formalized in much shorter form in [10]. A very general multiplicative ansatz was suggested for the moment closure function. The precise form of the ansatz was found by using the derivative matching procedure. The function was parameterized by a finite set of parameters which were found by matching time derivatives of exact and approximate moments. This was done for the system in the pure state. The condition for a good match was derived in the form of a generic formula that involves the moment closure function parameters.

The same formula was obtained in a different context where the XARNES method was suggested [17]. This fact motivated the present work. The derivative matching procedure will be applied to investigate the accuracy of the XARNES method. There are some important differences in the setups in [10, 11] and the present work. In here, the original procedure from [10, 11] will be carried in the opposite direction. The XARNES method is based on the well-defined multiplicative ansatz for factorial moments. It will be shown that the ansatz implies good derivative matching properties. Also, while the original work focused on the pure state in here the Poisson state will be of interest.

The mathematical setup

A reaction system is defined as follows. It is assumed that reacting particles mix well and that particle positions are irrelevant. A configuration of the system can be specified by listing how many particles of each type there are in the system

\vec{n} = (n_{1}, n_{2}, \dots, n_{i}, \dots, n_{T})

(1)

where variables n_i , i = 1, 2,..., T, are positive definite integers used to denote the particle numbers. A configuration of the system changes in time due to the presence of reactions.

The full list of reactions is given by r₁, r₂,..., r_R and a reaction r_α is formally defined in the usual chemical notation as

u_{α 1} X_{1} + \dots + u_{α T} X_{T} \overset{λ_{α}}{\to} v_{α 1} X_{1} + \dots + v_{α T} X_{T}

(2)

The positive definite vectors

{\vec{u}}_{α} = (u_{α 1}, u_{α 2}, \dots, u_{α R})

(3)

{\vec{v}}_{α} = (v_{α 1}, v_{α 2}, \dots, v_{α R})

(4)

with α = 1,2,..., R contain the stoichiometry coefficients for the reactions. It is assumed that the dynamics can be modeled as the Markov process where λ_α is the reaction rate for a reaction r _α having the unit ofinverse time. The dynamics defined in such a way is stochastic.

One possible way to describe the dynamics and characterize noise is to construct the master equation, solve it, and obtain the particle number distribution function $P (\vec{n}, t)$ which describes all stochastic properties of the system. However, both the computation and the direct inspection of the particle number distribution function is often tedious. It is more useful to investigate certain properties of the distribution function.

In practice this is done by computing various observables, or ensemble averages, as

⟨ f (\vec{n}) ⟩ \equiv \sum_{\vec{n}} f (\vec{n}) P (\vec{n}, t)

(5)

where f is the function that suitably parameterizes an observable of interest. A typical observable could be some low level moment such as the mean or the variance. Clearly, the direct use of (5) is not practical for large systems and one needs to avoid this route somehow. The idea is to project the details of the dynamics and obtain a coarse grained description by monitoring a selected set of observables instead of all configurations. These observables will be classified, i.e. labeled, in a precise mathematical way. For a fixed number of particle types, it is useful to introduce the vector space of all admissible labels. Formally, this is done through the following set of definitions.

Definition 1. The space of vectors that are used to label various correlation effects or, depending on the context, observables, will be denoted by Ω,

Ω \equiv {\vec{m} | m_{i} \geq 0, i = 1, 2, \dots, T}

(6)

where a variable m_i is a positive definite integer. For example, the average number of particles of a type i will be labeled as ${\vec{e}}_{i} = (0, 0, \dots, 0, 1, 0, \dots, 0)$ where the digit 1 appears on the i-the place.

Definition 2. It is useful to introduce the order, or norm, of a vector $\vec{m} \in Ω$ as the sum of its components,

∥\vec{m}∥ = \sum_{i = 1}^{T} m_{i}

(7)

This order will be used to classify various correlation effects. It is clear from the definition that

∥{\vec{m}}_{1} + {\vec{m}}_{2}∥ = ∥{\vec{m}}_{1}∥ + ∥{\vec{m}}_{2}∥

(8)

The full set of observables will be split into two disjoint subsets. The first subset contains the observables that are being included in the projection and the second set contains the observables that are omitted. The observables in the second set need to be expressed somehow in terms of the observables in the first set. The following definitions will be used to classify such sets.

Definition 3. The set of vectors $\vec{m} \in Ω$ that contains all vectors with orders 0,1,2,...,ξ will be denoted by Ω_ξ,

Ω_{ξ} = {\vec{m} \in Ω | | | \vec{m} | | \leq ξ}

(9)

where ξ ≥ 1 is an arbitrary integer used to classify various theories.

Definition 4. In a similar way,

Ω_{\vec{m}} = \{{\vec{m}}^{'} \in Ω | m_{1}^{'} \leq m_{1}, m_{2}^{'} \leq m_{2}, \dots, m_{T}^{'} \leq m_{T}\}

(10)

will denote the set of vectors ${\vec{m}}^{'} \in Ω$ that are smaller than a vector $\vec{m}$ in the lexical sense defined above.

The set Ω_ξ will be used to denote the set of observables being explicitly considered in the theory. For example, the mean field theory (the classical chemical kinetics) that neglect effects of fluctuation works with the first order effects where ξ = 1 (X = Singles). Thus the mean field theory would work by constructing equations for all $ρ_{\vec{e} i}$ with ${\vec{e}}_{i \in Ω_{1}}$ . Sets of the type $Ω_{\vec{m}}$ will be used to limit various sums.

Definition 5. Finally, the set of all vectors that are not in Ω_ξ will be denoted as,

{\bar{Ω}}_{ξ} = Ω \ Ω_{ξ}

(11)

A vector from this set will be denoted by the bar character above the vector symbol, e.g. $\bar{m} \in {\bar{Ω}}_{ξ}$ .

Further, several definitions will prove useful that generalize well known operations on numbers to similar operations in Ω. These generalizations greatly compactify some of the mathematical expressions that will be discussed.

Definition 6. Let ${\vec{ω}}_{1}$ and ${\vec{ω}}_{2}$ be two arbitrary tuples of real numbers with rank T, such as $\vec{ω} = (ω_{1}, ω_{2}, \dots, ω_{T})$ . The set of all such vectors will be denoted by R^T . For any pair of such vectors the generalized power will be defined as

{\vec{ω}}_{1}^{{\vec{ω}}_{2}} \equiv ω_{11}^{ω_{21}} ω_{12}^{ω_{22}} ω_{13}^{ω_{23}} \dots ω_{1 i}^{ω_{2 i}} \dots ω_{1 T}^{ω_{2 T}}

(12)

Obviously, Ω⊂R^T and this definition can be also used for vectors in Ω. It is clear from the definition that

{\vec{ω}}^{{\vec{ω}}_{1} + {\vec{ω}}_{2}} = {\vec{ω}}^{{\vec{ω}}_{1}} {\vec{ω}}^{{\vec{ω}}_{2}}

(13)

Definition 7. The binomial like coefficient that involves a pair of vectors ${\vec{m}}_{1}$ and ${\vec{m}}_{2}$ from Ω is formally defined as

(\begin{matrix} {\vec{m}}_{1} \\ {\vec{m}}_{2} \end{matrix}) \equiv (\begin{matrix} m_{11} \\ m_{21} \end{matrix}) (\begin{matrix} m_{12} \\ m_{22} \end{matrix}) (\begin{matrix} m_{13} \\ m_{23} \end{matrix}) \dots (\begin{matrix} m_{1 i} \\ m_{2 i} \end{matrix}) \dots (\begin{matrix} m_{1 T} \\ m_{2 T} \end{matrix})

(14)

and it is assumed that a binomial coefficient $(\begin{matrix} m_{1 i} \\ m_{2 i} \end{matrix})$ in the product is zero if m_2i> m_1i.

Definition 8. The factorial-like symbol applied to a vector $\vec{m}$ from Ω is generalized as

\vec{m}! \equiv m_{1}! m_{2}! m_{3}! \dots m_{i}! \dots m_{T}!

(15)

Definition 9. The product between two real numbers is generalized as

{\vec{ω}}_{1} ⊙ {\vec{ω}}_{2} = (ω_{11} ω_{21}, \dots, ω_{1 i} ω_{2 i}, \dots, ω_{11} ω_{21},)

(16)

where ${\vec{ω}}_{1}$ and ${\vec{ω}}_{2}$ are two arbitrary vectors from R^T . Please note that the result of the operation is an element in the same set, ${\vec{ω}}_{1} ⊙ {\vec{ω}}_{2} \in R^{T}$ . Also, the following identity related to this definition will be useful later on,

{({\vec{ω}}_{1} ⊙ {\vec{ω}}_{2})}^{\vec{m}} = {\vec{ω}}_{1}^{\vec{m}} {\vec{ω}}_{2}^{\vec{m}}

(17)

Exact equations of motion

There are several types of observables one could choose to work with. In this work factorial moments will be used. A factorial moment will be labeled by the related positive definite

vector $\vec{m} = (m_{1}, m_{2}, \dots, m_{T}) \in Ω$ and is defined as

p \vec{m} = 〈 (\begin{matrix} \vec{n} \\ \vec{m} \end{matrix}) \vec{m}! 〉

(18)

It was shown in [17] that the equation system for the exact factorial moments is given by

\frac{d}{d t} p_{\vec{m}} (t) = \sum_{α = 1}^{R} λ_{α} \sum_{\vec{c} \in Ω_{R}} (\begin{matrix} \vec{m} \\ \vec{c} \end{matrix}) Γ_{α} (\vec{c}) ρ_{\vec{m} - \vec{c} + {\vec{u}}_{α}} (t)

(19)

and the structure of the equations will be explained in the following. Γ coefficients in the sum are given by

Γ_{α} (\vec{c}) = [(\begin{matrix} {\vec{v}}_{α} \\ \vec{c} \end{matrix}) - (\begin{matrix} {\vec{u}}_{α} \\ \vec{c} \end{matrix})] \frac{\vec{c}!}{{\vec{u}}_{α}!}

(20)

for all $\vec{m} \in Ω$ . A vector $\vec{c} \in Ω_{R}$ is a positive definite vector to be referred to as a contraction vector. Contraction vectors emerge during the field theoretic derivation of the equations of motion when one uses the Wick theorem. More details regarding the field theoretic setup can be found in [14, 15]. One can derive the same set of equations in another way (not shown). Sums over contraction vectors will be of the central interest in the following.

The space of all possible contractions is defined by reactions that can occur in the system.

From the definition of Γ coefficients in (20) one can see that for a fixed ${\vec{u}}_{α}$ or ${\vec{v}}_{α}$ the sum over the contraction vectors in (19) is restricted.

This is suggestive of the following formal definition of the space Ω_R⊂Ω:

Ω_{R} = ⋃_{α = 1}^{R} (Ω_{{\vec{u}}_{α}} ⋃ Ω_{{\vec{v}}_{α}})

(21)

Despite the fact that the sum over contraction vectors is finite, the equation system for exact moments represents, in principle, an infinite hierarchy of equations since on the right hand side of the equation for a given $ρ_{\vec{m}}$ there are terms that involve higher order moments since it may happen that $∥\vec{m} - \vec{c} + {\vec{u}}_{α}∥ > ∥\vec{m}∥$ . The hierarchy of equations for the exact moments appears not particularly useful. However, it can be used to devise approximation schemes. If higher order moments can be expressed in terms of few lower order ones then the equation system closes down.

A way of closing the hierarchy: the XARNES method

The XARNES method is based on the closure ansatz which is constructed by using the concept of correlation forms [14, 15, 17]:

ln ν_{\vec{m}} (t) = \sum_{\hat{m} \in Ω ξ} (\begin{matrix} \vec{m} \\ \hat{m} \end{matrix}) w_{\hat{m}} (t)

(22)

where $w_{\hat{m}}$ denotes the correlation form labeled by a vector $\hat{m} \in Ω$ . The detailed motivation behind this equation is given in [17]. It is assumed that that correlation forms with the orders above a given threshold ξ can be assumed small, $w_{\vec{m}} (t) \approx 0 \Leftrightarrow ∥\vec{m}∥ > ξ$ . Since a moment $ν_{\vec{m} (t)}$ is not exactly equal to the related moment $ρ_{\vec{m} (t)}$ , two different symbols, v and ρ, are used to denote them. However, if the approximation above works well, possibly when the number of vectors in Ω_ξ becomes large, their values should be close.

Please note that if a factorial moment is zero, the left hand side of Eq. (22) becomes infinite owing to the singularity of the logarithmic function. This implies that the related correlation form becomes also infinite. Clearly, by construction, the XARNES ansatz is somewhat ambiguous for cases when some factorial moments vanish. In what follows it will be assumed that all factorial moments are strictly larger than zero but they can be arbitrary close to zero.

It was shown previously [14, 15, 17] that the assumption (22) is an implicit ansatz for defining the function that expresses a higher order non-base factorial moments $ν_{\bar{m} (t)}$ with $\bar{m} \in {\bar{Ω}}_{ξ}$ in terms of the base moments $\vec{ν} (t) \equiv (\dots, {ν_{\hat{m} (t), \dots}}_{)}$ with $\hat{m} \in Ω_{ξ}$ :

v_{\bar{m}} (t) = ψ_{\bar{m}} (\vec{ν} (t)); \bar{m} \in {\bar{Ω}}_{ξ}

(23)

where the moment closure function is given by

ψ_{\bar{m}} (\vec{ν} (t)) = \prod_{\hat{m} \in Ω_{ξ}} v_{\hat{m}} {(t)}^{γ_{\hat{m}}^{\bar{m}}}

(24)

and γ coefficients are defined as

γ_{\hat{m}}^{\bar{m}} = \sum_{{\hat{m}}^{'} \in Ω_{ξ}} (\begin{matrix} \bar{m} \\ {\hat{m}}^{'} \end{matrix}) {(C^{- 1})}_{{\hat{m}}^{'}, \hat{m}}

(25)

with the matrix C specified as

C_{{\hat{m}}_{1}, {\hat{m}}_{2}} = (\begin{matrix} {\hat{m}}_{1} \\ {\hat{m}}_{2} \end{matrix})

(26)

Also, by construction one has that $ψ_{\hat{m}} (\vec{ν} (t)) = v_{\hat{m}} (t)$ .

Finally, by combining (24) with (19) gives the XARNES system of equations,

\frac{d}{d t} ν_{\hat{m}} (t) = \sum_{α = 1}^{R} λ_{α} \sum_{\vec{c} \in Ω_{R}} (\begin{matrix} \hat{m} \\ \vec{c} \end{matrix}) Γ_{α} (\vec{c}) ψ_{\hat{m} - \vec{c} + \vec{u} α} (\vec{ν} (t))

(27)

for $\hat{m} \in Ω_{ξ}$ .

It is clear from the form of equation (19) that $ρ_{\vec{0}} = ⟨1⟩$ does not depend on time. For $\vec{m} = 0$ the sum over contraction vectors can only contain one vector, $\vec{c} = \vec{0}$ , and since $Γ_{α} \vec{0} = 0$ the time derivative of $ρ_{\vec{0}}$ vanishes. This expresses the fundamental probability conservation requirement for any reasonable theory. In contrast to the $ρ_{\vec{0}}$ moment, a correlation moment with $\vec{m} \neq 0$ is a real dynamic quantity. Based on this one might partition the Ω_ξ space in two spaces. The first space should consist of the null vector, while the second space would consist of all other vectors in Ω_ξ. It is possible to show that the values of γ are same regardless whether the zero vector is singled out or not.

An intriguing similarity

A similar set of equations as the one given by (24-26) has been obtained in a slightly different context [10, 11] where the multiplicative ansatz given in (24) was the starting point in developing a moment closure method for zero centered moments, $η_{\vec{m}}$ , which in the notation of the present work would be given by

η_{\vec{m}} \equiv ⟨{\vec{n}}^{\vec{m}}⟩

(28)

The goal was to determine the values of the γ coefficients in Eq (24) from the requirement that time derivatives of exact and approximate moments match at the time instance when the distribution function resembles the distribution function of the pure system. It was shown that the derivatives match best if the coefficients γ are given exactly by (25). Thus the equations would be identical if not for the fact that the related equations in [10, 11] are for zero-centered moments.

This remarkable coincidence where the same set of coefficients is obtained in two different ways is rather intriguing. In particular, this strongly suggests that the XARNES method might have advantageous properties as it comes to the derivative matching between the exact and the approximate moments computed for a suitable initial condition. It will be shown by employing a strict mathematical analysis, in the same way as done in [10, 11], that this is indeed the case for the Poisson initial condition. This in turn explains the previous numerical observation from [14, 15, 17] that the XARNES method performs better when the particle number distribution function is Poisson-like as compared to the situation when the distribution resembles the one of the pure system.

Derivative matching setup

In here a similar procedure as in [10, 11] will be carried out to show that the XARNES ansatz expressed in (24) and (25) has some advantageous derivative matching properties. The original procedure will be implemented as follows. Consider the Eqs. (19) and(27) at time t = t₀ where the particle number distribution function is strictly given by the uncorrelated multivariate Poisson distribution. In such a case all correlation forms are zero except the ones specified by vectors ${\vec{e}}_{i}, i = 1, \dots, T$ . Thus the multivariate (uncorrelated) Poisson distribution is parameterized by parameters $\vec{μ} = (μ_{1}, μ_{2}, \dots, μ_{T}) \in R^{T}$ where $μ_{i} = exp (w_{\vec{e} i)}$ . By a trivial application of Eq. (22) one can see that the exact factorial moments of the reacting system computed at t = t₀ are given by

ρ_{\vec{m}} (t_{0}) \equiv {\vec{μ}}^{\vec{m}}

(29)

for any $\vec{m} \in Ω$ .

At t = t₀ the base factorial moments that are used in the XARNES method have to be chosen. This choice specifies the boundary condition for the XARNES equation of motion in (27). Naturally, for the purposes of comparing time derivatives it will be assumed that

v_{\hat{m}} (t_{0}) = ρ_{\hat{m}} (t_{0}) = {\vec{μ}}^{\vec{m}}; \hat{m} \in Ω ξ

(30)

since if the values of the exact and the approximate base moments do not match their derivatives will not likely match either. The question is: given that the values of the base and the exact moments are same, can one expect that time derivatives of these quantities match as well?

In order to make some progress in answering this question a couple of identities involving γ coefficients will be needed. These identities are hard to prove by using the explicit form of these coefficients given in (25). The formalism of generating functions will be used instead. Before doing that the following two definitions need to be stated,

Definition 10. Symbol $P_{ε} (\vec{ω})$ will be used to denote a polynomial

P_{ε} (\vec{ω}) = \sum_{\vec{m} \in Ω}^{∥\vec{m}∥ \geq ε} A_{\vec{m}} {\vec{ω}}^{\vec{m}}

(31)

where $A_{\vec{m}}$ are arbitrary real valued coefficients. No other restrictions are imposed on the sum.

Definition 11. Let $P_{ε} (\vec{ω})$ be a polynomial as defined previously. Symbol $[{\vec{ω}}^{\vec{m}}]$ will be used to denote the operator that extracts the coefficient in front of a particular ${\vec{ω}}^{\vec{m}}$ term in the polynomial, i.e. $A_{\vec{m}}$ :

[{\vec{ω}}^{\vec{m} 0}] \sum_{\vec{m} \in Ω_{*}} A_{\vec{m}} {\vec{ω}}^{\vec{m}} = \{A_{\begin{matrix} _{{\vec{m}}_{0}} & {\vec{m}}_{0} \in Ω * \\ 0 & {\vec{m}}_{0} \notin Ω * \end{matrix}}

(32)

for any Ω_* ⊂ Ω. Likewise,

[{\vec{ω}}^{{\vec{m}}_{0}}] P_{ε} (\vec{ω}) = \{A_{\begin{matrix} {\vec{m}}_{0} \\ 0 \end{matrix}} \begin{matrix} ∥{\vec{m}}_{0}∥ \geq ε \\ ∥{\vec{m}}_{0}∥ < ε \end{matrix}

(33)

The following lemma is extremely useful for proving a couple of identities that will be needed later. The lemma has the form of a generating function identity.

Lemma 1. Let coefficients γ be defined by Eqs. (22) and (35). In other words, the only requirement imposed on these coefficients is that they are used to parameterize higher order factorial moments in terms of the base moments with the XARNES ansatz implied. In such a case these coefficients obey the following identity

\begin{gathered} \sum_{\hat{m} \in Ω_{ξ}} γ_{\hat{m}}^{\bar{m}} {(1 + \vec{ω})}^{\hat{m}} = \\ {(1 + \vec{ω})}^{\bar{m}} + P_{ξ + 1} (\vec{ω}) \end{gathered}

(34)

where $\vec{ω}$ and 1 ≡ (1,1,...,1) are vectors in R^T and $\bar{m} \in {\bar{Ω}}_{ξ}$ . Eq. (34) will be referred to as the generating function equation.

Proof. This identity can be proven as follows. First, one combines the XARNES ansatz (24) rewritten as

ln ν_{\bar{m}} = \sum_{\hat{m} \in Ω ξ} γ_{\hat{m}}^{\bar{m}} ln ν_{\hat{m}}

(35)

with the definition of the correlation forms (22) to obtain

\sum_{\hat{m} \in Ω ξ} γ_{\hat{m}}^{\bar{m}} ln ν \hat{m} = \sum_{\hat{m} \in Ω ξ} (\begin{matrix} \vec{m} \\ \hat{m} \end{matrix}) w_{\hat{m}}

(36)

By assuming that all correlation forms have predefined values given by

w_{\hat{m}} = {\vec{ω}}^{m}

(37)

where $\vec{ω} \in R^{T}$ is arbitrary but fixed, leads to the equation that is central for the proof of the lemma,

\sum_{\hat{m} \in Ω ξ} γ_{\hat{m}}^{\bar{m}} ln ν_{\hat{m}} = \sum_{\hat{m} \in Ω ξ} (\begin{matrix} \bar{m} \\ \hat{m} \end{matrix}) {\vec{ω}}^{\hat{m}}

(38)

First, we focus on the left hand side of the above equation. From (37), and (22) with $\vec{m} = {\hat{m}}_{1} \in Ω_{ξ}$ ,

ln ν_{\hat{m} 1} = \sum_{m 2 \in Ω ξ} (\begin{matrix} {\hat{m}}_{1} \\ {\hat{m}}_{2} \end{matrix}) {\vec{ω}}^{{\hat{m}}_{2}}

(39)

which by the use of the binomial theorem can be recognized as

ln ν_{\hat{m}} = {(1 + \vec{ω})}^{\hat{m}}

(40)

This turns the left hand side of (38) into

\sum_{\hat{m} \in Ω_{ξ}} γ_{\hat{m}}^{\bar{m}} ln ν_{\hat{m}} = \sum_{\hat{m} \in Ω ξ} γ_{\hat{m}}^{\bar{m}} {(1 + \vec{ω})}^{\hat{m}}

(41)

The sum over the vectors in Ω_ξ in the right hand side of (38) can be extended to include all vectors in $Ω_{\bar{m}}$ . Naturally, these terms have to be subtracted afterwards. By doing that one obtains

\sum_{\hat{m} \in Ω_{ξ}} (\begin{matrix} \bar{m} \\ \hat{m} \end{matrix}) {\vec{ω}}^{\hat{m}} = \sum_{\vec{m} \in Ω_{\bar{m}}} (\begin{matrix} \bar{m} \\ \vec{m} \end{matrix}) {\vec{ω}}^{\vec{m}} - \sum_{{\bar{m}}^{'} \in Ω_{\bar{m}} \ Ω ξ} (\begin{matrix} \bar{m} \\ {\bar{m}}^{'} \end{matrix}) {\vec{ω}}^{{\bar{m}}^{'}}

(42)

By the use of the binomial theorem the first term on the right hand side of the equation can be recognized as the first term in the right hand of (34). Likewise, it is trivial to see that the second term in the equation can be characterized as $P_{ξ + 1} (\vec{ω})$ . Thus the equation above becomes,

\sum_{\hat{m} \in Ω_{ξ}} (\begin{matrix} \bar{m} \\ \hat{m} \end{matrix}) {\vec{ω}}^{\hat{m}} = {(1 + \vec{ω})}^{\bar{m}} + P_{ξ + 1} (\vec{ω})

(43)

Finally, the Lemma follows by using (38), (41), and (43).

A couple of useful identities will be proven that follow from this Lemma and are stated as three corollaries. The first two corollaries can be proven easily without using the Lemma, e.g. as in [10, 11]. In here they are proven in a different way to illustrate how to use the Lemma. The third corollary is a highly non trivial statement that would be very hard to prove by direct use of the explicit form of γ coefficients.

Corollary 1. Let coefficients γ be given and let them satisfy the condition of Lemma 1. Then,

\sum_{{\hat{m}}_{2} \in Ω_{ξ}} γ_{{\hat{m}}_{2}}^{\bar{m}} (\begin{matrix} {\hat{m}}_{2} \\ {\hat{m}}_{1} \end{matrix}) = (\begin{matrix} \bar{m} \\ {\hat{m}}_{1} \end{matrix}); {\hat{m}}_{1} \in Ω_{ξ}

(44)

Proof. The proof is trivial. One only needs to apply the operator $[{\vec{ω}}^{{\hat{m}}_{1}}]$ on the both sides of the generating function equation (34).

Corollary 2. Let coefficients γ be given and let them satisfy the condition of Lemma 1. Then,

\sum_{\hat{m} \in Ω_{ξ}} \hat{m} γ_{\hat{m}}^{\bar{m}} = \bar{m}

(45)

Proof. The identity can be obtained by evaluating the gradient of Eq. (34) with respect to $\vec{ω}$ at the point $\vec{ω} = \vec{0}$ .

Corollary 3. Let the coefficients γ satisfy the generating function equation (34). Then following identity holds

\sum_{\hat{m} \in Ω_{ξ}} γ_{\hat{m}}^{\bar{m}} (\begin{matrix} \hat{m} \\ {\vec{c}}_{1} \end{matrix}) (\begin{matrix} \hat{m} \\ {\vec{c}}_{2} \end{matrix}) = (\begin{matrix} \bar{m} \\ {\vec{c}}_{1} \end{matrix}) (\begin{matrix} \bar{m} \\ {\vec{c}}_{2} \end{matrix})

(46)

provided vectors ${\vec{c}}_{1}$ and ${\vec{c}}_{2}$ satisfy $|{\vec{c}}_{1} + {\vec{c}}_{2}| \leq ξ$ .

Proof. First, one has to assume that

\vec{ω} = {\vec{ω}}_{1} + {\vec{ω}}_{2} + {\vec{ω}}_{1} ⊙ {\vec{ω}}_{2} ⊙

(47)

where ${\vec{ω}}_{1}, {\vec{ω}}_{2} \in R^{T}$ are arbitrary but bound by the above constraint. Also, it is useful to realize that

1 + \vec{ω} = 1 + {\vec{ω}}_{1} + {\vec{ω}}_{2} + {\vec{ω}}_{1} ⊙ {\vec{ω}}_{2} = (1 + {\vec{ω}}_{1}) ⊙ (1 + {\vec{ω}}_{2})

(48)

By using the above expression in the generating function formula, and (17), one obtains

\begin{gathered} \sum_{\hat{m} \in Ω ξ} γ_{\hat{m}}^{\bar{m}} {(1 + {\vec{ω}}_{1})}^{\hat{m}} {(1 + {\vec{ω}}_{2})}^{\hat{m}} = \\ {(1 + {\vec{ω}}_{1})}^{\bar{m}} {(1 + {\vec{ω}}_{2})}^{\bar{m}} + \\ P_{ξ + 1} ({\vec{ω}}_{1} + {\vec{ω}}_{2} + {\vec{ω}}_{1} ⊙ {\vec{ω}}_{2}) \end{gathered}

(49)

In the third step one has to apply operators $[{\vec{ω}}_{1}^{{\vec{c}}_{1}}]$ and $[{\vec{ω}}_{1}^{{\vec{c}}_{2}}]$ to the generating function expression above. Applying the operators to the left hand side of the equation above gives the left hand side of Eq. (46). Likewise by applying the operators to the first term on the right hand side of the equation gives the right hand side of Eq. (46). Thus what is left to show is that the action of the operator onthe remaining second term results in zero.

The second term has the following structure

\begin{gathered} P_{ξ + 1} ({\vec{ω}}_{1} + {\vec{ω}}_{2} + {\vec{ω}}_{1} ⊙ {\vec{ω}}_{2}) = \\ \sum_{\vec{p}, \vec{q}, \vec{s} \in Ω}^{∥\vec{p} + \vec{q} + \vec{s}∥ \geq ξ + 1} A_{\vec{p}, \vec{q}, \vec{s}} {\vec{ω}}_{1}^{\vec{p}} {\vec{ω}}_{1}^{\vec{q}} {({\vec{ω}}_{1} ⊙ {\vec{ω}}_{2})}^{\vec{s}} \end{gathered}

(50)

where A coefficients can be found but their exact form is not relevant. Next, by using (17) and (13) the equation becomes

\begin{gathered} P_{ξ + 1} ({\vec{ω}}_{1} + {\vec{ω}}_{2} + {\vec{ω}}_{1} ⊙ {\vec{ω}}_{2}) = \\ \sum_{\vec{p}, \vec{q}, \vec{s} \in Ω}^{∥\vec{p} + \vec{q} + \vec{s}∥ \geq ξ + 1} A_{\vec{p}, \vec{q}, \vec{s}} {\vec{ω}}_{1}^{\vec{p} + \vec{s}} {\vec{ω}}_{2}^{\vec{q} + \vec{s}} \end{gathered}

(51)

and by a simple change of variables $\vec{p} + \vec{s} = {\vec{c}}_{1}$ and $\vec{q} + \vec{s} = {\vec{c}}_{2}$ one arrives at

\begin{gathered} P_{ξ + 1} ({\vec{ω}}_{1} + {\vec{ω}}_{2} + {\vec{ω}}_{1} ⊙ {\vec{ω}}_{2}) = \\ \sum_{{\vec{c}}_{1}, {\vec{c}}_{2}, \in Ω}^{∥{\vec{c}}_{1} + {\vec{c}}_{2}∥ \geq ξ + 1} A_{{\vec{c}}_{1}, {\vec{c}}_{2}} {\vec{ω}}_{1}^{{\vec{c}}_{1}} {\vec{ω}}_{2}^{{\vec{c}}_{2}} \end{gathered}

(52)

To obtain the final form of the equation, and in particular the condition specified in the sum, one has to use the fact that $∥{\vec{c}}_{1} + {\vec{c}}_{2}∥ = ∥\vec{p} + \vec{q} + 2 \vec{s}∥ \geq ∥\vec{p} + \vec{q} + \vec{s}∥ \geq ξ + 1$ . Finally, from the last equation one can see that, indeed, the application of the operator $[{\vec{ω}}_{1}^{{\vec{c}}_{1}}] [{\vec{ω}}_{2}^{{\vec{c}}_{2}}]$ gives zero provided that $∥{\vec{c}}_{1} + {\vec{c}}_{2}∥ \leq ξ$ . This proves the corollary. Please note that it would be very hard, if not impossible, to obtain such a result from the explicit expression for γ coefficients given in (25).

Higher order generalizations of this corollary are possible. For example, by using

1 + \vec{ω} = (1 + {\vec{ω}}_{1}) ⊙ (1 + {\vec{ω}}_{2}) ⊙ (1 + {\vec{ω}}_{3})

(53)

one can easily prove that

\sum_{\hat{m} \in Ω_{ξ}} γ_{\hat{m}}^{\bar{m}} (\begin{matrix} \hat{m} \\ {\vec{c}}_{1} \end{matrix}) (\begin{matrix} \hat{m} \\ {\vec{c}}_{2} \end{matrix}) (\begin{matrix} \hat{m} \\ {\vec{c}}_{3} \end{matrix}) = (\begin{matrix} \bar{m} \\ {\vec{c}}_{1} \end{matrix}) (\begin{matrix} \bar{m} \\ {\vec{c}}_{2} \end{matrix}) (\begin{matrix} \bar{m} \\ {\vec{c}}_{3} \end{matrix})

(54)

provided that vectors ${\vec{c}}_{1}$ , ${\vec{c}}_{2}$ , and ${\vec{c}}_{3}$ satisfy $∥{\vec{c}}_{1} + {\vec{c}}_{2} + {\vec{c}}_{3}∥ \leq ξ$ .

Now we are ready to prove some derivative matching results. The first question is whether the XARNES ansatz and the related moment closure function are consistent for the Poisson initial condition. This is answered in a form of the following Lemma.

Lemma 2. If the base and the exact factorial moments match at t = t₀, and the particle number distribution function at this time instance is the Poisson distribution, i.e.

ρ_{\hat{m}} (t_{0}) = ν_{\hat{m}} (t_{0}) = {\vec{μ}}^{\hat{m}}; \hat{m} \in Ω ξ

(55)

then non-base moments also match:

ρ_{\hat{m}} (t_{0}) = ν_{\hat{m}} (t_{0}) = ψ_{\bar{m}} (v (t_{0})) = {\vec{μ}}^{\bar{m}}

(56)

for all $\bar{m} \in {\bar{Ω}}_{ξ}$ .

Proof. The direct use of the XARNES ansatz gives

ln ν_{\bar{m}} = \sum_{\hat{m} \in Ω ξ} \hat{m} γ_{\hat{m}}^{\bar{m}} ln \vec{μ} = \bar{m} ln \vec{μ}

(57)

where (45) was used in the last step.

Finally, we arrive at the central discussion of this work, i.e. the discussion of various time derivatives of exact and approximate moments and when and how they differ. For doing this it is instructive to investigate the difference between the exact and the XARNES equation systems. In this context one can easily show that the following equation system is valid,

\begin{gathered} p_{\hat{m}}^{[h]} - v_{\hat{m}}^{[h]} = \sum_{α = 1}^{R} λ_{α} \sum_{\vec{c} \in Ω R} (\begin{gathered} \hat{m} \\ \vec{c} \end{gathered}) Γ_{α} (\vec{c}) \times \\ \{\sum_{{\hat{m}}_{1} \in Ω ξ} [ρ_{{\hat{m}}_{1}}^{[h - 1]} - ψ_{{\hat{m}}_{1}}^{[h - 1]}] δ_{{\hat{m}}_{1}, \hat{m} - \vec{c} + \vec{u} α} + \\ \sum_{\hat{m} \in Ω ξ} [ρ_{\hat{m}}^{[h - 1]} - ψ_{\hat{m}}^{[h - 1]}] δ_{\hat{m}, \hat{m} - \vec{c} + \vec{u} α}\} \end{gathered}

(58)

and notation $φ^{[h]} \equiv \frac{d^{h}}{d t^{h}} φ (t) |_{t = t_{0}}$ with h = 1,2,3,... is implied where φ(t) is any function, and φ^[0] ≡ φ(t₀). The equation system above will be useful for proving a series of derivative matching theorems, in the similar vein as done in [10, 11].

Three derivative matching theorems

Three derivative matching theorems will be proven, one theorem per derivative. The first two theorems have been proven in [10, 11] for the pure state. In here they are proven for the Poisson state. The third theorem is entirely new.

The structure of the proofs is somewhat different than in [10, 11] since in here the focus is on factorial moments. It seems that the equation system for factorial moments is more compact than for other types of moments. As an artifact of that, the theorems do not contain the error terms ε that were used in [10, 11]. The theorems proven in here are more generic since they hold even for multi particle reactions, not just binary reactions. Again, as stated previously, all components of $\vec{μ}$ are taken strictly larger than zero. If one of the components is zero the XARNES ansatz does not work.

Theorem 1. If the base and the exact factorial moments match at t = t₀, and the particle number distribution function is the Poisson distribution, i.e., if

ρ_{\hat{m}} (t_{0}) = v_{\hat{m}} (t_{0}) = {\vec{μ}}^{\hat{m}}; \hat{m} \in Ω_{ξ}

(59)

then the first order derivatives in time also match for the base factorial moments:

\frac{d}{d t} ρ_{\hat{m}} (t) |_{t = t_{0}} = \frac{d}{d t} ν_{\hat{m}} (t) |_{t = t_{0}}

(60)

for all $\hat{m} \in Ω_{ξ}$ .

Proof. The theorem can be easily proven by considering Eq. (58) with h = 1. By assumptions of the theorem one has that $p_{\hat{m}}^{[0]} - ψ_{\hat{m}}^{[0]} = 0$ which eliminates the sum over $\hat{m} \in Ω_{ξ}$ in (58). From Lemma 2 it follows that $ρ_{\bar{m}}^{[0]} - ψ_{\bar{m}}^{[0]} = 0$ which eliminates the sum over $\bar{m} \in {\bar{Ω}}_{ξ}$ . This finally proves the theorem.

Theorem 2. If the base and the exact factorial moments match at t = t₀, and the particle number distribution function is the Poisson distribution, i.e., if

ρ_{\hat{m}} (t_{0}) = ν_{\hat{m}} (t_{0}) = {\vec{μ}}^{\hat{m}}; \hat{m} \in Ω_{ξ}

(61)

and if Ω _R ⊂ Ω ξ , then the second order derivatives in time also match:

\frac{d^{2}}{d t^{2}} ρ_{\hat{m}} (t) |_{t = t_{0}} = \frac{d^{2}}{d t^{2}} v_{\hat{m}} (t) |_{t = t_{0}}; \hat{m} \in Ω_{ξ}

(62)

Proof. The proof of this theorem is somewhat lengthier. To prove the theorem one needs to consider Eq. (58) with h = 2. If the assumptions of the theorem are valid, by theorem 1, $p_{\hat{m}}^{[1]} - ψ_{\hat{m}}^{[1]} = 0$ which eliminates the sum over $\hat{m} \in Ω_{ξ}$ in (58). Thus what is left to show is that a difference $ρ_{\bar{m}}^{[1]} - ψ_{\bar{m}}^{[1]}$ with $\bar{m} \in r Ω_{ξ}$ vanishes.

A straight forward application of the time derivative on the moment closure function leads to the following expression,

ψ_{\bar{m}}^{[1]} = \sum_{\hat{m} \in Ω_{ξ}} \frac{\partial ψ \bar{m}}{\partial ψ \hat{m}} v_{\hat{m}}^{[1]} = \sum_{\hat{m} \in Ω_{ξ}} γ_{\hat{m}}^{\bar{m}} {\vec{μ}}^{\bar{m} - \hat{m}} v_{\hat{m}}^{[1]}

(63)

Also, the use of the exact, and the XARNES equation systems to evaluate $ρ_{\bar{m}}^{[1]}$ and $ψ_{\bar{m}}^{[1]}$ gives

\begin{gathered} ρ_{\bar{m}}^{[1]} - ψ_{\bar{m}}^{[1]} = \sum_{\vec{c} \in Ω_{R}} [(\begin{matrix} \bar{m} \\ \vec{c} \end{matrix}) - \sum_{\hat{m} \in Ω_{ξ}} γ_{\hat{m}}^{\bar{m}} (\begin{matrix} \hat{m} \\ \vec{c} \end{matrix})] \times \\ \sum_{α = 1}^{R} λ_{α} Γ_{α} (\vec{c}) {\vec{μ}}^{\bar{m} + {\vec{u}}_{α} - \vec{c}} \end{gathered}

(64)

This difference is zero provided

\sum_{\hat{m} \in Ω_{ξ}} γ_{\hat{m}}^{\bar{m}} (\begin{matrix} \hat{m} \\ \vec{c} \end{matrix}) = (\begin{matrix} \bar{m} \\ \vec{c} \end{matrix})

(65)

for every $\vec{c} \in Ω_{R}$ . Please note that this condition is almost identical to the equation that characterize the γ coefficients of the XARNES ansatz. In one replaces $\vec{c}$ with $\hat{m}$ , and Ω _R with Ω _ξ in the equation above, then the equation obtained in such a way would be identical to Eq. (25) or (44). Thus if Ω _R ⊂ Ω _ξ then the equation above is contained in the condition that defines the γ coefficients, and the equation is automatically valid. This proves the theorem.

The third order derivatives will be investigated in the same vein as the first and the second order derivatives. The result will be formulated in a precise mathematical theorem. However, before stating the next theorem, it is useful to generalize the space of contraction vectors as follows.

Definition 12. Vector space of sums of contraction vectors ${\vec{c}}_{1} + {\vec{c}}_{2} + \dots + {\vec{c}}_{h}$ where each of the vectors in the sum is from Ω R , will be denoted as

Ω_{h \otimes R} = {{\vec{c}}_{1} + {\vec{c}}_{2} + \dots + {\vec{c}}_{h} | {\vec{c}}_{1}, {\vec{c}}_{2} \dots, {\vec{c}}_{h} \in Ω_{R}}

(66)

and h is an integer and obeys h ≥ 1.

Theorem 3. If the base and the exact factorial moments match at t = t₀ and the particle number distribution function is the Poisson distribution, i.e., if

ρ_{\hat{m}} (t_{0}) = ν_{\hat{m}} (t_{0}) = {\vec{μ}}^{\hat{m}}; \hat{m} \in Ω_{ξ}

(67)

and if Ω_2⊗R⊂ Ω_ξ , then the third order derivatives in time also match

\frac{d^{3}}{d t^{3}} ρ_{\hat{m}} (t) |_{t = t_{0}} = \frac{d^{3}}{d t^{3}} ν_{\hat{m}} (t) |_{t = t_{0}}; \hat{m} \in Ω_{ξ}

(68)

Proof. The theorem can be proven by considering Eq. (58) with h = 3. By theorem 2 one has that $ρ_{\hat{m}}^{[2]} - ψ_{\hat{m}}^{[2]} = 0$ . What is left to show is that all differences $ρ_{\bar{m} - ψ_{\bar{m}}^{[2]}}^{[2]}$ with $\bar{m} \in {\bar{Ω}}_{ξ}$ vanish as well. Unfortunately this is a highly nontrivial task.

By the use of the standard calculus one can show that

ψ_{\bar{m}}^{[2]} = ψ_{\bar{m}, †}^{[2]} + ψ_{\bar{m}, ‡}^{[2]}

(69)

where

\begin{gathered} ψ_{\bar{m}, †}^{[2]} = \sum_{{\hat{m}}_{1}, {\hat{m}}_{2} \in Ω_{ξ}} (γ_{{\hat{m}}_{1}}^{\bar{m} γ} γ_{{\hat{m}}_{2}}^{\bar{m} γ} - γ_{{\hat{m}}_{1}}^{\bar{m} γ} δ_{{\hat{m}}_{1}, {\hat{m}}_{2}}) \times \\ μ^{\bar{m} - {\hat{m}}_{1} - {\hat{m}}_{2}} v_{{\hat{m}}_{1}}^{[1]} v_{{\hat{m}}_{2}}^{[1]} \end{gathered}

(70)

and

ψ_{\bar{m}, ‡}^{[2]} = \sum_{\hat{m} \in Ω_{ξ}} γ_{\hat{m}}^{\bar{m}} {\vec{μ}}^{\bar{m} - \hat{m}} ν_{\hat{m}}^{[2]}

(71)

By using the XARNES equations, and identity (44) for γ coefficients, the $ψ_{\bar{m}, †}^{[2]}$ can be expressed as

\begin{aligned} ψ_{\bar{m}, †}^{[2]} =, \sum_{{\vec{c}}_{1}, {\vec{c}}_{2} \in Ω_{R}} [(\begin{matrix} \bar{m} \\ {\vec{c}}_{1} \end{matrix}) (\begin{matrix} \bar{m} \\ {\vec{c}}_{2} \end{matrix}) - \sum_{_{\hat{m} \in Ω ξ}} γ_{\hat{m}}^{\bar{m}} (\begin{matrix} \hat{m} \\ {\vec{c}}_{1} \end{matrix}) (\begin{matrix} \hat{m} \\ {\vec{c}}_{2} \end{matrix})] \times \\ \sum_{α, β} λ_{α} λ_{β} Γ_{α} ({\vec{c}}_{1}) Γ_{β} ({\vec{c}}_{2}) {\vec{μ}}^{\bar{m} + \vec{u} α + {\vec{u}}_{β} - {\vec{c}}_{1} - {\vec{c}}_{2}} \end{aligned}

(72)

which vanishes provided

\sum_{\hat{m} \in Ω_{ξ}} γ_{\hat{m}}^{\bar{m}} (\begin{matrix} \hat{m} \\ {\vec{c}}_{1} \end{matrix}) (\begin{matrix} \hat{m} \\ {\vec{c}}_{2} \end{matrix}) = (\begin{matrix} \bar{m} \\ {\vec{c}}_{1} \end{matrix}) (\begin{matrix} \bar{m} \\ {\vec{c}}_{2} \end{matrix})

(73)

for any ${\vec{c}}_{1}, {\vec{c}}_{2} \in Ω_{R}$ . This is indeed true by corollary 3 under the assumptions of the theorem.

What is left to show is that $ρ_{\bar{m}}^{[2]} - ψ_{\bar{m}, ‡}^{[2]} = 0$ . A strategy for proving this is as follows. If one could show that the following identity holds for the exact moments

Δ_{\bar{m}} \equiv ρ_{\bar{m}}^{[2]} - \sum_{\hat{m} \in Ω ξ} γ_{\hat{m}}^{\bar{m}} {\vec{μ}}^{\bar{m} - \hat{m}} ρ_{\hat{m}}^{[2]} = 0

(74)

then it is clear that $ρ_{\bar{m}}^{[2]} - ψ_{\bar{m}, ‡}^{[2]} = 0$ would be true since one could write

ρ_{\bar{m}}^{[2]} - ψ_{\bar{m}, ‡}^{[2]} = \sum_{\hat{m} \in Ω ξ} γ_{\hat{m}}^{\bar{m}} {\vec{μ}}^{\bar{m} - \hat{m}} (ρ_{\hat{m}}^{[2]} - ψ_{\hat{m}}^{[2]})

(75)

and this expression would vanish by Theorem 2.

Somewhat naive recursive application of the exact equations of motion results in

\begin{gathered} ρ_{\bar{m}}^{[2]} = \sum_{α, β} λ_{α} λ_{β} \sum_{{\vec{c}}_{1}, {\vec{c}}_{2} \in Ω_{R}} (\begin{matrix} \bar{m} \\ {\vec{c}}_{1} \end{matrix}) (\begin{matrix} \bar{m} + {\vec{u}}_{α} - {\vec{c}}_{1} \\ {\vec{c}}_{2} \end{matrix}) \times \\ Γ_{α} ({\vec{c}}_{1}) Γ_{β} ({\vec{c}}_{2}) {\vec{μ}}^{\bar{m} + {\vec{u}}_{α} - {\vec{u}}_{β} - {\vec{c}}_{1} - {\vec{c}}_{2}} \end{gathered}

(76)

By using a tedious manipulation of the binomial coefficients the expression above can be converted into a more useful form

\begin{gathered} ρ_{\bar{m}}^{[2]} = \sum_{α, β} λ_{α} λ_{β} \sum_{{\vec{c}}_{1}, {\vec{c}}_{2}, \vec{d} \in Ω_{R}} (\begin{matrix} \bar{m} \\ c_{1} \end{matrix}) (\begin{matrix} \bar{m} - {\vec{c}}_{1} \\ c_{2} \end{matrix}) \times \\ (\frac{{\vec{u}}_{α}}{d}) Γ_{α} ({\vec{c}}_{1}) Γ_{β} ({\vec{c}}_{2} + \vec{d}) {\vec{μ}}^{\bar{m} + {\vec{u}}_{α} + {\vec{u}}_{β} - {\vec{c}}_{1} - {\vec{c}}_{2} - \vec{d}} \end{gathered}

(77)

In fact, it is easier to start from (77) and obtain (76). First, the manipulation requires that the sum of ${\vec{c}}_{2}$ and $\vec{d}$ is changed, into the sum over ${\vec{c}}_{2} = {\vec{c}}_{2} + \vec{d}$ and ${\vec{c}}_{3} = {\vec{c}}_{2}$ After that the Vandermonde identity needs to be used which consumes the sum over ${\vec{c}}_{3^{'}}$ , resulting finally in (77).

By using the fact that

(\begin{matrix} \bar{m} \\ {\vec{c}}_{1} \end{matrix}) (\begin{matrix} \bar{m} - {\vec{c}}_{1} \\ {\vec{c}}_{2} \end{matrix}) = (\begin{matrix} \bar{m} \\ {\vec{c}}_{1} + {\vec{c}}_{2} \end{matrix}) (\begin{matrix} {\vec{c}}_{1} + {\vec{c}}_{2} \\ {\vec{c}}_{2} \end{matrix})

(78)

Eq (77) can be written in the most useful form as

\begin{gathered} ρ_{\bar{m}}^{[2]} = \sum_{{\vec{c}}_{1}, {\vec{c}}_{2} \in Ω_{R}} (\begin{matrix} \bar{m} \\ {\vec{c}}_{1} + {\vec{c}}_{2} \end{matrix}) \times \\ \sum_{α, β} Λ_{α},_{β} (\vec{μ}, {\vec{c}}_{1}, {\vec{c}}_{2}) {\vec{μ}}^{\bar{m} + {\vec{u}}_{α} + {\vec{u}}_{β} - {\vec{c}}_{1} - {\vec{c}}_{2}} \end{gathered}

(79)

The exact form of a coefficient $Λ_{α, β} (\vec{μ}, {\vec{c}}_{1}, {\vec{c}}_{2})$ can be found if needed and, in fact, it is a series in $\vec{μ}$ . However, the exact form of these coefficients is not relevant for the discussion that follows.

Let us use (79) in (74). This gives

\begin{gathered} Δ_{\bar{m}} = \sum_{{\vec{c}}_{1}, {\vec{c}}_{2} \in Ω_{R}} [(\begin{matrix} \bar{m} \\ {\vec{c}}_{1} + {\vec{c}}_{2} \end{matrix}) - \sum_{{\hat{m}}_{1} \in Ω_{ξ}} γ_{{\hat{m}}_{1}}^{\bar{m}} (\begin{matrix} {\hat{m}}_{1} \\ {\vec{c}}_{1} + {\vec{c}}_{2} \end{matrix})] \times \\ \sum_{α, β} Λ_{α, β} (\vec{μ}, {\vec{c}}_{1}, {\vec{c}}_{2}) {\vec{μ}}^{\bar{m} + {\vec{u}}_{α} + {\vec{u}}_{β} - {\vec{c}}_{1} - {\vec{c}}_{2}} \end{gathered}

(80)

and $Δ_{\bar{m}}$ is zero if the following condition is met,

\sum_{\hat{m} \in Ω_{ξ}} γ_{\hat{m}}^{\bar{m}} (\begin{matrix} \hat{m} \\ {\vec{c}}_{1, 2} \end{matrix}) = (\begin{matrix} \bar{m} \\ {\vec{c}}_{1, 2} \end{matrix})

(81)

for every ${\vec{c}}_{1, 2} \in Ω_{2 \otimes R}$ . Eq. (81) is satisfied by the assumption of the theorem which states that Ω_2⊗R⊂ Ω_ξ . In such a case equations in (81) are a subset of the equations satisfied by the γ coefficients which are given in (44) and are automatically valid.

The three theorems proven so far are suggestive of the fact that one might try to prove the following conjecture.

Conjecture 1. If the base and the exact factorial moments match at t = t₀, and the particle number distribution function is the Poisson distribution, i.e., if

ρ_{\hat{m}} (t_{0}) = ν_{\hat{m}} (t_{0}) = {\vec{μ}}^{\hat{m}}; \hat{m} \in Ω_{ξ}

(82)

and if Ω_h⊗R⊂ Ω_ξ , where h is an arbitrary integer such that h ≥ 1, then the time derivatives with orders D = 0, 1, 2, ..., h + 1 will also match

\frac{d^{D}}{d t^{D}} ρ_{\hat{m}} (t) |_{t = t_{0}} = \frac{d^{D}}{d t^{D}} v_{\hat{m}} (t) |_{t = t_{0}}; \hat{m} \in Ω_{ξ}

(83)

Eventual. Inductive proof for general h could be used. However, the problems is that one would need to inspect a difference $ρ_{\bar{m}}^{[h]} - ψ_{\bar{m}}^{[h]}$ for arbitrary h and show that it vanishes under some conditions. Presumably, the main condition, apart from the standard requirements, e.g. such as having the Poisson initial condition, would be that Ω_h⊗R⊂ Ω_ξ . For example, one can easily see that an expressions such as the one shown in (53) will appear if one tries to calculate higher time derivatives of $ψ_{\bar{m}}$ . However, as demonstrated for the h = 2 (D = 3) case the computation of $ψ_{\bar{m}}^{[h]}$ for higher values of h is a rather cumbersome and technical procedure. Generalization to higher orders is apparently very hard but not impossible.

There are couple of reasons why such a conjecture might be valid. First, the structure of the proofs of theorems 1-3 (the cases h = 0, 1, 2) suggests such a possibility. Second, there is numerical evidence from a previous study [17] that increase in ξ improves the accuracy of the XARNES method. In the context of the theorems discussed in here, increase in ξ implies that the initial Ω_ξ set becomes larger. This in turncan make the condition Ω_h⊗R⊂ Ω_ξ valid for a larger value of h. Finally as a result of that a larger number of derivatives would match which would explain the observed accuracy improvements in the studied benchmark cases. Finally, third, this conjecture has be checked by Mathematica for the T = 1 case and two binary reactions 2X₁ → 0 and 2X₁ → X₁, both with h = 0, 1, 2, 3, 4, 5 and ξ = 2, 4, 6, 8, 10 respectively, and one multi particle reaction 3X₁ → 2X₁ with h = 0, 1, 2, 3 and ξ = 3, 6, 9.

Conclusions

The three theorems explain the mechanism behind the numerically observed fact that the XARNES method works well if the particle number distribution function is close to the Poisson distribution. Thus if all correlation forms are in some sense small, the XARNES method will provide a very accurate result.

For very small molecule counts one should expect problems with moment closure formulas like (24) with negative γ coefficients. The XARNES ansatz can describe such situation but one needs to consider a limiting processwhere factorial moments approach zero from the above. For example, one might want to start the systems from a state with the Poisson distribution, which is natural in many biologically relevant cases, but with some components of $\vec{μ}$ equal to zero. This cannot be done directly since the XARNES ansatz breaks down. In more practical terms, any decent numerical Ordinary Differential Equation (ODE) solver should issue a warning for such an initial state. To start the system from a state wheresome copy numbers are zero it is necessary to consider increasingly smaller values for such copy numbers.

Previous numerical studies showed that there are systems for which the XARNES ODE system develops a singularity and the numerical solver has to stop [14, 15, 17]. Unfortunately, the theorems that have been proven cannot say anything about such singularities since the particle number distribution function becomes increasingly different from the Poisson distribution.

References

Eldar A, Elowitz MB: Functional roles for noise in genetic circuits. Nature. 2010, 467: 167-173.
Article PubMed Central CAS PubMed Google Scholar
Paulsson J: Summing up the noise in gene networks. Nature. 2004, 427: 415-418.
Article CAS PubMed Google Scholar
Lestas I, Vinnicombe G, Paulsson J: Fundamental limits on the suppression of molecular fluctuations. Nature. 2010, 467: 174-178.
Article PubMed Central CAS PubMed Google Scholar
Gillespie DT: Exact stochastic simulation of coupled chemical-reactions. J Phys Chem. 1977, 81: 2340-2361.
Article CAS Google Scholar
Elf J, Ehrenberg M: Fast evaluation of fluctuations in biochemical networks with the linear noise approximation. Genome Res. 2003, 13: 2475-2484.
Article PubMed Central CAS PubMed Google Scholar
Nasell I: An extension of the moment closure method. Theor Popul Biol. 2003, 64: 233-239.
Article PubMed Google Scholar
Sasai M, Wolynes PG: Stochastic gene expression as a many-body problem. Proc Natl Acad Sci USA. 2003, 100: 2374-2379.
Article PubMed Central CAS PubMed Google Scholar
Engblom S: Computing the moments of high dimensional solutions of the master equation. Appl Math Comput. 2006, 180: 498-515.
Article Google Scholar
Singh A, Hespanha JP: Approximate Stochastic Models for Chemically Reacting Systems. 2006, http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.135.8468 http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.135.8468
Google Scholar
Singh A, Hespanha JP: Lognormal moment closures for biochemical reactions. Proceedings of the 45th Ieee Conference on Decision and Control, Vols 1-14. 2006, 2063-2068. IEEE Conference on Decision and Control
Chapter Google Scholar
Singh A, Hespanha JP: A derivative matching approach to moment closure for the stochastic logistic model. Bull Math Biol. 2007, 69: 1909-1925.
Article PubMed Google Scholar
Lee CH, Kim KH, Kim P: A moment closure method for stochastic reaction networks. J Chem Phys. 2009, 130: 134107-
Article PubMed Google Scholar
Grima R: Investigating the robustness of the classical enzyme kinetic equations in small intracellular compartments. BMC Syst Biol. 2009, 3: 101-
Article PubMed Central PubMed Google Scholar
Konkoli Z: Multiparticle reaction noise characteristics. J Theor Biol. 2011, 271: 78-86.
Article PubMed Google Scholar
Konkoli Z: Spontaneous noise reduction in a strongly cooperative reaction model. J Theor Biol. 2011, 285: 96-102.
Article PubMed Google Scholar
Walczak AM, Sasai M, Wolynes PG: Self-consistent proteomic field theory of stochastic gene switches. Biophys J. 2005, 88: 828-850.
Article PubMed Central CAS PubMed Google Scholar
Konkoli Z: Modelling reaction noise with a desired accuracy by using the X level Approach Reaction Noise Estimator (XARNES) method. J Theor Biol. 2011, submitted
Google Scholar
Kotomin E, Kuzovkov V: Modern aspects of diffusion-controlled reactions: cooperative phenomena in bimolecular processes, Volume 34 of Comprehensive chemical kinetics. 1996, Amsterdam: Elsevier
Google Scholar
Konkoli Z: Application of Bogolyubov's theory of weakly nonideal Bose gases to the A + A, A + B, B + B reaction-diffusion system. Phys Rev E. 2004, 69: 011106-
Article Google Scholar

Download references

Author information

Authors and Affiliations

Chalmers University of Technology, Gothenburg, Sweden
Zoran Konkoli

Authors

Zoran Konkoli
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zoran Konkoli.

Additional information

Competing interests

The authors declare that they have no competing interests.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Konkoli, Z. Mathematical explanation of the predictive power of the X-level approach reaction noise estimator method. Theor Biol Med Model 9, 12 (2012). https://doi.org/10.1186/1742-4682-9-12

Download citation

Received: 11 January 2012
Accepted: 13 April 2012
Published: 13 April 2012
DOI: https://doi.org/10.1186/1742-4682-9-12

Mathematical explanation of the predictive power of the X-level approach reaction noise estimator method

Abstract

Introduction

The mathematical setup

Exact equations of motion

A way of closing the hierarchy: the XARNES method

An intriguing similarity

Derivative matching setup

Three derivative matching theorems

Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Rights and permissions

About this article

Cite this article

Keywords

Theoretical Biology and Medical Modelling

Contact us

Mathematical explanation of the predictive power of the X-level approach reaction noise estimator method

Abstract

Introduction

The mathematical setup

Exact equations of motion

A way of closing the hierarchy: the XARNES method

An intriguing similarity

Derivative matching setup

Three derivative matching theorems

Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Theoretical Biology and Medical Modelling

Contact us