A danger of low copy numbers for inferring incorrect cooperativity degree
© Konkoli; licensee BioMed Central Ltd. 2010
Received: 23 August 2010
Accepted: 1 November 2010
Published: 1 November 2010
Skip to main content
© Konkoli; licensee BioMed Central Ltd. 2010
Received: 23 August 2010
Accepted: 1 November 2010
Published: 1 November 2010
A dose-response curve depicts the fraction of bound proteins as a function of unbound ligands. Dose-response curves are used to measure the cooperativity degree of a ligand binding process. Frequently, the Hill function is used to fit the experimental data. The Hill function is parameterized by the value of the dissociation constant and the Hill coefficient, which describes the cooperativity degree. The use of Hill's model and the Hill function has been heavily criticised in this context, predominantly the assumption that all ligands bind at once, which resulted in further refinements of the model. In this work, the validity of the Hill function has been studied from an entirely different point of view. In the limit of low copy numbers the dynamics of the system becomes noisy. The goal was to asses the validity of the Hill function in this limit, and to see in what ways the effects of the fluctuations change the form of the dose-response curves.
Dose-response curves were computed taking into account effects of fluctuations. The effects of fluctuations were described at the lowest order (the second moment of the particle number distribution) by using the previously developed Pair Approach Reaction Noise EStimator (PARNES) method. The stationary state of the system is described by nine equations with nine unknowns. To obtain fluctuation-corrected dose-response curves the equations have been investigated numerically.
The Hill function cannot describe dose-response curves in a low particle limit. First, dose-response curves are not solely parameterized by the dissociation constant and the Hill coefficient. In general, the shape of a dose-response curve depends on the variables that describe how an experiment (ensemble) is designed. Second, dose-response curves are multi-valued in a rather non-trivial way.
The Hill function is frequently used to infer the degree of cooperativity of the chemical reaction in which ligand molecules bind to a protein . Often, the binding of a ligand increases the association rate for the binding of the next ligand. Such reactions are said to be (positively) cooperative. There are examples of cooperative reactions in cell biology. The classical example is the binding of oxygen molecules by hemoglobin . Other perhaps less well-known examples would be parts of the Notch signaling and 30 S ribosome assembly processes , as well as the assembly of cholesterol-sphingomyelin complexes . Also, the noise characteristics of various ligand binding reactions were studied theoretically in  and some of the experimental systems could be classified as cooperative reactions. A cooperative reaction builds a final complex successively. If strong cooperativity is present, the dynamics of the system can be studied using Hill's model, at least to a first approximation .
where C0 denotes a protein that binds ligands A, and C h is the ligand-protein complex. The Hill coefficient h describes the number of binding sites on the protein. Both the forward and the back reactions are allowed.
Strictly speaking, the Hill coefficient in Hill's model (1) is a stoichiometry coefficient and should be an integer number larger than zero. However, in the calculations that follow, h will be allowed non-integer values. Thus in the context of this work the Hill model should be understood more from a model average perspective, where the Hill coefficient is an effective parameter.
where c0, c h , a are used to denote the amounts of unbound proteins, bound proteins, and free ligands, respectively. Please note that the Hill function is only parameterized by K and h. When fitting experimental data to extract K and h, it is useful to allow h to be a real number. Also, the Hill function is used frequently in theoretical studies to model cooperativity effects.
In general, c0, c h and a can denote average particle numbers, particle concentrations or partial pressures. It really depends on the types of experiments one wishes to describe. The dissociation constant is essentially controlled by the ratio of the forward and the backward reaction rates.
The original Hill's model is unrealistic since a truly multiparticle reaction with a high Hill's coefficient would be a very unlikely reaction event. The probability that all required ligand molecules meet at the right place, at the right time, is very small. The model was already criticised by Hill himself [6, 7]. Subsequently, more realistic models were suggested in a series of studies: Adair ; Monod, Wyman, Changeux ; and Koshland, Nemethy, Filmer . The difference between the models was critically investigated on the mean field level in , which confirmed Hill's original claim that the Hill equation can be used in a case of strong cooperativity when intermediate states are short-lived. For a reaction set that appears strongly cooperative as in (1), the Hill coefficient provides a rough measure of the cooperativity degree of the reaction.
Despite the problems discussed above, the use of Hill's model has some merits , and the Hill equation is used frequently in many fields as discussed in review article . Accordingly, in this work, Hill's model will be taken as a basic standard for describing multiparticle (cooperative) reactions. The validity of the model has been extensively investigated previously. The conditions for safe usage of Hill's model can be easily verified.
From now on, it will be assumed that the Hill model under investigation is a valid alias for a more complicated multiparticle-like reaction scheme. The focus will be on investigating the correctness of the resulting Hill's function φ H (a) in a low particle number limit. The ultimate goal of this study is to investigate in what ways the effects of the noise related to the low copy numbers affect the form of the dose-response curve predicted by Hill. Please note that such a goal enforces consideration of a closed system. For an open system, where injection and the decay of particles are allowed, one cannot use the Hill function at all.
The expression for K in (4) can be obtained from the stationary state equations that describe the system in the mean field limit. Use of equations (27-29) and (30) in the methods section leads to the desired result. Strictly speaking, the variable K is not a dissociation constant, but it can be related to it by trivial rescaling by the volume of the system.
For any type of initial conditions the dynamical system at hand will reach equilibrium. The focus will be on investigating the equilibrium state of the model, which in turn will enable us to compute the dose response curve φ (a).
The central technical result of this paper is the derivation of the nine (non-linear) equations (5-13) with nine unknowns. These equations describe the equilibrium state of the model. The derivation of the equations is described in the methods section. The equations can help in analytical understanding of the problem.
In equation (5), and in the following, the symbol χ with a subscript denotes a correlation function. Correlation functions were introduced previously (Konkoli, Z.: Multiparticle reaction noise characteristics, submitted) and describe fluctuations. The situation when all χ = 1 corresponds to the mean field limit, where the effects of fluctuations are absent. It is easy to see that in such a case equations (5-7) combine to give the classical Hill function in (3). However, the correlation functions do not equal one in general, and the expression for the Hill function in equation (3) might be invalid.
Equations (6) and (7) express the fact that the total number of protein complexes (with and without ligands) P0, and the total number of ligands in the system (both free and bound) L0, cannot change over time. Averages ⟨P0⟩ and ⟨L0⟩ need to be used; depending on an ensemble, these quantities might be stochastic. It ultimately depends on how the system is prepared during an experiment.
The nine equations with the nine unknowns (5-13) are the central result of the paper. The equations are non-linear and fully describe the stationary state of the system when the effects of particle number fluctuations are taken into account. The observables of interest (average numbers of particles and correlation functions) are implicit functions of the ensemble properties ⟨P0⟩, ⟨L0⟩,, , and ⟨P0L0⟩.
The equations are not exact. They were derived using the Pair Approach Reaction Noise Estimator (PARNES) method introduced previously (Konkoli, Z.: Multiparticle reaction noise characteristics, submitted). The PARNES method works by approximating higher order moments of a particle number distribution by second order moments. Should the need arise, the method can be easily extended beyond the pair approach level.
The PARNES method is based on the usage of correlation forms. The correlation forms are used in studies of spatially extended diffusion controlled reactions . They are employed to close the hierarchy of many-point density functions. In the present work, the particular methods discussed in  were adopted to study a well mixed reaction system. Because a second quantization formalism is used, the PARNES approximation is naturally expressed as a closure relationship for factorial moments of a particle number distribution. The implementation of the closure procedure is shown in the methods section. There are other ways to perform the closure [4, 14–18].
Clearly, once moments are given it should be possible to work backwards and extract the form of the particle number distribution function. This is a rather non-trivial problem and will be studied else-where. Essentially, the PARNES approximation is an expansion around the Poisson distribution. For χ ≈ 1 the distribution function is Poisson-like. Situations with χ < 1 and χ > 1 describe sub- and supra-Poisson regimes respectively.
It is possible to see that when particle numbers become large the correlation functions approach the mean field limit in which all correlation functions are equal to one. For example, by neglecting the a-h2c h , ⟨P0⟩ and hc h terms in equations (11), (12) and (13) respectively, and assuming that , and ⟨L0P0⟩ ≈ ⟨L0⟩⟨P0⟩, the resulting equations can be solved by the mean field ansatz. This shows that the Hill function can be used in a large particle number limit.
The issue is whether all solutions of the central equation system are such that φ can be expressed solely as a function of a. If this is the case then there is only one equation to use, and there should be no ambiguity regarding the proper choice of Hill's coefficient. By inspecting the form of the central equations it can be seen that this is not the case in general. For example, depending on the procedure used to compute the points in the plot that depicts φ (a), many curves can be obtained. Equivalently, in more technical terms, for a given reaction system, repeating the experiment to determine φ (a) with different ensemble setups (the ways the system is prepared), one can obtain different curves for φ (a). Fitting the curves to φ H (a) would result in different Hill's coefficient for each curve. Thus, the fact that the central equations depend on ensemble properties has far reaching consequences when it comes to extracting the correct Hill coefficient from experiments.
Analysis of both figures shows that for large particle numbers the mean field result (the Hill function) is obtained. This is expected, since the mean field description should be correct for large copy numbers. However, in general, the discrepancy from the mean field case can be significant. For Poisson-like initial conditions the reference curve is approached from below. In the case of pure initial states, the reference curve is approached from above (below) for high (low) values of a.
For pure initial states, and in the intermediate regions of a, φ curves are much steeper that the corresponding Hill function. Please note that the curves for pure states are multi-valued since for a given value of a there can be more than one value of φ (e.g. all thin curves in Figure 2 for values of a slightly greater than one are multi-valued). Similar behaviour is observed for Poisson-like initial states but the onset occurs at smaller values of a (e.g. the dotted line in Figure 1). The question is whether such behavior is an artefact of using the PARNES approximation.
Many dangers have already been recognized in using the Hill function to fit experimental data. The difficulties discussed so far in the literature are mostly related to the fact that the Hill model is only an approximation of a more complicated reaction scheme. This work points to a yet another danger, but in terms of principles.
The findings of this work point to the fact that one should be careful in using the Hill function to fit experimental data when the number of particles in the system is low. The actual dependence of φ on a is much more complex than predicted by the Hill function φ H (a). First, dose-response curves depend on the way the experiment is done. Repeating the experiment with different ensemble properties could result in a number of distinct curves. Accordingly, equally many values for the Hill coefficient could be extracted. Second, dose-response curves are multivalued in a rather non-trivial way, which has to do with the fact that some ligands will always be unbound, depending on the number of ligands in the system.
The discrepancy between fluctuation-corrected dose-response curves and the Hill function has nothing to with a fundamental flaw in the Hill model itself. The features are rather generic. Similar behaviour is likely to be observed for any more realistic model of ligand binding.
The nine equations obtained in this work could aid experimental studies in which the Hill coefficient is measured. Clearly, to obtain the correct value for the Hill coefficient, one needs to use the correct curve. The nine equations that define dose-response curves could be investigated further to obtain analytical approximations for fluctuation-corrected dose-response curves.
This work can be extended in many ways. The uniqueness conditions for the equations have not been investigated yet. Preliminary numerical investigations show that the structure of the solutions is rather complex, since Mathematica solver had to be fine-tuned to find the solutions. Also, the nine equations allow for non-physical solutions with negative densities or negative correlation functions. This problem can be solved by proper parameterization of the densities. The question is whether some of the features observed here are an artefact of the "all or none" reaction principle that is intrinsic to Hill's model. For example, it is not clear whether the multi-value character of dose response curves will still be observed in more realistic ligand binding models. Some of the issues discussed above will be investigated in forthcoming publications.
where any combination of the plus and the minus signs can be picked at will. The particle number probability distribution function P(c, t) defines the probability that the system is found in a configuration c at a time t. Please note that the equation contains binomial coefficients that count ways of choosing clusters of h particles.
where f is an arbitrary function of state c. In principle, to compute the averages using (16) is hard. Such a procedure would require the direct solution of the master equation, which is computationally rather demanding. To avoid using equation (16), the equations of motion for the observables of interest will derived. Once in place, these equations of motion can be studied directly. To derive the equations, the problem is mapped on to a quantum field theory using the standard techniques . Thereafter, it is possible to derive the desired equations of motion in a straightforward manner. Please note that any other approach can be used to derive the equations. The filed theory is used in here since it is a useful book-keeping device.
and the operators in parentheses denote the creation operators for C0, C h and A particles: and â† respectively. The operators without the dagger sign, ĉ0, ĉ h and â, denote the corresponding annihilation operators. The generating function is the linear combination of all possible configurations of the system, where each configuration is weighted by the corresponding probability of occurrence.
The equation follows from (19) and the fact that ⟨1| = 0. In the following, to simplify the notation, an expression of the form will be abbreviated to . This should cause no confusion between (16) and (21). If the expression contains field theoretic creation and annihilation operators, the expression should be interpreted as in (21).
Please note that the equations contain the expression ⟨ĉ0â h ⟩, so it appears that we need an equation for that quantity as well. This will be dealt with later.
and express the fact that the total number of protein complexes (with and without ligands) P0, and the total number of ligands in the system (both free and bound) L0, cannot change over time. For example, the first conservation law can be obtained by adding equations (27) and (28).
Please note that the conservation laws involve only quantities that describe the ensemble that was used to prepare the system. The ensemble is defined by five independent parameters ⟨P0⟩, ⟨L0⟩, , and ⟨P0L0⟩.
where x, y and z are integers greater than or equal to zero. The accuracy of the PARNES approximation has been investigated on a similar model where it was confirmed that it provides a semi-quantitative description (Konkoli, Z.: Multiparticle reaction noise characteristics, submitted). For large particle numbers it is rather accurate. A similar investigation for Hill's model (Figure 4) leads to the same conclusions. The PARNES approximation provides a qualitative description of the stationary state of Hill's model in the low particle number limit.
Finally, using the PARNES method (52), an approximative form of equations (42-45) can be derived. Carrying out the procedure, and combining the result with the conservation laws (37-41), results in the nine equations with nine unknowns listed in (5-13), which were introduced in the results section.
The central equations (5-13) can be obtained roughly as follows. Equation (5) results from the stationary state condition (42), and equations (6-7) are the first two conservation laws (37) and (38) expressed in a new notation. Equations (8-10) result from the stationary state conditions (43-45). Equations (11-13) are derived from the conservation laws for second moments (39-41).
In the general case, the equations are rather involved and cannot be solved analytically. The numerical procedure for solving the equations naturally suggests itself as follows. First, one solves equations (5-7) assuming that all correlation functions are one. This gives the first guess for the average particle numbers c0, c h and a. The values obtained are inserted into (8-13) to evaluate a guess for the correlation functions. The resulting values can be again used again in (5-7) to obtain even better values for the average particle numbers. The procedure continues until results converge to the fixed point values.
However, the procedure discussed above is numerically unstable in the low particle number limit. The plots in the work were generated by a method similar to the analytic continuation. The equations were solved in the large particle number limit by the method outlined in the previous paragraph, after which the desired point in the ensemble parameter space can be approached incrementally along a line. In every step, the solution from the previous point is used as a guess for the point that follows.
The financial support of the Chalmers Biocenter and an internal MC2 grant for strategic development is greatly acknowledged.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.