Partial Isotope Profiles Are Sufficient for Protein Turnover Analysis Using Closed-Form Equations of Mass Isotopomer Dynamics.

Sadygov RG

doi:10.1021/acs.analchem.0c03343

Partial Isotope Profiles Are Sufficient for Protein Turnover Analysis Using Closed-Form Equations of Mass Isotopomer Dynamics.

Sadygov RG ¹

Affiliations

1. Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch, 301 University of Blvd, Galveston, Texas 77555, United States.
Authors
Sadygov RG¹
(1 author)

ORCIDs linked to this article

Sadygov RG | 0000-0003-1590-155X

Analytical Chemistry, 21 Oct 2020, 92(21):14747-14753
https://doi.org/10.1021/acs.analchem.0c03343 PMID: 33084301 PMCID: PMC8880304

Free full text in Europe PMC

Abstract

Metabolic labeling with atom-based heavy isotopes, followed by liquid chromatography coupled with mass spectrometry (LC-MS), has been a powerful technique for studies of proteome and metabolome. In proteomics, the protein turnover of thousands of proteins can be estimated from the gradual incorporation of ²H or ¹⁵N in the diet. Software tools have been developed to automate the estimation of protein turnover. Traditionally, the turnover has been estimated using the time course of the depletion of the normalized abundance of monoisotopes. While the bioinformatic aspects of peak detection and integration, time course modeling, and uncertainty estimation have progressed, mass isotopomer dynamics during label incorporation has only been modeled from approximate approaches or numerical simulations. We derive closed-form equations that describe the dynamics of mass isotopomers during metabolic labeling with an atom-based stable isotope. The derived equations create an alternative method for estimating label incorporation. They also provide opportunities for estimation of precursor-product relationships in species or systems where they are unknown. The equations are useful in bioinformatic tools for analyzing mass spectral data from metabolic labeling.

Free full text

Anal Chem. Author manuscript; available in PMC 2022 Feb 25.

Published in final edited form as:

Anal Chem. 2020 Nov 3; 92(21): 14747–14753.

Published online 2020 Oct 21. https://doi.org/10.1021/acs.analchem.0c03343

PMCID: PMC8880304

NIHMSID: NIHMS1778742

PMID: 33084301

Partial Isotope Profiles are Sufficient for Protein Turnover Analysis using Closed-form Equations of Mass Isotopomer Dynamics

Rovshan G. Sadygov

Author information Copyright and License information Disclaimer

The publisher's final edited version of this article is available at Anal Chem

See other articles in PMC that cite the published article.

Go to:

Associated Data

Supplementary Materials: Supporting.
NIHMS1778742-supplement-Supporting.docx (699K)

Go to:

Abstract

Metabolic labeling with atom-based heavy isotopes, followed by LC-MS, has been a powerful technique for studies of proteome and metabolome. In proteomics, protein turnover of thousands of proteins can be estimated from the gradual incorporation of ²H or ¹⁵N in the diet. Software tools have been developed to automate the estimation of protein turnover. Traditionally, the turnover has been estimated using the time-course of the depletion of the normalized abundance of monoisotope. While the bioinformatics aspects of peak detection and integration, time-course modeling, and uncertainty estimation have progressed, mass isotopomer dynamics during label incorporation has only been modeled from approximate approaches or numerical simulations.

We derive closed-form equations that describe the dynamics of mass isotopomers during metabolic labeling with an atom-based stable isotope. The derived equations create an alternative method for estimating label incorporation. They also provide opportunities for estimations of precursor-product relationships in species or systems where they are unknown. The equations are useful in bioinformatics tools for analyzing mass spectral data from metabolic labeling.

Keywords: peptide isotope distribution during metabolic labeling, closed-form equations for mass isotopomers, protein turnover, metabolic labeling and LC-MS, exchangeable hydrogens in heavy water labeling, enrichment percent in heavy atom

Go to:

Graphical Abstract

An external file that holds a picture, illustration, etc.
Object name is nihms-1778742-f0006.jpg

Go to:

Introduction

Metabolic labeling with stable isotopes followed by liquid-chromatography coupled into mass spectrometry (LC-MS) has been used to study the turnover of proteins, lipids, and other biomolecules in vivo¹. For protein turnover studies, labeled samples are processed via standard proteomics workflows². The labeling is quantified via changes in the isotope profiles of peptides. In a widely-used approach, the time-course of the relative abundance (normalized by the sum of all mass isotopomers) of the monoisotope is used in an exponential decay model to extract the degradation rate constant (DRC) of a protein³.

Atom-based labeling⁴ (such as labeling with ²H or ¹⁵N) often creates overlapping isotope profiles of labeled and unlabeled peptides. For example, labeling with heavy water (²H labeling) is partial, since less than 10% enriched water is provided to model organisms (high doses of heavy water are toxic). The isotope profiles of labeled species are simulated using a model where the labeling atom is designated as a new atom “type” with a different isotope pattern⁵. For example, in heavy water labeling, hydrogens, which are accessible to the deuterium in heavy water, are designated as belonging to an atom “type” with the ²H isotope abundance of (p_H + p_X), and ¹H isotope abundance of (1 - p_H - p_X). Here, p_H is the relative abundance (RA) of ²H in nature; p_X is the RA of ²H incorporated from the heavy water into the biomolecule. Numerical techniques^6–8 (for example, Fast-Fourier transforms, or multinomial distribution) are used to calculate the isotope profiles of labeled species at various values of p_X. The calculations are used to determine p_X from the fit of the theoretical profiles into the experimental observation⁵, or for the estimation of the number of ²H atoms incorporated into the biomolecule at the enrichment plateau⁹. In this work, we show that closed-form equations govern the dynamics of the abundances of the mass isotopomers during labeling.

The abundances of mass isotopomers of a labeled species are time-dependent during the labeling. However, for heavy mass isotopomers, only the isotopologues that originate from the labeling atom “type” change during the labeling. The rest of the heavy isotopologues are time-invariant. Therefore, we separate the isotopologues of the labeling atom “type” from those of the rest of the atoms in a biomolecule. The modeling of the mass isotopologues of the labeling atom “type” leads to closed-form equations linking the abundances (both raw and normalized) of mass isotopomers during metabolic labeling. The equations can be used in alternative approaches to analyzing mass spectral data from metabolic labeling. As an example, we show how to calculate rate constants using raw abundances of two mass isotopomers. The derivations are presented for deuterium enrichment of peptides resulting from heavy water labeling. However, the equations are general. They can be used for other atom-based labeling agents^{4, 10–11}, such as ¹⁵N and ¹²C, or for other biomolecules such as lipids¹².

2.1. Data Description.

We used a large-scale data set of the liver proteome of (low-density lipoprotein receptor knock-out) LDLR^−/− mice fed a normal diet¹³. The data set consists of 127 LC-MS experiments. They were performed in six times points of label duration. Peptide samples were prepared from eleven SDS-PAGE bands. Each sample was run in duplicate. Mass spectral data were collected using Q Exactive™ Plus Hybrid Quadrupole-Orbitrap™ Mass Spectrometer (Thermo Scientific, CA). Mascot¹⁴ database search engine was used for peptide identification. It calculates the global false discovery rate (we set it at 5%) by using matches to the reversed sequences. The data set is available at ProteomeXChange site (PXD009493). It contains 21706 unique peptides, which have been quantified in at least four out of the six labeling timepoints. The R scripts written for numerical simulations in this work are available at http://dynamic-proteome.utmb.edu/MIDynamics/MIDynamics.aspx.

2.2. Novel closed-form equations of mass isotopomer dynamics during metabolic labeling

We will use three forms of a reference to mass isotopomers of a peptide. M_k refers to the k^th mass isotopomer (k = 0 is the monoisotope). It will not designate any abundance. I_k(t) applies to the RA of the k^th mass isotopomer at the labeling time t. A_k(t) applies to the MS measured raw (non-normalized) abundance of the k^th mass isotopomer. Mass isotopomers result from the combination of the isotopologues with the same nominal mass. The time-course of I₀(t) is used to determine the protein degradation rate constant, k, via an exponential decay model³:

I_{0} (t) = I_{0}^{a s y m p} + (I_{0} (0) - I_{0}^{a s y m p t}) * e^{- k t}

Eq. (1)

where I₀^asymp is the asymptotic (after reaching the plateau of labeling) RA of the monoisotope.

The isotope distribution of a peptide in metabolic labeling with water is modeled by separating peptides’ hydrogen atoms into two groups^{5, 15}. In the first group are the hydrogens that are non-accessible to the deuterium. In the second group are the hydrogens that can be labeled by the deuterium in heavy water. The number of hydrogens in the second group we denote as N_EH. We refer to this hydrogen type as X_H. The relevant probabilities of ²H for each group are p_H and (p_H + p_X(t)), respectively. Here, p_H is the natural abundance of deuterium, and p_X(t) is the deuterium enrichment in a peptide from the heavy water at the labeling duration time t. Details of the derivations of the following closed-form equations are provided in Supporting Information. Here, we note that the time-dependent isotopologues originate from the deuteriums of X_H. The equation for the time-course evolution of I₀(t) is:

I_{0} (t) = I_{0} (0) {(1 - \frac{p_{X} (t)}{1 - p_{H}})}^{N_{E H}}

Eq. (2)

p_X(t) is also referred to as molar percent excess (MPE). When the labeling of a protein reaches its plateau, (p_X(t)+p_H) is equal to p_W (²H enrichment of heavy water). An important feature of Eq. (2) is that it depends on the normalization at two different timepoints of labeling; 0 and t. Therefore, only RAs can be used in this equation.

By separating the probability of the isotopologue originating from the labeling of X_H from the other isotopologues of M₁, one obtains the following equation for I₁(t):

I_{1} (t) = {(1 - \frac{p_{X} (t)}{1 - p_{H}})}^{N_{E H} - 1} \frac{p_{X} (t)}{{(1 - p_{H})}^{2}} N_{E H} I_{0} (0) + {(1 - \frac{p_{X} (t)}{1 - p_{H}})}^{N_{E H}} I_{1} (0)

Eq. (3)

The first term in the sum originates from the M₁ isotopologue of X_H. The second term is the sum of all M₁ isotopologues, except the isotopologue of X_H. By substituting Eq. (2) into Eq. (3), we obtain the following relationship for the time-course of the I₁(t)/I₀(t) ratio:

\frac{I_{1} (t)}{I_{0} (t)} = N_{E H} \frac{p_{X} (t)}{(1 - p_{H}) (1 - p_{H} - p_{X} (t))} + \frac{I_{1} (0)}{I_{0} (0)} = \frac{A_{1} (t)}{A_{0} (t)}

Eq. (4)

Previously, Anderson and coworkers¹⁶ have discussed that the I₁(t)/I₀(t) ratio can be expanded as a sum of probabilities of the isotopologues of M₁. Eq. (4) presents the explicit form of the time dependency of the I₁(t)/I₀(t) ratio on the label enrichment, p_X(t), of a peptide. An important aspect of Eq. (4) is that it can use non-normalized abundances of the monoisotope and the first heavy mass isotopomer as measured in LC-MS. From Eq. (4), we determine the deuterium enrichment, p_X(t), of a peptide:

p_{X} (t) = \frac{(A_{1} (t) / A_{0} (t) - A_{1} (0) / A_{0} (0)) {(1 - p_{H})}^{2}}{N_{E H} + (1 - p_{H}) (A_{1} (t) / A_{0} (t) - A_{1} (0) / A_{0} (0))}

Eq. (5)

Eq. (5) is important because, on the right-hand side, there is a dependency on the A₁(t)/A₀(t) ratio and $N_{E H}$ . The ratio is determined from the raw abundances of only two mass isotopomers at each labeling timepoint. For comparison, we note that DeuteRater¹⁷, a bioinformatics tool for protein turnover estimations, uses EMass algorithm¹⁸ to compute I_k(t) for a given deuterium enrichment via a convolution-like procedure. The changes in RAs of mass isotopomers, (I_k(t) – I_k(0)), are determined as a function of p_X(t) from the least-squares regression. We point out two advantages that Eq. (5) presents. First, it is exact. Second, in practical applications, there is no need for a complete isotope profile to determine p_X(t): only the raw abundances of the monoisotope, A₀(t), and the first heavy isotope, A₁(t), are needed. The results of an approach using only two (for peptides, usually the most abundant) mass isotopomers will be less susceptible to co-elution interferences compared to the approaches using the complete isotope profile.

Starting from the probabilities of all isotopologues of M₂, we have derived the following closed-form equation for the time-course evolution of the ratio of raw abundances of M₂ and M₀:

\frac{A_{2} (t)}{A_{0} (t)} = \frac{A_{2} (0)}{A_{0} (0)} - \frac{A_{1} (0)}{A_{0} (0)} \frac{p_{H} N_{E H}}{(1 - p_{H})} + {(\frac{p_{H}}{1 - p_{H}})}^{2} \frac{N_{E H} (N_{E H} + 1)}{2} - {(\frac{p_{X} (t) + p_{H}}{1 - p_{H} - p_{X} (t)})}^{2} \frac{N_{E H} (N_{E H} + 1)}{2} + \frac{N_{E H} (p_{X} (t) + p_{H})}{(1 - p_{H} - p_{X} (t))} \frac{A_{1} (t)}{A_{0} (t)}

Eq. (6)

The derivations of Eqs. (3) and (6) are provided in Supporting Information. Eq. (6) links three raw abundances, A₀(t), A₁(t), and A₂(t), with the deuterium enrichment of a peptide, p_X(t), and its number of exchangeable hydrogens, N_EH. Eq. (6) is exact; no approximations have been made in its derivation. The correctness of the equation can be seen from the asymptotic behaviors and will numerically be validated below.

Eq. (5) relates N_EH and p_X(t) via the time-course of the A₁(t)/A₀(t) ratio. We recast the equation to the form shown below to emphasize the calculations of N_EH:

N_{E H} = \frac{(1 - p_{H} - p_{X} (t))}{p_{X} (t)} {\frac{A_{1} (t)}{A_{0} (t)} - \frac{A_{1} (0)}{A_{0} (0)}} (1 - p_{H})

Eq. (7)

Eqs. (4) and (6) constitute a system of two equations for two variables - N_EH and p_X(t). Theoretically, they should determine the variables exactly at every time point of labeling from raw abundances of the first three mass isotopomers. Hellerstein and coworkers⁹ have determined N_EH values from the mass isotopomer distributions of peptides at the plateau enrichment of a protein. Their approach⁹ generated mass isotopomer distributions from combinatorial simulations with different N_EH values to determine the optimal value. At the plateau of label incorporation, (p_X(t)+p_H) in Eq. (7) is equal to p_W. Given body water enrichment with deuterium, p_W, Eq. (7) determines N_EH values without simulations and using only the raw abundances of the first two mass isotopomers.

We have also derived the time evolution equation for the third heavy mass isotopomer, M3:

I_{3} (t) = a (t) I_{0} (t) + b (t) * {I_{1} (t) - c (t) I_{0} (t)} + c (t) * {I_{2} (t) - c (t) (I_{1} (t) - c (t) I_{0} (t)) - b (t) I_{0} (t)} + I_{0} (t) * {\frac{I_{3} (0)}{I_{0} (0)} - \frac{N_{E H} p_{H}}{1 - p_{H}} {\frac{I_{2} (0)}{I_{0} (0)} - \frac{(N_{E H} + 1) p_{H}}{2 (1 - p_{H})} (\frac{I_{1} (0)}{I_{0} (0)} - \frac{N_{T} p_{H}}{1 - p_{H}}) - (3 N_{T} N_{E H} + 3 N_{T} - N_{E H}^{2} - 3 N_{E H} - 2) {(\frac{p_{H}}{1 - p_{H}})}^{2} \frac{1}{6}}}

Eq. (8)

In Eq. (8), N_T is the number of all hydrogen atoms in a molecule, a(t), b(t), and c(t) are time-dependent coefficients defined below:

a (t) = (\begin{matrix} N_{E H} \\ 3 \end{matrix}) {(\frac{p_{X} (t) + p_{H}}{1 - p_{H} - p_{X} (t)})}^{3}, b (t) = (\begin{matrix} N_{E H} \\ 2 \end{matrix}) {(\frac{p_{X} (t) + p_{H}}{1 - p_{H} - p_{X} (t)})}^{2}, c (t) = \frac{N_{E H} (p_{X} (t) + p_{H})}{1 - p_{H} - p_{X} (t)}

$(\begin{matrix} N_{E H} \\ k \end{matrix})$ is the binomial coefficient. Eqs. (2) and (8) can be used to determine the A₃(t)/A₀(t) ratio, which we provide in Supporting Information, Eq. (S14). The ratio is rather complex to be used for solutions of p_X(t) or N_EH. However, as we discuss below, the asymptotic value of I₃(t) (at the plateau of enrichment) is useful for determining if the M₃ or higher mass isotopomers are needed for the analysis of the label incorporation into a peptide. Low abundance high mass isotopomers are preferably omitted in peak detection and integration to reduce the chances of interferences from the co-eluting contaminants.

3.1. Simulations validate the derived equations

We used four computational simulations to numerically validate Eqs. (4) and (6). Each simulation used a peptide sequence, its N_EH, and a preassigned value of enrichment, p_X(t), to generate the peptides’ isotope profile (using FFTs). The four examples (peptide sequence(N_EH, p_X(t)) are: IQDAGLVLADALR(23, 0.004), VAQAPWK(15, 0.005), SFPFVSK(9, 0.006), and GLASYYEISVDDGPWEK(31, 0.007). Thus, for each peptide, we know the actual value of A₂/A₀. Next, we obtained values of A₂(t)/A₀(t) for p_X(t) values in the range of (0,1) from Eq. (6). p_X(t) was incrementally increased by 0.001. The A₁(t)/A₀(t) ratio in Eq. (6) was calculated using Eq. (4). For each peptide, the difference, (A₂/A₀ - A₂(t)/A₀(t)), should pass through zero at the p_X(t) value assigned to the corresponding peptide. In Figure 1, we show the differences in the vicinity of zero for the four peptides. The peptides are denoted P₁ through P₄, respectively. As seen from the figure, the computations have exactly reproduced the solutions that corresponded to the true “enrichments,” which were used to simulate the mass isotopomers of “labeled” peptides. All N_EH values were exactly predicted, as well. The figure shows that the difference is semi-linear in p_X(t), near its solution. However, in the (0.,1.) interval of p_X(t), the dependency is non-linear. The system of equations had two solutions for p_X(t) and N_EH. One of the solutions was readily discarded, as it corresponded to N_EH values far larger than the number of all hydrogens in the peptide. The simulations used non-contaminated, exact mass isotopomer abundances.

An external file that holds a picture, illustration, etc.
Object name is nihms-1778742-f0001.jpg

Figure 1.

Numerical simulations validate the closed-form equations for A₂(t)/A₀(t) and A₁(t)/A₀(t) ratios. The y-axis shows the difference between the true value of A₂/A₀ and the value computed from Eq. (6) at various values of p_X(t) (x-axis). The blue line is the y=0 line. Black circles on each curve are the values for one specific peptide, denoted as P₁ through P₄.

In Supporting Information, we provide a schematic and description of steps used in generating Figure 1. In Figure S1, we show the simulations of I₀(t), I₁(t), I₂(t), and I₃(t) as a function p_X(t) for the peptide sequence, IQDAGLVLADALR, over the range of complete depletion of these mass isotopomers due to the labeling with deuterium. The closed-form equations determine p_w values at which each mass isotopomer is completely depleted.

3.2. Tests on simulated rate constants

Using the R environment¹⁹, we implemented an approach to estimate DRC from the time-course of the A₁(t)/A₀(t) ratio using Eqs. (5), (2) and (1). We applied this method and compared it with the traditional method, which uses the time-course of I₀(t) (obtained from complete isotope profile) to calculate the DRC. As the first test, we used the simulations because, in the simulations, we know the actual values of DRCs. We used the same 21706 DRCs (from the murine liver proteome) for each method.

For determinations from I₀(t), the simulations were done using Eq. (1) for each DRC and adding to the actual time-course the errors sampled from the Laplace distribution (whose parameters were determined from the above-referenced data set of the murine liver)²⁰. We then determined the simulated DRC using the noisy I₀(t). In the following text, we refer to this as the traditional method.

For the A₁(t)/A₀(t) ratio, we assumed the Laplace distributed error as well. First, we computed the true value of A₁(t)/A₀(t). Then, we added Laplace distributed noise to the theoretical values. From the noisy-added A₁(t)/A₀(t) values, we determined p_X(t) at every time point of labeling using Eq. (5). The p_X(t) values were inserted into Eq. (2) to obtain the reconstructed time-course of I₀(t). The time-course of the reconstructed I₀(t) was used to determine DRC in this alternative method.

First, we checked if the two methods produced the same results when simulations used very small errors. Figure S2 of Supplementary Information shows the scatter plot of the rate constants produced by the two methods. The scaling parameter of Laplace distribution for errors was set to exp(−8). As expected from an exact solution, the scatter plot is the identity line.

When we increased the error parameter to make it equal to the estimations from the mouse liver data set (the scale parameter was exp(−4.14)), the rate constants computed by two methods showed some differences, Figure S3. We compared the calculated rate constants with the actual values. The scatter plot of the relative errors is shown in Figure S4. 19634 simulations using A₁(t)/A₀(t) ratio produced rate constant that agreed with actual values better than ten percent (of the actual values). The corresponding number from the I₀(t) simulations was 9203. 8460 simulations produced better than ten percent agreement with the true values in both methods. Thus, assuming the same error distribution, A₁(t)/A₀(t) ratio method produced better results in these simulations.

Figure 2 shows the scatter plot of the absolute values of the DRC errors calculated using I₀(t) (y-axis) and A₁(t)/A₀(t) (x-axis) using actual data from the liver proteome. Less than 0.02% of the data were outside of the axis ranges. The red line is the identity line. For 74% of DRC, the absolute errors of simulated DRCs using I₀(t) was higher than that from A₁(t)/A₀(t). The slope of the linear regression of the absolute value of the relative error on the correlations (between the “experimental” and theoretical time-course data) was −0.64 for A₁(t)/A₀(t) method and −0.32 for I₀(t) method. Thus, for high values of the correlation (better agreement between the experimental data and theoretical fit), the relative error’s absolute value was smaller for both methods. Figure 3 shows the density plots of the computed-to-actual DRC ratios for the two methods. The data was filtered to keep only those that have corresponding correlations higher than 0.8. 89% and 65% of data passed the filtering for A₁(t)/A₀(t) and I₀(t) methods, respectively. Both density functions have a mode near one, as expected. However, the method using A₁(t)/A₀(t) (blue curve) retained more data (after the filtering by correlation) and had a smaller standard deviation.

An external file that holds a picture, illustration, etc.
Object name is nihms-1778742-f0002.jpg

Figure 2.

For many peptides, abssolute errors of rate constants obtained from the alternative approach were smaller than those from the traditional method. Scatter plot of absolute errors of rate constants estimations using A₁(t)/A₀(t) (x-axis) and I₀(t) (y-axis). The red line is the line of identity.

An external file that holds a picture, illustration, etc.
Object name is nihms-1778742-f0003.jpg

Figure 3.

In simulations, rate constants obtained using A₁(t)/A₀(t) ratio showed better agreement with actual values. Density plots of the computed-to-actual DRC ratios from the traditional (black line) and alternative (blue line) methods.

3.3. Direct computations of rate constants of the murine liver proteome

Next, we used both approaches to calculate the DRCs from the murine liver proteome. Out of the 21706 peptide entries, 6935 passed the correlation (0.975) and residual sum of squares (RSS) (0.001) filtering from both methods. 8059 and 11110 peptide entries passed the thresholds in the traditional and new methods, respectively. The scatter plot of the DRCs that passed more stringent filtering by both methods is shown in Figure 4. On the x-axis are the DRCs using two mass isotopomers, on the y-axis are the DRCs computed using the traditional method. In this figure, the rate constants were filtered to include only those that correlated with the experimental data with the Pearson correlation of 0.995 or better (from both methods).

An external file that holds a picture, illustration, etc.
Object name is nihms-1778742-f0004.jpg

Figure 4.

The rate constants obtained by the alternative and traditional methods are close for the data filtered by correlation. This is shown by the scatter plot of DRCs computed using I₀(t) (y-axis) and A₁(t)/A₀(t) ratio (x-axis) for the murine liver proteome data set.

4.1. Results and Discussions.

We have derived novel, closed-form equations that describe mass isotopomer dynamics of a peptide during deuterium incorporation from heavy water metabolic labeling. The equations provide mechanistic insights into the changes occurring in the isotope profiles. Eq. (3) shows that the time-course evolution of M₁ mass isotopomer is contributed by two terms. The contribution from I₀(0) increases over the duration of labeling, while the contribution from I₁(0) decreases. The time-course of the M₂ mass isotopomer, Eq. (6), is similar. It is determined by two terms: I₁(t) and I₀(t). It can also be expressed solely by the enrichment in the deuterium, p_X(t), and the N_EH value of a peptide. Using the derived equations, for every peptide, complete dynamics of the first four mass isotopomers can be generated from the values of I₀(0), I₁(0), I₂(0), and I₃(0). Figure S1 of Supplementary Information shows the dynamics of the mass isotopomers for the peptide sequence, IQDAGLVLADALR, generated theoretically. Previously, the generation of the mass isotopomer profiles required theoretical simulations of the relative abundances of the peptide, given the enrichment with the deuterium (and the N_EH value). Using the closed-form equations, the dynamics is easily computed.

The developed equations can be used in various practical applications and to update analytical techniques currently used to analyze LC-MS data from metabolic labeling. Recently, Ilchenko and colleagues²¹ developed an approximation for determining N_EH at the plateau enrichment. They approximated RAs by considering the first two mass isotopomers only. From Eq. (4), there is an exact formula between the raw abundances obtained from two mass isotopomers and the N_EH value:

\frac{A_{0} (t)}{A_{0} (t) + A_{1} (t)} = \frac{I_{0} (0)}{I_{0} (0) + I_{1} (0) + I_{0} (0) N_{E H} \frac{p_{W} - p_{H}}{(1 - p_{H}) (1 - p_{W})}}

The approximate equation of Ilchenko did not contain the division of the third term in the denominator by $((1 - p_{H}) (1 - p_{W}))$ . Alternatively, Eq. (7) can be used to compute N_EH directly from the ratio of the abundances of the first two mass isotopomers.

Formula based results provide several advantages. The estimation of RA of monoisotope at the plateau has been an integral part of the models for extracting protein turnover rates. Because of the known formula, we can now readily compute the exact form of the expected abundances for heavy mass isotopomers. For example, for the first and second heavy mass isotopomers, they are:

I_{1}^{a s y m p} = {(1 - \frac{p_{W} - p_{H}}{1 - p_{H}})}^{N_{E H} - 1} \frac{p_{W} - p_{H}}{{(1 - p_{H})}^{2}} N_{E H} I_{0} (0) + {(1 - \frac{p_{W} - p_{H}}{1 - p_{H}})}^{N_{E H}} I_{1} (0)

I_{2}^{a s y m p} = (\frac{I_{2} (0)}{I_{0} (0)} - \frac{I_{1} (0)}{I_{0} (0)} \frac{p_{H} N_{E H}}{(1 - p_{H})} + {(\frac{p_{H}}{1 - p_{H}})}^{2} \frac{N_{E H} (N_{E H} + 1)}{2} - {(\frac{p_{W}}{1 - p_{W}})}^{2} \frac{N_{E H} (N_{E H} + 1)}{2}) I_{0}^{a s y m p} + \frac{N_{E H} p_{W}}{(1 - p_{W})} I_{1}^{a s y m p}

In previous approaches²², these quantities were obtained from simulations. The explicit form of I₀ at the asymptote is well known and obtained from Eq. (1) by replacing (p_X(t)+p_H) with p_W.

The newly derived equations provide an alternative approach to the estimation of rate constants. Currently, only 30-40% of all quantified peptides are viable for protein turnover estimations²³. For the rest of the peptides, goodness-of-fit measures (RSS and the correlation between experimental data and theoretical fit) are below acceptable thresholds. One of the reasons for the low quality of quantification is the complexity of proteomes in mammalian samples. Even in high-resolution and mass accuracy mass analyzers, there are many co-eluting species that interfere with the peak detection and quantification of target peptides. When using many mass isotopomers, the chance of co-elution with at least one of them increases, and, through the normalization, the co-elution affects the RAs for all other mass isotopomers. One alternative approach, which the new closed-form equations allow, is to estimate the DRCs from two mass isotopomers only. As was noted by Anderson and coworkers¹⁶, the use of the two mass isotopomers, M₀(t) and M₁(t), is expected to be more accurate. For peptides, these two signals are, in general, the most abundant in the isotope profile. The time-course of A₁(t)/A₀(t) requires no normalization. Besides, a detailed analysis of spectral accuracy has found that Orbitrap mass spectrometers exhibit lesser errors in determining A₁(t)/A₀(t) ratios compared to those of the other mass isotopmers²⁴. In Supplementary Information, we describe simulations and comparison of the errors in estimating I₀(t) and A₁(t)/A₀(t) using the murine liver proteome data set. Figure S4 in Supplementary Information shows the comparison of the relative errors. Combining two methods increased the number of peptide entries with the errors of 5% or less (from the theoretical value) by 31%.

We have tested an alternative approach to estimating rate constants using only two mass isotopomers. At each time point of labeling, one estimates p_X(t) using N_EH and A₁(t)/A₀(t) ratio in Eq. (5). The calculated p_X(t), N_EH, and the theoretical value of I₀(0) are inserted into Eq. (2) to reconstruct the time-course of I₀(t). The time-course data is, then, used in non-linear least-square regression, Eq. (1), to estimate k. It is a numerically simple approach that uses the smallest number of mass isotopomers and does not employ approximations or expensive numerical calculations. The errors in the estimation of A₁(t)/A₀(t) and I₀(t) in each case will determine which method to use for DRC. If both errors are small, they produce the same DRCs, Figure S2.

Figure 5 shows an example of DRC estimation using the A₁(t)/A₀(t) ratio. For the peptide sequence QIAAVMQR (cytoplasmic Aspartate aminotransferase), the non-linear least-squares fit to the experimental data of I₀(t) (black circles) fails to converge. The fit to the data from A₁(t)/A₀(t) (blue diamonds) converges. The Pearson correlation between the fit and the experimental data was 0.99. The RSS was 0.0001. The calculated DRC was 0.11 day^-1. The median of DRCs of 27 other peptides of the protein was 0.13 day⁻¹.

An external file that holds a picture, illustration, etc.
Object name is nihms-1778742-f0005.jpg

Figure 5.

The alternative method computes the rate constant when the traditional method does not converge due to the contamination of the isotope profile of a peptide by a co-eluting species. The y-axis is the RA of the monoisotope, and the x-axis is the labeling duration.

The closed-form equations for extracting deuterium enrichment from the abundance of only two mass isotopomers will potentially be useful for efforts to quantify label incorporation using isotope distributions of fragment ions in MS². We²⁵ and others¹⁶ have shown that label incorporation is also exhibited in the MS² spectra. However, most of the time, Orbitrap mass analyzers report truncated isotope profile (only two mass isotopomers) of fragment ions²⁵, which is enough for the closed-form equations derived here.

Eqs. (4) and (6) uniquely define the time-course of the A₂(t)/A₁(t) ratio, as well. In the cases when a contaminant interferes with the monoisotopic peak of a peptide, the A₂(t)/A₁(t) ratio can be used for the analysis of the label incorporation.

In metabolic labeling using heavy water, deuterium atoms are incorporated into non-essential amino acids (NEAAs) and subsequently into proteins. There are a certain number of hydrogens (mostly in sidechains of NEAAs) in a peptide that are accessible to the deuterium in the heavy water. Thus, the label (deuterium) incorporation into a peptide is a function of the rate constant, N_EH, p_W, and the labeling duration. The estimation of the rate constant is sensitive to the accuracy of N_EH determination. In principle, the N_EH for each amino acid can be determined using GC-MS²⁶. In practice, the N_EH values of mouse amino acids, initially determined by Commerford and coworkers²⁷ (using metabolic labeling with tritiated water), have been used for other species⁹. Eqs. (6) and (7) allow to uniquely determine N_EH and p_X(t) simultaneously from the ratios of the raw abundances of the first three mass isotopomers, Figure 1. For improved accuracy of the rate constant estimation and extending its applications to other species, it will be important to determine N_EH values for each species, and preferably from LC-MS. The newly derived closed-form equations provide capabilities for such determinations, while still requiring only a partial isotope profile of a peptide.

Another practical application of the derived equations will be the use of their asymptotic forms to determine how many mass isotopomers are important in the analysis given the enrichment in the diet water, the number of exchangeable hydrogens, the natural isotope profile of a peptide. Currently, one cannot discard any of the high mass isotopomers a priori, because as the peptide is labeled with heavy water, the relative abundances of the high mass isotopomers increase. Using the derived equations, one obtains exact RAs of high mass isotopomers at the asymptote of labeling. An algorithm, then, can make an informed decision about which mass isotopomers to keep for the quantification. Keeping the most important (but a small number) of mass isotopomers is important for reducing the chances of interferences by the co-eluting contaminants. This application will help improve accuracy in the traditional method.

The developed closed-form equations are not specific to the labeling with deuterium. They apply to other atom-based labeling agents. For example, to describe the dynamics of RAs resulting from ¹⁵N labeling⁴, one needs to replace N_EH with the number of accessible nitrogen sites, and p_H with the natural abundance of ¹⁵N isotope. The p_X(t) determined from the Eq. (5) will then be the enrichment of peptide with ¹⁵N.

The developments in this work complement our previous work on automating rate constant estimations¹³. That work developed an algorithm for peak detection and integration in a three-dimensional (m/z, elution time, and abundance) space. It then used the complete isotope profile of a peptide to determine the RA of the monoisotope. The time-course of RA during metabolic labeling was used to determine the rate constant. The process was automated to analyze data from high-throughput experiments. In that work, we did not employ any mass isotopomer algebra beyond calculating the RA of monoisotope from the complete isotope profile. In this work, we developed the closed-form equations for the first four mass isotopomers during the metabolic labeling with heavy water. The use of these equations in facilitating the data analysis is discussed.

Go to:

Conclusion.

We present new closed-form equations that link the dynamics of the abundances of the first four mass isotopomers in metabolic labeling with heavy atoms. We show that the mechanistic insights gained into the complex mass isotopomer profiles can be used in practice to help in analyses proteome turnover. Thus, we show that only two mass isotopomers are enough to extract the deuterium enrichment precisely. Using the enrichment, one can reconstruct the RA of monoisotope and use it for rate constant estimation. The theoretical simulations prove the accuracy of the equations. Application to a real data set shows the relevance of the equations for improving DRC estimations in a practical setting.

From any triple of mass isotopomers, the equations allow simultaneous determinations of the number of sites that are accessible to heavy isotope atom and the enrichment in the heavy isotope. Equations for all four mass isotopomers can help to make an informed decision on the importance of the high mass isotopomers for proteome turnover.

Go to:

Supplementary Material

Supporting

Click here to view.^{(699K, docx)}

Go to:

Acknowledgements.

The research reported in this publication was supported in part by the NIGMS of the NIH under Award Number R01GM112044. The content is solely the responsibility of the author and does not necessarily represent the official views of the National Institutes of Health.

Go to:

Abbreviations

Eq	equation
FFT	fast-Fourier transform
GC	gas chromatography
LC-MS	liquid chromatography and mass spectrometry
LDLR	low-density lipoprotein receptor
MPE	molar percent excess
NEAA	non-essential amino acid
NEH	number of exchangeable hydrogens
RA	relative abundance
RSS	residual sum of squares

Go to:

Footnotes

Supporting Information. Supporting Note: The derivations, numerical validation of the closed-form equations, and tests using LC-MS data from heavy water metabolic labeling of mice. Figure S1: Profiles of the first four mass isotopomers of peptide sequence “IQDAGLVLADALR”. Figure S2: The scatter plot of the simulations of rate constants calculations using traditional and alternative approaches under small noise conditions. Figure S3: The scatter plot of the simulations of rate constants calculations using traditional and alternative approaches under experimental noise. Figure S4: Relative errors of rate constant estimations from two alternative approaches. Figure S5: The scatter plot of the relative errors of A₁(t)/A₀(t) and I₀(t).

Conflict of Interest.

The author declares no conflict of interest.

Go to:

References.

1. Wilkinson DJ, Historical and contemporary stable isotope tracer approaches to studying mammalian protein metabolism. Mass Spectrom Rev 2016. [Europe PMC free article] [Abstract] [Google Scholar]

2. Zhang Y; Fonslow BR; Shan B; Baek MC; Yates JR III, Protein analysis by shotgun/bottom-up proteomics. Chem. Rev 2013, 113 (4), 2343–2394. [Europe PMC free article] [Abstract] [Google Scholar]

3. Papageorgopoulos C; Caldwell K; Shackleton C; Schweingrubber H; Hellerstein MK, Measuring protein synthesis by mass isotopomer distribution analysis (MIDA). Anal Biochem 1999, 267 (1), 1–16. [Abstract] [Google Scholar]

4. Rauniyar N; McClatchy DB; Yates JR 3rd, Stable isotope labeling of mammals (SILAM) for in vivo quantitative proteomic analysis. Methods 2013, 61 (3), 260–8. [Abstract] [Google Scholar]

5. Kasumov T; Ilchenko S; Li L; Rachdaoui N; Sadygov RG; Willard B; McCullough AJ; Previs S, Measuring protein synthesis using metabolic H-2 labeling, high-resolution mass spectrometry, and an algorithm. Analytical Biochemistry 2011, 412 (1), 47–55. [Europe PMC free article] [Abstract] [Google Scholar]

6. Claesen J; Dittwald P; Burzykowski T; Valkenborg D, An efficient method to calculate the aggregated isotopic distribution and exact center-masses. J Am Soc Mass Spectrom 2012, 23 (4), 753–63. [Abstract] [Google Scholar]

7. Alves G; Ogurtsov AY; Yu YK, Molecular Isotopic Distribution Analysis (MIDAs) with adjustable mass accuracy. J Am Soc Mass Spectrom 2014, 25 (1), 57–70. [Europe PMC free article] [Abstract] [Google Scholar]

8. Hellerstein MK; Neese RA, Mass isotopomer distribution analysis at eight years: theoretical, analytic, and experimental considerations. Am J Physiol 1999, 276 (6 Pt 1), E1146–70. [Abstract] [Google Scholar]

9. Price JC; Holmes WE; Li KW; Floreani NA; Neese RA; Turner SM; Hellerstein MK, Measurement of human plasma proteome dynamics with (2)H(2)O and liquid chromatography tandem mass spectrometry. Anal. Biochem 2012, 420 (1), 73–83. [Abstract] [Google Scholar]

10. Guan S; Price JC; Prusiner SB; Ghaemmaghami S; Burlingame AL, A data processing pipeline for mammalian proteome dynamics studies using stable isotope metabolic labeling. Mol. Cell Proteomics 2011, 10 (12), M111. [Europe PMC free article] [Abstract] [Google Scholar]

11. Leger T; Garcia C; Collomb L; Camadro JM, A Simple Light Isotope Metabolic Labeling (SLIM-labeling) Strategy: A Powerful Tool to Address the Dynamics of Proteome Variations In Vivo. Mol Cell Proteomics 2017, 16 (11), 2017–2031. [Europe PMC free article] [Abstract] [Google Scholar]

12. Goh B; Kim J; Seo S; Kim TY, High-Throughput Measurement of Lipid Turnover Rates Using Partial Metabolic Heavy Water Labeling. Anal Chem 2018, 90 (11), 6509–6518. [Abstract] [Google Scholar]

13. Sadygov RG; Avva J; Rahman M; Lee K; Ilchenko S; Kasumov T; Borzou A, d2ome, Software for in Vivo Protein Turnover Analysis Using Heavy Water Labeling and LC-MS, Reveals Alterations of Hepatic Proteome Dynamics in a Mouse Model of NAFLD. J Proteome Res 2018, 17 (11), 3740–3748. [Europe PMC free article] [Abstract] [Google Scholar]

14. Perkins DN; Pappin DJ; Creasy DM; Cottrell JS, Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis 1999, 20 (18), 3551–3567. [Abstract] [Google Scholar]

15. Angel TE; Naylor BC; Price JC; Evans C; Szapacs M, Improved Sensitivity for Protein Turnover Quantification by Monitoring Immonium Ion Isotopologue Abundance. Anal Chem 2019, 91 (15), 9732–9740. [Abstract] [Google Scholar]

16. Wang B; Sun G; Anderson DR; Jia M; Previs S; Anderson VE, Isotopologue distributions of peptide product ions by tandem mass spectrometry: quantitation of low levels of deuterium incorporation. Anal Biochem 2007, 367 (1), 40–8. [Europe PMC free article] [Abstract] [Google Scholar]

17. Naylor BC; Porter MT; Wilson E; Herring A; Lofthouse S; Hannemann A; Piccolo SR; Rockwood AL; Price JC, DeuteRater: a tool for quantifying peptide isotope precision and kinetic proteomics. Bioinformatics 2017, 33 (10), 1514–1520. [Abstract] [Google Scholar]

18. Rockwood AL; Haimi P, Efficient calculation of accurate masses of isotopic peaks. J. Am. Soc. Mass Spectrom 2006, 17 (3), 415–419. [Abstract] [Google Scholar]

19. R Core Team R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing: Vienna, Austria, 2019. [Google Scholar]

20. Sadygov VR; Zhang W; Sadygov RG, Timepoint Selection Strategy for In Vivo Proteome Dynamics from Heavy Water Metabolic Labeling and LC-MS. J Proteome Res 2020, 19 (5), 2105–2112. [Europe PMC free article] [Abstract] [Google Scholar]

21. Ilchenko S; Haddad A; Sadana P; Recchia FA; Sadygov RG; Kasumov T, Calculation of the Protein Turnover Rate Using the Number of Incorporated (2)H Atoms and Proteomics Analysis of a Single Labeled Sample. Anal Chem 2019, 91 (22), 14340–14351. [Europe PMC free article] [Abstract] [Google Scholar]

22. Busch R; Kim YK; Neese RA; Schade-Serin V; Collins M; Awada M; Gardner JL; Beysen C; Marino ME; Misell LM; Hellerstein MK, Measurement of protein turnover rates by heavy water labeling of nonessential amino acids. Biochim. Biophys. Acta 2006, 1760 (5), 730–744. [Abstract] [Google Scholar]

23. Lau E; Cao Q; Ng DC; Bleakley BJ; Dincer TU; Bot BM; Wang D; Liem DA; Lam MP; Ge J; Ping P, A large dataset of protein dynamics in the mammalian heart proteome. Sci Data 2016, 3, 160015. [Europe PMC free article] [Abstract] [Google Scholar]

24. Su X; Lu W; Rabinowitz JD, Metabolite Spectral Accuracy on Orbitraps. Anal Chem 2017, 89 (11), 5940–5948. [Europe PMC free article] [Abstract] [Google Scholar]

25. Borzou A; Sadygov VR; Zhang W; Sadygov RG, Proteome dynamics from heavy water metabolic labeling and peptide tandem mass spectrometry. International Journal of Mass Spectrometry 2019, 445, 116194. [Europe PMC free article] [Abstract] [Google Scholar]

26. Herath K; Bhat G; Miller PL; Wang SP; Kulick A; Andrews-Kelly G; Johnson C; Rohm RJ; Lassman ME; Previs SF; Johns DG; Hubbard BK; Roddy TP, Equilibration of (2)H labeling between body water and free amino acids: enabling studies of proteome synthesis. Anal Biochem 2011, 415 (2), 197–9. [Abstract] [Google Scholar]

27. Commerford SL; Carsten AL; Cronkite EP, The distribution of tritium among the amino acids of proteins obtained from mice exposed to tritiated water. Radiat Res 1983, 94 (1), 151–5. [Abstract] [Google Scholar]

Full text links

Read article at publisher's site: https://doi.org/10.1021/acs.analchem.0c03343

Read article for free, from open access legal sources, via Unpaywall: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8880304

Citations & impact

Impact metrics

Citations

Jump to Citations

Citations of article over time

Alternative metrics

Altmetric item for https://www.altmetric.com/details/92818098

Altmetric
Discover the attention surrounding your research
https://www.altmetric.com/details/92818098

Article citations

Flexible Quality Control for Protein Turnover Rates Using d2ome.
Deberneh HM, Sadygov RG
Int J Mol Sci, 24(21):15553, 25 Oct 2023
Cited by: 0 articles | PMID: 37958536 | PMCID: PMC10649227
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC
Quantifying label enrichment from two mass isotopomers increases proteome coverage for in vivo protein turnover using heavy water metabolic labeling.
Deberneh HM, Abdelrahman DR, Verma SK, Linares JJ, Murton AJ, Russell WK, Kuyumcu-Martinez MN, Miller BF, Sadygov RG
Commun Chem, 6(1):72, 17 Apr 2023
Cited by: 4 articles | PMID: 37069333 | PMCID: PMC10110577
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC
FAMetA: a mass isotopologue-based tool for the comprehensive analysis of fatty acid metabolism.
Alcoriza-Balaguer MI, García-Cañaveras JC, Benet M, Juan-Vidal O, Lahoz A
Brief Bioinform, 24(2):bbad064, 01 Mar 2023
Cited by: 1 article | PMID: 36857618 | PMCID: PMC10025582
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC
Software Tool for Visualization and Validation of Protein Turnover Rates Using Heavy Water Metabolic Labeling and LC-MS.
Deberneh HM, Sadygov RG
Int J Mol Sci, 23(23):14620, 23 Nov 2022
Cited by: 4 articles | PMID: 36498948 | PMCID: PMC9740640
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC
TurnoveR: A Skyline External Tool for Analysis of Protein Turnover in Metabolic Labeling Studies.
Basisty N, Shulman N, Wehrfritz C, Marsh AN, Shah S, Rose J, Ebert S, Miller M, Dai DF, Rabinovitch PS, Adams CM, MacCoss MJ, MacLean B, Schilling B
J Proteome Res, 22(2):311-322, 27 Sep 2022
Cited by: 4 articles | PMID: 36165806 | PMCID: PMC10066879
Free full text in Europe PMC

Go to all (9) article citations

Data

Data behind the article

This data has been text mined from the article, or deposited into data resources.

BioStudies: supplemental material and supporting data

http://www.ebi.ac.uk/biostudies/studies/S-EPMC8880304?xr=true

ProteomeXchange

(1 citation) ProteomeXchange - PXD009493

Funding

Funders who supported this work.

NIGMS NIH HHS (1)

Grant ID: R01 GM112044
38 publications

National Institute of General Medical Sciences (1)

Grant ID: R01GM112044
11 publications

Search life-sciences literature (45,103,589 articles, preprints and more)

Partial Isotope Profiles Are Sufficient for Protein Turnover Analysis Using Closed-Form Equations of Mass Isotopomer Dynamics.

Author information

Affiliations

Authors

ORCIDs linked to this article

Abstract

Free full text

Partial Isotope Profiles are Sufficient for Protein Turnover Analysis using Closed-form Equations of Mass Isotopomer Dynamics

Associated Data

Abstract

Graphical Abstract

Introduction

2.1. Data Description.

2.2. Novel closed-form equations of mass isotopomer dynamics during metabolic labeling

3.1. Simulations validate the derived equations

3.2. Tests on simulated rate constants

3.3. Direct computations of rate constants of the murine liver proteome

4.1. Results and Discussions.

Conclusion.

Supplementary Material

Supporting

Acknowledgements.

Abbreviations

Footnotes

References.

Full text links

Citations & impact

Impact metrics

Citations of article over time

Alternative metrics

Article citations

Data

Data behind the article

BioStudies: supplemental material and supporting data

ProteomeXchange

Similar Articles

Funding

NIGMS NIH HHS (1)﻿

National Institute of General Medical Sciences (1)﻿

Partnerships & funding

NIGMS NIH HHS (1)

National Institute of General Medical Sciences (1)