Summary of Study ST001430
This data is available at the NIH Common Fund's National Metabolomics Data Repository (NMDR) website, the Metabolomics Workbench, https://www.metabolomicsworkbench.org, where it has been assigned Project ID PR000918. The data can be accessed directly via it's Project DOI: 10.21228/M81H58 This work is supported by NIH grant, U2C- DK119886.
See: https://www.metabolomicsworkbench.org/about/howtocite.php
This study contains a large results data set and is not available in the mwTab file. It is only available for download via FTP as data file(s) here.
Study ID | ST001430 |
Study Title | Metabolic dynamics and prediction og gestational ange and time to delivery in pregant women |
Study Summary | Metabolism during pregnancy is a constantly changing yet precisely programmed process, the failure of which may have devastating consequences for the fetus. To capture in high resolution the sequence of metabolic events underlying the normal human pregnancy, we carried out an untargeted metabolome investigation on 784 weekly blood samples collected from 30 Danish pregnant women. The study revealed extensive metabolome alterations over the course of normal pregnancy: of 9,651 detected metabolic features, 4,995 were significantly changed (FDR < 0.05). Many metabolic changes were timed precisely according to pregnancy progression so that the overall metabolic profile demonstrated a highly choreographed pattern. Using machine-learning methods, we were able to build a linear models with five metabolites (four steroids and one phospholipid) that predicts gestational age with high accuracy (Pearson correlation coefficient, R = 0.95). |
Institute | Stanford University |
Laboratory | Snyder lab |
Last Name | Liang |
First Name | Liang |
Address | Alway M339, 300 Pasteur Drive, Palo Alto, California, 94305, USA |
liangtro@stanford.edu | |
Phone | +1 8167852490 |
Submit Date | 2019-08-30 |
Raw Data Available | Yes |
Raw Data File Type(s) | mzXML |
Analysis Type Detail | LC-MS |
Release Date | 2020-07-24 |
Release Version | 1 |
Select appropriate tab below to view additional metadata details:
Project:
Project ID: | PR000918 |
Project DOI: | doi: 10.21228/M81H58 |
Project Title: | Metabolic dynamics and prediction of gestational age and time to delivery in pregnant women |
Project Summary: | Metabolism during pregnancy is a constantly changing yet precisely programmed process, the failure of which may have devastating consequences for the fetus. To capture in high resolution the sequence of metabolic events underlying the normal human pregnancy, we carried out an untargeted metabolome investigation on 784 weekly blood samples (3 outlier samples are removed) collected from 30 Danish pregnant women. The study revealed extensive metabolome alterations over the course of normal pregnancy: of 9,651 detected metabolic features, 4,995 were significantly changed (FDR < 0.05). Many metabolic changes were timed precisely according to pregnancy progression so that the overall metabolic profile demonstrated a highly choreographed pattern. Using machine-learning methods, we were able to build a linear models with five metabolites (four steroids and one phospholipid) that predicts gestational age with high accuracy (Pearson correlation coefficient, R = 0.95). |
Institute: | Stanford University |
Last Name: | Liang |
First Name: | Liang |
Address: | Alway M339, 300 Pasteur Drive, Palo Alto, California, 94305, USA |
Email: | liangtro@stanford.edu |
Phone: | 8167852490 |
Publications: | https://doi.org/10.1016/j.cell.2020.05.002 |
Subject:
Subject ID: | SU001504 |
Subject Type: | Human |
Subject Species: | Homo sapiens |
Taxonomy ID: | 9606 |
Gender: | Female |
Species Group: | Mammals |
Factors:
Subject type: Human; Subject species: Homo sapiens (Factor headings shown in green)
mb_sample_id | local_sample_id | Gestational age Range |
---|---|---|
SA120934 | 635 | >20 |
SA120935 | 235 | >20 |
SA120936 | 147 | >20 |
SA120937 | 166 | >20 |
SA120938 | 551 | >20 |
SA120939 | 173 | >20 |
SA120940 | 255 | >20 |
SA120941 | 16 | >20 |
SA120942 | 644 | >20 |
SA120943 | 215 | >20 |
SA120944 | 313 | >20 |
SA120945 | 209 | >20 |
SA120946 | 42 | >20 |
SA120947 | 460 | >20 |
SA120948 | 1 | >20 |
SA120949 | 394 | >20 |
SA120950 | 699 | >20 |
SA120951 | 225 | >20 |
SA120952 | 23 | >20 |
SA120953 | 89 | >20 |
SA120954 | 420 | >20 |
SA120955 | 256 | >20 |
SA120956 | 95 | >20 |
SA120957 | 485 | >20 |
SA120958 | 740 | >20 |
SA120959 | 662 | >20 |
SA120960 | 643 | >20 |
SA120961 | 743 | >20 |
SA120962 | 38 | >20 |
SA120963 | 248 | >20 |
SA120964 | 788 | >20 |
SA120965 | 590 | >20 |
SA120966 | 609 | >20 |
SA120967 | 619 | >20 |
SA120968 | 78 | >20 |
SA120969 | 167 | >20 |
SA120970 | 728 | >20 |
SA120971 | 597 | >20 |
SA120972 | 473 | >20 |
SA120973 | 130 | >20 |
SA120974 | 732 | >20 |
SA120975 | 278 | >20 |
SA120976 | 368 | >20 |
SA120977 | 88 | >20 |
SA120978 | 132 | >20 |
SA120979 | 20 | >20 |
SA120980 | 19 | >20 |
SA120981 | 655 | >20 |
SA120982 | 36 | >20 |
SA120983 | 238 | >20 |
SA120984 | 123 | >20 |
SA120985 | 34 | >20 |
SA120986 | 247 | >20 |
SA120987 | 717 | >20 |
SA120988 | 752 | >20 |
SA120989 | 742 | >20 |
SA120990 | 486 | >20 |
SA120991 | 312 | >20 |
SA120992 | 385 | >20 |
SA120993 | 335 | >20 |
SA120994 | 283 | >20 |
SA120995 | 631 | >20 |
SA120996 | 570 | >20 |
SA120997 | 49 | >20 |
SA120998 | 588 | >20 |
SA120999 | 554 | >20 |
SA121000 | 322 | >20 |
SA121001 | 310 | >20 |
SA121002 | 469 | >20 |
SA121003 | 304 | >20 |
SA121004 | 766 | >20 |
SA121005 | 663 | >20 |
SA121006 | 555 | >20 |
SA121007 | 471 | >20 |
SA121008 | 457 | >20 |
SA121009 | 287 | >20 |
SA121010 | 746 | >20 |
SA121011 | 521 | >20 |
SA121012 | 776 | >20 |
SA121013 | 390 | >20 |
SA121014 | 279 | >20 |
SA121015 | 793 | >20 |
SA121016 | 496 | >20 |
SA121017 | 458 | >20 |
SA121018 | 376 | >20 |
SA121019 | 715 | >20 |
SA121020 | 432 | >20 |
SA121021 | 526 | >20 |
SA121022 | 673 | >20 |
SA121023 | 375 | >20 |
SA121024 | 580 | >20 |
SA121025 | 523 | >20 |
SA121026 | 726 | >20 |
SA121027 | 545 | >20 |
SA121028 | 488 | >20 |
SA121029 | 510 | >24 |
SA121030 | 628 | >24 |
SA121031 | 585 | >24 |
SA121032 | 149 | >24 |
SA121033 | 155 | >24 |
Collection:
Collection ID: | CO001499 |
Collection Summary: | To capture the highly dynamic pregnancy process, we established a multi-year single-center Danish normal pregnancy cohort with a unique design of high-density blood sampling. Consented female participants submitted weekly blood draws beginning in week 5 of pregnancy until the postpartum period. A total of 30 women with weekly blood samples were assigned to a discovery (N=21) and a validation (Validation-1, N=9) cohort , whose samples were analyzed in two separated years. |
Sample Type: | Blood (plasma) |
Storage Conditions: | -80℃ |
Treatment:
Treatment ID: | TR001519 |
Treatment Summary: | No treatment. |
Sample Preparation:
Sampleprep ID: | SP001512 |
Sampleprep Summary: | 784 normal pregnancy samples (3 outlier samples were removed) were completely randomized within each cohort (Discovery and Validation - 1) and analyzed in 12 batches across two years. 200 μL plasma was extracted by mixing 800 μL 1:1:1 acetone: acetonitrile: methanol with the internal standard mixture. The extraction mixture was vortexed and mixed for 15 min at 4 C and incubated at -20 C for 2 hours to allow protein precipitation. The supernatant was collected after centrifugation and evaporated to dryness under nitrogen (Biotage Turbovap). The dry extracts were reconstituted with 200 μL 1:1 methanol: water before analysis. |
Combined analysis:
Analysis ID | AN002391 | AN002392 |
---|---|---|
Analysis type | MS | MS |
Chromatography type | Reversed phase | Reversed phase |
Chromatography system | Thermo Dionex Ultimate 3000 | Thermo Dionex Ultimate 3000 |
Column | Agilent Zorbax Eclipse Plus C18 (100 x 2.1mm, 1.8 um) | Agilent Zorbax Eclipse Plus C18 (100 x 2.1mm, 1.8 um) |
MS Type | ESI | ESI |
MS instrument type | Orbitrap | Orbitrap |
MS instrument name | Thermo Q Exactive Plus Orbitrap | Thermo Q Exactive Plus Orbitrap |
Ion Mode | POSITIVE | NEGATIVE |
Units | peak area | peak area |
Chromatography:
Chromatography ID: | CH001758 |
Chromatography Summary: | Chromatographic conditions RPLC separation was performed using Zorbax SB columns (2.1 X 50mm, 1.8 Micron, 600 Bar; 827700-914) purchased from Agilent Technologies (Santa Clara, CA, USA). Mobile phases for RPLC consisted of 0.06% acetic acid in water (phase A) and 0.06% acetic acid in MeOH (phase B). Metabolites were eluted from the column at a flow rate of 0.6 mL/min, leading to a backpressure of 220– 280 bar at 99% phase A. A linear 1%–80% phase B gradient was applied over 9–10 min. The oven temperature was set to 60C, and the sample injection volume was 5 mL. |
Instrument Name: | Thermo Dionex Ultimate 3000 |
Column Name: | Agilent Zorbax Eclipse Plus C18 (100 x 2.1mm, 1.8 um) |
Chromatography Type: | Reversed phase |
MS:
MS ID: | MS002233 |
Analysis ID: | AN002391 |
Instrument Name: | Thermo Q Exactive Plus Orbitrap |
Instrument Type: | Orbitrap |
MS Type: | ESI |
MS Comments: | MS acquisition Metabolic extracts were analyzed by reversed-phase liquid chromatographic (RPLC)-mass spectrometry (MS) in both positive and negative ionization modes. Thermo Q Exactive Hybrid Quadrupole-Orbitrap plus and Q Exactive mass spectrometers (Xcalibur, Thermo Scientific, San Jose, CA, USA) were operated in full MS-scan mode for data acquisition (acquisition from m/z 500 to 2,000) with a scan rate of approximately 4 Hz and a resolution set at 30,000 (at m/z 400). The MS/MS spectra of the QC sample were acquired under different fragmentation energy (25 NCE and 50 NCE) of the top 10 parent ions. The resulting mass spectra were exported into Progenesis QI Software (Nonlinear Dynamics, Durham, NC, USA) for further processing. Section 1: Metabolomics Data Processing Metabolomic features were extracted with a unique mass/charge ratio and retention time, then aligned and quantified with the Progenesis QI software (Nonlinear Dynamics, Durham, NC, USA, http://www.nonlinear.com/progenesis/qi/). Peak deconvolution ll e2 Cell 181, 1680–1692.e1–e5, June 25, 2020 Resource was performed under default settings in Progenesis QI. Acquired data were processed using an analysis pipeline written in R (https:// www.R-project.org). Progenesis QI output was then processed by removing all metabolites that were quantified in less than 30% of the samples or had a median intensity of less than twofold signal over the noise threshold (S/N < 2). The noise threshold was estimated by using the median signal across all the blank runs (if no quantitation was reported in any of the blank runs, the feature was also included in the analysis, as it likely had good S/N characteristics). Then the data were log-transformed and normalized. For each run, the median of all features was centered to correct for variation in the sample amount. Then for each analyte, a linear correction was applied per batch to correct for any linear decrease or increase in abundance during the acquisition of a batch. In short, for each analyte and each batch, a linear model was fitted with the log-abundance of the analyte as the dependent variable and the acquisition number [run order (randomized)] as the independent variable. The model prediction was interpreted as an underlying drift in mass spectrometric sensitivity and subtracted from the analyte level to yield within-batch normalized abundances. Finally, for each analyte, the abundances were median centered by batch to correct for sensitivity differences between batches. The positive- and negative-mode features were then concatenated for downstream analysis. In total, 9,651 features were included in the final analysis. In addition, for samples with more than 50% of the values missing, the sample was removed (one sample in total). The remaining missing values were imputed by the nearest 10 neighbors using the k-Nearest Neighbor algorithm (Altman, 1992). Note that Discovery and Test Set 1 were normalized together, while samples of Test Set 2 were normalized independently. We applied principal component analysis (PCA) to examine the overall distribution of the sample data (with all 9,651 features) and check the run quality. The gestational ages (based on first-trimester ultrasound measurements) were superimposed to facilitate the analysis. During the analysis, the vast majority of the samples were separated by pre- and postpartum in PCA space defined by two components, which explained the largest variations (PC1 and 2, Figure 1B), while two samples of a same subject (last two in her collection, before and after childbirth) displayed irregular behavior in PCA and unsupervised clustering analysis. The two samples were treated as outliers and excluded from further analysis. We also performed partial least-squares discriminant analysis (PLSDA) according to the categories of gestational age (by the mixOmics package). |
Ion Mode: | POSITIVE |
MS ID: | MS002234 |
Analysis ID: | AN002392 |
Instrument Name: | Thermo Q Exactive Plus Orbitrap |
Instrument Type: | Orbitrap |
MS Type: | ESI |
MS Comments: | MS acquisition Metabolic extracts were analyzed by reversed-phase liquid chromatographic (RPLC)-mass spectrometry (MS) in both positive and negative ionization modes. Thermo Q Exactive Hybrid Quadrupole-Orbitrap plus and Q Exactive mass spectrometers (Xcalibur, Thermo Scientific, San Jose, CA, USA) were operated in full MS-scan mode for data acquisition (acquisition from m/z 500 to 2,000) with a scan rate of approximately 4 Hz and a resolution set at 30,000 (at m/z 400). The MS/MS spectra of the QC sample were acquired under different fragmentation energy (25 NCE and 50 NCE) of the top 10 parent ions. The resulting mass spectra were exported into Progenesis QI Software (Nonlinear Dynamics, Durham, NC, USA) for further processing. Section 1: Metabolomics Data Processing Metabolomic features were extracted with a unique mass/charge ratio and retention time, then aligned and quantified with the Progenesis QI software (Nonlinear Dynamics, Durham, NC, USA, http://www.nonlinear.com/progenesis/qi/). Peak deconvolution ll e2 Cell 181, 1680–1692.e1–e5, June 25, 2020 Resource was performed under default settings in Progenesis QI. Acquired data were processed using an analysis pipeline written in R (https:// www.R-project.org). Progenesis QI output was then processed by removing all metabolites that were quantified in less than 30% of the samples or had a median intensity of less than twofold signal over the noise threshold (S/N < 2). The noise threshold was estimated by using the median signal across all the blank runs (if no quantitation was reported in any of the blank runs, the feature was also included in the analysis, as it likely had good S/N characteristics). Then the data were log-transformed and normalized. For each run, the median of all features was centered to correct for variation in the sample amount. Then for each analyte, a linear correction was applied per batch to correct for any linear decrease or increase in abundance during the acquisition of a batch. In short, for each analyte and each batch, a linear model was fitted with the log-abundance of the analyte as the dependent variable and the acquisition number [run order (randomized)] as the independent variable. The model prediction was interpreted as an underlying drift in mass spectrometric sensitivity and subtracted from the analyte level to yield within-batch normalized abundances. Finally, for each analyte, the abundances were median centered by batch to correct for sensitivity differences between batches. The positive- and negative-mode features were then concatenated for downstream analysis. In total, 9,651 features were included in the final analysis. In addition, for samples with more than 50% of the values missing, the sample was removed (one sample in total). The remaining missing values were imputed by the nearest 10 neighbors using the k-Nearest Neighbor algorithm (Altman, 1992). Note that Discovery and Test Set 1 were normalized together, while samples of Test Set 2 were normalized independently. We applied principal component analysis (PCA) to examine the overall distribution of the sample data (with all 9,651 features) and check the run quality. The gestational ages (based on first-trimester ultrasound measurements) were superimposed to facilitate the analysis. During the analysis, the vast majority of the samples were separated by pre- and postpartum in PCA space defined by two components, which explained the largest variations (PC1 and 2, Figure 1B), while two samples of a same subject (last two in her collection, before and after childbirth) displayed irregular behavior in PCA and unsupervised clustering analysis. The two samples were treated as outliers and excluded from further analysis. We also performed partial least-squares discriminant analysis (PLSDA) according to the categories of gestational age (by the mixOmics package). |
Ion Mode: | NEGATIVE |