Consistency Analysis and Uncertainty in ‘omic Data

Summary

As ‘omics moves into industrial and commercial applications, there is a need for quality and process control to ensure safety and efficacy of products and services, especially when machine learning may be involved. This underscores the need for suitable reference materials certified for their ‘omic profiles. We use informatic techniques to compare spectral data and determine their consensus distribution and uncertainty.

Description

Illustration showing the consensus of data points indicated in red, inside shaded concentric circles, and other data points outside of the circles indicated in blue. — Characterization of omic profile data is essential to quality and process control.

Credit: NIST

‘Omics is increasingly moving out of the laboratory and towards use in industrial and commercial applications. For instance, in biomanufacturing, there is a need for quality control when developing biotherapeutics, which will almost certainly require a machine-learning classifier to separate acceptable batches of a drug from unacceptable. Likewise, if metabolomic data are to be used for clinical purposes such as precision medicine, machine-learning classifiers are needed to accomplish these ends.

With new applications comes a need for reference materials relevant to these applications, and therefore the need to determine what values to certify and how to measure their uncertainty. This means RMs need to be certified for spectral data, and the NIST processes for estimating uncertainty need to be updated to enable comparisons between different spectra. In this project, we develop software tools to automate the process of comparing spectral data. One such tool, interlab_py (pages.nist.gov/interlab_py), compares data such as NMR spectra from interlaboratory studies, calculates a consensus distribution of the spectra, and identifies those spectra that fall outside the consensus.

Related Publications

Principal Component Analysis for Automated Classification of 2D Spectra and Interferograms of Protein Therapeutics: Influence of Noise, Reconstruction Details, and Data Preparation

Bioscience, Biomanufacturing, Glycomics, Metabolomics, Proteomics, Chemistry, Analytical chemistry, Health, Biopharmaceuticals, Precision medicine, Information technology, Artificial intelligence, Data and informatics, Manufacturing quality assurance, Mathematics and statistics, Uncertainty quantification, Standards and Reference materials

Consistency Analysis and Uncertainty in ‘omic Data

Share

Summary

Description

Related Publications