A Feature-Based Approach to Big Data Analysis of Medical Images

Inf Process Med Imaging. 2015:24:339-50. doi: 10.1007/978-3-319-19992-4_26.

Abstract

This paper proposes an inference method well-suited to large sets of medical images. The method is based upon a framework where distinctive 3D scale-invariant features are indexed efficiently to identify approximate nearest-neighbor (NN) feature matches-in O (log N) computational complexity in the number of images N. It thus scales well to large data sets, in contrast to methods based on pair-wise image registration or feature matching requiring O(N) complexity. Our theoretical contribution is a density estimator based on a generative model that generalizes kernel density estimation and K-nearest neighbor (KNN) methods.. The estimator can be used for on-the-fly queries, without requiring explicit parametric models or an off-line training phase. The method is validated on a large multi-site data set of 95,000,000 features extracted from 19,000 lung CT scans. Subject-level classification identifies all images of the same subjects across the entire data set despite deformation due to breathing state, including unintentional duplicate scans. State-of-the-art performance is achieved in predicting chronic pulmonary obstructive disorder (COPD) severity across the 5-category GOLD clinical rating, with an accuracy of 89% if both exact and one-off predictions are considered correct.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Data Interpretation, Statistical*
  • Databases, Factual
  • Humans
  • Imaging, Three-Dimensional / methods*
  • Information Storage and Retrieval / methods*
  • Lung / diagnostic imaging
  • Pattern Recognition, Automated / methods*
  • Pulmonary Disease, Chronic Obstructive / diagnostic imaging*
  • Radiographic Image Enhancement / methods
  • Radiographic Image Interpretation, Computer-Assisted / methods*
  • Reproducibility of Results
  • Sensitivity and Specificity
  • Tomography, X-Ray Computed / methods*