Skip to Main Content U.S. Department of Energy
Data Intensive Computing

Data Intensive Computing

High Performance Data Analysis Tools for Intelligent Sensor Pipelines

Principal Investigator: Navdeep Jaitly

Challenge

Large-scale biological experiments using mass spectrometry incur a repeated acquisition and analysis of patterns in redundant compounds. As a result, a large number of replicates is required to capture in-depth information. An intelligent control system to guide the mass spectrometers could reduce the number of required experiments.

Approach

  • Develop a high performance pipeline that manages 1) data as they are captured by the mass spectrometers, and 2) the processing algorithms that analyze this data on-the-fly.
  • Use the MeDICI framework to tie together the different processing components control the instruments.

Impact

A significant reduction in the number of required experiments by percentage is expected. This approach can potentially be applied to other situations requiring mass input from a

Collaborations

  • PNNL's Biological Sciences Division

Accomplishments

  • Feature discovery algorithm implemented
  • Alignment algorithm prototyped
  • Prototype pipeline in development
  • Plans for software development and testing on actual biological research system
see caption
Pipeline

Data Intensive Computing

Research Areas

Demonstrations

Highlights

Medici Technology to be Highlighted in Special Issue of Scientific Computing

USCD Director Describes How Global Platform "OptIPuter" Opens New Frontiers

Research Projects

Projects Overview