Cord blood is widely used as surrogate tissue in epigenome-wide association studies of prenatal conditions. Cell type composition variation across samples can be an important confounder of epigenome-wide association studies in blood that constitute a mixture of cells. We evaluated a newly developed cord blood reference panel to impute cell type composition from DNA methylation levels, including nucleated red blood cells (nRBCs). We estimated cell type composition from 154 unique cord blood samples with available DNA methylation data as well as direct measurements of nucleated cell types. We observed high correlations between the estimated and measured composition for nRBCs (r = 0.92, R(2) = 0.85), lymphocytes (r = 0.77, R(2) = 0.58), and granulocytes (r = 0.72, R(2) = 0.52), and a moderate correlation for monocytes (r = 0.51, R(2) = 0.25) as well as relatively low root mean square errors from the residuals ranging from 1.4 to 5.4%. These results validate the use of the cord blood reference panel and highlight its utility and limitations for epidemiological studies.
Validation of a DNA Methylation Reference Panel for the Estimation of Nucleated Cells Types in Cord Blood.