Publications

2020

Martin-Trujillo, A., Patel, N., Richter, F., Jadhav, B., Garg, P., Morton, S. U., McKean, D. M., DePalma, S. R., Goldmuntz, E., Gruber, D., Kim, R., Newburger, J. W., Porter, G. A., Giardini, A., Bernstein, D., Tristani-Firouzi, M., Seidman, J. G., Seidman, C. E., Chung, W. K., … Sharp, A. J. (2020). Rare genetic variation at transcription factor binding sites modulates local DNA methylation profiles.. PLoS Genetics, 16(11), e1009189. https://doi.org/10.1371/journal.pgen.1009189 (Original work published 2020)

Although DNA methylation is the best characterized epigenetic mark, the mechanism by which it is targeted to specific regions in the genome remains unclear. Recent studies have revealed that local DNA methylation profiles might be dictated by cis-regulatory DNA sequences that mainly operate via DNA-binding factors. Consistent with this finding, we have recently shown that disruption of CTCF-binding sites by rare single nucleotide variants (SNVs) can underlie cis-linked DNA methylation changes in patients with congenital anomalies. These data raise the hypothesis that rare genetic variation at transcription factor binding sites (TFBSs) might contribute to local DNA methylation patterning. In this work, by combining blood genome-wide DNA methylation profiles, whole genome sequencing-derived SNVs from 247 unrelated individuals along with 133 predicted TFBS motifs derived from ENCODE ChIP-Seq data, we observed an association between the disruption of binding sites for multiple TFs by rare SNVs and extreme DNA methylation values at both local and, to a lesser extent, distant CpGs. While the majority of these changes affected only single CpGs, 24% were associated with multiple outlier CpGs within ±1kb of the disrupted TFBS. Interestingly, disruption of functionally constrained sites within TF motifs lead to larger DNA methylation changes at nearby CpG sites. Altogether, these findings suggest that rare SNVs at TFBS negatively influence TF-DNA binding, which can lead to an altered local DNA methylation profile. Furthermore, subsequent integration of DNA methylation and RNA-Seq profiles from cardiac tissues enabled us to observe an association between rare SNV-directed DNA methylation and outlier expression of nearby genes. In conclusion, our findings not only provide insights into the effect of rare genetic variation at TFBS on shaping local DNA methylation and its consequences on genome regulation, but also provide a rationale to incorporate DNA methylation data to interpret the functional role of rare variants.

Paulson, J. N., Williams, B. L., Hehnly, C., Mishra, N., Sinnar, S. A., Zhang, L., Ssentongo, P., Mbabazi-Kabachelor, E., Wijetunge, D. S. S., von Bredow, B., Mulondo, R., Kiwanuka, J., Bajunirwe, F., Bazira, J., Bebell, L. M., Burgoine, K., Couto-Rodriguez, M., Ericson, J. E., Erickson, T., … Schiff, S. J. (2020). Paenibacillus infection with frequent viral coinfection contributes to postinfectious hydrocephalus in Ugandan infants.. Science Translational Medicine, 12(563). https://doi.org/10.1126/scitranslmed.aba0565 (Original work published 2020)

Postinfectious hydrocephalus (PIH), which often follows neonatal sepsis, is the most common cause of pediatric hydrocephalus worldwide, yet the microbial pathogens underlying this disease remain to be elucidated. Characterization of the microbial agents causing PIH would enable a shift from surgical palliation of cerebrospinal fluid (CSF) accumulation to prevention of the disease. Here, we examined blood and CSF samples collected from 100 consecutive infant cases of PIH and control cases comprising infants with non-postinfectious hydrocephalus in Uganda. Genomic sequencing of samples was undertaken to test for bacterial, fungal, and parasitic DNA; DNA and RNA sequencing was used to identify viruses; and bacterial culture recovery was used to identify potential causative organisms. We found that infection with the bacterium Paenibacillus, together with frequent cytomegalovirus (CMV) coinfection, was associated with PIH in our infant cohort. Assembly of the genome of a facultative anaerobic bacterial isolate recovered from cultures of CSF samples from PIH cases identified a strain of Paenibacillus thiaminolyticus This strain, designated Mbale, was lethal when injected into mice in contrast to the benign reference Paenibacillus strain. These findings show that an unbiased pan-microbial approach enabled characterization of Paenibacillus in CSF samples from PIH cases, and point toward a pathway of more optimal treatment and prevention for PIH and other proximate neonatal infections.

Hsieh, A., Morton, S. U., Willcox, J. A. L., Gorham, J. M., Tai, A. C., Qi, H., DePalma, S., McKean, D., Griffin, E., Manheimer, K. B., Bernstein, D., Kim, R. W., Newburger, J. W., Porter, G. A., Srivastava, D., Tristani-Firouzi, M., Brueckner, M., Lifton, R. P., Goldmuntz, E., … Shen, Y. (2020). EM-mosaic detects mosaic point mutations that contribute to congenital heart disease.. Genome Medicine, 12(1), 42. https://doi.org/10.1186/s13073-020-00738-1 (Original work published 2020)

BACKGROUND: The contribution of somatic mosaicism, or genetic mutations arising after oocyte fertilization, to congenital heart disease (CHD) is not well understood. Further, the relationship between mosaicism in blood and cardiovascular tissue has not been determined.

METHODS: We developed a new computational method, EM-mosaic (Expectation-Maximization-based detection of mosaicism), to analyze mosaicism in exome sequences derived primarily from blood DNA of 2530 CHD proband-parent trios. To optimize this method, we measured mosaic detection power as a function of sequencing depth. In parallel, we analyzed our cohort using MosaicHunter, a Bayesian genotyping algorithm-based mosaic detection tool, and compared the two methods. The accuracy of these mosaic variant detection algorithms was assessed using an independent resequencing method. We then applied both methods to detect mosaicism in cardiac tissue-derived exome sequences of 66 participants for which matched blood and heart tissue was available.

RESULTS: EM-mosaic detected 326 mosaic mutations in blood and/or cardiac tissue DNA. Of the 309 detected in blood DNA, 85/97 (88%) tested were independently confirmed, while 7/17 (41%) candidates of 17 detected in cardiac tissue were confirmed. MosaicHunter detected an additional 64 mosaics, of which 23/46 (50%) among 58 candidates from blood and 4/6 (67%) of 6 candidates from cardiac tissue confirmed. Twenty-five mosaic variants altered CHD-risk genes, affecting 1% of our cohort. Of these 25, 22/22 candidates tested were confirmed. Variants predicted as damaging had higher variant allele fraction than benign variants, suggesting a role in CHD. The estimated true frequency of mosaic variants above 10% mosaicism was 0.14/person in blood and 0.21/person in cardiac tissue. Analysis of 66 individuals with matched cardiac tissue available revealed both tissue-specific and shared mosaicism, with shared mosaics generally having higher allele fraction.

CONCLUSIONS: We estimate that   1% of CHD probands have a mosaic variant detectable in blood that could contribute to cardiac malformations, particularly those damaging variants with relatively higher allele fraction. Although blood is a readily available DNA source, cardiac tissues analyzed contributed   5% of somatic mosaic variants identified, indicating the value of tissue mosaicism analyses.

Morton, S. U., Yuen, J. C., Feldman, H. A., Hashim, E., Rudie, C., Lindamood, K. E., Caughey, D., Moline, M., Sims, J. K., Sola-Visner, M. C., & Leeman, K. T. (2020). Screening With Reticulocyte Hemoglobin Increased Iron Sufficiency Among NICU Patients.. Pediatric Quality & Safety, 5(2), e258. https://doi.org/10.1097/pq9.0000000000000258 (Original work published 2020)

INTRODUCTION: To increase the rate of iron sufficiency among neonatal intensive care unit (NICU) patients from 16% to >35% within 12 months of implementing standardized assessment of reticulocyte hemoglobin (retHE).

METHODS: We implemented a quality improvement (QI) study to improve iron sufficiency in our out-born level III/IV NICU. We screened 2,062 admissions, of which 622 were eligible based on feeding status at discharge. QI interventions included educational efforts and guideline implementation. Our primary outcome measure was the percentage of patients with their discharge retHE measure within the normal range. We also tracked the process measure of the number of retHE tests performed and a balancing measure of the incidence of elevated retHE among patients receiving iron supplementation. Statistical process control (SPC) charts assessed for special cause variation.

RESULTS: The percentage of patients with a retHe within the normal range was significantly increased from a mean of 20% to 39% on SPC chart analysis. We measured significantly more retHE values after guideline implementation (11/mo to 24/mo) and found no cases of elevated retHE among patients receiving iron supplementation.

CONCLUSIONS: After the implementation of a standardized guideline, a higher rate of iron sufficiency was found in NICU patients at discharge. This work is generalizable to neonatal populations with the potential for a significant impact on clinical practice.

Sharma, A., Wasson, L. K., Willcox, J. A., Morton, S. U., Gorham, J. M., DeLaughter, D. M., Neyazi, M., Schmid, M., Agarwal, R., Jang, M. Y., Toepfer, C. N., Ward, T., Kim, Y., Pereira, A. C., DePalma, S. R., Tai, A., Kim, S., Conner, D., Bernstein, D., … Consortium, P. C. G. (2020). GATA6 mutations in hiPSCs inform mechanisms for maldevelopment of the heart, pancreas, and diaphragm.. ELife, 9. https://doi.org/10.7554/eLife.53278 (Original work published 2020)

Damaging GATA6 variants cause cardiac outflow tract defects, sometimes with pancreatic and diaphragmic malformations. To define molecular mechanisms for these diverse developmental defects, we studied transcriptional and epigenetic responses to GATA6 loss of function (LoF) and missense variants during cardiomyocyte differentiation of isogenic human induced pluripotent stem cells. We show that GATA6 is a pioneer factor in cardiac development, regulating SMYD1 that activates HAND2, and KDR that with HAND2 orchestrates outflow tract formation. LoF variants perturbed cardiac genes and also endoderm lineage genes that direct PDX1 expression and pancreatic development. Remarkably, an exon 4 GATA6 missense variant, highly associated with extra-cardiac malformations, caused ectopic pioneer activities, profoundly diminishing GATA4, FOXA1/2, and PDX1 expression and increasing normal retinoic acid signaling that promotes diaphragm development. These aberrant epigenetic and transcriptional signatures illuminate the molecular mechanisms for cardiovascular malformations, pancreas and diaphragm dysgenesis that arise in patients with distinct GATA6 variants.

Richter, F., Morton, S. U., Kim, S. W., Kitaygorodsky, A., Wasson, L. K., Chen, K. M., Zhou, J., Qi, H., Patel, N., DePalma, S. R., Parfenov, M., Homsy, J., Gorham, J. M., Manheimer, K. B., Velinder, M., Farrell, A., Marth, G., Schadt, E. E., Kaltman, J. R., … Gelb, B. D. (2020). Genomic analyses implicate noncoding de novo variants in congenital heart disease.. Nature Genetics, 52(8), 769-777. https://doi.org/10.1038/s41588-020-0652-z (Original work published 2020)

A genetic etiology is identified for one-third of patients with congenital heart disease (CHD), with 8% of cases attributable to coding de novo variants (DNVs). To assess the contribution of noncoding DNVs to CHD, we compared genome sequences from 749 CHD probands and their parents with those from 1,611 unaffected trios. Neural network prediction of noncoding DNV transcriptional impact identified a burden of DNVs in individuals with CHD (n = 2,238 DNVs) compared to controls (n = 4,177; P = 8.7 × 10-4). Independent analyses of enhancers showed an excess of DNVs in associated genes (27 genes versus 3.7 expected, P = 1 × 10-5). We observed significant overlap between these transcription-based approaches (odds ratio (OR) = 2.5, 95% confidence interval (CI) 1.1-5.0, P = 5.4 × 10-3). CHD DNVs altered transcription levels in 5 of 31 enhancers assayed. Finally, we observed a DNV burden in RNA-binding-protein regulatory sites (OR = 1.13, 95% CI 1.1-1.2, P = 8.8 × 10-5). Our findings demonstrate an enrichment of potentially disruptive regulatory noncoding DNVs in a fraction of CHD at least as high as that observed for damaging coding DNVs.

Morton, S. U., Agarwal, R., Madden, J. A., Genetti, C. A., Brownstein, C. A., López-Giráldez, F., Choi, J., Seidman, C. E., Seidman, J. G., Lyon, G. J., & Agrawal, P. B. (2020). Congenital Heart Defects Due to TAF1 Missense Variants.. Circulation. Genomic and Precision Medicine, 13(3), e002843. https://doi.org/10.1161/CIRCGEN.119.002843 (Original work published 2020)

2019

O’Connell, A. E., Gerashchenko, M. , V, O’Donohue, M.-F., Rosen, S. M., Huntzinger, E., Gleeson, D., Galli, A., Ryder, E., Cao, S., Murphy, Q., Kazerounian, S., Morton, S. U., Schmitz-Abe, K., Gladyshev, V. N., Gleizes, P.-E., Séraphin, B., & Agrawal, P. B. (2019). Mammalian Hbs1L deficiency causes congenital anomalies and developmental delay associated with Pelota depletion and 80S monosome accumulation.. PLoS Genetics, 15(2), e1007917. https://doi.org/10.1371/journal.pgen.1007917 (Original work published 2019)

Hbs1 has been established as a central component of the cell's translational quality control pathways in both yeast and prokaryotic models; however, the functional characteristics of its human ortholog (Hbs1L) have not been well-defined. We recently reported a novel human phenotype resulting from a mutation in the critical coding region of the HBS1L gene characterized by facial dysmorphism, severe growth restriction, axial hypotonia, global developmental delay and retinal pigmentary deposits. Here we further characterize downstream effects of the human HBS1L mutation. HBS1L has three transcripts in humans, and RT-PCR demonstrated reduced mRNA levels corresponding with transcripts V1 and V2 whereas V3 expression was unchanged. Western blot analyses revealed Hbs1L protein was absent in the patient cells. Additionally, polysome profiling revealed an abnormal aggregation of 80S monosomes in patient cells under baseline conditions. RNA and ribosomal sequencing demonstrated an increased translation efficiency of ribosomal RNA in Hbs1L-deficient fibroblasts, suggesting that there may be a compensatory increase in ribosome translation to accommodate the increased 80S monosome levels. This enhanced translation was accompanied by upregulation of mTOR and 4-EBP protein expression, suggesting an mTOR-dependent phenomenon. Furthermore, lack of Hbs1L caused depletion of Pelota protein in both patient cells and mouse tissues, while PELO mRNA levels were unaffected. Inhibition of proteasomal function partially restored Pelota expression in human Hbs1L-deficient cells. We also describe a mouse model harboring a knockdown mutation in the murine Hbs1l gene that shared several of the phenotypic elements observed in the Hbs1L-deficient human including facial dysmorphism, growth restriction and retinal deposits. The Hbs1lKO mice similarly demonstrate diminished Pelota levels that were rescued by proteasome inhibition.

Minoche, A. E., Horvat, C., Johnson, R., Gayevskiy, V., Morton, S. U., Drew, A. P., Woo, K., Statham, A. L., Lundie, B., Bagnall, R. D., Ingles, J., Semsarian, C., Seidman, J. G., Seidman, C. E., Dinger, M. E., Cowley, M. J., & Fatkin, D. (2019). Genome sequencing as a first-line genetic test in familial dilated cardiomyopathy.. Genetics in Medicine : Official Journal of the American College of Medical Genetics, 21(3), 650-662. https://doi.org/10.1038/s41436-018-0084-7 (Original work published 2019)

PURPOSE: We evaluated genome sequencing (GS) as an alternative to multigene panel sequencing (PS) for genetic testing in dilated cardiomyopathy (DCM).

METHODS: Forty-two patients with familial DCM underwent PS and GS, and detection rates of rare single-nucleotide variants and small insertions/deletions in panel genes were compared. Loss-of-function variants in 406 cardiac-enriched genes were evaluated, and an assessment of structural variation was performed.

RESULTS: GS provided broader and more uniform coverage than PS, with high concordance for rare variant detection in panel genes. GS identified all PS-identified pathogenic or likely pathogenic variants as well as two additional likely pathogenic variants: one was missed by PS due to low coverage, the other was a known disease-causing variant in a gene not included on the panel. No loss-of-function variants in the extended gene set met clinical criteria for pathogenicity. One BAG3 structural variant was classified as pathogenic.

CONCLUSION: Our data support the use of GS for genetic testing in DCM, with high variant detection accuracy and a capacity to identify structural variants. GS provides an opportunity to go beyond suites of established disease genes, but the incremental yield of clinically actionable variants is limited by a paucity of genetic and functional evidence for DCM association.

Richter, F., Hoffman, G. E., Manheimer, K. B., Patel, N., Sharp, A. J., McKean, D., Morton, S. U., DePalma, S., Gorham, J., Kitaygorodksy, A., Porter, G. A., Giardini, A., Shen, Y., Chung, W. K., Seidman, J. G., Seidman, C. E., Schadt, E. E., & Gelb, B. D. (2019). ORE identifies extreme expression effects enriched for rare variants.. Bioinformatics (Oxford, England), 35(20), 3906-3912. https://doi.org/10.1093/bioinformatics/btz202 (Original work published 2019)

MOTIVATION: Non-coding rare variants (RVs) may contribute to Mendelian disorders but have been challenging to study due to small sample sizes, genetic heterogeneity and uncertainty about relevant non-coding features. Previous studies identified RVs associated with expression outliers, but varying outlier definitions were employed and no comprehensive open-source software was developed.

RESULTS: We developed Outlier-RV Enrichment (ORE) to identify biologically-meaningful non-coding RVs. We implemented ORE combining whole-genome sequencing and cardiac RNAseq from congenital heart defect patients from the Pediatric Cardiac Genomics Consortium and deceased adults from Genotype-Tissue Expression. Use of rank-based outliers maximized sensitivity while a most extreme outlier approach maximized specificity. Rarer variants had stronger associations, suggesting they are under negative selective pressure and providing a basis for investigating their contribution to Mendelian disorders.

AVAILABILITY AND IMPLEMENTATION: ORE, source code, and documentation are available at https://pypi.python.org/pypi/ore under the MIT license.

SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.