Associate Professor of Public Health Sciences, Biomedical Engineering, Biochemistry and Molecular Genetics, and Data Science
Bio
B.S. Bioinformatics, Brigham Young University, 2008Ph.D. Computational Biology and Bioinformatics, Duke University, 2013Post-Doc Computational Epigenetics, CeMM, Vienna and Stanford University, 2016
The Sheffield Lab uses computation to ask and answer biological questions. We study the non-coding DNA that encodes gene regulatory networks and enables cellular differentiation, and how these networks break down in disease like cancer. To address these biological questions, we develop scientific computing infrastructure that enables large-scale computational biology research using cloud computing, software engineering, and maching learning.
Dr. Sheffield holds B.S. and Ph.D degrees in Bioinformatics and Computational Biology. He completed his PhD in 2013, working on large-scale analysis of chromatin accessibility in primates and among human cell-types. After his PhD, Dr. Sheffield was an HFSP Postdoctoral Fellow in Christoph Bock’s Lab at the Center for Molecular Medicine, Vienna, with the return phase of his fellowship in Howard Chang’s Lab at Stanford. Outside work, Dr. Sheffield enjoys spending time with his family, teaching his children, reading good books, writing, taking photos, and having fun. He is also the author of this popular Scientific Writing web resource.
Research Interests
Computational Biology and Bioinformatics
Biomedical Data Sciences
Machine Learning
Epigenomics
Gene Regulation and Chromatin
High-performance Computing
Selected Publications
Embeddings of genomic region sets capture rich biological associations in low dimensions. Bioinformatics (2021) ABSGharavi E, Gu A, Zheng G, Smith JP, Zhang A, Brown DE, and Sheffield NC
COCOA: coordinate covariation analysis of epigenetic heterogeneity. Genome Biology (2020) ABSLawson JT, Smith JP, Bekiranov S, Garrett-Bakelman FE, and Sheffield NC
Refgenie: a reference genome resource manager. GigaScience (2020) ABSStolarczyk M, Reuter VP, Smith JP, Magee NE, and Sheffield NC
The chromatin accessibility landscape of primary human cancers. Science (2018) ABSCorces MR, Granja JM, Shams S, Louie BH, Seoane JA, Zhou W, Silva TC, Groeneveld C, Wong CK, Cho SW, Satpathy AT, Mumbach MR, Hoadley KA, Robertson AG, Sheffield NC, Felau I, Castro MAA, Berman BP, Staudt LM, Zenklusen JC, Laird PW, Curtis C, Greenlea
DNA methylation heterogeneity defines a disease spectrum in Ewing sarcoma. Nature Medicine (2017) ABSSheffield NC, Pierron G, Klughammer J, Datlinger P, Schönegger A, Schuster M, Hadler J, Surdez D et al.
LOLA: enrichment analysis for genomic region sets and regulatory elements in R and Bioconductor. Bioinformatics (2016) ABSSheffield NC, and Bock C.
Multi-Omics of Single Cells: Strategies and Applications. Trends in Biotechnology (2016) ABSBock C, Farlik M, and Sheffield NC.
Single-cell DNA methylome sequencing and bioinformatic inference of epigenomic cell-state dynamics. Cell Reports (2015) ABSFarlik M, Sheffield NC, Nuzzo A, Datlinger P, Schönegger A, Klughammer J, and Bock C.
I am interested in understanding how cancers commandeer the normal regulatory machinery to create disease. As a model system, I use Ewing sarcoma, a pediatric tumor, which is a good model because it is almost always driven by a single, well-characterized mutagenic event: a chromosomal translocation leading to the fusion protein EWS-FLI1. To explore how this fusion protein re-wires the cells to proliferate uncontrollably, I am examining genome-wide epigenetic profiles of Ewing sarcoma. These types of questions lead to computational problems inherent in dealing with lots of data from different individuals, cancers, and data types.
Single-cell sequencing analysis
In the past, we have only been able to sequence populations of cells, leaving important cell-to-cell differences unexplored. New microfluidics and sequencing technology is making it possible to ask questions about single cells. Using this technology, I am interested in fundamental questions about how cells differentiate and respond to their environments at the single cell level.
Gene regulation and chromatin structure
I am interested in how cells fold their DNA to enable complex regulatory patterns. Humans are made up of many different cell-types. Though these cell-types share a single genome, they have very different phenotypes and functions, working together to enable multicellular life. The basis for these dynamics is regulatory DNA, which governs when and where different genes are expressed. I analyze data from high-throughput ChIP-seq, DNase-seq, and ATAC-seq experiments to understand how cells do this during development.