Recent advances in DNA sequencing technology are dramatically changing the scale and scope of modern molecular biology. Next generation sequencing instruments can sequence the equivalent of the human genome in a few days and at low cost, compared to the years of effort and billions of dollars spent to sequence the first human genome. This dramatic increase in efficiency has spurred tremendous growth in applications for DNA sequencing. For example, whereas the human genome project sought to sequence the genome of a small group of individuals, the 1000 genomes projects aims to catalog the genomes of more than 1000 individuals from all over the globe.

Our research is at the intersection of biology, biotechnology, and computer technology developing novel software algorithms and systems for understanding the mechanisms of human diseases and plant biology. Our lab has pioneered the use of cloud-computing technologies including Hadoop and Amazon EC2 as a platform for big data challenges in genomics, and we are at the leading edge of next generation sequencing technologies, including those from Illumina, PacBio, and most recently Oxford Nanopore. Our expertise spans from low level computer architecture, through sequencing, de novo assembly, variant identification, transcriptome & other -omics data and up to machine learning approaches to build predictive models of diseases and treatment response.

