Recent advances in DNA sequencing technology are dramatically changing the scale and scope of modern molecular biology. Next generation sequencing instruments can sequence the equivalent of the human genome in a few days and at low cost, compared to the years of effort and billions of dollars spent to sequence the first human genome. This dramatic increase in efficiency has spurred tremendous growth in applications for DNA sequencing. For example, whereas the human genome project sought to sequence the genome of a small group of individuals, the 1000 genomes projects aims to catalog the genomes of more than 1000 individuals from all over the globe.

Our research is at the intersection of biology, biotechnology, and computer technology developing novel software algorithms and systems for understanding the mechanisms of human diseases and plant biology. Our lab has pioneered the use of cloud-computing technologies including Hadoop and Amazon EC2 as a platform for big data challenges in genomics, and we are at the leading edge of next generation sequencing technologies, including those from Illumina, PacBio, and most recently Oxford Nanopore. Our expertise spans from low level computer architecture, through sequencing, de novo assembly, variant identification, transcriptome & other -omics data and up to machine learning approaches to build predictive models of diseases and treatment response.

Recent News
» High-coverage sequencing and annotated assemblies of the budgerigar genome
July 8, 2014
» Preprint on Error correction and assembly complexity of single molecule sequencing reads.
June 18, 2014
» Preprint on Reducing INDEL errors in whole-genome and exome sequencing.
June 18, 2014
» Schatz receives prestiguous NSF CAREER award to develop Algorithms for single molecule sequencing analysis
June 7, 2014
» Accepted Preprint on the Algorithmic Complexity of the Biomolecular Sequence Assembly Problem
April 7, 2014
(past news)

Upcoming Events

» Big Data: How Biological Data Science can improve our health, foods and energy
CSHL, Cold Spring Harbor, NY. June 18, 2014
» HiTSeq / ISMB
Boston, MA. July 11-12, 2014
» Genome Informatics
Churchill College, Cambridge, UK. Sept 21 - 24, 2014
» Beyond the Genome
Boston, MA. Oct 8 - 10, 2014
» RECOMB Comparative Genomics
CSHL, Cold Spring Harbor NY. Oct 19 - 22, 2014
» Biological Data Science
CSHL, Cold Spring Harbor, NY. Nov 5 - 8, 2014
» CSHL Advanced Sequencing Technology & Applications
CSHL, Cold Spring Harbor, NY. Nov 11 - 23, 2014
» Personal Genomes & Pharmacogenomics
CSHL, Cold Spring Harbor NY. Nov 12 - 15, 2013
(presentation archive)

Michael Schatz
Cold Spring Harbor Laboratory
One Bungtown Road
Koch Building 1119
Cold Spring Harbor, NY 11724

Tel: (516) 367-5218
Cel: (703) 966-1987
Fax: (516) 367-8380
E-mail: mschatz <at> cshl.edu
Twitter: @mike_schatz