Recent advances in DNA sequencing technology are dramatically changing the scale and scope of modern molecular biology. Next generation sequencing instruments can sequence the equivalent of the human genome in a few days and at low cost, compared to the years of effort and billions of dollars spent to sequence the first human genome. This dramatic increase in efficiency has spurred tremendous growth in applications for DNA sequencing. For example, whereas the human genome project sought to sequence the genome of a small group of individuals, the 1000 genomes projects aims to catalog the genomes of more than 1000 individuals from all over the globe.

Our research focuses on the development of scalable algorithms and systems to analyze DNA sequences, concentrating on the assembly and alignment of next generation sequencing reads, and related downstream analyses. These systems have been used to reconstruct the genomes of previously unsequenced organisms, probe sequence variations, and to explore a host of biological features across the tree of life. Moving forward, one of the main challenges facing computational biologists is the creation of analysis systems whose efficiency can match the dramatic improvements in sequencing throughput. As such, we are particularly interested in capitalizing on the latest advances in distributed and parallel computing to advance the state of the art in bioinformatics and genomics.

Recent News
» Accepted Preprint on the Algorithmic Complexity of the Biomolecular Sequence Assembly Problem
April 7, 2014
» Preprint on SplitMEM: Graphical pan-genome analysis with suffix skips
April 6, 2014
» Preprint on New whole genome de novo assemblies of three divergent strains of rice (O. sativa) documents novel gene space of aus and indica
March 31, 2014
» Genome Sequencing and Assembly @ Genome Access Course
March 31, 2014
» The next 10 years of quantitative biology
Keystone Symposia on Big Data in Biology. March 25, 2013
(past news)

Upcoming Events

» IEEE Fellows Night Dinner [Flyer]
Syracuse, NY. April 8, 2014
» Biology of Genomes
CSHL, Cold Spring Harbor, NY. May 6 - 10, 2014
» CSHL Symposium: Cognition
CSHL, Cold Spring Harbor, NY. May 28 - June 2, 2014
» Big Data: How Biological Data Science can improve our health, foods and energy
CSHL, Cold Spring Harbor, NY. June 18, 2014
» Genome Informatics
Churchill College, Cambridge, UK. Sept 21 - 24, 2014
» Beyond the Genome
Boston, MA. Oct 8 - 10, 2014
» Biological Data Science
CSHL, Cold Spring Harbor, NY. Nov 5 - 8, 2014
» CSHL Advanced Sequencing Technology & Applications
CSHL, Cold Spring Harbor, NY. Nov 11 - 23, 2014
» Personal Genomes & Pharmacogenomics
CSHL, Cold Spring Harbor NY. Nov 12 - 15, 2013
(presentation archive)

Michael Schatz
Cold Spring Harbor Laboratory
One Bungtown Road
Koch Building 1119
Cold Spring Harbor, NY 11724

Tel: (516) 367-5218
Cel: (703) 966-1987
Fax: (516) 367-8380
E-mail: mschatz <at> cshl.edu
Twitter: @mike_schatz