From Genetics to Epigenetics to CRISPR Gene Editing with Machine Learning

Jennifer Listgarten, PhD
Senior Researcher, Microsoft Research New England, Cambridge, MA
Thursday, March 2, 2017 - 11:00am
CCBR Red Room
Special Seminar
Abstract: 
Molecular biology, healthcare and medicine have been slowly morphing into large-scale, data driven sciences dependent on machine learning and applied statistics. In this talk I will start by explaining some of the modelling challenges in finding the genetic underpinnings of disease, which is important for screening, treatment, drug development, and basic biological insight. Genome and epigenome-wide associations, wherein individual or sets of (epi)genetic markers are systematically scanned for association with disease are one window into disease processes. Naively, these associations can be found by use of a simple statistical test. However, a wide variety of structure and confounders lie hidden in the data, such as cell type heterogeneity and population structure, leading to both spurious and missed associations if not properly addressed. Much of this talk will focus on how to model these types of data. Once we uncover genetic causes, genome editing—which is about deleting or changing parts of the genetic code—will one day let us fix the genome in a bespoke manner. Editing will also help us to understand mechanisms of disease, enable precision medicine and drug development, to name just a few more important applications. I will close by discussing how we developed state-of-the-art machine learning approaches, now used worldwide, to enable more effective CRISPR guide design for gene editing.
Host: 
Quaid Morris, PhD, Professor, The Donnelly Centre, Department of Molecular Genetics, Department of Computer Science, ECE