Dr. Jeffrey A. Bilmes
The University of Washington
Thursday, November 1, 2018 - 11:00am
Bahen Centre Rm. 1180
Invited Speaker Seminar
Abstract:
Machine learning involves the extraction and aggregation of information from data. The ability to
extract useful information from increasingly larger datasets, however, is becoming decreasingly
cost-effective. This is because data is getting bigger at a rate that computational improvements are
becoming more expensive to continue to match. A common strategy to overcome such difficulties is
either to discard data or to randomly subsample, but this is not sustainable if machine learning is to
continue to improve by exploiting all useful information in available data. In this talk, we will discuss
how to be more efficient in representing information in data through the process of summarization. In
particular, we will see how submodular and supermodular functions can model information in data,
and how these can be used to produce theoretically justified but still practical algorithms for various
forms of data summarization. This will include approaches that summarize data before training takes
place, and also some new tactics that learn and summarize simultaneously.
Host:
Dr. Michael Hoffman
Distinguished Lecture Series
Poster: