Leveraging more accurate and flexible discourse structures in question-answering and summarization

Existing systems for critical NLP tasks like question-answering and summarization are still unable to accurately uncover and effectively leverage the discourse structure of text; i.e., how clauses and sentences are related to each other in a document. This is a serious limitation in that relationships between clauses and sentences carries important information, which allows the text to express a meaning as a whole, beyond the sum of its parts. The goal of discourse parsing is to automatically determine the coherence structure of text.

Awarded to:
Postdoc Fellows:

Large-scale Bayesian modelling of drug resistance and evolution in human cancers at single-cell resolution

Recent advances in next generation sequencing (NGS) technologies have led to the ability to measure gene expression and DNA mutations across thousands of cells in cancer tumors at the single-cell level. This allows us to quantify the effect of chemotherapeutic drugs on the way tumors mutate and answer questions about why particular groups of cells (known as clones) evade treatment and cause relapse. However, the vast quantities of data produced by such measurements combined with the low signal-to-noise ratio makes analysis and interpretation particularly difficult.

Awarded to:
Postdoc Fellows:

Automated diagnosis and prognostication of severity in COPD via deep learning frameworks using multi-modal data

Chronic Obstructive Pulmonary Disease (COPD) is a progressive, debilitating, chronic respiratory disease. It is currently the 4th leading cause of mortality and is responsible for 100,000 hospitalizations and 10,000 deaths annually in Canada, and 3 million deaths worldwide. Although our understanding of COPD pathogenesis has improved substantially over the past 20 years, there is a notable lack of treatments that can modify disease progression and reduce mortality. Furthermore, current methods to clinically diagnose COPD are non-specific and insufficient to advance knowledge.

Awarded to:
Postdoc Fellows:

User Modeling and Adaptive Support for MOOCSUser Modeling and Adaptive Support for MOOCS

Massive open on-Line courses (MOOCS) have great potential to innovate education, but suffer from one key limitation typical of many on-line learning environments: lack of personalization. Intelligent Tutoring Systems (ITS) is a field that leverages Artificial Intelligence and Machine Learning to devise educational tools that can provide instruction tailored to the needs of individual learners, as good teachers do. In this project, Drs. Conati and Roll aim to apply some of the concepts and technique from ITS research to MOOCS.

Awarded to:
Postdoc Fellows:

Using text analysis for chronic disease management

The diagnosis, management, and treatment of chronic diseases (e.g., diabetes, chronic obstructive pulmonary diseases, and heart failure) have traditionally been focused on longitudinal histories and physical examinations as primary tools of assessment, and augmented by laboratory testing and imaging. Equally important to history taking and physical examinations is the objective assessments and understanding of the contribution of the patients' states of mind to their disease states. This is historically only documented qualitatively but highly challenging to measure quantitatively.

Awarded to:
Postdoc Fellows:

Application of deep learning approaches in modelling cheminformatics data and discovery of novel therapeutic agents for prostate cancer

The recent explosion of chemical and biological information calls for fundamentally novel ways of dealing with big data in the life sciences. This problem can potentially be addressed by the latest technological breakthroughs on both software and hardware frontiers. In particular, the latest advances in artificial intelligence (AI) enable cognitive data processing at very large-scale by means of deep learning (DL).

Awarded to:
Postdoc Fellows:

A platform for interactive, collaborative, and repeatable genomic analysis

Computer systems – both hardware and software – currently represent an active barrier to the scientific investigation of genomic data. Answering even relatively simple questions requires assembling disparate software tools (for alignment, variant calling, and filtering) into an analytics pipeline, and then solving practical IT problems in order to get that pipeline to function stably and at scale. This project will employ a whole system approach for providing a framework for genomic analysis.

Awarded to:
Postdoc Fellows:

From heuristics to guarantees: the mathematical foundations of algorithms for data science

Many of the most successful approaches commonly used in data-science applications (e.g., machine learning) come with little or no guarantees. Notable examples include convolutional neural networks (CNNs) and data-fitting formulations based on non-convex loss functions. In both cases, the training procedures are based on optimizing over intractable problems.

Awarded to:
Postdoc Fellows:

Modeling multiple types of "omics" data to understand the biology of human exposure to pollution and allergens

Inhaled environmental and occupational exposures such as air pollution and allergens are known to have a profound effect on our respiratory and immunological health. This collaborative project seeks to better understand how the human body responds adversely to these perturbants by developing and applying new computational models for analyses of integrated molecular data sets, collectively known as 'omics profiling (e.g., genomics, proteomics, metabolomics, epigenomics, transcriptomics, and polymorphisms).

Awarded to:
Postdoc Fellows:

Data science over graphs, streams, and sequences: From the analysis of fake news to prediction and intervention

Fake news and misinformation have been a serious problem for a long time and the advent of social media has made it more acute, particularly in the context of the 2016 U.S. Presidential election. This illustrates how social networks and media have started playing a fundamental role in the lives of most people--they influence the way we receive, assimilate, and share information with others.

Awarded to:
Postdoc Fellows:

Musqueam First Nation land acknowledegement

We honour xwməθkwəy̓ əm (Musqueam) on whose ancestral, unceded territory UBC Vancouver is situated. UBC Science is committed to building meaningful relationships with Indigenous peoples so we can advance Reconciliation and ensure traditional ways of knowing enrich our teaching and research.

Learn more: Musqueam First Nation

Data Science Institute

EOS Main Building
6339 Stores Road, Room 113C
dsi.admin@science.ubc.ca

Faculty of Science

Office of the Dean, Earth Sciences Building
2178–2207 Main Mall
Vancouver, BC Canada
V6T 1Z4
UBC Crest The official logo of the University of British Columbia. Urgent Message An exclamation mark in a speech bubble. Arrow An arrow indicating direction. Arrow in Circle An arrow indicating direction. A bookmark An ribbon to indicate a special marker. Calendar A calendar. Caret An arrowhead indicating direction. Time A clock. Chats Two speech clouds. External link An arrow pointing up and to the right. Facebook The logo for the Facebook social media service. A Facemask The medical facemask. Information The letter 'i' in a circle. Instagram The logo for the Instagram social media service. Linkedin The logo for the LinkedIn social media service. Lock, closed A closed padlock. Lock, open An open padlock. Location Pin A map location pin. Mail An envelope. Mask A protective face mask. Menu Three horizontal lines indicating a menu. Minus A minus sign. Money A money bill. Telephone An antique telephone. Plus A plus symbol indicating more or the ability to add. RSS Curved lines indicating information transfer. Search A magnifying glass. Arrow indicating share action A directional arrow. Spotify The logo for the Spotify music streaming service. Twitter The logo for the Twitter social media service. Youtube The logo for the YouTube video sharing service.