Research Projects

  1. Using contact networks, administrative, and linked genomic data to understand tuberculosis transmission in BC

    Tuberculosis (TB) is still a problem in British Columbia, with approximately 250 cases diagnosed each year. In order to meet the WHO’s goal of achieving TB pre-elimination by 2030, TB rates in BC need to decline at a faster rate, and a change in how we manage TB prevention and care in the province is needed. Fortunately, all TB-related laboratory, epidemiology, clinical, and public health activities are centralized at the BC Centre for Disease Control (BCCDC).

  2. Leveraging eye-tracking data to improve reliable detection of Alzheimer’s Disease and related patient’s states

    Reliable detection of disease in the early stages of Alzheimer’s Disease (AD) continues to be a challenge. This project led by Drs. Conati and Field aims to investigate the value of eye-tracking data as one of the sources of information to build machine learning detectors of AD. In addition, the team will investigate eye-tracking based detectors of AD-related states such confusion and distress during naturally occurring tasks.

    Awarded to:
    Postdoc Fellows:
    Oswald Barral
  3. Knowledge Graphs – Mining, Cleaning and Maintenance

    Extraction of knowledge from information sources ranging from unstructured and semi-structured, to structured has gained significant interest both in academia and in the industry. This is fueled by applications such as question answering and computational fact checking. Knowledge graphs (KG) have lately emerged as a de facto standard for knowledge representation, whereby knowledge is expressed as a collection of “facts", represented in the form of (subject, predicate, object) triples where subject and object are entities and predicate is a relation between those entities.

    Awarded to:
    Postdoc Fellows:
  4. Computer vision and machine learning techniques for video and facial understanding

    In this project, Drs. Sigal and Schmidt are pursuing a number of research goals at the intersection of computer vision and machine learning. In part one, the team will advance automatic video summarization by exploring novel richer joint video-linguistic and graph-structured representations to facilitate video retrieval, summarization and--potentially--action recognition tasks.

    Awarded to:
    Postdoc Fellows:
  5. Leveraging more accurate and flexible discourse structures in question-answering and summarization

    Existing systems for critical NLP tasks like question-answering and summarization are still unable to accurately uncover and effectively leverage the discourse structure of text; i.e., how clauses and sentences are related to each other in a document. This is a serious limitation in that relationships between clauses and sentences carries important information, which allows the text to express a meaning as a whole, beyond the sum of its parts. The goal of discourse parsing is to automatically determine the coherence structure of text.

    Awarded to:
    Postdoc Fellows:
  6. Large-scale Bayesian modelling of drug resistance and evolution in human cancers at single-cell resolution

    Recent advances in next generation sequencing (NGS) technologies have led to the ability to measure gene expression and DNA mutations across thousands of cells in cancer tumors at the single-cell level. This allows us to quantify the effect of chemotherapeutic drugs on the way tumors mutate and answer questions about why particular groups of cells (known as clones) evade treatment and cause relapse. However, the vast quantities of data produced by such measurements combined with the low signal-to-noise ratio makes analysis and interpretation particularly difficult.

  7. Automated diagnosis and prognostication of severity in COPD via deep learning frameworks using multi-modal data

    Chronic Obstructive Pulmonary Disease (COPD) is a progressive, debilitating, chronic respiratory disease. It is currently the 4th leading cause of mortality and is responsible for 100,000 hospitalizations and 10,000 deaths annually in Canada, and 3 million deaths worldwide. Although our understanding of COPD pathogenesis has improved substantially over the past 20 years, there is a notable lack of treatments that can modify disease progression and reduce mortality. Furthermore, current methods to clinically diagnose COPD are non-specific and insufficient to advance knowledge.

    Awarded to:
    Postdoc Fellows:
    Lisa Tang
  8. User Modeling and Adaptive Support for MOOCSUser Modeling and Adaptive Support for MOOCS

    Massive open on-Line courses (MOOCS) have great potential to innovate education, but suffer from one key limitation typical of many on-line learning environments: lack of personalization. Intelligent Tutoring Systems (ITS) is a field that leverages Artificial Intelligence and Machine Learning to devise educational tools that can provide instruction tailored to the needs of individual learners, as good teachers do. In this project, Drs. Conati and Roll aim to apply some of the concepts and technique from ITS research to MOOCS.

    Awarded to:

    Cristina Conati, Ido Roll

    Postdoc Fellows:
    Sébastien Lallé
  9. Using text analysis for chronic disease management

    The diagnosis, management, and treatment of chronic diseases (e.g., diabetes, chronic obstructive pulmonary diseases, and heart failure) have traditionally been focused on longitudinal histories and physical examinations as primary tools of assessment, and augmented by laboratory testing and imaging. Equally important to history taking and physical examinations is the objective assessments and understanding of the contribution of the patients' states of mind to their disease states. This is historically only documented qualitatively but highly challenging to measure quantitatively.

    Awarded to:
    Postdoc Fellows:
    Hyeju Jang
  10. Application of deep learning approaches in modelling cheminformatics data and discovery of novel therapeutic agents for prostate cancer

    The recent explosion of chemical and biological information calls for fundamentally novel ways of dealing with big data in the life sciences. This problem can potentially be addressed by the latest technological breakthroughs on both software and hardware frontiers. In particular, the latest advances in artificial intelligence (AI) enable cognitive data processing at very large-scale by means of deep learning (DL).

  11. A platform for interactive, collaborative, and repeatable genomic analysis

    Computer systems – both hardware and software – currently represent an active barrier to the scientific investigation of genomic data. Answering even relatively simple questions requires assembling disparate software tools (for alignment, variant calling, and filtering) into an analytics pipeline, and then solving practical IT problems in order to get that pipeline to function stably and at scale. This project will employ a whole system approach for providing a framework for genomic analysis.

  12. From heuristics to guarantees: the mathematical foundations of algorithms for data science

    Many of the most successful approaches commonly used in data-science applications (e.g., machine learning) come with little or no guarantees. Notable examples include convolutional neural networks (CNNs) and data-fitting formulations based on non-convex loss functions. In both cases, the training procedures are based on optimizing over intractable problems.

    Awarded to:
    Postdoc Fellows:
    Halyun Jeong
  13. Modeling multiple types of "omics" data to understand the biology of human exposure to pollution and allergens

    Inhaled environmental and occupational exposures such as air pollution and allergens are known to have a profound effect on our respiratory and immunological health. This collaborative project seeks to better understand how the human body responds adversely to these perturbants by developing and applying new computational models for analyses of integrated molecular data sets, collectively known as 'omics profiling (e.g., genomics, proteomics, metabolomics, epigenomics, transcriptomics, and polymorphisms).

    Awarded to:

    Chris Carlsten, Sara Mostafavi

    Postdoc Fellows:
    Zahra Jalali
  14. Data science over graphs, streams, and sequences: From the analysis of fake news to prediction and intervention

    Fake news and misinformation have been a serious problem for a long time and the advent of social media has made it more acute, particularly in the context of the 2016 U.S. Presidential election. This illustrates how social networks and media have started playing a fundamental role in the lives of most people--they influence the way we receive, assimilate, and share information with others.

    Awarded to:
    Postdoc Fellows:
    Ezequiel Smucler
  15. Functional Capabilities of the Gut Microbiome in Immune Checkpoint Inhibitor-Associated Responses

    Immune checkpoint inhibitors, or ICIs, are a powerful new treatment option for a variety of tumors, but efficacy varies between patients and mild to life-threatening side effects can occur. The bacteria that reside in a patient's gut have been shown to impact ICI efficacy and to predict the development of colitis side-effects, but it is not clear specifically which bacteria impact ICI response or toxicity.

    Awarded to:
    Postdoc Fellows:
  16. Monitoring Breast Cancer: Bringing Single-cell and Liquid Biopsy Analysis to the Cloud

    Breast tumor genetics can change during disease progression, leading to distinct tumor cell populations that often contribute to therapeutic resistance. We propose to perform state-of-the-art, single-cell genomic sequencing on breast cancer biopsies and circulating tumor DNA (also known as liquid biopsies) to evaluate genetic changes during the course of a patient's therapy.

    Awarded to:
    Postdoc Fellows:
  17. Pathology AI for a Federated Quality Assurance Program: Ovarian Cancer Pilot

    As type-specific treatments are being developed for patients with epithelial ovarian cancer, it has become important to accurately diagnose the distinct cancer types. Our vision is to establish an international network for AI-based, privacy-protected pathology quality assurance. As proof of concept, we propose to develop and deploy a machine learning-based ovarian cancer histopathology classifier.

Musqueam First Nation land acknowledegement

We honour xwməθkwəy̓ əm (Musqueam) on whose ancestral, unceded territory UBC Vancouver is situated. UBC Science is committed to building meaningful relationships with Indigenous peoples so we can advance Reconciliation and ensure traditional ways of knowing enrich our teaching and research.

Learn more: Musqueam First Nation

Data Science Institute

EOS Main Building
6339 Stores Road, Room 113C
dsi.admin@science.ubc.ca

Faculty of Science

Office of the Dean, Earth Sciences Building
2178–2207 Main Mall
Vancouver, BC Canada
V6T 1Z4
UBC Crest The official logo of the University of British Columbia. Urgent Message An exclamation mark in a speech bubble. Arrow An arrow indicating direction. Arrow in Circle An arrow indicating direction. A bookmark An ribbon to indicate a special marker. Calendar A calendar. Caret An arrowhead indicating direction. Time A clock. Chats Two speech clouds. External link An arrow pointing up and to the right. Facebook The logo for the Facebook social media service. A Facemask The medical facemask. Information The letter 'i' in a circle. Instagram The logo for the Instagram social media service. Linkedin The logo for the LinkedIn social media service. Lock, closed A closed padlock. Lock, open An open padlock. Location Pin A map location pin. Mail An envelope. Mask A protective face mask. Menu Three horizontal lines indicating a menu. Minus A minus sign. Money A money bill. Telephone An antique telephone. Plus A plus symbol indicating more or the ability to add. RSS Curved lines indicating information transfer. Search A magnifying glass. Arrow indicating share action A directional arrow. Spotify The logo for the Spotify music streaming service. Twitter The logo for the Twitter social media service. Youtube The logo for the YouTube video sharing service.