Breast cancer prediction with survival analysis

working

Created by RAJALAKSHMI RAHUL on 28, Sep 23 & supervised by Priya Chetty

Breast cancer is a significant health concern worldwide. Its prognosis and survival rate are greatly dependent on timely detection and accurate prediction of the progression. Many prediction models have been developed which take into consideration genomics, racial disparities, and tumor characteristics. However most of them focus on short-term outcomes. Long-term follow-up studies that assess breast cancer recurrence, late-stage complications, and survival beyond the initial treatment phase are essential for providing a more comprehensive picture of patient outcomes.

This study first reviews critical research which has been conducted in the past on breast cancer prediction and identifies their shortcomings. It also identiies the distribution pattern and risk factors. Then it uses two existing breast cancer datasets with over 1000 observations each, containing important variables such as demographics, tumor size, omics data, mutation count, cancer type, duration of treatment, among others. Survival analysis is applied to identify independent predictors of breast cancer survival, considering factors such as tumor characteristics, treatment modalities, and patient demographics. Furthermore, machine learning algorithms are employed to enhance predictive accuracy. Python software is used.

Mindmap_breastcancer.pdf

Goal 1

Goal 1- To critically review existing research on breast cancer prediction using machine learning algorithms

Purpose: Healthcare datasets are different in nature than other statistical datasets. Understanding them is essential for creating a prediction model. In this goal we will identify key studies on breast cancer prediction models and study the properties, assumptions, methodologies and parameters of their datasets. We will look at them critically and systematically identify their shortcomings.

Method: Systematic and critical analysis of 50-60 existing studies on prediction modelling for breast cancer. Following elements will be reviewed:

Author
Aim
Study type
Dataset characteristics
Variables/ parameters
Data analysis method
Findings
Shortcomings

Requirement: Familiarity with healthcare datasets is a must. Must also possess knowledge of prediction modelling, empirical review, systematic review and literature review.

Milestones

To contribute and publish select a pending milestone.

Completed

Importance of analysing breast cancer data