HABERMAN 'S SURVIVAL DATA

( EXPLORATORY DATA ANALYSIS)

Info about dataset from github Title: Haberman's Survival Data

Sources: (a) Donor: Tjen-Sien Lim (limt@stat.wisc.edu) (b) Date: March 4, 1999

Past Usage:

Relevant Information: The dataset contains cases from a study that was conducted between 1958 and 1970 at the University of Chicago's Billings Hospital on the survival of patients who had undergone surgery for breast cancer.

Number of Instances: 306

Number of Attributes: 4 (including the class attribute)

Attribute Information:

Age of patient at time of operation (numerical) Patient's year of operation (year - 1900, numerical) Number of positive axillary nodes detected (numerical) Survival status (class attribute) 1 = the patient survived 5 years or longer 2 = the patient died within 5 year Missing Attribute Values: None

import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
import pandas as pd
# pip3 install dtale
import dtale
df= pd.read_csv('haberman.csv',header=None)
df.head(10)
0 1 2 3
0 30 64 1 1
1 30 62 3 1
2 30 65 0 1
3 31 59 2 1
4 31 65 4 1
5 33 58 10 1
6 33 60 0 1
7 34 59 0 2
8 34 66 9 2
9 34 58 30 1
#there is no column names provided in dataset file
df.columns=['Age','Year','Aux_Nodes','Survival_Stat']
df.head(10)
Age Year Aux_Nodes Survival_Stat
0 30 64 1 1
1 30 62 3 1
2 30 65 0 1
3 31 59 2 1
4 31 65 4 1
5 33 58 10 1
6 33 60 0 1
7 34 59 0 2
8 34 66 9 2
9 34 58 30 1
 

You can use dtale.show() function to see the dataframe in "dtale" format
I commented that part as it won't show output here as it runs on a sever
If you are running this notebook on colab add this code:

import dtale.app as dtale_app
dtale_app.USE_COLAB = True

from IPython.display import Image
Image(filename='C:/Users/Chinmay/Desktop/dtale1.png')
from IPython.display import Image
Image(filename='C:/Users/Chinmay/Desktop/dtale2.png')
from IPython.display import Image
Image(filename='C:/Users/Chinmay/Desktop/df_charts1_dtale.png')
from IPython.display import Image
Image(filename='C:/Users/Chinmay/Desktop/df_charts2_dtale.png')
from IPython.display import Image
Image(filename='C:/Users/Chinmay/Desktop/df_charts3_dtale.png')
from IPython.display import Image
Image(filename='C:/Users/Chinmay/Desktop/df_corelations_dtale.png')
from IPython.display import Image
Image(filename='C:/Users/Chinmay/Desktop/df_describe_dtale.png')
from IPython.display import Image
Image(filename='C:/Users/Chinmay/Desktop/df_describe_dtale_histogram.png')