Using ML to Determine Infant Heart Rate Tracings - Python Portion (Part 2)

15 minute read

png

Background:

This project is divided into two parts. The first part was an R-based portion for exploratory data analysis. This is the second part utilizing Python machine learning algorithms.

Neonatal mortality rates have remained steady for the last several years and continue to be a concern in the United States and despite medical advances, there has not been much progress in affecting neonatal outcomes. This trend is worrisome. One way of ensuring fetal well-being is via external fetal cardiotocography which measures the heart rate of the infant. Certain findings during labor are reassuring while others are indicative of possible fetal distress.

This project sought to determine an accurate and precise machine learning algorithm using data obtained from the UCI Machine Learning database and is composed of various measurements of the heart rate tracings to generate a prediction of normal, suspect, or pathologic findings. It is the goal of this project to use a database of specific technical characteristics of fetal heart rate monitoring to develop a predictive model using an automated system to better identify worrisome decreases in fetal heart rate. The data was obtained using an automated system (SisPorto 2.0) to quantify these parameters.

The data that I will primarily use is from the University of California – Irvine Machine Learning Repository. The dataset can be found at the following web address: https://archive.ics.uci.edu/ml/datasets/Cardiotocography.

According to the website, there were over 2000 fetal heart tracings (cardiotocograms) and interpreted by three expert obstetricians. Many of the measurements include the technical measurements include heart rate accelerations, decelerations, max heart rate, minimum heart rates, heart rate baseline and finally the target variable is whether the tracing was normal, suspect, or pathologic.

There were two different types of target variables included in this dataset. There were 10 different categorical variables that when put together could be interpreted as suspect or not. The other target variable was a three class categorical variable describing normal, suspect, or pathologic.

The purpose of this portion is for model fitting and prediction of the data.

Data Cleaning:

The data was loaded into Python and null values were removed. Below is a description of the variables for review.

	b	e	LBE	LB	AC	FM	UC	ASTV	MSTV	ALTV	MLTV	DL	DS	DP	DR	Width	Min	Max	Nmax	Nzeros	Mode	Mean	Median	Variance	Tendency	A	B	C	D	E	AD	DE	LD	FS	SUSP	CLASS	NSP
1	240.0	357.0	120.0	120.0	0.0	0.0	0.0	73.0	0.5	43.0	2.4	0.0	0.0	0.0	0.0	64.0	62.0	126.0	2.0	0.0	120.0	137.0	121.0	73.0	1.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0	1.0	0.0	9.0	2.0
2	5.0	632.0	132.0	132.0	4.0	0.0	4.0	17.0	2.1	0.0	10.4	2.0	0.0	0.0	0.0	130.0	68.0	198.0	6.0	1.0	141.0	136.0	140.0	12.0	0.0	0.0	0.0	0.0	0.0	0.0	1.0	0.0	0.0	0.0	0.0	6.0	1.0
3	177.0	779.0	133.0	133.0	2.0	0.0	5.0	16.0	2.1	0.0	13.4	2.0	0.0	0.0	0.0	130.0	68.0	198.0	5.0	1.0	141.0	135.0	138.0	13.0	0.0	0.0	0.0	0.0	0.0	0.0	1.0	0.0	0.0	0.0	0.0	6.0	1.0
4	411.0	1192.0	134.0	134.0	2.0	0.0	6.0	16.0	2.4	0.0	23.0	2.0	0.0	0.0	0.0	117.0	53.0	170.0	11.0	0.0	137.0	134.0	137.0	13.0	1.0	0.0	0.0	0.0	0.0	0.0	1.0	0.0	0.0	0.0	0.0	6.0	1.0
5	533.0	1147.0	132.0	132.0	4.0	0.0	5.0	16.0	2.4	0.0	19.9	0.0	0.0	0.0	0.0	117.0	53.0	170.0	9.0	0.0	137.0	136.0	138.0	11.0	1.0	0.0	1.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0	2.0	1.0
...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...
2122	2059.0	2867.0	140.0	140.0	0.0	0.0	6.0	79.0	0.2	25.0	7.2	0.0	0.0	0.0	0.0	40.0	137.0	177.0	4.0	0.0	153.0	150.0	152.0	2.0	0.0	0.0	0.0	0.0	0.0	1.0	0.0	0.0	0.0	0.0	0.0	5.0	2.0
2123	1576.0	2867.0	140.0	140.0	1.0	0.0	9.0	78.0	0.4	22.0	7.1	0.0	0.0	0.0	0.0	66.0	103.0	169.0	6.0	0.0	152.0	148.0	151.0	3.0	1.0	0.0	0.0	0.0	0.0	1.0	0.0	0.0	0.0	0.0	0.0	5.0	2.0
2124	1576.0	2596.0	140.0	140.0	1.0	0.0	7.0	79.0	0.4	20.0	6.1	0.0	0.0	0.0	0.0	67.0	103.0	170.0	5.0	0.0	153.0	148.0	152.0	4.0	1.0	0.0	0.0	0.0	0.0	1.0	0.0	0.0	0.0	0.0	0.0	5.0	2.0
2125	1576.0	3049.0	140.0	140.0	1.0	0.0	9.0	78.0	0.4	27.0	7.0	0.0	0.0	0.0	0.0	66.0	103.0	169.0	6.0	0.0	152.0	147.0	151.0	4.0	1.0	0.0	0.0	0.0	0.0	1.0	0.0	0.0	0.0	0.0	0.0	5.0	2.0
2126	2796.0	3415.0	142.0	142.0	1.0	1.0	5.0	74.0	0.4	36.0	5.0	0.0	0.0	0.0	0.0	42.0	117.0	159.0	2.0	1.0	145.0	143.0	145.0	1.0	0.0	1.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0	1.0	1.0

2126 rows × 37 columns

Description of Variables:

FileName and SegFile = Personal Identifiers

Date = Individual Date of Measurement

b = Starting Point of Measurement

e = Ending Point of Measurement

LBE = Baseline value (By Medical Expert)

LB = Fetal Heart Rate Baseline (beats/minute - Automated)

AC = # Accelerations/second

FM = # Fetal Movements/second

UC = # Uterine Contractions/second

DL = # Light Decelerations/second

DS = # Severe Decelerations/second

DP = # Prolonged Decelerations/second

DR = # Repetitive Decelerations (All 0s)

ASTV = % of time with abnormal short term variability

MSTV = Mean Value of Short Term Variability

ALTV = % of time with abnormal long term variability

MLTV = Mean Value of Long Term Variability

Width = Width of Fetal Heart Rate Histogram

Min = Minimum of Fetal Heart Rate Histogram

Max = Maximum of Fetal Heart Rate Histogram

Nmax = # Histogram Peaks

Nzeros = # Histogram Zeros

Mode = Histogram Mode

Mean = Histogram Mean

Median = Histogram Median

Variance = Histogram Variance

Tendency = Histogram Tendency: -1 = Left Asymmetric, 0 = Symmetric, 1 = Right Asymmetric

A = Calm Sleep

B = REM Sleep

C = Calm Vigilance

D = Active Vigilance

AD = Accelerative/Decelerative Pattern (Stress Situation)

DE = Decelerative Pattern (Vagal Stimulation)

LD = Largely decelerative pattern

FS = Flat Sinusoidal Pattern (Pathologic State)

SUSP = Suspect Pattern

Class = 1 to 10 for Classes A to SUSP

NSP = Fetal State Class Code (Normal = 1, Suspect = 2, Pathologic = 3)

The datatypes were encoded to adequately consider the categorical variables in the dataset.

In the R portion of this project, high levels of correlation were found between the b and e variables, the expert and automated determination of fetal heart rate, and finally mode, mean, and median variables were all highly correlated with one another.

In this dataset, there is a 10 label multi-class target as well as the 3 label multi-class target variable. I will be using the 3 label multi-class target variable for my target.

The target variable is NSP which classifies the FHR as Normal, Suspect, or Pathologic.

However, I will first include the 10 label target variables as predictors to see if this increases model performance after hyperparameter tuning and then I will do this without the 10 class variables to see if the performance is better or worse.

I plan on checking K Nearest Neighbors, Decision Tree, and Random Forest algorithms to determine the best model.

#Checking Features for High Correlations With Correlation >
corr_matrix = df.corr().abs()

#Selecting Upper Triangle of Correlation Matrix
upper = corr_matrix.where(np.triu(np.ones(corr_matrix.shape),
                                 k=1).astype(np.bool))

#Find index of feature columns with correlation >0.90
to_drop = [column for column in upper.columns if any(upper[column] >0.90)]
to_drop

['e', 'LB', 'Median']

The e, LB, and Median variables were all highly correlated and droped from the dataframe.

	b	LB	AC	FM	UC	ASTV	MSTV	ALTV	MLTV	DL	DS	DP	DR	Width	Min	Max	Nmax	Nzeros	Mode	Mean	Variance	Tendency	A	B	C	D	E	AD	DE	LD	FS	SUSP	CLASS	NSP
1	240.0	120.0	0.0	0.0	0.0	73.0	0.5	43.0	2.4	0.0	0.0	0.0	0.0	64.0	62.0	126.0	2.0	0.0	120.0	137.0	73.0	1.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0	1.0	0.0	9.0	2.0
2	5.0	132.0	4.0	0.0	4.0	17.0	2.1	0.0	10.4	2.0	0.0	0.0	0.0	130.0	68.0	198.0	6.0	1.0	141.0	136.0	12.0	0.0	0.0	0.0	0.0	0.0	0.0	1.0	0.0	0.0	0.0	0.0	6.0	1.0
3	177.0	133.0	2.0	0.0	5.0	16.0	2.1	0.0	13.4	2.0	0.0	0.0	0.0	130.0	68.0	198.0	5.0	1.0	141.0	135.0	13.0	0.0	0.0	0.0	0.0	0.0	0.0	1.0	0.0	0.0	0.0	0.0	6.0	1.0
4	411.0	134.0	2.0	0.0	6.0	16.0	2.4	0.0	23.0	2.0	0.0	0.0	0.0	117.0	53.0	170.0	11.0	0.0	137.0	134.0	13.0	1.0	0.0	0.0	0.0	0.0	0.0	1.0	0.0	0.0	0.0	0.0	6.0	1.0
5	533.0	132.0	4.0	0.0	5.0	16.0	2.4	0.0	19.9	0.0	0.0	0.0	0.0	117.0	53.0	170.0	9.0	0.0	137.0	136.0	11.0	1.0	0.0	1.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0	2.0	1.0
...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...
2122	2059.0	140.0	0.0	0.0	6.0	79.0	0.2	25.0	7.2	0.0	0.0	0.0	0.0	40.0	137.0	177.0	4.0	0.0	153.0	150.0	2.0	0.0	0.0	0.0	0.0	0.0	1.0	0.0	0.0	0.0	0.0	0.0	5.0	2.0
2123	1576.0	140.0	1.0	0.0	9.0	78.0	0.4	22.0	7.1	0.0	0.0	0.0	0.0	66.0	103.0	169.0	6.0	0.0	152.0	148.0	3.0	1.0	0.0	0.0	0.0	0.0	1.0	0.0	0.0	0.0	0.0	0.0	5.0	2.0
2124	1576.0	140.0	1.0	0.0	7.0	79.0	0.4	20.0	6.1	0.0	0.0	0.0	0.0	67.0	103.0	170.0	5.0	0.0	153.0	148.0	4.0	1.0	0.0	0.0	0.0	0.0	1.0	0.0	0.0	0.0	0.0	0.0	5.0	2.0
2125	1576.0	140.0	1.0	0.0	9.0	78.0	0.4	27.0	7.0	0.0	0.0	0.0	0.0	66.0	103.0	169.0	6.0	0.0	152.0	147.0	4.0	1.0	0.0	0.0	0.0	0.0	1.0	0.0	0.0	0.0	0.0	0.0	5.0	2.0
2126	2796.0	142.0	1.0	1.0	5.0	74.0	0.4	36.0	5.0	0.0	0.0	0.0	0.0	42.0	117.0	159.0	2.0	1.0	145.0	143.0	1.0	0.0	1.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0	1.0	1.0

Predictive Model Training and Fitting

KNN

First, the KNN model was trained and evaluated.

#Setting Up Features and Target Variables
target = df['NSP']
features = df.loc[:, df.columns != 'NSP']

#Getting Dummy Variables for our categorical variables
target = pd.get_dummies(target)
features = pd.get_dummies(features)

#Importing Packages for KNN
from sklearn.model_selection import train_test_split
from sklearn.neighbors import KNeighborsClassifier
from sklearn.preprocessing import StandardScaler
from sklearn.metrics import classification_report
from sklearn.metrics import confusion_matrix
from sklearn.model_selection import GridSearchCV, RandomizedSearchCV
from sklearn.metrics import roc_auc_score

#Create Standardizer
standardizer = StandardScaler()

#Standardize Features
features_standardized = standardizer.fit_transform(features)

#Train/Test 80/20 Split
features_train, features_test, target_train, target_test = train_test_split(features_standardized, target, test_size = 0.2, random_state = 1)

#Creating Classifier with K of 3
knn = KNeighborsClassifier(n_neighbors = 3, n_jobs = -1)

#Fitting Classifier on Trianing Data
knn.fit(features_train, target_train)
print(knn.fit)

#Generating Confusion Matrix
target_pred = knn.predict(features_test)
test0 = np.array(target_test).argmax(axis = 1)
predictions0 = np.array(target_pred).argmax(axis = 1)
print(confusion_matrix(test0, predictions0))

#Printing Classification Report for KNN with N-Neighbors of 3
print(classification_report(test0, predictions0))

<bound method SupervisedIntegerMixin.fit of KNeighborsClassifier(algorithm='auto', leaf_size=30, metric='minkowski',
                     metric_params=None, n_jobs=-1, n_neighbors=3, p=2,
                     weights='uniform')>
[[326   0   0]
 [  1  67   0]
 [  0   1  31]]
              precision    recall  f1-score   support

           0       1.00      1.00      1.00       326
           1       0.99      0.99      0.99        68
           2       1.00      0.97      0.98        32

    accuracy                           1.00       426
   macro avg       0.99      0.98      0.99       426
weighted avg       1.00      1.00      1.00       426

The hyperparameters were then optimized and tuned for best accuracy.

#Hyperparameter Tuning for KNN Using Grid Search CV

#Creating Hyperparameter Grid
param_dist1 = {"leaf_size": list(range(1,50)),
              "n_neighbors": list(range(1,30)),
              "p": [1,2]}

#Create New KNN Object
knn_2 = KNeighborsClassifier()

#Use Gridsearch
clf = GridSearchCV(knn_2, param_dist1, cv=10, n_jobs=-1)

#Fit Model
best_model = clf.fit(features_standardized, target)

print('Best Leaf Size:', best_model.best_estimator_.get_params()['leaf_size'])
print('Best p:', best_model.best_estimator_.get_params()['p'])
print('Best n_neighbors:', best_model.best_estimator_.get_params()['n_neighbors'])
print('Best Metric:', best_model.best_estimator_.get_params()['metric'])
print('Best Weights:', best_model.best_estimator_.get_params()['weights'])

Best Leaf Size: 1
Best p: 1
Best n_neighbors: 3
Best Metric: minkowski
Best Weights: uniform

The best leaf size was 1, p was 1, n_neighbors of 3, using minkowski metrics, and uniform weights.

The model was re-run using the hyperparameters and classification report was obtained.

[[326   0   0]
 [  2  66   0]
 [  0   1  31]]
              precision    recall  f1-score   support

           0       0.99      1.00      1.00       326
           1       0.99      0.97      0.98        68
           2       1.00      0.97      0.98        32

    accuracy                           0.99       426
   macro avg       0.99      0.98      0.99       426
weighted avg       0.99      0.99      0.99       426

Decision Tree classification

The data was then trained and fit using a Decision Tree classifier.

#Fitting Data On Decision Tree Algorithm

#Importing Packages
from sklearn.tree import DecisionTreeClassifier

#Setting Up Features and Target Variables
target = df['NSP']
features = df.loc[:, df.columns != 'NSP']

#Train/Test Splitting with 80/20 Split
features_train1, features_test1, target_train1, target_test1 = train_test_split(features, target, test_size = 0.2, random_state = 1)

#Instantiating Decision Tree Classifier
dt = DecisionTreeClassifier(random_state = 1)
print(dt.fit(features_train1, target_train1))

#Setting Target Prediction Variable Based on Features Test
target_pred = dt.predict(features_test1)

#Generating Confusion Matrix
test1 = np.array(target_test1)
predictions1 = np.array(target_pred)
print(confusion_matrix(test1, predictions1))

#Classification Report for Decision Tree
print(classification_report(test1, predictions1))

DecisionTreeClassifier(class_weight=None, criterion='gini', max_depth=None,
                       max_features=None, max_leaf_nodes=None,
                       min_impurity_decrease=0.0, min_impurity_split=None,
                       min_samples_leaf=1, min_samples_split=2,
                       min_weight_fraction_leaf=0.0, presort=False,
                       random_state=1, splitter='best')
[[324   2   0]
 [  1  67   0]
 [  0   1  31]]
              precision    recall  f1-score   support

         1.0       1.00      0.99      1.00       326
         2.0       0.96      0.99      0.97        68
         3.0       1.00      0.97      0.98        32

    accuracy                           0.99       426
   macro avg       0.98      0.98      0.98       426
weighted avg       0.99      0.99      0.99       426

The hyperparameters were optimized for best accuracy and the model was re-run using these parameters.

#Fitting New DT Algorithm on Tuned Hyperparameters

#Setting Up Features and Target Variables
target = df['NSP']
features = df.loc[:, df.columns != 'NSP']

#Train/Test Splitting with 80/20 Split
features_train1, features_test1, target_train1, target_test1 = train_test_split(features, target, test_size = 0.2, random_state = 1)

#Instantiating Decision Tree Classifier
dt = DecisionTreeClassifier(class_weight=None, criterion='entropy', max_depth=None,
                       max_features=None, max_leaf_nodes=None,
                       min_impurity_decrease=0.0, min_impurity_split=None,
                       min_samples_leaf=1, min_samples_split=2,
                       min_weight_fraction_leaf=0.0, presort=False,
                       random_state=1, splitter='best')
dt.fit(features_train1, target_train1)

#Setting Target Prediction Variable Based on Features Test
target_pred = dt.predict(features_test1)

#Generating Confusion Matrix
test1 = np.array(target_test1)
predictions1 = np.array(target_pred)
print(confusion_matrix(test1, predictions1))

#Classification Report for Decision Tree
print(classification_report(test1, predictions1))

[[321   5   0]
 [  2  66   0]
 [  0   1  31]]
              precision    recall  f1-score   support

         1.0       0.99      0.98      0.99       326
         2.0       0.92      0.97      0.94        68
         3.0       1.00      0.97      0.98        32

    accuracy                           0.98       426
   macro avg       0.97      0.97      0.97       426
weighted avg       0.98      0.98      0.98       426

Random Forest Classifier

Finally, the data was trained and evaluated based on a Random Forest Classifier.

#Fitting Data on Random Forest Algorithm

#Importing Packages
from sklearn.ensemble import RandomForestClassifier

#Setting Up Features and Target Variables
target = df['NSP']
features = df.loc[:, df.columns != 'NSP']

#Train/Test Splitting with 80/20 Split
features_train2, features_test2, target_train2, target_test2 = train_test_split(features, target, test_size = 0.2, random_state = 1)

#Creating RFC Model
model = RandomForestClassifier(n_estimators = 100, random_state = 1)

#Fitting Training Data
print(model.fit(features_train2, target_train2))

#Checking Predictions
rf_predictions = model.predict(features_test2)

#Generating Confusion Matrix
test2 = np.array(target_test2)
predictions2 = np.array(rf_predictions)
print(confusion_matrix(test2, predictions2))

#Classification Report
print(classification_report(test2, predictions2))

RandomForestClassifier(bootstrap=True, class_weight=None, criterion='gini',
                       max_depth=None, max_features='auto', max_leaf_nodes=None,
                       min_impurity_decrease=0.0, min_impurity_split=None,
                       min_samples_leaf=1, min_samples_split=2,
                       min_weight_fraction_leaf=0.0, n_estimators=100,
                       n_jobs=None, oob_score=False, random_state=1, verbose=0,
                       warm_start=False)
[[326   0   0]
 [  2  66   0]
 [  0   1  31]]
              precision    recall  f1-score   support

         1.0       0.99      1.00      1.00       326
         2.0       0.99      0.97      0.98        68
         3.0       1.00      0.97      0.98        32

    accuracy                           0.99       426
   macro avg       0.99      0.98      0.99       426
weighted avg       0.99      0.99      0.99       426

The hyperparameters were once again tuned for best accuracy and the model was re-run.

[[326   0   0]
 [  2  66   0]
 [  0   1  31]]
              precision    recall  f1-score   support

         1.0       0.99      1.00      1.00       326
         2.0       0.99      0.97      0.98        68
         3.0       1.00      0.97      0.98        32

    accuracy                           0.99       426
   macro avg       0.99      0.98      0.99       426
weighted avg       0.99      0.99      0.99       426

Removing 10-Class Predictor Using KNN, Decision Tree, and Random Forest Classifiers

The three algorithms were then evaluated with the 10-point categorical predictors removed to see if this hindered or helped accuracy.

KNN Results

Before Hyperparameter Tuning:

[[319   6   1]
 [ 23  40   5]
 [  7   6  19]]
              precision    recall  f1-score   support

           0       0.91      0.98      0.95       326
           1       0.77      0.59      0.67        68
           2       0.76      0.59      0.67        32

    accuracy                           0.89       426
   macro avg       0.81      0.72      0.76       426
weighted avg       0.88      0.89      0.88       426

After Hyper-Parameter Tuning:

[[321   5   0]
 [ 32  35   1]
 [ 10   6  16]]
              precision    recall  f1-score   support

           0       0.88      0.98      0.93       326
           1       0.76      0.51      0.61        68
           2       0.94      0.50      0.65        32

    accuracy                           0.87       426
   macro avg       0.86      0.67      0.73       426
weighted avg       0.87      0.87      0.86       426

Based on the results here, the model actually performed worse by the F1-score, even when using hyperparameter tuning, when removing the other 10-class predictors to be used as prediction for our target variable.

Decision Tree Classifier

Before Hyperparameter Tuning:

[[308  17   1]
 [ 16  46   6]
 [  1   2  29]]
              precision    recall  f1-score   support

         1.0       0.95      0.94      0.95       326
         2.0       0.71      0.68      0.69        68
         3.0       0.81      0.91      0.85        32

    accuracy                           0.90       426
   macro avg       0.82      0.84      0.83       426
weighted avg       0.90      0.90      0.90       426

Before Hyperparameter Tuning:

[[314  10   2]
 [ 16  47   5]
 [  1   3  28]]
              precision    recall  f1-score   support

         1.0       0.95      0.96      0.96       326
         2.0       0.78      0.69      0.73        68
         3.0       0.80      0.88      0.84        32

    accuracy                           0.91       426
   macro avg       0.84      0.84      0.84       426
weighted avg       0.91      0.91      0.91       426

Based on removing the 10 point classifier as predictors, the Decision Tree did perform worse. The model had fairly positive results except for the Suspect class with a lower F1 score as compared to Normal and Pathologic categories

Random Forest Classifiers

Before Hyperparameter Tuning:

[[325   0   1]
 [ 20  45   3]
 [  4   6  22]]
              precision    recall  f1-score   support

           0       0.96      0.99      0.97       326
           1       0.88      0.66      0.76        68
           2       0.85      0.69      0.76        32

   micro avg       0.94      0.92      0.93       426
   macro avg       0.89      0.78      0.83       426
weighted avg       0.94      0.92      0.92       426
 samples avg       0.92      0.92      0.92       426

After Hyperparameter Tuning:

[[324   2   0]
 [ 23  42   3]
 [  5   1  26]]
              precision    recall  f1-score   support

           0       0.94      0.99      0.97       326
           1       0.93      0.62      0.74        68
           2       0.90      0.81      0.85        32

   micro avg       0.94      0.92      0.93       426
   macro avg       0.92      0.81      0.85       426
weighted avg       0.94      0.92      0.92       426
 samples avg       0.92      0.92      0.92       426

In this case as well, the Random Forest model did perform worse without the 10 classifier system that was in the model before.

Using the results above, both the Random Forest and KNN models performed similarly after hyperparameter tuning and removing the extraneous variables of e, LBE, and Median. Further, the 10 point classification variables used as predictors actually improved the F1, Precision, and Recall scores so those variables should be kept in the model. One possible explanation is that some of the information in the 10 point classifier could help the algorithm point to a suspect or pathologic pattern and provides more information for the algorithm to make a judgment.

Further model deployment would entail a much larger study population in addition to expert Obstetrician oversight to compare the computerized algorithms determination with known clinical experience. However, this does suggest that machine learning algorithms could assist clinicians in determining potentially suspect or pathologic patterns.

To view more specifics on the coding and project, please refer to the GitHub repository.

Brandon May

Using ML to Determine Infant Heart Rate Tracings - Python Portion (Part 2)

Background:

Data Cleaning:

Predictive Model Training and Fitting

KNN

Decision Tree classification

Random Forest Classifier

Removing 10-Class Predictor Using KNN, Decision Tree, and Random Forest Classifiers

KNN Results

Decision Tree Classifier

Random Forest Classifiers

You may also enjoy

Lung Cancer Survival Prediction After Surgical Treatment

Heart Failure Survival Prediction Using Machine Learning

Detection of Breast Cancer In Biopsy Specimens Using Machine Learning

The Story About Airline Safety - Part 3 - Blog Post