Words

2017-08-08 20:09:20 +01:00
parent a84a969225
commit 46e3cc97db
1 changed files with 29 additions and 28 deletions
@@ -19,7 +19,7 @@
 \usepackage{booktabs}
 \usepackage{tabulary}

-\usepackage[pass]{geometry}
+\usepackage[margin=1.0in]{geometry}
 \usepackage{pdflscape}
 \usepackage{graphicx}

@@ -399,11 +399,11 @@ as a classifier to provide a final accuracy score of 73\%.\\
 \begin{tabulary}{\linewidth}{LLLLLL}
 \dtoprule
 Author                   & Pre-processing/segmentation                                                                                                               & Features                                                                                                        & Classification Method & Dataset                                                                                                                 & Reported Accuracy                                  \\ \hline
-Maglogiannis et.\ al     & Wavelet decomposition, Shannon energy peak picking                                                                                        & Features derived from wavelet decomposition and PCG segmentations                                               & SVM                   & 198 recordings, 38 normal, 41 AS systolic murmur, 43 MR systolic murmur, 38 AR diastolic murmur, 38 MS diastolic murmur & $91.43\%\;Ac$                                      \\
-Ari et.\ al              & Amplitude envelope peak picking~\parencite{Ari2007}                                                                                       & Wavelet based features                                                                                          & LSSVM                 & 64 patients, 64 recordings, 512 cycles                                                                                  & $88.750-100\%\;Ac$ (dependant on abnormality type) \\
-Quiceno-Manrique et.\ al & Downsampled to 4KHz, Normalised to maximum of signal, ECG assisted QRS complex detection algorithm used for segmentation                  & Spectral features derived from STFT, Wavelet decomposition and quadratic energy distributions                   & $k$-NN                & 22 patients, 16 normal, 6 abnormal, 8 recordings (12s) per patient                                                      & $98\%\;Ac$                                         \\
-Schmidt et.\ al          & Signal filtered into frequency bands, Segmented by HMM based method+hand corrected, removal of high variance sub-segments to remove noise & Parametric spectral features (AR, ARMA and Music), Instantaneous frequency and amplitude, Power in octave bands & QDA                   & 435 Recordings, 133 patients, 70 normal, 63 abnormal                                                                    & $73\%\;Ac$                                         \\
-Reed et.\ al             & ---                                                                                                                                       & Wavelet decomposition coefficients, Manual feature reduction                                                    & ANN                   & 5 patients, 4 cycles per patient                                                                                        & $100\%\;Ac$                                        \\
+Maglogiannis et.~al \citeyearpar{Maglogiannis2009}     & Wavelet decomposition, Shannon energy peak picking                                                                                        & Features derived from wavelet decomposition and PCG segmentations                                               & SVM                   & 198 recordings, 38 normal, 41 AS systolic murmur, 43 MR systolic murmur, 38 AR diastolic murmur, 38 MS diastolic murmur & $91.43\%\;Ac$                                      \\
+Ari et.~al \citeyearpar{Ari2010}              & Amplitude envelope peak picking~\parencite{Ari2007}                                                                                       & Wavelet based features                                                                                          & LSSVM                 & 64 patients, 64 recordings, 512 cycles                                                                                  & $88.750-100\%\;Ac$ (dependant on abnormality type) \\
+Quiceno-Manrique et.~al \citeyearpar{Quiceno-Manrique2010a}& Downsampled to 4KHz, Normalised to maximum of signal, ECG assisted QRS complex detection algorithm used for segmentation                  & Spectral features derived from STFT, Wavelet decomposition and quadratic energy distributions                   & $k$-NN                & 22 patients, 16 normal, 6 abnormal, 8 recordings (12s) per patient                                                      & $98\%\;Ac$                                         \\
+Schmidt et.~al \citeyearpar{Schmidt2015}          & Signal filtered into frequency bands, Segmented by HMM based method+hand corrected, removal of high variance sub-segments to remove noise & Parametric spectral features (AR, ARMA and Music), Instantaneous frequency and amplitude, Power in octave bands & QDA                   & 435 Recordings, 133 patients, 70 normal, 63 abnormal                                                                    & $73\%\;Ac$                                         \\
+Reed et.~al \citeyearpar{Reed2004}             & ---                                                                                                                                       & Wavelet decomposition coefficients, Manual feature reduction                                                    & ANN                   & 5 patients, 4 cycles per patient                                                                                        & $100\%\;Ac$                                        \\
 \dbottomrule\\
 % TODO: Add footnote explanation for Ac = Accuracy
 % TODO: Add citeyearpar references to authors
@@ -491,13 +491,13 @@ commonly used features in previous PCG related literature such as wavelet
 decomposition based features, MFCCs and Shannon Energy. The system also uses a
 total of 40 classifiers, 20 for signals labeled to be `standard' and 20 for
 thos labeled as `atypical'. A mixture of Random Forrest, LogitBoost and
-Cost-Sensitive Classifiers are used to classify signals in parallel. Final
+Cost-Sensitive Classifiers (CSC) are used to classify signals in parallel. Final
 results are combined using a rule based decision, designed through manual
 experimentation.\\
 % TODO: Read into accuracy results for this method more closely

 Potes et.\ al present a similar approach to that of Homsi et.\
-al~\citeyearpar{Potes2016}. 124 similar time-frequency features are extracted
+al~\citeyearpar{Potes2016}. 124 similar time-frequency ($t-f$) features are extracted
 and used as vectors for an AdaBoost classifier. This was combined with a deep
 learning approach using a Convolutional Neural Network (CNN) classifier. The
 signal was decomposed into 4 frequency bands and segmented, to provide input to
@@ -508,8 +508,8 @@ set for the challenge at 86.02\%.\\
 Zabihi et.\ al take an alternative approach by choosing not to segment PCG data
 in the pre-processing stage~\citeyearpar{Zabihi2016}. This is with the intention of reducing
 computational complexity of the resulting algorithm. In addition, the proposed
-method utilizes a wrapper method for feature selection (sequential forward
-feature selection) and Linear Predictive Coefficients (LPC) for the reduction
+method utilizes a wrapper sequential forward
+feature selection (SFS) and Linear Predictive Coefficients (LPC) for the reduction
 of features used for classification. This benefits the system by removing correlated and irrelevant
 features, Thus reducing computational complexity and removing irellevant noise
 from feature vectors prior to training.
@@ -528,40 +528,41 @@ estimated classification of a new data point. From the 228 extracted features,
 scores using generated histograms. This allowed for the training scores to be
 automatically optimized by the algorithm.\\

- Wavelet, MFCC and inter-beat neural network classifier~\parencite{Kay2017}
+Kay et.\ al present a method using ANNs, a wide variety of features and PCA for
+feature reduction. The algorithm scores well on the test set. However, this
+work is most noteable for it's rigurous evaluation by authors, using leave on
+out cross validation for a clearer understanding of  the generalisation of the
+algorithm, as well as highlighting issues with the underlying dataset that are
+discussed in Section~\ref{Dataset}
+
 - Large number of features, tensor based feature reduction and
 K-NN~\parencite{Bobillo2016}
 - Convolutional neural networks, MFCCs~\parencite{Rubin2016}



-\newgeometry{margin=1cm} % modify this if you need even more space
-\begin{landscape}
-\begin{table}[htbp]
-    \captionof{table}{Summary of research prior to the Physionet Challenge 2016} \label{PriorWorkTable}
+\begin{table}[H]
+    \captionof{table}{Summary of Physionet Challenge 2016 entries} \label{PriorWorkTable}
 \scriptsize
 %\centering
 \rowcolors{1}{gray!15}{white}
 \doublespacing
-\begin{tabulary}{\linewidth}{LLLLL}
+\begin{tabulary}{\linewidth}{CCCCC}
 \dtoprule
-Author                 & Method                                                                                         & Datasets                                                                                       & \mbox{Reported} Results         & Notes                                                                                            \\ \bottomrule
-Springer et.\ al \citeyearpar{Springer2016} & HSMM, Logistic regression                                                                       & 10,172s of recordings from 112 patients. 12,181 first and 11,627 second heart sounds. & $95.63\pm0.85\%$                             & Supervised algorithm.                                                                                                                                                            \\
-Huiying et.\ al \citeyearpar{Liang1997b} & Normalised average Shannon energy envelope, peak picking                                        & 37 recordings, 14 pathological murmurs and 23 physiological murmurs. 515 cycles       & $91.03\%\;Ac$                                          & Unsupervised Algorithm.  Dataset consists entirely of child recording. Optimized on full dataset                                                                                 \\
-Vepa et.\ al \citeyearpar{Vepa2008}     & Wavelet decomposition, energy and simplicity measurement                                       & 160 heart cycles collected from a variety of sources (training CDs, web resources)    & $84\%\;Ac$                                             & Unsupervised Algorithm, Optimized on full dataset                                                                                                                                \\
-Sun et.\ al \citeyearpar{Sun2014}             & Viola integral envelope extraction, short-time modified Hilbert transform, peak picking        & 6949s of recordings, from 121 patients                                                & $97.37\%\;Ac$                                          & Supervised algorithm. Tolerance for segmentation accuracy not specified                                                                                                          \\
-Sepehri et.\ al \citeyearpar{Sepehri2010}        & Spectral density estimation, auto-regressive parameters, multi-layer perceptron neural network & 120 recording, from 60 patients                                                       & $93.6\%\;Ac$                                           & Supervised algorithm                                                                                                                                                             \\
-Ricke et.\ al \citeyearpar{Ricke2005}    & Shannon energy (and related features), HMM                                                     & 9 recordings, from 9 patients                                                         & $98\%\;Ac$                                             & Supervised algorithm                                                                                                                                                             \\
-Schmidt et.\ al \citeyearpar{Schmidt2015}  & DHMM, Auto-correlation duration features, Homomorphic envelogram                               & 113 recordings, from 113 patients. 8s per recording. 15 abnormal recordings           & $98.8\;Se,\;98.6\;P_+$ on test set                         & All data recorded ``lateral to the sternum in the fourth intercostal space on the left side''. Mix of noisy and clean recordings. 40 recording used for training, 73 for testing \\
-Gill et.\ al \citeyearpar{Gill2005}         & Homomorphic envelogram, Embedded HMMs                                                          & 44 recording, 17 subjects. 30-60s per recording                                       & $98.6\%\;Ac, 96.9\;P_+$ for S1. $98.3\;Ac,\;96.5\;P_+$ for S2 & Recording taken in sub-optimal environments (noisy hospitals, offices etc...)                                                                                                    \\
-Gupta et.\ al \citeyearpar{Gupta2007}    & Homomorphic filtering, $k$-means clustering                                                       & 41 patients, 340 heart cycles. 110 normal,  124 systolic murmur, 106 diastolic murmur & $90.29\%\;Ac$                                          & Unsupervised Algorithm.                                                                                                                                                          \\ \hline
+Author            & Features                                                            & Classification Method                                    & Reported Scores & Challenge Score \\ \midrule
+Potes et.~al     & 124 $t-f$ features                                                  & Combined AdaBoost/ANN                                    &                 & 86.02\%         \\
+Zabihi et.~al    & 40 temporal, spectral and $t-f$ features, reduced using SFS and LPC & 2 ensembles of neural networks                           &                 & 85.90\%         \\
+Kay et.~al       & CWT, MFCCs, complexity measures, Inter-beat features, PCA           & ANNs                                                     &                 & 85.20\%         \\
+Bobillo           & MFCCs and WPD, reduced using tensor decomposition                   & $k$-NN                                                   &                 & 84.54\%         \\
+Homsi et.~al     & 131 time domain, STFT based andwavelet based features               & Combined ensembles of LogitBoost, Random Forrest and CSC &                 & 84.48\%         \\
+Maknickas et.~al &                                                                     &                                                          &                 & 84.15\%         \\
+Plesinger et.~al &                                                                     &                                                          &                 & 84.11\%         \\
+Rubin et.~al     &                                                                     &                                                          &                 & 83.99\%         \\
 \dbottomrule\\
 % TODO: Add footnote explanation for Ac = Accuracy
 % TODO: Add citeyearpar references to authors
 \end{tabulary}
 \end{table}
-\end{landscape}
-\restoregeometry

 % TODO: Insert table of previous research methods, datasets and results