end of day commit

This commit is contained in:
Sam Perry
2017-08-07 23:04:07 +01:00
parent e144f7079f
commit 730d41307c
+73 -34
View File
@@ -247,7 +247,7 @@ for the duration of each state in the HMM~\citeyearpar{Gill2005}. This is
handled through the extraction of 6 duration features based primarily on peaks,
which are then used as feature vectors for the HMM. Results of 98.6\%
sensitivity, 96.9\% positive predictivity for S1 sounds and 98.3\% sensitivity,
96.5\% positive predictivity were reported.
96.5\% positive predictivity for S2 sounds were reported.
The issue of state duration is further addressed by Schmidt et.\ al through use
of a duration-dependent hidden Markov (DHMM)~\citeyearpar{Schmidt2015}. The
DHMM is a modified HMM that considers the duration of the current state when
@@ -325,15 +325,79 @@ been produced with regards to general abnormality detection, with many projects
choosing to focus on specific conditions such as murmurs, atrial fibrillation
and flutter, and heart valve disease. This section outlines some key research
into these areas, alongside initial research into general abnormality
detection.
detection.\\
- SVM classifier for heart valve diseas~\parencite{Maglogiannis2009}
- Threshold classifier for atrial fibrillation and flutter~\parencite{Dash2009}
- k-NN Classifier for murmur detection~\parencite{Quiceno-Manrique2010a}
- Feature analysis specifically for coronary artery
diseas~\parencite{Schmidt2015}
- GDA and MLP Neural-net classification of general abnormalities~\parencite{Yaghouby2009}
- SVM, k-NN and Bayesian classifier of general abnormalities~\parencite{Lubaib2016}
Maglogiannis et.\ al present a classifier for discrimination of heart valve
disease from regular heart sounds using an SVM
classifier~\citeyearpar{Maglogiannis2009}.
Roughly 100 features were extracted from the signal, based on direct analysis
of each heart cycle component (S1, Systole, S2, Diastole) and the average
shannon energy envelope of these components.
A database of 198 heart sounds was curated for the project, acquired from 8
sources, such as medical CDs and pre-existing databases.
An accuracy of 91.43\% is reported using 10-fold stratified cross-validation.
In addition, the project aimed to classify individual abnormalities in a 3 step
process, by distinguishing between systolic or diastolic murmurs, and then
distinguishing between aortic or mitral diseases. The classifier achieved
accuracy between 90-97\% for these classifications.\\
Ari et.\ al also propose an SVM based method for abnormality
classification~\citeyearpar{Ari2010}.\\
Quiceno-Manrique et.\ al demonstrate the use of various time frequency
representations (TFR) such as short-time fourier transform, wavelet transforms,
Wigner-Ville distribution etc\ldots, with a $k$-nearest neighbour classifier
(k-NN) for systolic murmur detection~\citeyearpar{Quiceno-Manrique2010a}. This
work highlights the effectiveness of alternative TFRs to traditional fourier
methods. This method also employs Principle Component Analysis (PCA) for the
mapping of a high dimensional feature space to a lower dimension, for the
benefit of computational performance. Features were evaluated using a dataset
of of 22 patients, 6 of which were labeled as having a systolic murmur. The
highest reported accuracy was achieved using MFCCs as the primary feature
vector achieving a 98\% accuracy on 10-fold cross validation.\\
Schmidt et.\ al aim to find features that can be used for classification of
coronary artery disease through detection of small
murmurs~\citeyearpar{Schmidt2015}. A large number of features are then
calculated to provide vectors for classification. Parametric spectral features
such as ARMA are used, alongside instantaneous frequency and octave power
measurements. These are combined with complexity features such as sample
entropy and simplicity. Complexity features are chosen in an attempt to exploit
the likely stochastic nature of murmurs, when compared to normal heart sounds.
Given the large number of features calculated, PCA is used to retain only the
most relevant information. Quadratic discriminant analysis (QDA) is then used
as a classifier to provide a final accuracy score of 73\%.\\
General abnormality detection algorithms are significantly less common prior to
the challenge. Reed et.\ al implement a simple classification using artificial
neural networks (ANNs) and wavelet decomposition~\citeyearpar{Reed2004}.
However, due to the comparitively small sample size used for training (1
patient per abnormality, 4 cycles per patient), a reported accuracy of 100\%
would likely generalise poorly.
\newgeometry{margin=1cm} % modify this if you need even more space
\begin{landscape}
\begin{table}[htbp]
\captionof{table}{Summary of research prior to the Physionet Challenge 2016}\label{PriorWorkTable}
\scriptsize
%\centering
\rowcolors{1}{gray!15}{white}
\doublespacing
\begin{tabulary}{\linewidth}{LLLLLL}
\dtoprule
Author & Pre-processing/segmentation & Features & Classification Method & Dataset & Reported Accuracy \\ \midrule
Maglogiannis et.\ al & Wavelet decomposition, Shannon energy peak picking & Features derived from wavelet decomposition and PCG segmentations & SVM & 198 recordings, 38 normal, 41 AS systolic murmur, 43 MR systolic murmur, 38 AR diastolic murmur, 38 MS diastolic murmur & $91.43\%\;Ac$ \\
Ari et.\ al & Amplitude envelope peak picking~\parencite{Ari2007} & Wavelet based features & LSSVM & 64 patients, 64 recordings, 512 cycles & $88.750-100\%\;Ac$ (dependant on abnormality type) \\
Quiceno-Manrique et.\ al & Downsampled to 4KHz, Normalised to maximum of signal, ECG assisted QRS complex detection algorithm used for segmentation & Spectral features derived from STFT, Wavelet decomposition and quadratic energy distributions & $k$-NN & 22 patients, 16 normal, 6 abnormal, 8 recordings (12s) per patient & $98\%\;Ac$ \\
Schmidt et.\ al & Signal filtered into frequency bands, Segmented by HMM based method+hand corrected, removal of high variance sub-segments to remove noise & Parametric spectral features (AR, ARMA and Music), Instantaneous frequency and amplitude, Power in octave bands & QDA & 435 Recordings, 133 patients, 70 normal, 63 abnormal & $73\%\;Ac$ \\
Reed et.\ al & & Wavelet decomposition coefficients, PCA feature reduction & ANN & 5 patients, 4 cycles per patient & $100\%\;Ac$ \\
\dbottomrule\\
% TODO: Add footnote explanation for Ac = Accuracy
% TODO: Add citeyearpar references to authors
\end{tabulary}
\end{table}
\end{landscape}
\restoregeometry
\subsubsection{Physionet challenge entries}
scoring method
@@ -348,31 +412,6 @@ K-NN~\parencite{Bobillo2016}
- Convolutional neural networks, MFCCs~\parencite{Rubin2016}
\newgeometry{margin=1cm} % modify this if you need even more space
\begin{table}[htbp]
\captionof{table}{Summary of research prior to the Physionet Challenge 2016} \label{PriorWorkTable}
\scriptsize
%\centering
\rowcolors{1}{gray!15}{white}
\doublespacing
\begin{tabulary}{\linewidth}{LLLLL}
\dtoprule
Author & Method & Datasets & \mbox{Reported} Results & Notes \\ \bottomrule
Springer et.\ al \citeyearpar{Springer2016} & HSMM, Logistic regression & 10,172s of recordings from 112 patients. 12,181 first and 11,627 second heart sounds. & $95.63\pm0.85\%$ & Supervised algorithm. \\
Huiying et.\ al \citeyearpar{Liang1997b} & Normalised average Shannon energy envelope, peak picking & 37 recordings, 14 pathological murmurs and 23 physiological murmurs. 515 cycles & $91.03\%\;Ac$ & Unsupervised Algorithm. Dataset consists entirely of child recording. Optimized on full dataset \\
Vepa et.\ al \citeyearpar{Vepa2008} & Wavelet decomposition, energy and simplicity measurement & 160 heart cycles collected from a variety of sources (training CDs, web resources) & $84\%\;Ac$ & Unsupervised Algorithm, Optimized on full dataset \\
Sun et.\ al \citeyearpar{Sun2014} & Viola integral envelope extraction, short-time modified Hilbert transform, peak picking & 6949s of recordings, from 121 patients & $97.37\%\;Ac$ & Supervised algorithm. Tolerance for segmentation accuracy not specified \\
Sepehri et.\ al \citeyearpar{Sepehri2010} & Spectral density estimation, auto-regressive parameters, multi-layer perceptron neural network & 120 recording, from 60 patients & $93.6\%\;Ac$ & Supervised algorithm \\
Ricke et.\ al \citeyearpar{Ricke2005} & Shannon energy (and related features), HMM & 9 recordings, from 9 patients & $98\%\;Ac$ & Supervised algorithm \\
Schmidt et.\ al \citeyearpar{Schmidt2015} & DHMM, Auto-correlation duration features, Homomorphic envelogram & 113 recordings, from 113 patients. 8s per recording. 15 abnormal recordings & $98.8\;Se,\;98.6\;P_+$ on test set & All data recorded ``lateral to the sternum in the fourth intercostal space on the left side''. Mix of noisy and clean recordings. 40 recording used for training, 73 for testing \\
Gill et.\ al \citeyearpar{Gill2005} & Homomorphic envelogram, Embedded HMMs & 44 recording, 17 subjects. 30-60s per recording & $98.6\%\;Ac, 96.9\;P_+$ for S1. $98.3\;Ac,\;96.5\;P_+$ for S2 & Recording taken in sub-optimal environments (noisy hospitals, offices etc...) \\
Gupta et.\ al \citeyearpar{Gupta2007} & Homomorphic filtering, $k$-means clustering & 41 patients, 340 heart cycles. 110 normal, 124 systolic murmur, 106 diastolic murmur & $90.29\%\;Ac$ & Unsupervised Algorithm. \\ \hline
\dbottomrule\\
% TODO: Add footnote explanation for Ac = Accuracy
% TODO: Add citeyearpar references to authors
\end{tabulary}
\end{table}
\restoregeometry
\newgeometry{margin=1cm} % modify this if you need even more space
\begin{landscape}