FEW-SHOT ELECTROCARDIOGRAM (ECG) SIGNAL CLASSIFICATION METHOD BASED ON IMPROVED SIAMESE NETWORK

Description

CROSS-REFERENCE TO THE RELATED APPLICATIONS

This application is based upon and claims priority to Chinese Patent Application No. 202311498055.5, filed on Nov. 13, 2023, the entire contents of which are incorporated herein by reference.

TECHNICAL FIELD

The present disclosure relates to the technical field of electrocardiogram (ECG) signal classification, and in particular to a few-shot ECG signal classification method based on an improved Siamese network.

BACKGROUND

In recent years, deep learning (DL)-based algorithm models have achieved unprecedented success in big data (BD) processing in the field of artificial intelligence (AI). However, due to the rarity and large individual differences of certain types of arrhythmias the acquired data is limited, which limits the generalization ability and accuracy of existing models. Few-shot learning is mainly used for neural network classifiers, which only requires a small number of samples for learning and training, and can achieve efficient recognition and classification of electrocardiogram (ECG) signals.

SUMMARY

In order to overcome the above-mentioned shortcomings in the prior art, the present disclosure provides a few-shot electrocardiogram (ECG) signal classification method based on an improved Siamese network, which can improve the classification accuracy.

In order to solve the technical problem, the present disclosure adopts the following technical solution.

The few-shot ECG signal classification method based on an improved Siamese network includes the following steps:

- a) acquiring n original ECG signals to form an original ECG signal set D, D={(x₁, y₁), (x₂, y₂), . . . , (x_i, y_i), . . . , (x_n, y_n)}, where x_idenotes an i-th original ECG signal, and y_idenotes a class label corresponding to the i-th original ECG signal x_i, i∈{1, . . . , n};
- b) preprocessing the original ECG signal set D to remove noise in the original ECG signals, thereby acquiring a clean ECG signal set D′, D′={(x′₁, y₁), (x′₂, y₂), . . . , (x′_i, y_i), . . . , (x′_n, y_n)}, where x′_idenotes an i-th clean ECG signal;
- c) normalizing the i-th clean ECG signal x′_ito acquire a normalized ECG signal x″_i; and performing zero-padding in the end of a sequence of the normalized ECG signal x_i″ if a length of the sequence of the normalized ECG signal x_i″ is less than L_max, such that the length of the sequence of the normalized ECG signal x″ is equal to L_max, thereby acquiring a normalized ECG signal set D″, D″={(x″₁, y″₁), (x″₂, y₂), . . . , (x″_i, y_i), . . . , (x″_n, y_n)}
- d) creating a sample pair set P based on the normalized ECG signal set D″,

$P = {((x_{1}^{″}, x_{2}^{″}), Y^{'}), ((x_{2}^{″}, x_{3}^{″}), Y^{'}), \dots, ((x_{i - 1}^{″}, x_{i}^{″}), Y^{'}), ((x_{i}^{″}, x_{i + 1}^{″}), Y^{'}), \dots, ((x_{n - 2}^{″}, x_{n - 1}^{″}), Y^{'}), ((x_{n - 1}^{″}, x_{n}^{″}), Y^{'})}, where$

$Y^{'} = {\begin{matrix} 1 & y_{i - 1} = y_{i} \\ 0 & y_{i - 1} \neq y_{i} \end{matrix};$

y_i−1denotes a class label corresponding to the (i−1)-th original ECG signal x_i−1; and there are M sample pairs in the sample pair set P,

$M = \frac{n \times (n - 1)}{2};$

- e) constructing a few-shot classification model, and inputting a sample pair ((x″₁, x″_i+1), Y′) from the sample pair set P into the few-shot classification model to acquire a similarity score E_w(x″_i, x″_i+1);
- f) training, by an adaptive moment estimation (Adam) optimizer, the few-shot classification model through a loss function L to acquire an optimized few-shot classification model;
- g) randomly sampling K ECG signals from each of N classes in a Massachusetts Institute of Technology-Beth Israel Hospital (MIT-BIH) dataset to form a support set s_support, s_support={(s₁, a₁), (s₂, a₂), . . . , (s_i, a_i), . . . , (s_NK, a_NK)}, where s_idenotes an i-th ECG signal, and a_idenotes a class label corresponding to the i-th ECG signal s_i, i∈{1, . . . , NK};
- h) randomly sampling Q ECG signals from each of the N classes in the MIT-BIH dataset to form a query set s_query, s_query={(q₁, b₁), (q₂, b₂), . . . , (q_i, b_i), . . . , (q_NQ, b_NQ)}, where q_idenotes an i-th ECG signal, and b; denotes a class label corresponding to the i-th ECG signal q_i, i∈{1, . . . , NQ};
- i) replacing the i-th original ECG signal x_iwith the i-th ECG signal s_i, and repeating the steps b) and c) to acquire an i-th normalized ECG signal s″_i, thereby acquiring a normalized support set s′_support, s″_support={(s″₁, a₁), (s″₂, a₂), . . . , (s″_i, a_i), . . . , (s′_NK, a_NK)}; and replacing the i-th original ECG signal x_iwith the i-th ECG signal q_i, and repeating the steps b) and c) to acquire an i-th normalized ECG signal q″_i, thereby acquiring a normalized query set s″_query, s″_query={(q″₁, b₁), (q″₂, b₂), . . . , (q″_i, b_i), . . . , (q″_NQ, b_NQ)}; and
- j) inputting the i-th normalized ECG signal s″_iand the i-th normalized ECG signal q″_iinto the optimized few-shot classification model to acquire a classification result.

Further, the step a) includes: acquiring the n original ECG signals from a University of California Riverside (UCR) dataset.

Further, the step b) includes: denoising, by a first median filter and a second median filter in sequence, the i-th original ECG signal x_ito acquire the i-th clean ECG signal x′_i.

Preferably, the first median filter has a width of 300 ms, and the second median filter has a width of 600 ms.

Preferably, L_max=187.

Further, the step e) includes:

- e-1) constructing the few-shot classification model, including an embedding module and a metric module;
- e-2) constructing the embedding module of the few-shot classification model, where the embedding module includes a Siamese network formed by a first CMP module and a second CMP module; the first CMP module includes a convolutional layer, a first rectified linear unit (ReLU) activation function layer, a primary capsule layer of a capsule network, a digital capsule layer of the capsule network, a first fully connected layer, a second ReLU activation function layer, and a second fully connected layer; and the second CMP module includes a convolutional layer, a first ReLU activation function layer, a primary capsule layer of a capsule network, a digital capsule layer of the capsule network, a first fully connected layer, a second ReLU activation function layer, and a second fully connected layer;
- e-3) inputting the i-th normalized ECG signal x″₁into the convolutional layer and the first ReLU activation function layer of the first CMP module in sequence to acquire a feature f₁¹; inputting the feature f₁¹into the primary capsule layer of the capsule network in the first CMP module to acquire a vector f₁²; inputting the vector f₁²into the digital capsule layer of the capsule network in the first CMP module to acquire a feature f₁³; inputting the feature f₁³into the first fully connected layer and the second ReLU activation function layer of the first CMP module in sequence to acquire a feature f₁⁴; and inputting the feature f₁⁴into the second fully connected layer of the first CMP module to acquire a feature f(x″_i);
- e-4) inputting the (i+1)-th normalized ECG signal x″_i+1into the convolutional layer and the first ReLU activation function layer of the first CMP module in sequence to acquire a feature f₂¹; inputting the feature f₂¹into the primary capsule layer of the capsule network in the first CMP module to acquire a vector f₂²; inputting the vector f₂²into the digital capsule layer of the capsule network in the first CMP module to acquire a feature f₂³; inputting the feature f₂³into the first fully connected layer and the second ReLU activation function layer of the first CMP module in sequence to acquire a feature f₂⁴; and inputting the feature f₂⁴into the second fully connected layer of the first CMP module to acquire a feature f(x″₁₊₁); and
- e-5) inputting the feature f(x″_i) and the feature f(x″_i+1) into the metric module of the few-shot classification model, and calculating the similarity score E_w(x″₁, x″_i+1) by E_w(x″_i, x″_i+1)=∥f(x″_i)−f(x″_i+1)∥, where ∥●∥ denotes a Euclidean distance (ED) calculation.

Preferably, in the step e-2), the convolutional layer of the first CMP module includes a 3×3 convolution kernel, and the convolutional layer of the second CMP module includes a 3×3 convolution kernel.

Further, the step f) includes: calculating the loss function

$L by L = L_{1} + α L_{2},$

$where L_{1} = Y^{'} \frac{1}{2} {(E_{w} (x_{i}^{″}, x_{i + 1}^{″}))}^{2} + (1 - Y^{'}) {\max (0, m - E_{w} (x_{i}^{″}, x_{i + 1}^{″}))}^{2};$

m denotes a hyperparameter, α denotes a hyperparameter; and L₂denotes a cross entropy loss function.

Further, the step j) includes:

- j-1) inputting the i-th normalized ECG signal s″_iof a u-th class into the convolutional layer and the first ReLU activation function layer of the first CMP module in sequence to acquire a feature f₃¹, u∈{1, . . . , N}; inputting the feature f₃¹into the primary capsule layer of the capsule network in the first CMP module to acquire a vector f₃²; inputting the vector f₃²into the primary capsule layer of the capsule network in the first CMP module to acquire a feature f₃³; inputting the feature f₃³into the first fully connected layer and the second ReLU activation function layer of the first CMP module in sequence to acquire a feature f₃⁴; inputting the feature f₃⁴into the second fully connected layer of the first CMP module to acquire a feature f(s″_i)_u; and calculating, by a mean( ) function in Python, an average of all K features f(s″₁)_u, f(s″₂)_u, . . . , f(s″_i)_u, . . . , f(s″_K)_u, of the u-th class to acquire a feature vector μ_u;
- j-2) inputting the i-th normalized ECG signal q″_iinto the convolutional layer and the first ReLU activation function layer of the first CMP module in sequence to acquire a feature f₄¹; inputting the feature f₄¹into the primary capsule layer of the capsule network in the first CMP module to acquire a vector f₄²; inputting the vector f₄²into the primary capsule layer of the capsule network in the first CMP module to acquire a feature f₄³; inputting the feature f₄³into the first fully connected layer and the second ReLU activation function layer of the first CMP module in sequence to acquire a feature f₄⁴; and inputting the feature f₄⁴into the second fully connected layer of the first CMP module to acquire a feature f(q″_i);
- j-3) inputting the feature vector μ_uand the feature f(q″_i) into the metric module of the few-shot classification model, and calculating the similarity score E_w(μ_u, f(q″_i)) by E_w(μ_u, f(q″_i))=∥μ_u−f(q″_i)∥; and
- j-4) calculating a class label ŷ_iof the i-th normalized ECG signal q″_iby {right arrow over (y)}_i=arg max {E_w(μ₁, f(q″_i)), E_w(μ₂, f(q″_i)), . . . , E_w(μ_u, f(q″_i)), . . . , E_w(μ_N, f(q″_i))}, and combining class labels of all NQ normalized ECG signals to form the classification result.

The present disclosure has the following beneficial effects. The present disclosure constructs the CMP module as a sub-network of the Siamese network, and combines the extracted local and global features to better analyze peak information such as position, amplitude, and offset, making the transformed feature vector more robust. In this way, the present disclosure improves the accuracy and stability of few-shot ECG signal classification.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a flowchart of a few-shot ECG signal classification method based on an improved Siamese network according to the present disclosure;

FIG. 2 is a structural diagram of a CMP module according to the present disclosure;

FIG. 3 shows a comparison of average accuracy and K for different models according to the present disclosure;

FIGS. 4A-4B show a comparison of confusion matrices for models in 3-way 10-shot according to the present disclosure; and

FIGS. 5A-5F show a comparison between true and predict labels in 3-way 10-shot.

Table 1 Average accuracy comparison results of models in the present disclosure

Table 2 Average precision, average recall, and average F1 score comparison results of different models in the present disclosure

DETAILED DESCRIPTION OF THE EMBODIMENTS

The present disclosure is further described with reference to FIG. 1 and FIG. 2.

The few-shot ECG signal classification method based on an improved Siamese network includes the following steps:

- a) n original ECG signals are acquired to form original ECG signal set D, D={(x₁, y₁), (x₂, y₂), . . . , (x_i, y_i), . . . , (x_n, y_n)}, where x_idenotes i-th original ECG signal, and y_idenotes a class label corresponding to the i-th original ECG signal x_i, i∈{1, . . . , n}.
- b) The original ECG signal set D is preprocessed to remove noise in the original ECG signals, thereby acquiring clean ECG signal set D′, D′={(x′₁, y₁), (x′₂, y₂), . . . , (x′_i, y_i), . . . , (x′_n, y_n)}, where x′_idenotes i-th clean ECG signal.
- c) The i-th clean ECG signal x′_iis normalized to acquire normalized ECG signal x″_i; and performing zero-padding is performed in the end of a sequence of the normalized ECG signal x″_iif a length of the sequence of the normalized ECG signal x″_iis less than L_max, such that the length of the sequence of the normalized ECG signal x″_iis equal to L_max, thereby acquiring normalized ECG signal set D″, D″={x″₁, y₁), (x″₂, y₂), . . . , (x″_i, y_i), . . . , (x″_n, y_n)}.
- d) Sample pair set P is created based on the normalized ECG signal set D″,

y_i−1denotes a class label corresponding to the (i−1)-th original ECG signal x_1-1; and there are M sample pairs in the sample pair set P,

$M = \frac{n \times (n - 1)}{2};$

- e) A few-shot classification model is constructed, and sample pair (x″_i, x″_1-1),Y″) from the sample pair set P is input into the few-shot classification model to acquire similarity score E_w(x″_i, x″_i+1)
- f) The few-shot classification model is trained by an adaptive moment estimation (Adam) optimizer through loss function L to acquire an optimized few-shot classification model.
- g) K ECG signals are randomly sampled from each of N classes in a Massachusetts Institute of Technology-Beth Israel Hospital (MIT-BIH) dataset to form support set s_support, s_support={(s₁, a₁), (s₂, a₂), . . . , (s_i, a_i), . . . , (s_NK, a_NK)}, where s_idenotes i-th ECG signal, and a_idenotes a class label corresponding to the i-th ECG signal s_i, i∈{1, . . . , NK}.
- h) Q ECG signals are randomly sampled from each of the N classes in the MIT-BIH dataset to form query set s_queryfor the purpose of accurately classifying NQ queries based on given NK samples, s_query={(q₁, b₁), (q₂, b₂), . . . , (q_i, b_i), . . . , (q_NQ, b_NQ)}, where q_idenotes i-th ECG signal, and b; denotes a class label corresponding to the i-th ECG signal q_i, i∈{1, . . . , NQ}.

i) The i-th original ECG signal x_iis replaced with the i-th ECG signal s_i, and the steps b) and c) are repeated to acquire i-th normalized ECG signal s″_i, thereby acquiring normalized support set s″_support, s″_support={(s″₁, a₁), (s″₂, a₂), . . . , (s″_i, a_i), . . . , (s″_NK, a_NK)}. The i-th original ECG signal x_iis replaced with the i-th ECG signal q_i, and the steps b) and c) are repeated to acquire i-th normalized ECG signal q″_i, thereby acquiring normalized query set query s″_query, s″_query={(q″₁, b₁), (q″₂, b₂), . . . , (q″_i, b_i), . . . , (q″_NQ, B_NQ)}.

- j) The i-th normalized ECG signal s″_iand the i-th normalized ECG signal q″_iare input into the optimized few-shot classification model to acquire classification result.

The present disclosure provides a brand new CMP module to establish the Siamese network for few-shot ECG signal classification, which improves classification accuracy.

In an embodiment of the present disclosure, in the step a), the n original ECG signals are acquired from a University of California Riverside (UCR) dataset.

In an embodiment of the present disclosure, in the step b), the i-th original ECG signal x_iis denoised by a first median filter and a second median filter in sequence to acquire the i-th clean ECG signal x′_i. In the embodiment, preferably, the first median filter has a width of 300 ms, and the second median filter has a width of 600 ms.

In an embodiment of the present disclosure, L_max=187.

In an embodiment of the present application, the step e) is as follows.

- e-1) The few-shot classification model is constructed, including an embedding module and a metric module.
- e-2) The embedding module of the few-shot classification model is constructed, where the embedding module includes a Siamese network formed by a first CMP module and a second CMP module; the first CMP module includes a convolutional layer, a first rectified linear unit (ReLU) activation function layer, a primary capsule layer of a capsule network, a digital capsule layer of the capsule network, a first fully connected layer, a second ReLU activation function layer, and a second fully connected layer; and the second CMP module includes a convolutional layer, a first ReLU activation function layer, a primary capsule layer of a capsule network, a digital capsule layer of the capsule network, a first fully connected layer, a second ReLU activation function layer, and a second fully connected layer.
- e-3) The i-th normalized ECG signal x″_iis input into the convolutional layer and the first ReLU activation function layer of the first CMP module in sequence to extract low-level feature of the ECG signal x″_i, thereby acquiring feature f₁¹. The feature f₁¹is input into the primary capsule layer of the capsule network in the first CMP module for a feature-to-vector transformation, thereby acquiring vector f₁². The vector f₁²is input into the digital capsule layer of the capsule network in the first CMP module, and the vector f₁²is subjected to matrix transformation, input weighting, summation, and non-linear transformation to acquire feature f₁³. The feature f₃¹is input into the zero-neuron first fully connected layer and second ReLU activation function layer of the first CMP module in sequence for nonlinear mapping to acquire feature f₁⁴. The feature f₁⁴is input into the second fully connected layer of the first CMP module, and an embedding vector mapped from the first fully connected layer to a 0-dimensional space outputs an embedding vector with a same dimension as an input dimension to acquire feature f(x″_i).
- e-4) The (i+1)-th normalized ECG signal x″_i+1is input into the convolutional layer and the first ReLU activation function layer of the first CMP module in sequence to extract low-level feature of the ECG signal x″_i+1, thereby acquiring feature f₂¹. The feature f₂¹is input into the primary capsule layer of the capsule network in the first CMP module for a feature-to-vector transformation, thereby acquiring vector f₂². The vector f₂²is input into the digital capsule layer of the capsule network in the first CMP module, and the vector f₂²is subjected to matrix transformation, input weighting, summation, and non-linear transformation to acquire feature f₂³. The feature f₂³is input into the zero-neuron first fully connected layer and second ReLU activation function layer of the first CMP module in sequence for nonlinear mapping to acquire feature f₂⁴. The feature f₂⁴is input into the second fully connected layer of the first CMP module, and an embedding vector mapped from the first fully connected layer to a 0-dimensional space outputs an embedding vector with a same dimension as an input dimension to acquire feature f(x″_i+1).
- e-5) The feature f(x″_i) and the feature f(x″_i+1) are input into the metric module of the few-shot classification model, and the similarity score is E_w(x″_i, x″_i+1) calculated by E_w(x″₁, x″_i+1)=∥f(x″_i)−f(x″_i+1)∥, where ∥●∥ denotes a Euclidean distance (ED) calculation.

In the embodiment, in the step e-2), the convolutional layer of the first CMP module includes a 3×3 convolution kernel, and the convolutional layer of the second CMP module includes a 3×3 convolution kernel.

In the step f), the loss function L is calculated by L=L₁+αL₂, where L₁is designed to adjust the loss function of the Siamese network.

$L_{1} = Y^{'} \frac{1}{2} {(E_{w} (x_{i}^{″}, x_{i + 1}^{″}))}^{2} + (1 - Y^{'}) {\max (0, m - E_{w} (x_{i}^{″}, x_{i + 1}^{″}))}^{2},$

where m denotes a hyperparameter; α denotes a hyperparameter; and L₂denotes a cross entropy loss function. Further, α=5, m=5. The total loss L takes into account both sample distance and feature classification.

The step j) is as follows.

- j-1) The i-th normalized ECG signal s″_iof a W-th class is input into the convolutional layer and the first ReLU activation function layer of the first CMP module in sequence to acquire feature f₃¹, u∈{1, . . . , N}. The feature f₃¹is input into the primary capsule layer of the capsule network in the first CMP module to acquire vector f₃². The vector f₃²is input into the primary capsule layer of the capsule network in the first CMP module to acquire feature f₃³. The feature f₃³is input into the first fully connected layer and the second ReLU activation function layer of the first CMP module in sequence to acquire feature f₃⁴. The feature f₃⁴is input into the second fully connected layer of the first CMP module to acquire feature f(s″_i)_u. An average of all K features f(s″₁)_u, f(s″₂)_u, . . . , f(s″_i), . . . , f(s″_K)_uof the W-th class is calculated by a mean( ) function in Python to acquire feature vector μ_u.
- j-2) The i-th normalized ECG signal q″_iis input into the convolutional layer and the first ReLU activation function layer of the first CMP module in sequence to acquire feature f₄¹. The feature f₄¹is input into the primary capsule layer of the capsule network in the first CMP module to acquire vector f₄². The vector f₄²is input into the primary capsule layer of the capsule network in the first CMP module to acquire feature f₄³. The feature f₄³is input into the first fully connected layer and the second ReLU activation function layer of the first CMP module in sequence to acquire feature f₄⁴. The feature f₄⁴is input into the second fully connected layer of the first CMP module to acquire feature f(q″_i).
- j-3) The feature vector μ_uand the feature f(q″_i) are input into the metric module of the few-shot classification model, and the similarity score is E_w(μ_u, f(q″_i)) is calculated by E_w(μ_u, f(q″_i))=∥μ_u−f(q″_i)∥.
- j-4) Class label ŷ_iof the i-th normalized ECG signal q″_iis calculated by {right arrow over (y)}_i=arg max {E_w(μ₁, f(q″_i)), E_w(μ₂, f(q″_i)), . . . , E_w(μ_u, f(q″_i)), . . . , E_w(μ_N, f(q″_i))}, and class labels of all NQ normalized ECG signals are combined in to the classification result.

Taking the publicly available MIT-BIH dataset as an example, the implementation of the present disclosure is explained in detail below.

The model proposed by the present disclosure is compared with mainstream classification task models (ED, dynamic time warping (DTW), long short-term memory-fully connected network (LSTM-FCN)) and a Siamese convolutional neural network (SCNN) model, and the final accuracy is the average of 20 tasks. Accuracy, precision, recall, and F1 score are used as evaluation indicators.

The training is performed based on UCR ECG200 and ECG5000 datasets, the validation is performed based on UCR TwoLeadECG and ECGFiveDays datasets, and the model testing is performed based on the MIT-BIH dataset. FIG. 3 shows a comparison of the relationship between the average accuracy and K for different models. It can be seen from the figure that as K increases, ED almost monotonically increases, and the precision, recall, and F1 score also increase. DTW does not follow such a smooth behavior and offers poorer performance than ED at a smaller K value. However, DTW outperforms ED at a value close to 50 and may perform better at a larger value. Unlike ED and DTW, FCN-LSTM exhibits an extremely irregular behavior during training, with a significant fluctuation in accuracy in certain areas, which can be attributed to the randomness of neural network optimization and the lack of labeled data for training. The comparison between the model of the present disclosure and the SCNN model shows that the accuracy does not increase sharply from K=1 to K=50, but tends to stabilize around 0.93, and the recall, precision, and F1 score also tend to stabilize around 0.93.

FIGS. 4A-4B show a confusion matrix of the CMP model in 3-way 10-shot on the MIT-BIH dataset. It can be seen from the figure that the model of the present disclosure has better comprehensive performance and lower misdiagnosis rate during the evaluation process. FIGS. 5A-5F show changes in true and predict labels of 6 randomly selected signals during 3-way 10-shot (N, S and V are represented by 0, 1 and 2, respectively). Table 1 shows comparison results of accuracy acquired by different models under different K values on the MIT-BIH dataset, while Table 2 shows comparison results of average precision, average recall, and average F1 score of different models on the MIT-BIH dataset. In summary, from the perspective of model performance, the model of the present disclosure can effectively distinguish between acceptable and unacceptable ECG signals in practical environments.

Finally, it should be noted that the above descriptions are only preferred embodiments of the present disclosure, and are not intended to limit the present disclosure. Although the present disclosure has been described in detail with reference to the foregoing embodiments, those skilled in the art may still modify the technical solutions described in the foregoing embodiments, or equivalently substitute some technical features thereof. Any modification, equivalent substitution, improvement, etc. within the spirit and principles of the present disclosure shall fall within the scope of protection of the present disclosure.

Claims

1. A few-shot electrocardiogram (ECG) signal classification method based on an improved Siamese network, comprising the following steps: a) acquiring n original ECG signals to form an original ECG signal set D, D={(x1, y1), (x2, y2), . . . , (xi, yi), . . . , (xn, yn)}, wherein xi denotes an i-th original ECG signal, and yi denotes a class label corresponding to the i-th original ECG signal xi, i∈{1, . . . , n};b) preprocessing the original ECG signal set D to remove noise in the n original ECG signals, thereby acquiring a clean ECG signal set D′, D′={(x′1, y1), (x′2, y2), . . . , (x′i, yi), . . . , (x′n, yn)}, wherein x′i denotes an i-th clean ECG signal;c) normalizing the i-th clean ECG signal x′i to acquire a normalized ECG signal x″i; and performing zero-padding in an end of a sequence of the normalized ECG signal x″i if a length of the sequence of the normalized ECG signal x″i is less than Lmax, wherein the length of the sequence of the normalized ECG signal x″i is equal to Lmax, and a normalized ECG signal set D″ is acquired, D″={(x″1, y1), (x″2, y2), . . . , (x″i, yi), . . . , (x″i, yi)};d) creating a sample pair set P based on the normalized ECG signal set D″,
2. The few-shot ECG signal classification method based on the improved Siamese network according to claim 1, wherein the step a) comprises: acquiring the n original ECG signals from a University of California Riverside (UCR) dataset.
3. The few-shot ECG signal classification method based on the improved Siamese network according to claim 1, wherein the step b) comprises: denoising, by a first median filter and a second median filter in sequence, the i-th original ECG signal xi to acquire the i-th clean ECG signal x′i.
4. The few-shot ECG signal classification method based on the improved Siamese network according to claim 3, wherein the first median filter has a width of 300 ms, and the second median filter has a width of 600 ms.
5. The few-shot ECG signal classification method based on the improved Siamese network according to claim 1, wherein Lmax=187.
6. The few-shot ECG signal classification method based on the improved Siamese network according to claim 1, wherein the step e) comprises: e-1) constructing the few-shot classification model, comprising an embedding module and a metric module;e-2) constructing the embedding module of the few-shot classification model, wherein the embedding module comprises a Siamese network formed by a first CMP module and a second CMP module; the first CMP module comprises a convolutional layer, a first rectified linear unit (ReLU) activation function layer, a primary capsule layer of a capsule network, a digital capsule layer of the capsule network, a first fully connected layer, a second ReLU activation function layer, and a second fully connected layer; and the second CMP module comprises a convolutional layer, a first ReLU activation function layer, a primary capsule layer of a capsule network, a digital capsule layer of the capsule network, a first fully connected layer, a second ReLU activation function layer, and a second fully connected layer;e-3) inputting the i-th normalized ECG signal x″i into the convolutional layer and the first ReLU activation function layer of the first CMP module in sequence to acquire a feature f11; inputting the feature f11 into the primary capsule layer of the capsule network in the first CMP module to acquire a vector f12; inputting the vector f12 into the digital capsule layer of the capsule network in the first CMP module to acquire a feature f13; inputting the feature f13 into the first fully connected layer and the second ReLU activation function layer of the first CMP module in sequence to acquire a feature f14; and inputting the feature f14 into the second fully connected layer of the first CMP module to acquire a feature f(x″i);e-4) inputting an (i+1)-th normalized ECG signal x″i+1 into the convolutional layer and the first ReLU activation function layer of the first CMP module in sequence to acquire a feature f21; inputting the feature f21 into the primary capsule layer of the capsule network in the first CMP module to acquire a vector f22; inputting the vector f22 into the digital capsule layer of the capsule network in the first CMP module to acquire a feature inputting the feature f23 into the first fully connected layer and the second ReLU activation function layer of the first CMP module in sequence to acquire a feature f24; and inputting the feature f24 into the second fully connected layer of the first CMP module to acquire a feature f(x″i+1); ande-5) inputting the feature f(x″i) and the feature f(x″i+1) into the metric module of the few-shot classification model, and calculating the similarity score Ew(x″1, x″i+1) by Ew(x″i, x″i+1)=∥f(x″i)−f(x″i+1)∥, wherein ∥●∥ denotes a Euclidean distance (ED) calculation.
7. The few-shot ECG signal classification method based on the improved Siamese network according to claim 6, wherein in the step e-2), the convolutional layer of the first CMP module comprises a 3×3 convolution kernel, and the convolutional layer of the second CMP module comprises a 3×3 convolution kernel.
8. The few-shot ECG signal classification method based on the improved Siamese network according to claim 1, wherein the step f) comprises: calculating the loss function L by, L=L1+αL2, wherein
9. The few-shot ECG signal classification method based on the improved Siamese network according to claim 6, wherein the step j) comprises: j-1) inputting the i-th normalized ECG signal s″i of a u-th class into the convolutional layer and the first ReLU activation function layer of the first CMP module in sequence to acquire a feature f31, u∈{1, . . . , N}; inputting the feature f31 into the primary capsule layer of the capsule network in the first CMP module to acquire a vector f32; inputting the vector f32 into the primary capsule layer of the capsule network in the first CMP module to acquire a feature f33; inputting the feature f33 into the first fully connected layer and the second ReLU activation function layer of the first CMP module in sequence to acquire a feature f34; inputting the feature f34 into the second fully connected layer of the first CMP module to acquire a feature f(s″i)u; and calculating, by a mean( ) function in Python, an average of all K features f(s″1)u, f(s″2)u, . . . , f(s″i)u, . . . , f(s″K)u, of the u-th class to acquire a feature vector μu;j-2) inputting the i-th normalized ECG signal q″i into the convolutional layer and the first ReLU activation function layer of the first CMP module in sequence to acquire a feature f41; inputting the feature f41 into the primary capsule layer of the capsule network in the first CMP module to acquire a vector f42; inputting the vector f42 into the primary capsule layer of the capsule network in the first CMP module to acquire a feature f43; inputting the feature f43 into the first fully connected layer and the second ReLU activation function layer of the first CMP module in sequence to acquire a feature f44; and inputting the feature f44 into the second fully connected layer of the first CMP module to acquire a feature f(q″i);j-3) inputting the feature vector μu and the feature f(q″i) into the metric module of the few-shot classification model, and calculating the similarity score Ew(μu, f(q″i)) by Ew(μu, f(q″i))=∥μu−f(q″i)∥; andj-4) calculating a class label ŷi of the i-th normalized ECG signal q″i by {right arrow over (y)}i=arg max {Ew(μ1, f(q″i)), Ew(μ2, f(q″i)), . . . , Ew(μu, f(q″i)), . . . , Ew(μN, f(q″i))}, and combining class labels of all NQ normalized ECG signals to form the classification result.

Priority Claims (1)

Number	Date	Country	Kind
2023114980555	Nov 2023	CN	national

FEW-SHOT ELECTROCARDIOGRAM (ECG) SIGNAL CLASSIFICATION METHOD BASED ON IMPROVED SIAMESE NETWORK

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

Priority Claims (1)