CROSS-PERIOD BRAIN FINGERPRINT IDENTIFICATION METHOD WITH PARADIGM ADAPTIVE DECOUPLING AND SYSTEM THEREOF

Description

CROSS-REFERENCE TO RELATED APPLICATION

This patent application claims the benefit and priority of Chinese Patent Application No. 202410087933.2 filed with the China National Intellectual Property Administration on Jan. 22, 2024, the disclosure of which is incorporated by reference herein in its entirety as part of the present application.

TECHNICAL FIELD

The present disclosure belongs to the technical field of biometric identification based on electroencephalogram signals, and in particular relates to a cross-period brain fingerprint identification method with paradigm adaptive decoupling and a system thereof.

BACKGROUND

As a highly identifiable physiological feature, an electroencephalogram signal has great potential in the field of biometric identification. With the continuous development and popularization of consumer wearable devices, users can acquire electroencephalogram data through sensors in the devices to realize contactless identity identification. Compared with traditional biometric technologies (such as face identification and fingerprint identification), the electroencephalogram signal identification is more confidential, more difficult to be stolen, and revocable and revisable. On the other hand, compared with conventional biological features, the electroencephalogram signal identification has obvious advantages in some specific scenarios, such as some highly confidential and safe occasions, or in some new technology usage scenarios, such as virtual reality (VR) and augmented reality (AR).

However, different methods of collecting electroencephalogram signals lead to different features and manifestations of electroencephalogram signals, which challenges the generalization and identification ability of the model. In addition, electroencephalogram signals are very sensitive to internal and external noises, such as physiological state, psychological state and environmental noise, which makes it difficult to guarantee the stability and reliability of the model in different periods. Therefore, most of the work is to collect electroencephalogram signals in a single paradigm or in a single period, which is not consistent with the actual application scenario.

In order to overcome these challenges, the present disclosure provides a cross-period brain fingerprint identification method with paradigm adaptive decoupling. The method can effectively separate identity-related features and paradigm task-related features from highly coupled electroencephalogram information; and further learn domain invariant features with identity identification ability through domain adversarial training. The present disclosure can be adaptive to electroencephalogram signals collected by various paradigms under a cross-period condition, which has a high practical value in practical application scenarios and provides innovative solutions for the fields of biometric identification, identity verification and the like.

SUMMARY

According to the shortcomings of the prior art, the present disclosure provides a cross-period brain fingerprint identification method with paradigm adaptive decoupling and a system thereof. First, a feature extractor extracts is a feature representation from original electroencephalogram data; thereafter, effectively separates identity-related features and paradigm task-related features from highly coupled electroencephalogram information; and finally, learns domain invariant features with identity identification ability through domain adversarial training.

In a first aspect, the present disclosure provides a cross-period brain fingerprint identification method with paradigm adaptive decoupling, including the following steps:

- Step 1: collecting electroencephalogram data
- collecting electroencephalogram data of a plurality of subjects at different periods;
- Step 2: preprocessing the electroencephalogram data, labeling the electroencephalogram data with identity labels of subjects, and then dividing the electroencephalogram data into a source domain and a target domain according to a sequence of collecting periods, wherein the source domain is a data set with known identity labels of the subjects, and the target domain is a data set with identity labels to be predicted;
- Step 3: constructing a feature extraction module with paradigm adaptive decoupling, and training and testing the feature extraction module;
- the feature extraction module with paradigm adaptive decoupling includes a multi-scale convolution module, a graph convolution module and an attention embedding module;
- the multi-scale convolution module includes a plurality of parallel one-dimensional convolution layers, a splicing layer, a fusion layer and a filtering layer; the plurality of one-dimensional convolution layers have convolution kernels with different sizes; the multi-scale convolution module receives data of the source domain and the target domain, processes the data in multi-time dimension through a plurality of parallel one-dimensional convolution kernels with different sizes, and outputs features of different levels as an input of the splicing layer; the splicing layer splices an output of one-dimensional convolution layers of different levels as an input of the fusion layer; the fusion layer fuses the features learned by different convolution kernels output by the splicing layer, and flattens the features as an input of the filtering layer;
- the graph convolution module includes a plurality of parallel graph convolution networks, and is configured to mine a topological relation and spatial information between channels in a data-driven manner;
- the attention embedding module is configured to act on outputs of graph convolution of different levels, and transform a graph structure into an embedding vector through an attention mechanism as an input of decouplers;

Step 4:, constructing a decoupling module for decoupling features and respective classifiers, and training and testing the decoupling module and the classifiers;

- wherein the decoupling module for decoupling features includes an intra-domain specific identity information decoupler, an inter-domain invariant identity information decoupler, a paradigm task information decoupler, a first mutual information network and a second mutual information network;
- the intra-domain specific identity information decoupler is configured to decouple a feature representation h_outextracted by the feature extraction module into a domain specific identity feature representation h_sped-id;
- the inter-domain invariant identity information decoupler is configured to decouple the feature representation h_outextracted by the feature extraction module into a domain invariant identity feature representation h_inv-id;
- the paradigm task information decoupler is configured to decouple the feature representation h_outextracted by the feature extraction module into a paradigm task-related feature representation h_task;
- the first mutual information network is configured to calculate a first mutual information loss between the paradigm task-related feature representation h_taskand the domain invariant identity feature representation h_inv-id; the second mutual information network is configured to calculate a second mutual information loss between the domain specific identity feature representation h_sped-idand the domain invariant identity feature representation h_inv-id;
- network parameters of the decoupling module are updated through the first mutual information loss and the second mutual information loss, so as to reduce the mutual information between decoupled features and obtain a better decoupling effect;
- the intra-domain specific identity information decoupler, the inter-domain invariant identity information decoupler, the paradigm task information decoupler, the first mutual information network and the second mutual information network all consist of fully connected layer networks, and the three types of decoupled feature representations, namely h_task, h_inv-idand h_sped-id, will be used as the input of each classifier, the mutual information networks aim to calculate mutual information loss, and network parameters are updated through the loss, so as to reduce the mutual information between decoupled features and obtain a better feature decoupling effect;
- the classifiers include a domain classifier, an identity classifier and a paradigm task classifier;
- the domain classifier performs domain adversarial training to reduce distribution differences between different domains and obtain domain invariant features; and takes the domain invariant identity feature representation h_inv-idas the input, which consists of a fully connected layer and a Softmax activation function;
- the identity classifier aims at obtaining accurate classification information; takes the domain specific identity feature representation h_sped-idand the domain invariant identity feature representation h_inv-idas the input, which consists of a fully connected layer and a Softmax activation function;
- the paradigm task classifier aims at promoting a decoupling effect and reducing the influence of interference information on an identity identification task; takes the paradigm task-related feature representation h_taskas the input, which consists of a fully connected layer and a Softmax activation function; and
- Step 5: using the feature extraction module, the decoupling module and the classifiers which have been trained and verified to realize cross-period brain fingerprint identification.

Preferably, in Step 2, preprocessing the electroencephalogram data includes: filtering and down-sampling the electroencephalogram data collected in Step 1, and then fragmenting the electroencephalogram data to obtain a plurality of fragments with a sample length of L.

Preferably, in Step 2, dividing the electroencephalogram data into the source domain and the target domain according to the sequence of collecting periods means that: in a time sequence, the period data collected first is taken as the source domain with identity labels which is denoted as custom-character _S={X_S, Y_S}={(x_S¹,y_S¹), ⋅ ⋅ ⋅ , (x_Sⁿ^S,y_S^N^S)}, n_Srepresents the number of samples in the source domain, x_Sⁱ|_i=1ⁿ^S∈^d×Lrepresents an electroencephalogram data fragment in a period of time, y_sⁱ|_i=1ⁿ^S∈^Crepresents an identity label of an i-th subject in the source domain, and C represents the number of the subjects; the period data collected later is taken as unlabeled target domain of identities to be predicted, which is denoted as custom-character _T={X_T}={x_T¹, ⋅ ⋅ ⋅ , x_Tⁿ^t}, x_Tⁱ|_i=1ⁿ^t∈^d×Lrepresents an electroencephalogram data fragment in a period of time, and n_trepresents the number of samples in the target domain.

Preferably, in Step 3, the kernel size of each type of one-dimensional convolution layer is determined by the proportion coefficient a_k|_k=1^K∈ custom-character and the sample length L, and the proportion coefficient is artificially set, where k represents the convolution kernel of a k-th type of one-dimensional convolution layers.

In the present disclosure, a_k=[0.1, 0.2, 0.5], that is, there are three types of one-dimensional convolution kernels with different sizes.

The scale of the time convolution kernel of the k-th type is denoted as custom-character ^kand defined as:

$\begin{matrix} 𝒜^{k} = (1, α_{k} \cdot L) & (1) \end{matrix}$

Preferably, in Step 3, the multi-scale one-dimensional convolution layer is specifically expressed as:

$\begin{matrix} X_{T_out}^{k} = Φ_{\log} (ℱ_{AP} (Φ_{square} (ℱ_{Conv 2 D} (X_{i}, S_{T}^{k})))) & (2) \end{matrix}$

wherein X_i∈ custom-character ^C×L˜_S/T,i∈[1, ⋅ ⋅ ⋅ ,N] represents the input of the multi-scale one-dimensional convolution layer, that is, the preprocessed electroencephalogram data, N is the number of sample fragments of the preprocessed electroencephalogram data, C is the number of channels of the electroencephalogram collecting device, and L is the length of the set electroencephalogram data in the time dimension; X_{T_out}^k∈ custom-character ^t×C×f^krepresents the output of the multi-scale convolution module, t represents the number of one-dimensional convolution layers, and f_krepresents the feature length of the output features of the k-th type of one-dimensional convolution layers; _Conv2D(X_i,S_T^k) represents the convolution operation of X_iusing a time convolution kernel with a size of S_T^k; Φ_square(·) is a square function; custom-character _AP(·) represents the average pooling operation, and Φ_log(·) represents a logarithmic function.

The splicing layer is specifically expressed as:

$\begin{matrix} X_{cat}^{i} = Γ (X_{T_out}^{1}, \dots, X_{T_out}^{K}) & (3) \end{matrix}$

where Γ(·) represents the serial operation along the feature dimension; X_catⁱ∈ custom-character ^t×C×Σf^kwill be used as the input of the fusion layer.

The fusion layer uses a 1*1 convolution layer to fuse the features learned by different convolution kernels; the number of convolution kernels in the 1*1 convolution layer is set to t. Preferably, Leaky-ReLU is used as the activation function, and the average pooling layer is used to down-sample the learned representation. After batch standardization, the fused representation from different 1*1 convolution kernels will be flattened and become the attribute of each channel node in the graph representation; therefore, the attention fusion representation Xⁱ_fusefor each X_catⁱcan be calculated as:

$\begin{matrix} {\overline{X}}^{i}_{fuse} = ℱ_{bn} (ℱ_{AP} (Φ_{L - ReLU} (ℱ_{fuse} (ℱ_{dropout} (ℱ_{bn} (X_{cat}^{i})))))) & (4) \end{matrix}$

X
ⁱ
_fuseis reshaped as Xⁱ_fuse∈ custom-character ^{C×t* 0.5*Σf}^k:

$\begin{matrix} X_{fuse}^{i} = ℱ_{reshape} ({\overline{X}}^{i}_{fuse}) & (5) \end{matrix}$

The graph convolution module includes a plurality of graph convolution layers connected in series. Specifically, a messaging framework is used to realize the graph convolution layer (GNN):

$\begin{matrix} X_{out} = GNN (A, X_{fuse}^{i}) & (6) \end{matrix}$

where A=Xⁱ_fuseXⁱ_fuse^T∈ custom-character ^d×d, d represents the number of channels of electroencephalogram features;

The attention embedding module obtains the embedded vector representation of the features through the attention mechanism. Specifically, first, the global mean representation

$h_{mean} = \frac{1}{d} \sum_{i = 1}^{d} x_{out}^{i}$

is calculated, and then each channel x_outⁱ|_i^dand h_meanare subjected to inner products to obtain a similarity representation, which can be regarded as the expression of the importance degree belonging to each channel. Finally, the final embedded vector representation is obtained by weighted summation:

$\begin{matrix} h_{out} = soft \max (h_{mean} X_{out}^{T}) X_{out} & (8) \end{matrix}$

Preferably, in the training process, the decouplers use adversarial training to achieve feature decoupling, specifically, the intra-domain specific identity information decoupler, the inter-domain invariant identity information decoupler and the paradigm task information decoupler decouple h_outinto the domain specific identity feature representation h_sped-id, the domain invariant identity feature representation h_inv-idand the paradigm task-related feature representation h_task, respectively.

Preferably, the intra-domain specific identity feature h_sped-idand the inter-domain invariant identity feature h_inv-idare constrained by L1 norm.

Thereafter, the identity classifier custom-character _c-id(·) and the paradigm task classifier _c-taskare trained to realize correct classification, which are iteratively optimized through a cross entropy loss.

$\begin{matrix} ℒ_{c 1} = - 𝔼_{(x_{s}, y_{s}) \sim {\tilde{𝒟}}_{s}} \sum y_{s} \log (ℱ_{C - id} (h_{spec - id} / h_{inv - id})) & (9) \end{matrix}$

$\begin{matrix} ℒ_{c 2} = - 𝔼_{x \sim {\tilde{𝒟}}_{s / t}} \sum y_{task} \log (ℱ_{C - task} (h_{task})) & (10) \end{matrix}$

where custom-character _(x_S_,y_S_{)˜{tilde over (D)}}_S, represents a feature representation extracted from the source domain data; _{x˜{tilde over (D)}}_s/trepresents a feature representation extracted from the source domain data or the target domain data; y_Srepresents a real identity label of the source domain data; y_taskrepresents the task label that the subject carries out when electroencephalogram data is collected○

In the training process, the domain classifier custom-character _C-domain(·) realizes the distribution alignment of the source domain and the target domain in a domain adversarial manner, specifically, in an identity-related feature subspace, the source domain and the target domain are aligned through domain adversarial training:

$\begin{matrix} ℒ_{d} = - 𝔼_{x \sim {\tilde{𝒟}}_{S}} [\log (ℱ_{C - domain} (h_{inv - id}))] - {𝔼_{x \sim \tilde{𝒟}}}_{t} [\log (1 - ℱ_{C - domain} (h_{inv - id}))] & (12) \end{matrix}$

where custom-character _{x˜{tilde over (D)}}_S[log(_C-domain(h_inv-id)] represents that an encouragement that the domain classifier _C-domain(·) correctly predicts the source domain data, and _{x˜{tilde over (D)}}_S[log(1−_C-domain(h_inv-id)] represents an encouragement that the features decoupled by the inter-domain invariant identity information decoupler deceive custom-character _C-domain(·) to obtain the domain invariant identity feature.

Preferably, a better decoupling effect is obtained by updating the network through the mutual information loss. That is, the first mutual information loss custom-character ₁(h_inv-id,h_task) between the domain invariant identity feature representation h_inv-idand the paradigm task-related feature h_taskis calculated through the first mutual information network, and the second mutual information loss ₂(h_inv-id, h_spec-id) between the domain invariant identity feature representation h_inv-idand the domain specific identity feature representation h_sped-idis calculated through the second mutual information network, which are expressed as:

$\begin{matrix} {ℳℐ}_{1} (h_{inv - id}, h_{task}) = ℐ_{1} (h_{inv - id}, h_{task}; θ) - \log (e^{ℐ_{1} (h_{inv - id}, {h_{task}}^{'}; θ)}) & (14) \end{matrix}$

$\begin{matrix} {ℳℐ}_{2} (h_{inv - id}, h_{spec - id}) = ℐ_{2} (h_{inv - id}, h_{spec - id}; θ) - \log (e^{ℐ_{2} (h_{inv - id}, {h_{spec - id}}^{'}; θ)}) & (15) \end{matrix}$

where h_taskand h_spec-idrepresent the edge distributions sampled from h_taskand h_sped-id, respectively; custom-character _1/2(·) represents the first mutual information network and the second mutual information network, respectively; θ is a learnable parameter.

In a second aspect, the present disclosure provides a cross-period brain fingerprint identification system, including:

- a data collecting module, configured to collect electroencephalogram data;
- a data preprocessing module, configured to preprocess the electroencephalogram data;
- an identifying module, configured to realize cross-period brain fingerprint identification according to preprocessed electroencephalogram data by using the feature extraction module, the decoupling module and the classifiers which have been trained and verified in advance.

In a third aspect, the present disclosure provides a computer-readable storage medium having a computer program stored thereon, which, when executed in a computer, causes the computer to execute the method.

In a fourth aspect, the present disclosure provides a computing device, including a memory and a processor, wherein executable codes are stored in the memory, and the processor, when executing the executable codes, implement the method.

Through these devices and apparatuses, the present disclosure realizes an efficient and accurate cross-period brain fingerprint identification solution, and brings an innovative technical support for the fields of biometric identification, identity verification and the like.

The present disclosure has the following features and beneficial effects.

- 1. The present disclosure introduces an intra-domain specific identity information decoupler, an inter-domain invariant identity information decoupler and a paradigm task information decoupler to decouple the features extracted by the feature extraction module.
- 2. The present disclosure introduces a mutual information loss. That is, the mutual information between the intra-domain specific identity information, the inter-domain invariant identity information and the paradigm task information is reduced through the first mutual information network and the second mutual information network, and the decoupling effect of the network is enhanced.
- 3. The present disclosure introduces a domain classifier, an identity classifier and a paradigm task classifier to pass through a domain label, an identity label and a paradigm task label, and the decouplers are guided to decouple paradigm task features and identity features effectively through adversarial training.
- 4. The solution disclosed in the present disclosure is adaptive to the electroencephalogram data collected by various paradigms: the method of the present disclosure can decouple the identity features of the electroencephalogram data collected by various paradigms in the training process, which makes it more in line with the practical application scenarios and improves the practicability.
- 5. The solution disclosed in the present disclosure has high confidentiality: since electroencephalogram signals are difficult to be stolen, the cross-period brain fingerprint identification technology of the present disclosure has high confidentiality and is expected to become a safe identity verification and identification method.
- 6. The solution disclosed in the present disclosure has a wide application prospect: the cross-period brain fingerprint identification method of the present disclosure can be applied to the fields of biometric identification, identity verification, security authentication and the like, and provide an innovative technical support for related industries.

BRIEF DESCRIPTION OF THE DRAWINGS

In order to more clearly explain the technical scheme in the embodiment of the present disclosure or the prior art, the drawing required in the description of the embodiment or the prior art will be briefly introduced hereinafter. Obviously, the drawing in the following description only represents some embodiments of the present disclosure. For those skilled in the art, other related drawings can be obtained according to the drawing without creative labor. FIG. 1 shows a working flow chart of a brain fingerprint identification method according to the present disclosure.

DETAILED DESCRIPTION OF THE EMBODIMENTS

In the case of no contradiction, the embodiments in the present disclosure and the features in the embodiments can be combined with each other. The specific implementation steps of the cross-period brain fingerprint identification method with paradigm adaptive decoupling of the present disclosure are as follows.

- 1. Electroencephalogram data is collected: electrodes are placed on a scalp by a special device, and electroencephalogram signals of a plurality of subjects are collected at different periods in a non-invasive manner.
- 2. The electroencephalogram data is preprocessed, labelled with identity labels of subjects, and then divided into a source domain and a target domain according to a sequence of collecting periods, wherein the source domain is a data set with known identity labels of the subjects, and the target domain is a data set with identity labels to be predicted.
- 2-1: the electroencephalogram data collected in Step 1 is filtered and down-sampled to reduce noise and improve signal quality.
- 2-2: electroencephalogram features obtained in Step 2-1 are divided to obtain a plurality of fragments with a sample length of L, and corresponding fragments are labelled with an identity label of a subject. The plurality of fragments are divided into a source domain and a target domain according to the sequence of collecting periods. In a time sequence, period data collected first is taken as the source domain with identity labels which is denoted as _S={X_S, Y_S}={(x_S¹,y_S¹), ⋅ ⋅ ⋅ , (x_Sⁿ^S,y_S^N^S)}, n_Srepresents the number of samples in the source domain, x_Sⁱ|_i=1ⁿ^S∈^d×Lrepresents an electroencephalogram data fragment in a period of time, y_sⁱ|_i=1ⁿ^S∈^Crepresents an identity label of an i-th subject in the source domain, and C represents the number of the subjects. Period data collected later is taken as unlabeled target domain, which is denoted as _T={X_T}={x_T¹, ⋅ ⋅ ⋅ , x_Tⁿ^t}, x_Tⁱ|_i=1ⁿ^t∈^d×Lrepresents an electroencephalogram data fragment in a period of time, and n, represents the number of samples in the target domain.

3. A feature extraction module with paradigm adaptive decoupling is constructed: the feature extraction module with paradigm adaptive decoupling mainly consists of a multi-scale convolution module, a graph convolution module and an attention embedding module. The multi-scale convolution module includes more than one one-dimensional convolution layer, which has convolution kernels with different sizes. The multi-scale convolution module further includes a splicing layer and a fusion layer. The graph convolution module consists of a plurality of parallel graph convolution networks. The attention embedding module acts on an output of graph convolution of different levels, and transforms a graph structure into an embedding vector through an attention mechanism.

4. A decoupling module for decoupling features and respective classifiers thereof are constructed: wherein the decoupling module consists of a plurality of decouplers and mutual information networks. The decouplers includes an intra-domain specific identity information decoupler, an inter-domain invariant identity information decoupler and a paradigm task information decoupler. The decouplers decouple extracted feature representation into domain specific identity feature representation, domain invariant identity feature representation and paradigm task-related feature representation. Both the decouplers and the mutual information networks all consist of fully connected layer networks, and outputs of the decouplers are used as inputs of the classifiers. The mutual information networks aim at reducing mutual information between features and promote better decoupling. The classifiers includes: a domain classifier, which is configured to perform domain adversarial training to reduce distribution differences between different domains and obtain domain invariant features; an identity classifier, which aims at obtaining accurate classification information; a paradigm task classifier, which aims at promoting decoupling effect and reducing influence of interference information on an identity identification task. Each classifier takes the output of the respective decoupler as an input, which consists of a fully connected layer and a Softmax activation function.

5. The network model is trained: the classifiers are trained by using decoupled domain invariant identity information, and model parameters are constantly updated through an optimization algorithm, so that the model can obtain better identification performance on training data.

Through above specific implementation steps, the present disclosure realizes a cross-period brain fingerprint identification method with paradigm adaptive decoupling, which can effectively improve the accuracy and robustness of brain fingerprint identification in practical application scenarios.

Specifically, in Step 1, electroencephalogram cap leads are connected to corresponding brain region of a subject to collect electroencephalogram data.

In Step 2, the electroencephalogram data collected in Step 1 is preprocessed. Original electroencephalogram signal contains frequencies of noises. In order to remove power frequency interference resulted from electroencephalogram collecting device and electromyogram interference of the subject, the electroencephalogram data is down-sampled to 200 Hz, and the original electroencephalogram data is filtered by a Butterworth filter at 1 to 75 Hz.

Obtained electroencephalogram features are divided into fragments with a predetermined time window size of L. The specific window size of the present disclosure is 5 s, and the corresponding fragments are labeled with a label of a subject. The fragments are divided into a source domain and a target domain according to the sequence of collecting periods. The period data collected first is taken as the source domain with identity labels, and the period data collected later is taken as the target domain of the identity labels to be predicted.

According to the further setting of the present disclosure, in Step 3, a feature extraction module with paradigm adaptive decoupling is included. Specifically, the feature extraction module mainly consists of a multi-scale convolution module, a graph convolution module and an attention embedding module. The multi-scale convolution module includes more than one one-dimensional convolution layer, which has convolution kernels with different sizes. The multi-scale convolution module further includes a splicing layer, a fusion layer and a transition layer. A size of a one-dimensional convolution kernel is determined according to different proportions of the sample duration L, and the proportion coefficient is denoted as a_k|_k=1^K∈ custom-character , where k represents a convolution kernel of a k-th type of time convolution layer. In the present disclosure, a_k=[0.1, 0.2, 0.5], that is, there are three types of one-dimensional convolution kernels with different sizes. The scale of the k-th type of the time convolution kernel is denoted as custom-character ^kand defined as:

$\begin{matrix} 𝒜^{k} = (1, α_{k} \cdot L) & (1) \end{matrix}$

For a given electroencephalogram data xⁱ|_i=1ⁿ∈ custom-character ^d×L˜_S/T, n is the number of samples of the electroencephalogram data, and d is the number of channels for collecting the electroencephalogram data. First, a plurality of multi-scale one-dimensional time convolution kernels will be used at the same time to learn dynamic time representation of the electroencephalogram data. The representation obtained after the multi-scale time convolution layer is processed by the logarithm of the average pooling square in order to learn the power features of the dynamic time representation of the electroencephalogram signals.

The time convolution output of the k-th type is denoted as X^(i,k)∈ custom-character ^Tⁿ^×d×L^k, where T_nrepresents the number of time convolution kernels, and L_krepresents the feature length of the output feature. X^(i,k)can be specifically expressed as:

$\begin{matrix} X^{(i, k)} = Φ_{\log} (ℱ_{A P} (Φ_{square} (ℱ_{C o n v} (x^{i}, 𝒜^{k})))) & (2) \end{matrix}$

where custom-character _Conv2D(·) represents the convolution operation, Φ_square(·) is a square function; _AP(·) represents the average pooling operation, and Φ_log(·) represents a logarithmic function.

The splicing layer will connect the time convolution kernel outputs of all levels in series in the feature dimension. Therefore, for the input xⁱ|_i=1ⁿ∈ custom-character ^d×L, the output X_catⁱ∈^Tⁿ^×d×ΣL^kof the multi-scale time convolution layer can be calculated as:

$\begin{matrix} X_{cat}^{i} = Γ (X^{(i, 1)}, \dots, X^{(i, K)}) & (3) \end{matrix}$

where Γ(·) represents the serial operation along the feature dimension.

Preferably, the fusion layer uses a 1*1 convolution layer as the attention fusion layer to fuse the features learned by different convolution kernels. The number of convolution kernels in the 1*1 convolution layer is set to t. Preferably, Leaky-ReLU is used as the activation function, and the average pooling layer is used to down-sample the learned representation. After batch standardization, the fused representation from different 1*1 convolution kernels will be flattened and become the attribute of each channel node in the graph representation. Therefore, the attention fusion representation Xⁱ_fuse fuse for each X_catⁱcan be calculated as:

$\begin{matrix} {\bar{X}}_{fuse}^{i} = ℱ_{bn} (ℱ_{AP} (Φ_{L - ReLU} (ℱ_{fuse} (ℱ_{dropout} (ℱ_{bn} (X_{cat}^{i})))))) & (4) \end{matrix}$

where custom-character _bn(·) is a batch normalization function, _dropout(·) is a random dropout layer preventing over-fitting, _fuse(·) is a 1*1 convolution function, Φ_L-ReLU(·) is a Leaky-ReLU activation function. Xⁱ_fuse∈^{t×C×0.5*Σf}^kis reshaped as Xⁱ_fuse∈^C×t*0.5*Σf^kto construct the attribute of each node (the electroencephalogram signal channel) in the graph representation:

$\begin{matrix} X_{fuse}^{i} = ℱ_{reshape} ({\bar{X}}_{fuse}^{i}) & (5) \end{matrix}$

The graph convolution module is realized by a graph convolution network using a general messaging framework:

$\begin{matrix} X_{out} = GNN (A, X_{fuse}^{i}) & (6) \end{matrix}$

where A=Xⁱ_fuseXⁱ_fuse^T∈ custom-character ^d×d

The attention embedding module obtains the importance of each channel through a similarity calculation, and then carries out a weighted summation based on this. Specifically, first, the global mean representation is calculated:

$\begin{matrix} h_{mean} = \frac{1}{d} \sum_{i = 1}^{d} x_{out}^{i} & (7) \end{matrix}$

Thereafter, each channel x_outⁱ|_i^dand h_meanare subjected to inner product to obtain a similarity representation, which can be regarded as the expression of the importance degree belonging to each channel. Finally, the final embedded vector representation is obtained by weighted summation:

$\begin{matrix} h_{o u t} = softmax (h_{m e a n} X_{o u t}^{T}) X_{o u t} & (8) \end{matrix}$

According to the further setting of the present disclosure, in Step 4, a decoupling module for decoupling features and respective classifiers thereof are constructed, and are trained and tested.

Specifically, the adversarial training method is used to realize feature decoupling and classification. First, h_outis decoupled into an identity-related feature had and a task-related feature h_taskby a decoupler custom-character _D(·), and then the classifier _C-id(·) is trained to realize correct classification, which is iteratively optimized through a cross entropy loss.

$\begin{matrix} ℒ_{c 1} = - 𝔼_{(x_{?}, y_{?}) \sim {\tilde{D}}_{?}} \sum y_{s} \log (ℱ_{C - id} (h_{sepc - id} / h_{inv - id})) & (9) \end{matrix}$

$\begin{matrix} ℒ_{c 2} = - 𝔼_{x \sim {\tilde{D}}_{?}} \sum y_{task} \log (ℱ_{C - id} (h_{task})) & (10) \end{matrix}$

$? indicates text missing or illegible when filed$

Thereafter, the classifiers are fixed, and the two decoupling features of the decouplers are trained to deceive the classifiers of each other. The present disclosure uses a minimized negative entropy to encourage such deception effect:

$\begin{matrix} ℒ_{e n t} = \sum_{x \sim {\tilde{D}}_{?}} \log (ℱ_{C - id} - (h_{task})) + \sum_{x \sim {\bar{D}}_{?}} \log (ℱ_{C - task} (h_{inv - i d})) & (11) \end{matrix}$

$? indicates text missing or illegible when filed$

The first item indicates deceiving the identity classifier with task features, and the second item indicates deceiving the task classifier with identity features.

The present disclosure realizes the distribution alignment of the source domain and the target domain through adversarial training, and specifically, the training loss function can be expressed as:

$\begin{matrix} ℒ_{d} = - 𝔼_{x \sim {\tilde{D}}_{s}} [\log (ℱ_{C - domain} (h_{inv - id}))] - 𝔼_{x - {\tilde{D}}_{?}} [\log (1 - ℱ_{C - domain} (h_{inv - i d}))] & (12) \end{matrix}$

$? indicates text missing or illegible when filed$

Preferably, the present disclosure further decouples the identity information into intra-domain specific identity information and inter-domain invariant identity information, and reduces the difference of prediction results therebetween through L1 norm constraint, specifically, which can be expressed as:

$\begin{matrix} L_{?} = { ℱ_{C - i d} (h_{i n v - i d}) - F_{C - i d} (h_{spec - i d}) }_{1} & (13) \end{matrix}$

$? indicates text missing or illegible when filed$

Preferably, a better decoupling effect is obtained by updating the network through the mutual information loss. That is, the mutual information loss between the inter-domain invariant identity feature h_inv-idand the paradigm task-related feature h_taskis calculated through the first mutual information network, and the mutual information loss between the inter-domain invariant identity feature h_inv-idand the intra-domain specific identity feature h_sped-idis calculated through the second mutual information network, which are specifically expressed as:

$\begin{matrix} {ℳℐ}_{1} (h_{i n v - i d}, h_{task}) = 𝒯_{1} (h_{inv - id}, h_{task}; θ) - \log (e^{𝒯_{1} (h_{inv - id}, h_{task}^{'}; θ)}) & (14) \end{matrix}$

$\begin{matrix} {ℳℐ}_{2} (h_{i nv - id}, h_{spec - id}) = 𝒯_{2} (h_{i nv - id}, h_{s p ec - id}; θ) - \log (e^{𝒯_{2} (h_{inv - id}, h_{spec - id}^{'}; θ)}) & (15) \end{matrix}$

where h_taskand h_spec-idrepresent the edge distributions sampled from h_taskand h_sped-id, respectively; custom-character _1/2(·) represent the first mutual information network and the second mutual information network, respectively, that is, the fully connected layer network module; and θ is a learnable parameter.

According to the further setting of the present disclosure, in Step 5, the network model is trained. In order to optimize the network parameters, the present disclosure uses a back propagation method, and updates the network parameters through iteration until the required standard is reached. The training update mode is as shown in the following table 1.

TABLE 1

a working flow chart of a brain fingerprint identification

method with paradigm adaptive decoupling

Input: labelled source domain data custom-character

_S, unlabeled

target domain data to be predicted custom-character

_T, feature

extractor F , decoupler D , mutual information network MI , classifier C ;

Output: trained feature extractor {tilde over (F)} ,

decoupler {tilde over (D)} , and classifier {tilde over (C)} .

1, extracting features through Formulas (2)-(8);

2, updating F, D, C through Formulas (9)-(10).

3, updating F, D by through Formula (11)

4. updating F, D and C through Formula (12)-(13)

5, updating D, MI through Formulas (14)-(15).

Repeat the above Steps 1-5 until convergence.

The cross-period brain fingerprint identification method with paradigm adaptive decoupling according to the present disclosure is compared with the classic and recently proposed methods in the field of a brain-computer interface on two cross-period data sets: SEED-V and an electroencephalogram data set collected based on an RSVP protocol under the domain adaptive framework, and the obtained identification precisions (%) are as shown in the following table:

TABLE 1

cross-period identification precisions (%) and F1 scores of SEED-V

Time period
Time period
Time period

1→Time period 2
1→Time period 3
2→Time period 3
average

F1

F1

F1

F1

Model
Precision
score
Precision
score
Precision
score
Precision
score

Model of the present
84.23
82.14
86
84.02
80.16
78.04
83.46
81.4

disclosure

EEGnet-domain
60.14
52.34
62.98
55.87
75.49
68.76
66.2
58.99

adaptive

TScption-domain
74.26
65.51
75.16
70.75
74.09
72.06
74.5
69.44

adaptive

brainnet-domain
79.38
74.62
74.98
72.56
79.52
76.62
77.96
74.6

adaptive

TABLE 2

cross-period identification precisions (%) and F1 scores of

an electroencephalogram data set based on an RSVP protocol

Time period 1→Time period 2

model
Precision
F1 score

Model of the present disclosure
85.7
85.7

EEGnet-domain adaptive
71.37
71.18

TScption-domain adaptive
83.72
83.78

brainnet-domain adaptive
72.8
70.46

The embodiments of the present disclosure have been described in detail above with reference to the drawing, but the present disclosure is not limited to the described embodiments. It will be obvious to those skilled in the art that many changes, modifications, substitutions and variations can be made to these embodiments, including components, without departing from the principle and spirit of the present disclosure, which still fall within the scope of protection of the present disclosure.

Claims

1. A cross-period brain fingerprint identification method with paradigm adaptive decoupling, comprising: Step 1: collecting electroencephalogram data;Step 2: preprocessing the electroencephalogram data, labeling the electroencephalogram data with identity labels of subjects, and then dividing the electroencephalogram data into a source domain and a target domain according to a sequence of collecting periods, wherein the source domain is a data set with known identity labels of the subjects, and the target domain is a data set with identity labels to be predicted;Step 3: constructing a feature extraction module with paradigm adaptive decoupling, and training and testing the feature extraction module;Step 4: constructing a decoupling module for decoupling features and respective classifiers, and training and testing the decoupling module and the classifiers;wherein the decoupling module for decoupling features comprises an intra-domain specific identity information decoupler, an inter-domain invariant identity information decoupler, a paradigm task information decoupler, a first mutual information network and a second mutual information network;the intra-domain specific identity information decoupler is configured to decouple a feature representation hout extracted by the feature extraction module into a domain specific identity feature representation hsped-id;the inter-domain invariant identity information decoupler is configured to decouple the feature representation hout extracted by the feature extraction module into a domain invariant identity feature representation hinv-id;the paradigm task information decoupler is configured to decouple the feature representation hout extracted by the feature extraction module into a paradigm task-related feature representation htask;the first mutual information network is configured to calculate a first mutual information loss between the paradigm task-related feature representation htask and the domain invariant identity feature representation hinv-id; the second mutual information network is configured to calculate a second mutual information loss between the domain specific identity feature representation hsped-id and the domain invariant identity feature representation hinv-id; and network parameters of the decoupling module are updated through the first mutual information loss and the second mutual information loss;the classifiers comprise a domain classifier, an identity classifier and a paradigm task classifier;the domain classifier receives the domain invariant identity feature representation hinv-id, and then performs domain adversarial training to reduce distribution differences between different domains and obtain domain invariant features;the identity classifier receives the domain specific identity feature representation hsped-id and the domain invariant identity feature representation hinv-id to acquire accurate classification information;the paradigm task classifier receives the paradigm task-related feature representation htask, promotes a decoupling effect, and reduces an influence of interference information on an identity identification task; andStep 5: using the feature extraction module, the decoupling module and the classifiers which have been trained and verified to realize cross-period brain fingerprint identification.
2. The method according to claim 1, wherein in Step 2, preprocessing the electroencephalogram data comprises: filtering and down-sampling the electroencephalogram data collected in Step 1, and then fragmenting the electroencephalogram data to obtain a plurality of fragments with a sample length of L.
3. The method according to claim 1, wherein in Step 2, dividing the electroencephalogram data into the source domain and the target domain according to the sequence of collecting periods means that: in a time sequence, period data collected first is taken as the source domain with identity labels which is denoted as S={XS, YS}={(xS1,yS1), ⋅ ⋅ ⋅ , (xSnS,ySnS)}, nS represents a number of samples in the source domain, xSi|i=1nS∈d×L represents an electroencephalogram data fragment in a period of time, ysi|i=1nS∈C represents an identity label of an i-th subject in the source domain, and C represents a number of the subjects; period data collected later is taken as unlabeled target domain of identities to be predicted, which is denoted as T={XT}={xT1, ⋅ ⋅ ⋅ , xTnt}, xTi|i=1nt∈d×L represents an electroencephalogram data fragment in a period of time, and nt represents a number of samples in the target domain.
4. The method according to claim 1, wherein in Step 3, the feature extraction module with paradigm adaptive decoupling comprises a multi-scale convolution module, a graph convolution module and an attention embedding module; the multi-scale convolution module comprises a plurality of parallel one-dimensional convolution layers, a splicing layer, a fusion layer and a filtering layer; the plurality of one-dimensional convolution layers have convolution kernels with different sizes; the multi-scale convolution module receives data of the source domain and the target domain, processes the data in multi-time dimension through a plurality of parallel one-dimensional convolution kernels with different sizes, and outputs features of different levels as an input of the splicing layer; the splicing layer splices an output of one-dimensional convolution layers of different levels as an input of the fusion layer; the fusion layer fuses the features learned by different convolution kernels output by the splicing layer, and flattens the features as an input of the filtering layer;the graph convolution module comprises a plurality of parallel graph convolution networks, and is configured to mine a topological relation and spatial information between channels in a data-driven manner;the attention embedding module is configured to act on outputs of graph convolution of different levels, and transform a graph structure into an embedding vector through an attention mechanism as an input of decouplers.
5. The method according to claim 1, wherein in a training process, the intra-domain specific identity information decoupler, the inter-domain invariant identity information decoupler and the paradigm task information decoupler decouple hout into the domain specific identity feature representation hsped-id, the domain invariant identity feature representation hinv-id and the paradigm task-related feature representation htask, respectively, and then train the identity classifier C-id(·) and the paradigm task classifier C-task to realize correct classification, which are iteratively optimized through a cross entropy loss.
6. The method according to claim 1, wherein the domain specific identity feature representation hsped-id and the domain invariant identity feature representation hinv-id are constrained by L1 norm.
7. The method according to claim 1, wherein the first mutual information loss 1(hinv-id,htask) between the domain invariant identity feature representation hinv-id and the paradigm task-related feature representation htask is calculated through the first mutual information network, and the second mutual information loss 2(hinv-id,hspec-id) between domain invariant identity feature representation hinv-id and the domain specific identity feature representation hsped-id is calculated through the second mutual information network, which are expressed as:
8. A cross-period brain fingerprint identification system for implementing the method according to any one of claim 1, comprising: a data collecting module, configured to collect electroencephalogram data;a data preprocessing module, configured to preprocess the electroencephalogram data;an identifying module, configured to realize cross-period brain fingerprint identification according to preprocessed electroencephalogram data by using the feature extraction module, the decoupling module and the classifiers which have been trained and verified in advance.
9. A computing device, comprising a memory and a processor, wherein executable codes are stored in the memory, and the processor, when executing the executable codes, implement the method according to any one of claim 1.

Priority Claims (1)

Number	Date	Country	Kind
202410087933.2	Jan 2024	CN	national

CROSS-PERIOD BRAIN FINGERPRINT IDENTIFICATION METHOD WITH PARADIGM ADAPTIVE DECOUPLING AND SYSTEM THEREOF

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

Priority Claims (1)