The present disclosure belongs to the field of noninvasive load monitoring (NILM), and relates to a feature extraction method for a current signal, in particular to a current signal segmentation algorithm.
In recent years, due to the continuous development of smart grid technologies, the application of lean and intelligent analysis of electrical signals has become increasingly extensive. For example, NILM technology analyzes integrated voltage and current through feature extraction and machine learning (ML) algorithms and monitors appliance usage without submeters. Many feature extraction methods were proposed to extract wavelet features, voltage-current (V-I) trajectories, current harmonics, active/reactive power and automatically learned deep features for NILM. According to Non-Intrusive Load Monitoring Using Semi-Supervised Machine Learning and Wavelet Design, signal features are extracted by matching a set of wavelets to load types through wavelet designing and the Procrustes analysis. The wavelets used according to different load types need to be selected from a large number of wavelets, which limits the use of the method for unknown electrical appliances. Different V-I trajectory family features are extracted from a V-I trajectory picture to describe the working states of different electrical appliances. The V-I trajectory features include looping direction, enclosed area, nonlinearity of mean curve, number of self-intersections, slope of middle segment and area of right and left segments, etc. However, the noise signal has a great influence on the V-I trajectory, which can easily lead to the mutation of the V-I trajectory features. The current harmonic features such as mode and mode ratio are extracted from high-frequency current harmonics to distinguish different electrical appliances. However, in order to improve the extraction effect, the sampling frequency of the signal is generally required to reach 1 MHZ or above, which brings a huge obstacle to the application of current harmonic features. According to Non-Intrusive Load Monitoring by Novel Neuro-Fuzzy Classification Considering Uncertainties, cluster analysis is performed on the active/reactive power data of various household appliances, and various centers are regarded as the power load features of each appliance for training and identification. However, it is hard to distinguish electrical appliances with similar active/reactive power features. Deep learning (DL) methods require a large amount of labeled data for supervised or semi-supervised learning, but labeled data is often costly and scarce in practical scenarios, which greatly limits the development of this method. Although the power feature is found in many works, for NILM, the current feature is more effective than the power feature.
An objective of the present disclosure is to provide a current signal segmentation algorithm. The present disclosure accurately depicts current signals of different electrical appliances in different states by extracting distinguishing features of the current signals, so as to facilitate subsequent processing.
The objective of the present disclosure is achieved by the following technical solution. A current signal segmentation algorithm, accurately depicting current signals of different electrical appliances in different states by extracting distinguishing features of the current signals, includes the following steps:
1) segmenting a current signal to separate a standby current and an overshoot current, only leaving a working current;
2) extracting a shape distribution feature of a working current segment;
3) extracting a statistical feature of the working current segment;
4) extracting a harmonic feature of the working current segment;
5) calculating a similarity between a segment pair; and
6) deriving a maximum clique set in a similarity graph through a maximum clique search algorithm as a class from automatic segmentation.
In the present disclosure, the segmenting a current signal to separate a standby current and an overshoot current, only leaving a working current includes:
step 1: sampling all vertices:
where, an original signal is denoted as S, which has a length of LS; sampled points are denoted as P; and every Nv points are sampled to obtain a vertex Vk;
Vk=max{Pi}−min{Pj}, where (k−1)*Nv+1≤i, j≤k*Nv, k*N≤Ls;
step 2: starting recording when the vertex obtained by down-sampling satisfies a condition C1: (Vk>Vk−1 and Vk>Vk+1)or(Vk>Vk−1 and Vk<Vk+1);
step 3: merging two neighboring segments that satisfy C2: |
step 4: detecting a standby current and an overshoot current:
determining a current of an i-th segment to be in a standby state when the current of the i-th segment satisfies the following condition:
determining the i-th segment Seci to be an overshoot current segment when the i-th segment satisfies the following condition:
The step of extracting a shape distribution feature of a working current segment includes:
step 1: extracting a value distribution class feature:
where, #( ) denotes a number of points in the set in parentheses, and k denotes a k-th effective segment; and
step 2: extracting a spatial distribution class feature:
where, median(Seck) denotes a median of values of all points in the k-th effective segment Seck.
The step of extracting a statistical feature of the working current segment includes:
step 1: labeling all vertices in a k-th effective segment Seck; and
step 2: calculating a mean, a median and a standard deviation of values of all the vertices.
The step of extracting a harmonic feature of the working current segment includes:
step 1: harmonically transforming all points in a k-th effective segment Seck; and
step 2: using first 16 harmonics;
where, F(i) denotes a value of an i-th harmonic.
The step of calculating a similarity between a segment pair includes:
assigning two segments each as A and B:
step 1: calculating 6 sub similarities between the two segments:
and
step 2: calculating a combined similarity:
The step of deriving a maximum clique set in a similarity graph through a maximum clique search algorithm as a class from automatic segmentation includes:
denoting the similarity graph as G=(V,E), where V denotes a segment set of the current signal, and E denotes a similarity matrix, which is a symmetrical similarity matrix; and denoting a similarity threshold as Sth and a class set as C, ci∈C,vk∈ci, k∈{1,2, . . . |V|};
step 1: initializing Ci=i and ei, i=0;
step 2: when the class set C is updated:
setting a similarity between all vertexes belonging to a current class and the current class to 0;
setting a similarity ei,j between all vertexes belonging to the current class in E to 0;
calculating a minimum similarity min_simii,clique between all the vertexes and the vertexes in the current class;
taking a maximum value max(min_simii,clique) of all min_simii,clique; if max(min_simii,clique)<Sth and max(E)>Sth, then:
creating a new class composed of initial members that two vertices corresponding to max(E);
setting a class label to a smaller value of the two vertices; and
deleting an origin class of the two vertices; or if max(min_simii,clique)≥Sth, then:
joining a vertex with the maximum value into the current class, and deleting an origin class thereof; and
step 3: renumbering the class label, and updating a class number.
The present disclosure classifies a current signal of an electrical appliance, extracts spatial, temporal, and frequency domain features of each segment, calculates a similarity between a segment pair, and derives a maximum clique set in a similarity graph through a maximum clique search algorithm. The present disclosure can quickly and accurately classify current signals generated by different electrical appliances in different working states, so as to facilitate subsequent processing.
A current signal segmentation algorithm accurately depicts current signals of different electrical appliances in different states by extracting distinguishing features of the current signals, and includes the following steps:
1) Segment a current signal to separate a standby current and an overshoot current, only leaving a working current.
2) Extract a shape distribution feature of a working current segment.
3) Extract a statistical feature of the working current segment.
4) Extract a harmonic feature of the working current segment.
5) Calculate a similarity between a segment pair.
6) Derive a maximum clique set in a similarity graph through a maximum clique search algorithm as a class from automatic segmentation.
The step of segmenting a current signal to separate a standby current and an overshoot current, only leaving a working current includes:
Step 1: Sample all vertices:
where, an original signal is denoted as S, which has a length of LS; and every NV points are sampled to obtain a vertex Vk:
Vk=max{Pi}−min{Pj}, where, (k−1)*Nv+1≤i, j≤k*Nv,k*N≤Ls.
Step 2: Start recording when the vertex obtained by down-sampling satisfies a condition C1: (Vk>Vk−1 and Vk>Vk+1)or(Vk>Vk−1 and Vk<Vk+1).
Step 3: Merge two neighboring segments that satisfy C2: |
where,
Step 4: Detect a standby current and an overshoot current:
Determine a current of an i-th segment to be in a standby state when the current of the i-th segment satisfies the following condition:
Determine the i-th segment Seci to be an overshoot current segment when the i-th segment satisfies the following condition:
The step of extracting a shape distribution feature of a working current segment includes:
Step 1: Extract a value distribution class feature:
where, #( ) denotes a number of points in the set in parentheses, and k denotes a k-th effective segment.
Step 2: Extract a spatial distribution class feature:
where, median(Seck) denotes a median of values of all points in the k-th effective segment Seck.
The step of extracting a statistical feature of the working current segment includes:
Step 1: Label all vertices in the k-th effective segment Seck.
Step 2: Calculate a mean, a median and a standard deviation of values of all the vertices. The step of extracting a harmonic feature of the working current segment includes:
Step 1: Harmonically transform all points in a k-th effective segment Seck.
Step 2: Use first 16 harmonics:
where, F(i) denotes a value of an i-th harmonic.
The step of calculating a similarity between a segment pair includes:
Let two segments be A and B:
Step 1: Calculate 6 subsimilarities between the two segments:
Step 2: Calculate a combined similarity:
The step of deriving a maximum clique set in a similarity graph through a maximum clique search algorithm as a class from automatic segmentation includes:
Denote the similarity graph as G=(V,E), where V denotes a segment set of the current signal, and E denotes a similarity matrix, which is a symmetrical similarity matrix; and denote a similarity threshold as Sth and a class set as C, ci∈C,vk∈ci,k∈{1,2, . . . |V|}.
Step 1: Initialize ci=i and ei,i=0;
Step 2: When the class set C is updated:
Set a similarity between all vertexes belonging to a current class and the current class to 0;
Set a similarity ei,j between all vertexes belonging to the current class in E to 0.
Calculate a minimum similarity between all the vertexes and the vertexes in the current class.
Take a maximum value max (min_simii,clique) of all min_simii,clique.
If max(min_simii,clique)<Sth and max(E)>Sth, then:
If max(min_simii,clique)≥Sth, then:
Step 3: Renumber the class label, and updating a class number.
The present disclosure can quickly and accurately classify current signals generated by different electrical appliances in different working states, so as to facilitate subsequent processing.
Number | Date | Country | Kind |
---|---|---|---|
202010417871.9 | May 2020 | CN | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2020/098506 | 6/28/2020 | WO |