The invention provides methods and apparatus for processing of measurement data related to an electrical power grid or other electrical apparatus by using machine learning techniques and providing anomalous event detection from the electrical measurement data.
Stopping climate change motivates implementation of renewable energy sources such as wind and solar with much smaller carbon footprints than non-renewable sources. However, the behaviour of renewable sources may be irregular and can bring challenges for consistent operation in power distribution systems. Utility-scale (>1 MW) solar farm owners may suffer from significant plant failure rates, reduced equipment life, unplanned outages, and replacement overheads. These problems can be countered through better condition monitoring data collection and knowledge discovery to automatically understand issues and predict problems before they occur.
Monitoring tools can provide more granular and higher accuracy data capture together with precise timing information. However, there can be problems in the capability to process the data and detect anomalous behaviour, faults and failure modes.
According to a first aspect, the present invention provides a method of processing high resolution electrical measurement data according to claim 1. According to a second aspect, the present invention provides a system for processing high resolution electrical measurement data according to appended claim 15. Further optional features are provided in the appended dependent claims.
The inventors of the present invention have determined that high resolution data that may have been received from a sensor such as micro-synchrophasor measurement unit (μPMU) data can be collected and analysed alongside power quality measurement data that may have been received from another sensor such as a power quality monitor (PQM), both the μPMU and PQM located proximal to a power generating array such as a photovoltaic cell (solar) farm, and one or more machine learning techniques can be used for processing of the data for remote and automatic anomaly detection. Resolution can determine the precision of a measurement. High resolution data can be measured and will provide high precision. The variation to the resolution can affect the performance and processing (time and computational complexity) but the examples described herein may be adapted to work with different resolution data and the invention has been found to provide particularly effective results at high resolutions where information may be obtained, for example, from a micro-synchrophasor measurement unit.
The micro-synchrophasor measurement unit and power quality monitor may be integrated into a grid data unit that can provide high-resolution, high-precision, time-stamped data. The micro-synchrophasor may be configured to operate in the frequency-domain to process an electrical signal and collect a first set of data points; and the power quality monitor may be configured to operate in the time-domain to process the electrical signal and collect a second set of data points. The grid data unit may be configured to apply the same synchronised timestamp to the collected first and second sets of data points. A time-series database may receive the high resolution data, preferably via a secure telemetry. Time-series databases are better suited to such high volume measurements than relational databases. The database may be a data lake that can contain structured or unstructured data. This more efficient and effective grid data unit apparatus matches the volume (prospectively quadrillions of data-points amounting to petabytes) and rate of data collection with suitable handling and storage capacities. While the grid data hardware and database/data lake could operate independent of the other, together they provide an integrated solution offering superior performance with a reduced amount of equipment required.
The machine learning techniques can include data-driven unsupervised learning of data received from a micro-syncrophasor unit to detect an anomalous electrical event from the solar farm which may be connected to a power distribution system.
The machine learning techniques may use a partitioning based clustering method such as Clustering Large Applications based upon Randomised Search (CLARANS) to process the large amount of data that may be generated by the grid data unit. The clustering can identify clusters of data and distinctly separate data patterns occurring in the data. Abnormalities or outlier events can be identifiable from the clustered data which may be graphically represented and subsequently analysed automatically. Power quality monitor data from the power quality monitor device which may have been collected less frequently than the data from the micro-synchrophasor measurement unit can be used to validate the detected outlier events occurring in the clustered data as the time stamping of the data is consistent between the μPMU data and the PQM. The voltage and current phasor data may then be analyzed further to determine more information relating to the outlier event such as whether there was a fault event such as a voltage dip event in a particular window of time.
Systems and methods are described in detail below, by way of example only, with reference to the accompanying drawings in which:
Renewable power sources such as solar panels that work by absorbing sunlight with photovoltaic cells, generating direct current (DC) energy and then converting it to usable alternating current (AC) energy with the help of inverter technology. AC energy then flows through the electrical busbar and is distributed accordingly. Electrical characteristics or parameters of generated power from the power source and load may be collected and processed.
Referring to
The outlier data can be related to an anomalous event. Validation of the outlier data may be carried out by identifying time and magnitude information of the outlier data from the vector representation and mapping with time series data previously obtained from the grid data unit to precisely identify the outlier data and detect a related anomalous event. The user interface 140 provides a means for user interaction with the various units 110,120,130 and each of the units may be provided with its own user interface or a single user interface may interact with one or more of the units 110,120,130. The user interface may interact with a user, for example, by conveying data to a user such as by displaying visual images, graphs, results, and by receiving user input. The system may be part of a power generation system and network and can provide highly accurate information on the state of the generating equipment and power network. Such information can be used to make operational decisions to maximise utilization of the power generation plant. In some examples, the system is used to optimise the utilization of solar farms, wind turbines, electrical loads, transmission & distribution systems, or energy storage plants, or other electrical facilities.
Referring to
The μPMU 111 operates in the frequency domain and is used to collect voltage and current phasor measurement data for each half cycle at 10 milliseconds reporting periods (100 Hz in the UK). It may be configured to process an electrical signal and collect a first set of data points. It records the measurement of data with a time stamp and the measurement data may be one or more of: three phase voltages, three phase voltage angles, three phase current, three phase current angles, centre frequency offset, c37 frequency, fundamental power, fundamental apparent power, fundamental of reactive power global positioning system, latitude, longitude. The PQM 112 may be configured to operate in the time-domain to process the electrical signal and collect a second set of data points. It may provide a power quality function including an array of high-accuracy measurements according to the IEC 61000-4-30 Ed 3 Class A standard plus supra-harmonics in the 2-150 kHz range. The PQM 112 may have a lower time resolution of data being collected, for example, every 1 minute, and can be configured to send an alert to a user when an anomalous event such as a voltage sag or flicker is detected. The GDU 110 may be configured to apply the same synchronised timestamp to the collected first and second sets of data points. One-day data from the μPMU 111 can include each minute files that consist of six thousand rows representing each 10 milliseconds of data. Therefore, the measurement data will be a high time resolution and can include 8.64 million data points in a single day for each parameter. International patent application No. PCT/GB2019/051413 describes the use of a grid data unit including μPMUs and PQMs for sensing, monitoring and collecting electrical data and the subject matter is incorporated herein.
The amount of solar data collected by the grid data unit and particularly the μPMU 111 can be so large, fast, complex (provides several power parameters related to solar farm) that it may be difficult to process using traditional methods or manual analysis. However, irregular (or, abnormal) electrical characteristics of generated power from a PV brings challenges relating to consistent operation in power distribution systems, significant plant failure, reduction in equipment life, unplanned outages, and increase in the replacement overheads. In addition to the increasing velocities and varieties of electrical parameters, solar and wind farm data stream flows are unpredictable due to the sudden environmental changes (occur often) where magnitudes vary greatly, and can damage associated electrical equipment. Also, it can be difficult to link, match, cleanse and transform solar data across systems to find correlation between events and hierarchies. Hence, it is challenging to understand electrical power (and related parameter) trends and how to manage daily or regularly, seasonal and event-triggered peak data loads for maintaining solar farms, wind turbines, electrical loads, transmission & distribution systems, energy storage plants, or other electrical facilities.
The characteristic of the electrical signal is monitored and data relating to the characteristic is collected at reporting periods. In an example, and as mentioned in relation to the μPMU in
At 202, raw data may be sent to a server that is located remote to the grid data unit for storage. The high resolution data from the μPMU may then be further processed. The processing of data can include: (a) handling the data and its storage; and (b) automatic data processing for electrical trend finding. As a voltage source converter and its behavior is the heart of PV solar farm, the voltage has been considered for identifying the data irregularity (or abnormality) processing the solar data in an example for ease of explanation although it will be appreciated that the processing can be carried out also for other characteristics that are collected.
(a) Handling the Data and its Storage:
At 203, the raw data may be cleaned in that any time stamped data that is missing from the raw data is removed from the data set. This can reduce the size of the data set by removing missing data that would not provide any useful information from subsequent processing. The raw data from the μPMU can be large in volume and unstructured in nature. At 204, lossless data compression can be carried out. In an example a column based storage format is used as oppose to a row based storage format (such as CSV). An example of a column based storage format is the Apache Parquet (an open source file) format which can be used to create a data lake or data warehouse which offers compressed, an efficient columnar solar data representation for further processing. A substantial reduction in data set size can be obtained using Apache Parquet and in one example an 83% reduction in file size may be achieved compared to a row based storage format such as CSV. It will be appreciated that the cleaning and compressing of the raw data can provide advantages but may not be needed for automatic electrical trend finding.
(b) Automatic Data Processing for Electrical Trend Finding
Manual division and annotation of the data is too resource-intensive and difficult if not impossible while looking for anomalies (or irregular data behavior). Thus, the goal specifies here to separate two groups: regular/normal electrical trend, and irregular electrical trends. The best practice to achieve this goal is to create and map statistical algorithms like, clustering. By keeping in mind the fast processing and decision making, a clustering approach can be used based on CLARANS (Clustering Large Applications based on RANdomized Search). Compared to other clustering approaches, it has been found that the randomized search and the randomized selection of samples from the input data that is a property of CLARANS provides an effective and efficient technique where there is a large amount of data such as from a phasor measurement unit (PMU).
At 205, dataset transformation or conversion can be executed and the selection of the extent of the transformation will then affect the CLARANS clustering as this data will be analysed using CLARANS. An example of the transformation is shown in
At 206, a clustering technique such as CLARANS can be carried out on the transformed dataset. Referring to
An example of the application of the CLARANS clustering technique is described in more detail below in relation to the data set shown in
=(0.001%×86400)=|86.4|=86
Processing:
With the clustering technique, the high resolution electrical measurement data can be separated into clustered data that include a first cluster representing a first electrical trend and a second cluster representing an another electrical trend. In other examples, there may be more clusters to segregate the high resolution electrical measurement data.
Referring back to
Validation may then occur given that the electrical signal received by the μPMU during a day was also collected by a PQM. The detected outlier events from the clustered data can be validated by mapping or correlating the PQM data which would have generated an alert when an anomalous event had been detected. The alert may include graphical representation of the PQM data at a high resolution showing the anomalous event.
The clustering approach is now described according to an example experiment. In the example, the clustering approach has been experimented on three-phase voltage phasor data for 10 consecutive days (1May-10May 2020) to categorize its functional behavior and detect anomalies on the power distribution system. Each day, 8.64 million voltage phasor data points are gathered per phase. The results have been shown in
Number | Date | Country | Kind |
---|---|---|---|
2016025.5 | Oct 2020 | GB | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/GB2021/052613 | 10/8/2021 | WO |