The present subject matter described herein, in general, relates to applying signal processing and pre-trained deep learning model to extract features for a forecasting application, and more particularly, it relates to a method and a system for forecasting non-stationary time-series.
Many economic and financial time-series such as asset prices, interest rates, exchange rates, many operational time-series like product component inventories, or sales time-series of fashion items which show occasional spurts are non-stationary in nature. Classical forecasting models like ARMA (Autoregressive Moving Average), and ARIMA (Autoregressive Integrated Moving Average) assume the dynamics to be stationary and linear. More advanced models like ARCH (Autoregressive Conditional Heteroskedastic), GARCH (Generalized Autoregressive Conditional Heteroskedastic) and their variants model time-varying conditional variance of a non-stationary time-series data but cannot fully capture highly irregular phenomena generally observed in many practical time-series. Models from econometrics like Regime Shift non-stationary series build forecasts by segregating the series into different “regimes” or “states” and apply regime-specific forecasting models. However, these regime-shift models are generally hand-crafted, can work when the number of regimes is known a priori and are not fast and flexible enough to accommodate any newly observed regimes.
On the other hand, various machine learning techniques such as Artificial Neural Networks (ANN) and Support Vector Regression (SVR) try to capture and model non-stationary and non-linear time-series data but often suffer from the problem of overfitting, which makes it hard to discern if the model is capturing noise or some probabilistic properties of the time-series.
For some state-of-the-art technology, reference is made to CN 109767043 A which discloses a power load time-series big data intelligent modelling and prediction method. Electric load time-series big data intelligent modelling and prediction techniques, wavelet decomposition is carried out to electric load time-series historical data, electric load time-series historical data is decomposed into electric load high-frequency time-series historical data and frequency temporal sequence history data, then integrated approach is carried out to all time-series, time-series is clustered, the time-series for clustering classification to every kind is based on Elman neural network load forecasting model, finally the electric load after the decomposition of prediction is reconstructed, complete the intelligent modelling to electric load time-series, to realize that the electric load to different characteristic carries out efficiently, intelligent predicting.
Reference is also made to CN 109711383 A which discloses a convolutional neural network motor imagery electroencephalogram recognition method based on a time-frequency domain. The method comprises the steps: original right-hand man's Mental imagery EEG signals are converted to two-dimensional time-frequency figure using Short Time Fourier Transform by S1; S2 takes one-dimensional convolution mode and carries out feature extraction to a kind of 5 layers of convolutional neural networks structure of obtained two-dimensional time-frequency G-Design in order to avoid mixing for Time And Frequency information; S3 utilizes the entire CNN network of back-propagation algorithm training; Support vector machines is replaced the output layer in CNN using support vector machines as the classifier of entire model by S4. The present invention can guarantee to concentrate in EEG data, right-hand man's Mental imagery EEG signals feature discrimination with higher of extraction, and robustness is good.
Reference is also made to CN 108846261 A which discloses a gene expression time-series data classification method based on visibility graph algorithm. The method comprises the steps of: 1) constructing a basic network, selecting data strips according to the pre-processed gene expression time sequence data, constructing a visual image and a connection image through a visual image algorithm, and determining the basic structure of the co-expression network; 2) extracting relevant traditional characteristics according to the obtained basic network; 3) obtaining the characteristic vector of each gene node in the basic network by utilizing second-order random walk and neural network model learning; 4) and integrating the characteristics of the basic network, and finishing the classification of the gene expression time sequence data by using different strategies based on the obtained characteristics of the basic network through a density clustering algorithm. The invention provides a method for realizing gene expression time sequence data classification by adopting visual graph basic network construction, node feature vector extraction and density clustering algorithm, which has good precision and practicability.
Reference is also made to CN 110553839 A which discloses a gear box single and composite fault diagnosis method, equipment and system, and belongs to the field of mechanical equipment state monitoring and fault diagnosis. The diagnostic method comprises the following steps: (1) acquiring a vibration signal of the gear box; (2) dividing the acquired vibration signal into a plurality of data segments, wherein two adjacent data segments have coincident data, and calculating to obtain a wavelet time-frequency image corresponding to each data segment; (3) dividing the wavelet time-frequency image into a training set and a test set, and normalizing; (4) training a multi-label convolutional neural network by using a training set; (5) testing the trained multi-label convolutional neural network by using the test set; (6) and testing the qualified multi-label convolutional neural network as a fault diagnosis model. The method fully utilizes the excellent feature extraction capability of wavelet transformation, the excellent pattern recognition capability of the multi-label convolutional neural network and the applicability to the composite fault diagnosis problem, and can effectively realize the single and composite fault diagnosis of the gearbox.
Reference is also made to CN 109034277 A which discloses a power quality disturbance classification method and system based on multi-feature fusion. The invention describes a methodology that extracts the time-frequency characteristics, fundamental frequency signal, and signal noise intensity to classify Power Quality Disturbance. The method carries out S-transformation, Fourier transformation, and noise intensity measurement on a normalized sampled signal respectively to obtain time-frequency matrix, fundamental frequency signal and signal noise intensity. Through an integrated analysis of the time-frequency image and the matrix, and the fundamental frequency signal, three critical features are obtained. If the signal to noise ratio exceeds a threshold, a decision tree is applied to the identified three features to generate quality disturbance classification; else a probabilistic neural network is applied for classification.
Reference is also made to CN 106909784 A which discloses a recognition method of epileptic electroencephalograph based on two-dimensional time-frequency image depth convolution. The method comprising the steps of: (a) pre-process raw electroencephalograph signals; (b) extract effective frequency bands of the electroencephalograph signals; (c) build time-frequency diagram of the electroencephalograph signals; (d) train a deep convolution neural network LeNet-5 structure on time-frequency diagrams; (e) extract features of images and carrying out data dimensionality reduction through a fully connected network; (f) finally outputting two-dimensional vectors used to represent classification results; (g) selective optimal channels which are specific to a patient and obtain classification results; (h) utilize a weighted sum method on the outputs of the five optimal channels to identify epilepsy.
Reference is also made to CN 108510113 A which discloses an application of XGBoost to short-term load prediction. The invention designs an information entropy clustering and ATTENTION mechanism-based recurrent neural network short-term load prediction method. The method comprises the steps of: a) analyze features influencing a power load; b) calculate information entropies of all the features for the load by using an xgboost algorithm; c) performing cluster analysis based on the feature information entropies as weights for historical data of a prediction area; d) select a cluster with a shortest prediction day weight distance in the clustering result; e) consider prediction time from long to short and form a time sequence T; f) take the time sequence T as an encoder of an ATTENTION recurrent neural network; g) obtain the prediction result using a decoder.
Reference is made to non-patent literature by S Jayalakshmi and G. N. Sudha, titled Scalogram-based prediction model for respiratory disorders using optimized convolution neural networks, Artificial Intelligence in Medicine, 103, 2020 The paper splits respiratory sound signals using Empirical Mode Decomposition and creates scalograms of each intrinsic mode functions thus generated using a continuous wavelet transform. The scalograms are fed into a pretrained optimized Alexnet Convolutional Neural Network (CNN) architecture to predict to four classes of lung sounds-normal, crackles, wheezes and low-pitched wheezes.
Reference is made to non-patent literature by Y Lukic, C Vogt, O. Durr, and T Stadelmann, titled Speaker identification and clustering using convolution neural networks, IEEE International Workshop on Machine Learning for Signal Processing, Salerno, Italy, 2016. The paper uses spectrograms of speech signals as input to a CNN and study the optimal design of CNN networks for speaker identification and clustering. The work determines optimal convolutional filter dimension for effective speaker identification using different training experiments and then uses one of the post-convolutional layers as feature representation needed for clustering. The work demonstrates that using the output of the high level dense layers instead of the final soft-max layer offers improved clustering performance.
Reference is made to non-patent literature by D Verstracte et al, titled Deep learning enabled fault diagnosis using time-frequency image analysis of rolling element bearing, Shock and Vibration, 2017. This paper designs a custom CNN architecture to detect Normal, Inner race fault, Outer race fault condition in rolling element bearings in electric motors and evaluate the architecture against other architectures and machine learning models and establish the architecture robustness under different time frequency methods, Short-term Fourier Transform, Wavelet Transform, and Hilbert-Huang Transform.
Reference is made to non-patent literature by H Abbasi, A Gunn, L Bennet, and C P Unsworth, titled Deep convolutional neural networks and reverse biorthogonal wavelet scalograms for automatic identification of high-frequency micro-scale spike transients in Post-Hypoxic-Ischemic EEG, Conference of the IEEE Engineering in Medicine and Biology Society (EMBC′20), Montreal, Canada, Jul. 2020. This paper employs reverse biorthogonal wavelet-scalograms of ECoG segments to train a 17-layer deep CNN classifier for the precise identification of high-frequency micro-scale spike transient in post-H1 recordings of preterm fetal sheep.
Reference is made to non-patent literature by Y Zhanga, L Leib, Y Weic, titled Forecasting the Chinese stock market volatility with international market volatilities: The role of regime switching, North American Journal of Economics and Finance, 52, 2020. This paper investigates the role of Markov regime switching applied to the base heterogeneous autoregressive model for realized variance (HAR-RV) in the prediction of the Chinese stock market volatility with relevant indices from international markets incorporated. Regime switching model is empirically demonstrated to outperform the model with time varying parameters in predicting the volatility.
Reference is made to non-patent literature by J L Kirkby, D Nguyen, titled Efficient Asian option pricing under regime switching jump diffusions and stochastic volatility models, North American Journal of Economics and Finance, 52, 2020. The work reduces the problem of pricing Asian options in stochastic volatility models to that of a regime switching jump diffusion model and demonstrate its stability and robustness through numerical experiments.
The drawbacks associated with these conventional/known techniques is that the classical models currently employed tend to fit a single model to cover the entire time-series, thereby implying that they are only capable of handling a known regularity that appears throughout the series—like seasonal, cyclical behaviours. Thus, there is a need to identify regimes that are not known a priori.
The above-described need for forecasting non-stationary time-series is merely intended to provide an overview of some of the shortcomings of conventional systems/mechanism/techniques, and is not intended to be exhaustive. Other problems/shortcomings with conventional systems/mechanism/techniques and corresponding benefits of the various non-limiting embodiments described herein may become further apparent upon review of the following description.
This summary is provided to introduce concepts related to a method and a system for forecasting non-stationary time-series, and the same are further described below in the detailed description. This summary is not intended to identify essential features of the claimed subject matter, nor is it intended for use in determining or limiting the scope of the claimed subject matter.
The objective of the present invention is to provide a hierarchical scheme for forecasting non-stationary time-series based on machine learning methodologies and classical time-series analysis.
In particular, the present invention discloses a method and a system for a two-stage scheme for forecasting in nonstationary time-series that combines diverse methodologies in order to overcome the drawbacks associated with the prior art.
According to first aspect of the invention, there is provided a method for forecasting in a non-stationary time-series. The method comprising: generating a plurality of time-frequency images for a plurality of overlapping time-series segments of a user-specified window size, wherein the time-frequency images are generated by employing a continuous wavelet and obtaining the continuous wavelet transform at different scales; applying a pre-trained deep convolutional neural network trained on a plurality of images to the time-frequency images to output a high dimensional numerical vector for each time-frequency image; obtaining a 2D representation for each output numerical vector by applying the Uniform Manifold Approximation and Projection (UMAP) on the collection of numerical vectors; partitioning automatically the 2D representations into clusters by applying a clustering algorithm and an objective criterion such as Bayesian Information Criterion (BIC) to select the optimal number of clusters; mapping each sample of the time-series to the cluster corresponding to the time-frequency image of the time-series segment that the sample belongs to, for collecting time-series segments from stretches identified within the time-series samples; assembling a cluster-specific Auto-Regressive Moving Average (ARMA) forecast model considering all the segments of the time-series that belong to the cluster; and maintaining all cluster specific ARMA models in a repository.
According to second aspect of the invention, there is provided a system for forecasting in a non-stationary time-series. The system comprising: a generating unit, an applying unit, an obtaining unit, a partitioning unit, a mapping unit, an assembling unit and a maintaining unit. The generating unit is configured for generating a plurality of time-frequency images for a plurality of overlapping time-series segments of a user-specified window size, wherein the time-frequency images are generated by employing a continuous wavelet and obtaining the continuous wavelet transform at different scales. The applying unit is configured for applying a pre-trained deep convolutional neural network trained on a plurality of images to the time-frequency images to output a high dimensional numerical vector for each time-frequency image. The obtaining unit is configured for obtaining 2D representation for each output numerical vector by applying the Uniform Manifold Approximation and Projection (UMAP) on the collection of numerical vectors. The partitioning unit is configured for automatically partitioning the 2D representations into clusters by applying a clustering algorithm and an objective criterion such as Bayesian Information Criterion (BIC) to select the optimal number of clusters. The mapping unit (1610) for mapping each sample of the time-series to the cluster corresponding to the time-frequency image of the time-series segment that the sample belongs to, for collecting time-series segments from stretches identified within the time-series samples. The assembling unit is configured for assembling a cluster-specific Auto-Regressive Moving Average (ARMA) forecast model considering all the segments of the time-series that belong to the cluster. The maintaining unit is configured for maintaining all cluster specific ARMA models in a repository.
Other aspects, advantages, and salient features of the invention will become apparent to those skilled in the art from the following detailed description, which, taken in conjunction with the annexed drawings, discloses exemplary embodiments of the invention.
The detailed description is provided with reference to the accompanying figures. In the figures, the digit(s) of a reference number identifies the figure in which the reference number first appears. The same numbers are used throughout the drawings to refer to similar features and components.
It is to be understood that the attached drawings are for purposes of illustrating the concepts of the invention and may not be to scale.
The following clearly describes the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Apparently, the described embodiments are merely a part rather than all of the embodiments of the present invention. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present invention without creative efforts shall fall within the protection scope of the present invention.
The present invention can be implemented in numerous ways, as a process, an apparatus, a system, a composition of matter, a computer readable medium such as a computer readable storage medium or a computer network wherein program instructions are sent over optical or electronic communication links. In this specification, these implementations, or any other form that the invention may take, may be referred to as techniques. In general, the order of the steps of disclosed processes may be altered within the scope of the invention.
A detailed description of one or more embodiments of the invention is provided below along with accompanying figures that illustrate the principles of the invention. The invention is described in connection with such embodiments, but the invention is not limited to any embodiment. The scope of the invention is limited only by the claims and the invention encompasses numerous alternatives, modifications, and equivalents. Numerous specific details are set forth in the following description in order to provide a thorough understanding of the invention. These details are provided for the purpose of example and the invention may be practiced according to the claims without some or all of these specific details. For the purpose of clarity, technical material that is known in the technical fields related to the invention has not been described in detail so that the invention is not unnecessarily obscured.
In the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the invention. However, it will be understood by those skilled in the art that the present invention may be practiced without these specific details. In other instances, well-known methods, procedures, and components, modules, units and/or circuits have not been described in detail so as not to obscure the invention.
Although embodiments of the invention are not limited in this regard, discussions utilizing terms such as, for example, “processing,” “computing,” “calculating,” “determining,” “establishing”, “analyzing”, “checking”, or the like, may refer to operation(s) and/or process(es) of a computer, a computing platform, a computing system, or other electronic computing device, that manipulates and/or transforms data represented as physical (e.g., electronic) quantities within the computer's registers and/or memories into other data similarly represented as physical quantities within the computer's registers and/or memories or other information in a non-transitory storage medium that may store instructions to perform operations and/or processes.
Although embodiments of the invention are not limited in this regard, the terms “plurality” and “a plurality” as used herein may include, for example, “multiple” or “two or more”. The terms “plurality” or “a plurality” may be used throughout the specification to describe two or more components, devices, elements, units, parameters, or the like. Unless explicitly stated, the method embodiments described herein are not constrained to a particular order or sequence. Additionally, some of the described method embodiments or elements thereof can occur or be performed simultaneously, at the same point in time, or concurrently.
A method and a system for forecasting non-stationary time-series are disclosed. While aspects are described for forecasting in non-stationary time-series, the present invention may be implemented in any number of different computing systems, environments, and/or configurations, the embodiments are described in the context of the following exemplary systems, devices/nodes/apparatus, and methods.
Henceforth, embodiments of the present disclosure are explained with the help of exemplary diagrams and one or more examples. However, such exemplary diagrams and examples are provided for the illustration purpose for better understanding of the present disclosure and should not be construed as limitation on scope of the present disclosure.
The patent CN109767043A performs wavelets-based time-frequency decomposition of power load series to identify low frequency and high frequency segments in the series. Non-overlapping segments at each frequency level are grouped into clusters and group-specific neural networks are developed for forecasting. Predictions from neural networks corresponding to different clusters at each frequency levels are combined together to make the final forecast. The overall procedure involves multiple decomposition and super-positions and is expected to work only on a sufficiently long time series. Further, it is to be ensured that each cluster has a sufficient number of segments for its associated neural network to yield accurate forecasts. The present invention, in contrast builds on time-frequency images instead of numerical coefficients and exploits sophisticated pre-trained deep network architectures developed for images to extract features in order to group the segments into clusters or regimes. Further, the regime-specific forecasting models of the present invention are built on linear models that are easy to explain to the user, in contrast, to a black-box approach based on neural networks. The patent CN106909784 relies on the design of hand-crafted features on power load time-series and on a non-explicable encoder-decoder mechanism to generate forecasts.
The scope of the patent documents and the non-patent literature described in the background portion, are confined to classification-type of problems where each time-segment is bucketed into one of a known number of classes or categories. In the present invention, the number of regimes (which can be viewed as equivalent to classes/categories mentioned in these descriptions) is not known in advance; rather are automatically detected. Moreover, the scope of these existing patent/non-patent literature does not cover forecasting which involves predicting the future behaviour of a time-series. The non-patent literatures describe the classical regime-switching models for forecasting and assume that the underlying time-series has a known number of regimes.
The present invention is primarily intended to address forecasting in a complex form of non-stationarity in time-series that is characterized by regime-switches. However, the methodology and the steps involved herein are applicable to even other milder forms of non-stationarity generally tackled by popular, advanced models like ARIMA, ARCH, GARCH etc. Thus, unlike other existing models which can only forecast the time-series of the type that they are originally designed for, the present invention can be applied to a larger variety of time-series that occur in practice. Further, the model can automatically identify and adapt to any new structural changes that may emerge in the time-series as the series evolves. Hence the invention can help automate the overall process of forecasting and generate reliable forecasts at a higher frequency than is possible through the current models and software.
Some of the problems associated with the conventional techniques are:
The present invention relies on a blend of wavelet transforms and deep learning towards automatic identification of different types of regimes that may exist in a non-stationary time-series. Motivated by the limitations of the existing works, the present invention proposes a two-step framework for non-stationary time-series forecasting, where, in the first step, it uses the wavelet theory approach for capturing both high and low frequency components present in the time-series process during different time intervals. Later, it employs various deep learning models and machine learning algorithms such as RESNET50 (Residual Networks 50), UMAP, and k-means clustering to automatically identify the regime structure present in the time-series. In the second step, it estimates the classical ARMA model in an altered fashion, using all the time-series segments belonging to regime of interest—regime which consists of the most recent time-series segment. In this step, it is by construction of the framework that the same ARMA model will govern the evolution of all the time-series segments in the regime of interest. Accordingly, the scheme to model non-stationary data is different from those of the prior art in a way that it incorporates the regime structure while simultaneously focusing on capturing various localized frequency components present in localized time-series segments. The experiments on a financial time-series data set demonstrate efficacy of the methodology in yielding high forecast accuracy even in the presence of irregular patterns.
Many time-series data, such as stock prices, surface torque in oil-well drilling, sales of fashion items and so on, follow different dynamics in different time periods. A regime implies a characteristic dynamical behaviour of the time-series over a period and a shift suggests an abrupt change from one characteristic behaviour to another. The present invention pertains to forecasting in a complex, more generic form of non-stationary time-series characterized by such regime-switching behaviour. However, the methodology proposed in the present invention is applicable to even simpler stationary time-series.
Input Parameter Specification Module: (Module 1): This module collects all the input parameters required for the forecasting methodology of the present invention. Default values used in the current embodiment are shown in parentheses. This module collects user inputs for following parameters required for execution of all the subsequent modules:
Time-series segmentation module (Module 2): carries out the following steps and outputs a collection of time-series segments.
Wavelet Scalogram Module (Module 3) generates scalogram image from a time-series segment.
where the mother ψ wavelet is assumed to have zero mean, and s is varied over the scale range selected in the parameter setting module.
Numerical Vector Extraction Module (Module 4): extracts a numerical vector from each scalogram images using a pre-trained neural network.
Embedding module (Module 5): Uses UMAP (Uniform Manifold Approximation and Projection) to reduce the dimensionality of a given numerical vector to two dimensions without distorting the global structure of the collection of the numerical vectors.
Clustering and regime mapping module (Module 6):
BIC(m)=L(θ)−½m log(n)
where L(θ) is the loglikelihood function according to a selected model, m is the total number of clusters, and n is the data size. Use the following formula for BIC derived under identical spherical Gaussian distribution assumption of points within clusters:
where m is the number of clusters, ni is the number of points in cluster i, and d is the dimension of the data set. Further,
where ni is the size of ith cluster, xj is jth point in cluster i, and Ci is the center of the cluster Ci.
Model data preparation module (Module 7)
Model estimation module (Module 8): The following model estimation procedure is described for a regime, say k, and is repeated for each regime to build regime-specific forecasting models:
Yt=ƒ(t)+∈t,
approximate ƒ from the T observations (y0, . . . , yt, . . yT) by
where K(. , .) is a kernel and h is the bandwidth that determines the smoothing effect. In this embodiment, the Gaussian kernel is used:
Forecasting module (Module 9):
Forecast evaluation module (Module 10) to evaluate accuracy for forecasts against actuals.
where y and ŷ refer to actuals and forecasts respectively. Evaluated accuracy is used for adjusting the input parameters, if necessary, for improved training and test accuracy.
Any continuous wavelet is used to generate time-frequency images for each time-series segment; and wherein the default wavelet is set to Morlet wavelet. The pre-trained deep convolutional neural network is applied to each time-frequency image to output a high dimensional numerical vector; and wherein the neural network is set to ResNet50.
The step of partitioning automatically (1508) the 2D representation points into clusters further comprises: applying the BIC over a plurality of different clusters; and selecting a plurality of clusters that correspond to the maximal BIC.
The step of assembling (S1512) a cluster-specific forecast model further comprises: collecting all the non-overlapping time-series segments belonging to the cluster; removing trend from each window-size segment to convert each window-size segment of the time-series into a stationary time-series segment; and applying an ARMA model over the collection of stationarized segments of each partition, wherein the model is identified by selecting the number of autoregressive terms, p, and the number of moving average terms, q according to the AIC criterion.
The method further comprises: performing (S1516) forecasts on the time-series at any time t for a future horizon, h, at every new instant.
The step of performing forecasts on the time-series at any time t for a future horizon, h further comprises: considering a window length segment of the time-series backwards from t; generating the time-frequency image corresponding to the window length segment of the time-series by applying the continuous wavelet transform using the user-selected wavelet; passing the generated time-frequency image through the deep convolutional neural network to extract a numerical vector from the said time-frequency image; identifying the matching cluster for the time-frequency image by applying the UMAP; selecting the corresponding cluster specific model in the repository using the index of the identified cluster; and forecasting the h future values using the selected cluster-based model.
The method further comprises: performing (S1518) the method (1500) periodically whenever the time-series collects a fixed number of new points; and storing (S1520) the cluster—specific refined models in the repository.
The step for mapping (S1510) each sample of the time-series to the cluster corresponding to the time-frequency image of the time-series segment that the sample belongs to, further comprises the steps of: identifying different contiguous stretches of the time-series within the time-series samples mapped to the same cluster; dividing each stretch into non-overlapping window-size segments; and discarding the last segment of the stretch and collecting the remaining segments from each stretch.
The assembling unit (1612) is configured for assembling a cluster-specific Auto-Regressive Moving Average (ARMA) forecast model considering all the non-overlapping segments of the time-series that belong to the cluster. The maintaining unit (1614) is configured for maintaining all cluster specific ARMA models in a repository.
The partitioning unit (1608) for automatically partitioning the 2D representation points into clusters further comprises: an applying unit (16081) for applying the BIC over a plurality of different clusters; and a selecting unit (16082) for selecting a plurality of clusters that correspond to the maximal BIC.
The assembling unit (1612) for assembling a cluster-specific forecast model further comprises: a collecting unit to collect all non-overlapping time-series segments of the cluster; a removing unit (16122) for removing trend from each window-size segment of the time-series into a stationary time-series segment; and an applying unit (16123) for applying an ARMA model over the collection of stationarized partitions of the cluster, wherein the model is identified by selecting the number of autoregressive terms, p, and the number of moving average terms, q using the AIC criterion.
The mapping unit (1610) further comprises: a collecting unit (16101) for identifying different contiguous stretches of the time-series within the time-series samples mapped to a cluster; a dividing unit (16102) for dividing each stretch into contiguous non-overlapping window size segments; and a discarding unit (16103) for discarding the last segment of the stretch and collecting the remaining segments from each stretch.
The system also comprises: a performing unit (1616) for performing forecasts on the time-series at any time t for a future horizon, h, at every new instant.
The performing unit (1616) for performing forecasts on the time-series at any time t for a future horizon, h further comprises: a window length segment unit (16161) for considering a window length segment of the time-series backwards from t; a generating unit (16162) for generating the time-frequency image corresponding to the window length segment of the time-series by applying the continuous wavelet transform using the user-selected wavelet; a passing unit (16163) for passing the generated time-frequency image through the deep convolutional neural network to extract a numerical vector from the said time-frequency image; an identification unit (16164) for identifying the matching cluster for the time-frequency image by applying the UMAP; a selecting unit (16165) for selecting the corresponding cluster specific model in the repository using the index of the identified cluster; and a forecasting unit (16166) for forecasting the h future values using the selected cluster-based model.
The present invention highlights integrating sophisticated techniques from multiple domains—wavelet-based time-frequency analysis from signal processing to automatically visualize local behaviours along time and frequency axes, extracting important features from the time-frequency plots using a pre-trained deep convolutional neural network, applying dimensionality reduction to easily discern clusters or groupings in features, and automating the process of grouping the features, and finally, applying classical time-series forecasting models separately for each group. Unlike other existing techniques, no assumptions are made regarding the nature or structure of the time-series to generate forecasts.
The present invention finds its application in the field of engineering, economics, and finance domains, for example, stock forecasts, sales forecasting of fashion items, surface torque prediction in oil and gas well drilling, power consumption forecasts on grids and the like. The technical impact offered by the present invention is that it allows to make forecasts on a complex non-stationary time-series with high degree of accuracy compared with the classical models. The wavelet-based approach blended with deep learning is capable of detecting potential changes in the structure of the time-series such as changes in the cyclical components and abrupt short-length changes with high fidelity. The machine learning scheme used in the invention uses transfer learning from a pre-trained deep convolutional neural network reducing the need for large data sets to train the models.
Some of the non-limiting advantages of the present invention are indicated herein below:
Although implementations for the method and a system for forecasting non-stationary time-series has been described in a language specific to structural features and/or methods, it is to be understood that the appended claims are not necessarily limited to the specific features or methods described. Rather, the specific features and methods are disclosed as examples of implementations of a hierarchical scheme for forecasting non-stationary time-series based on machine learning methodologies and classical time-series analysis.
Number | Date | Country | Kind |
---|---|---|---|
202121045286 | Oct 2021 | IN | national |