The present disclosure relates to a method and a device capable of mechanically estimating a factor causing an error as a method for analyzing optical communication using constellation data.
Constellation data enables data transmitted through digital coherent communication to be expressed on a complex plane, and its diagram is a discrete diagram in which an amplitude and a phase are represented by a polar coordinate diagram. The confirmation of a deviation degree from a theoretical value makes it possible to quantitatively and visually confirm quality. The analysis of constellation data makes it possible to timely specify a factor causing the degradation of performance and find countermeasures for solving these problems and improving transmission quality. However, in a conventional constellation analysis, an expert has estimated a factor causing an error and has specified a correction target and taken countermeasures therefor.
In order to cope with this problem, a method using a high-order statistics to recognize a modulation format or a method using a principal component analysis (PCA) has been proposed. However, these statistical approaches require the acquisition of a large amount of constellation data. Further, the design of a method based on the statistical approaches requires high-level expert knowledge.
Meanwhile, the application of a data-driven algorithm as represented by deep learning to such a classic statistical model has been advanced. NPL 1 has proposed a method for estimating a modulation format and an OSNR using a convolutional neural network (CNN). Similarly, methods for detecting the bending states of optical fibers using a CNN have been proposed (see, for example, NPL 2 and NPL 3). It has been reported that the estimation of the bending states of optical fibers with high accuracy is possible. The deep learning shows extremely high estimation performance but disadvantageously requires a large amount of constellation data for learning. Further, a computing amount required for a learning time increases.
Constellation data is data obtained from a function block inside a digital signal processing circuit (DSP). In addition, the acquisition of several thousands of data is required in order to obtain a constellation map in which one point has a data amount of one byte and which can be clearly visually observable. When a PCIe interface enabling high-speed communication is mounted, data can be acquired at a relatively high speed. However, when a low-speed I2C or a serial interface such as MDIO is used, it takes a few minutes or a more time depending on a desired data amount. As a result, a problem may occur when the DSP is incorporated into a transceiver module. The transceiver module uses only MDIO described above as an access interface to a device and cannot use a PCIe interface. Therefore, it takes a long time to acquire constellation data. Thus, constellation data acquired by the transceiver module has a poor real time property.
From the above circumstances, it is desirable that learning or estimation can be performed with a small amount of constellation data, but deep learning known for its extremely high estimation performance disadvantageously requires a large amount of constellation data for learning. Further, a computing amount required for a learning time increases.
The present invention has been made in view of the problem and has an object of proposing a method and a device for estimating the state of a transmission path or an optical transmitter capable of mechanically estimating a factor causing an error with a small amount of constellation data and a low computing amount.
A device for estimating a state of optical communication according to the present disclosure includes:
a data preprocessing unit that reduces the number of data using random sampling with respect to constellation data in which an amplitude and a phase of optical communication data are represented by a polar coordinate diagram and performs distribution calculation and a dimension reduction;
a learning unit that learns a dictionary matrix in sparse coding using learning constellation data processed by the data preprocessing unit; and
a recognition unit that calculates a sparse coefficient using recognition constellation data processed by the data preprocessing unit and the dictionary matrix learned by the learning unit and estimates a factor causing degradation of the optical communication using the calculated sparse coefficient.
A method for estimating a state of optical communication according to the present disclosure includes:
reducing the number of data using random sampling with respect to constellation data in which an amplitude and a phase of optical communication data are represented by a polar coordinate diagram and performing distribution calculation and a dimension reduction by a data preprocessing unit;
learning a dictionary matrix in sparse coding using learning constellation data processed by the data preprocessing unit by a learning unit; and
calculating a sparse coefficient using recognition constellation data processed by the data preprocessing unit and the dictionary matrix learned by the learning unit and estimating a factor causing degradation of the optical communication using the calculated sparse coefficient by a recognition unit.
A program for estimating a state of optical communication according to the present disclosure causes a computer to perform:
a data preprocessing step of reducing the number of data using random sampling with respect to constellation data in which an amplitude and a phase of optical communication data are represented by a polar coordinate diagram and performing distribution calculation and a dimension reduction;
a learning step of learning a dictionary matrix in sparse coding using learning constellation data processed in the data preprocessing step; and
a recognition step of calculating a sparse coefficient using recognition constellation data processed in the data preprocessing step and the dictionary matrix generated in the learning step and estimating a factor causing degradation of the optical communication using the calculated sparse coefficient.
According to the present disclosure, it is possible to specify a factor causing the degradation of quality in optical communication with a small computing amount and with a high speed and high accuracy from a smaller amount of constellation data compared with a conventional technology.
Hereinafter, embodiments of the present disclosure will be described in detail with reference to the drawings. Note that the present disclosure is not limited to the following embodiments. These embodiments are given only for exemplification, and the present disclosure can be carried out in modes in which various modifications and improvements are made on the basis of the knowledge of persons skilled in the art. Note that constituting elements having the same symbols in the present specification and the drawings show the same elements.
Sparse coding has attracted attention as an information processing model that extracts useful information hidden in a large amount of data (see, for example, NPL 4). By considering most elements of a huge amount of data to be sparse and paying attention to non-zero elements, the essence of data is extracted. As opposed to the deep learning of a non-linear model, the sparse coding is a linear model and exhibits high performance for applications in which data to be analyzed has a sparse property. In comparison with deep learning such as a CNN, the sparse coding has the aspects of 1) learning with a small amount of data is possible, 2) a low computing amount, and 3) an explicable AI and therefore has received attention. In the present disclosure, a method for estimating the state of a transmission path or an optical transmitter using Label Consistent K-SVD (LC K-SVD) (see, for example, NPL 5) that is a sparse dictionary learning method will be proposed.
Hereinafter, state estimation using constellation data will be described before the description of specific examples.
[State Estimation Using Constellation Data]
Constellation data enables data transmitted through digital coherent communication to be expressed on a complex plane. The phase and amplitude information of a coherent communication signal can be visually expressed.
Since the constellation data expresses the phase state and the amplitude state of a signal, the state of a transmission path or an optical transmitter can be estimated according to its shape. Here, with respect to the state estimation of an optical transmitter, specific examples of state estimation for the three errors of the phase error of a master bias in a modulator, the imbalance state of an I/Q gain, and the imbalance state of an I/Q skew will be given.
(a) Phase Error Of Master Bias in Modulator As shown in
In this case, there is a possibility that the biases of phase modulators 13X and 13Y are deviated in the optical IQ modulation module shown in
(b) Imbalance State of I/Q Gain
As shown in
In this case, there is a possibility that the amplitude of an I or Q driving signal in optical I/Q modulators 12X and 12Y is deviated in the optical IQ modulation module shown in
(c) Imbalance State of I/Q Skew
As shown in
In this case, there is a possibility that a correction value for a signal delay to a synthesizer 14 that synthesizes polarized waves together is deviated in the optical IQ modulation module shown in
As a method for enabling substantially highly-accurate estimation even with a small amount of data and a low computing amount, a state estimation method using sparse coding will be proposed. First, a general outline will be described. Then, preprocessing, sparse dictionary learning, and recognition that are element technologies will be described.
1. General Outline
The device according to the present disclosure performs the two steps of “sparse dictionary learning” (S101) and “recognition” (S102). As processing common to these two steps, the data preprocessing units 111 and 121 first perform preprocessing to reduce the number of data and a dimension on constellation data. Next, in step S101 of the sparse dictionary learning, the learning unit 112 performs learning using learning constellation data in which the normality and the error state of optical communication are specified to determine a parameter such as a sparse dictionary. Thus, the device according to the present disclosure acquires a dictionary matrix in which a sparse coefficient corresponding to the constellation data has been learned. Then, in step S102 of the recognition, the recognition unit 122 discriminates the normality or the error state of the optical communication with respect to recognition constellation data using the learned parameter such as the sparse dictionary.
2. Preprocessing
S121: Random Sampling
In the first step, the number of sample data is reduced by random sampling.
S122: Distribution Calculation
The number of constellation data or a histogram I(s, t) belonging to respective coordinates s,t (s=1, . . . , S,t=1, . . . , T) is calculated.
S123: Pooling
A dimension (resolution) is reduced by pooling. In the pooling, the dimension is reduced using sum pooling so as not to collapse the shape of the constellation data. The pooling is performed on a small block basis of scale×scale. The dimension after the pooling is as follows.
[Math. 01]
In the present disclosure, this dimension is sometimes expressed as S×T.
yi in which the number of the constellation data or the histogram
[Math. 03]
Ī(s,t) (0-3)
finally belonging to the respective coordinates
[Math. 02]
s,t(s=1, . . . S,t=1, . . . ,
is arranged as a column vector is used as an observation signal for the sparse dictionary learning and the recognition. Here, i represents the index of the constellation data.
3. Sparse Dictionary Learning: K-SVD
A set of observation signals yi is expressed by the following Formula.
[Math. 04]
Y={y
i}i=1N (0-4)
An observation signal yi is an M-dimensional column vector and expressed by the following Formula.
[Math. 05]
M=
At this time, it is assumed that Y can be expressed by the linear coupling of K bases as shown in
(Math. 1)
Y=DX (1)
Here,
[Math. 11]
D={d
1
, . . . , d
k}∈M×K (1-1)
is a dictionary matrix having a base dk (M-dimensional column vector) as an element, and
[Math. 12]
X={x
i}i=1N (1-2)
is a matrix having a sparse coefficient xi (K-dimensional column vector) as an element.
Generally, K>M (the number of bases is greater than the dimension of an observation signal) is established, and an overcomplete dictionary matrix is used. Since the uniqueness of X cannot be assured in the expression Y=DX in which the number of bases is greater than the dimension of a signal, the bases generally used in the expression of an observation signal Y are limited to a part of D. That is, the constraint that only a small amount of T0 coefficients take a non-zero value and remaining most coefficients take a zero value is imposed. A state in which the number of non-zero elements is small with respect to the whole as described above is called sparse. An optimization problem having the constraint of the sparse is formulated as
that minimizes a reconfiguration error. Here,
[Math. 21]
∥·∥0 (2-1)
expresses an L0 norm (the number of non-zero elements in a vector), and
[Math. 22]
∥·∥F (2-2)
expresses the norm of Frobenius and is defined by the following Formula.
[Math. 23]
∥A∥F=√{square root over (ΣijAij2)} (2-3)
In dictionary learning, two steps are generally alternately performed to solve the optimization problem of Formula (2). The learning unit 112 shown in
Step S131 is the problem of fixing a dictionary matrix D and calculating a sparse coefficient xi and can be rewritten as the following Formula.
However, it has been known that this problem is a combination optimization problem with which an optimum solution cannot be obtained unless all the bases are combined together and is NP-hard. As a solution for this problem, many algorithms such as a method based on a greedy method and a method in which a problem is solved after an I0 constraint is relaxed by an I1 constraint have been proposed. As an example, an orthogonal matching pursuit (OMP) method that is an approximate solution based on an I0 constraint is used in the present disclosure.
(b) Step S132: Update Dictionary Matrix
In step S132, X calculated in step S131 is fixed, and the dictionary matrix D is updated. K-SVD is considered to be a generalized k-means method. In the k-means method, a cluster and a sample correspond to each other one to one. Meanwhile, in the K-SVD, a sample is expressed as the primary coupling of cluster centroids (bases in the K-SVD).
In the K-SVD, a residual from a linear prediction vale in which a base dk is excluded from a set Y of observation signals
is subjected to singular value decomposition (SVD) to calculate dk and xkT. However, since an obtained solution does not necessarily satisfy the constraint of sparse, only non-zero elements in xkT calculated in step S131 are updated in the K-SVD.
When SVD is applied to an error ERk at this time and decomposed into orthogonal matrices U and V and a diagonal matrix E, the following Formula is obtained.
ui and vj are the i-th column vectors of U and V, and σi is the i-th diagonal component of Δ.
In the K-SVD, the approximate solution of the column vector of a base and a sparse coefficient is obtained using components u1 and σ1vT1 relating to a first singular value as shown in the following Formulas.
[Math. 6]
Base: dk=u1 (6)
[Math. 7]
Sparse coefficient: XRk=σ1v1T (7)
By repeatedly calculating a sparse coefficient (S131) and updating a dictionary matrix (S132) as described above, a dictionary matrix D can be generated. By performing sparse dictionary learning using learning constellation data, a dictionary matrix D in which the constellation data of optical communication put in a normal state or the constellation data of the optical communication put in various error states has been learned can be generated.
4. Learning for Class Classification: LC K-SVD In the K-SVD, sparse representation is calculated so as to minimize a reconfiguration error. Meanwhile, in LC-KSVD, a cost function is set as a weighted sum of
(a) A reconfiguration error (first term),
(b) A recognition sparse code error (second term), and
(c) A recognition error with respect to class classification (third term) to learn sparse representation.
The first term is the same reconfiguration error as that of the K-SVD.
In the second term,
[Math. 8-1]
Q=[q1, . . . , qn]∈K×K (8-1)
is a recognition sparse code for the class classification of observation signals yi, and the constraint that the observation signals yi classified into the same class share the same base dk is imposed. Here, “A” is a linear transformation matrix for mapping sparse representation X to the space of a recognition sparse code Q.
The third term is a recognition error with respect to class classification. W is a matrix for class classification, and
[Math. 8-2]
H=[h1, . . . , hN]∈m×N (8-2)
is the class label of an input Y.
[Math. 8-3]
h
1=[0,0 . . . 1 . . . 0,0]T∈m (8-3)
is the label vector of a class corresponding to an observation signal yi. In the label vector, 1 expresses a corresponding class, and m expresses the number of classes. Each of α and β is a parameter for adjusting a contribution ratio. For example, when there are three error states, the number of classes including a normal state m becomes 4 in total.
Note that Formula (8) can be rewritten as the following Formula. This Formula has the same format as that of Formula (2), and a dictionary can be learned by the same algorithm as that of the K-SVD.
5. Recognition
In step S102 of the recognition, the following Formula is solved using a dictionary D assumed by LC-KSVD with respect to an observation signal yi shaped from recognition constellation data to calculate a sparse coefficient xi.
For the derivation of the sparse coefficient xi, OMP can be, for example, used.
Next, the calculated sparse coefficient xi is projected using a matrix W according to the following Formula.
[Math. 12]
ĥ
i
=Wx
i (12)
Here, the projection of the sparse coefficient xi is sometimes represented as “hi{circumflex over ( )}”.
On the basis of an estimation value hi{circumflex over ( )} after the projection, any of m classes to which recognition constellation data belongs is discriminated. The class of hi{circumflex over ( )} corresponding to the element closest to 1 is discriminated. Thus, any class into which the recognition constellation data is classified can be discriminated. That is, a recognition can be made as to whether the recognition constellation data is classified into a normal state or any of the three error states. Accordingly, a factor causing the degradation of optical communication can be estimated as described above with reference to
[Evaluation Results]
In order to verify the validity of an invented method, a simulation was performed on 16 AQAM constellation data.
The dimension of the acquired constellation data is 19600 (S×T=140×140), and the number of data per one constellation data is 8192. A normal state and three error states (the phase error of a master bias in a modulator, the imbalance state of an I/Q gain, and the imbalance state of an I/Q skew) were artificially generated to acquire the constellation data.
Ten constellation data was randomly selected from the respective classes of a normal state and three errors and used as learning constellation data. Further, 200 constellation data was randomly selected from respective classes as recognition constellation data. Basic parameters were set as follows:
The number of sample data: 4000
Dimension S×T: 20×20=400 (scale=7)
Dictionary size K: 60
Sparse ratio: 20%
where the sparse ratio is defined as
Windows 7 was used as OS, and MALAB was used as a software language. With the above settings, the recognition ratio of the proposed method reached 100%. A data amount was about 1/100 with respect to original data, a learning time (training time) was about 1.6 seconds, and a recognition time was less than 0.1 seconds. As a result, the achievement of high recognition performance was confirmed with a small amount of data and a substantially low computing amount.
Next, a recognition ratio (recognition accuracy) and a computing amount (computing time) in a case in which data amounts (the number of sample data and a dimension) were made variable were evaluated.
According to a state estimation method using sparse coding relating to the present invention, it is possible to estimate the state of a transmission path or an optical transmitter in optical communication with a small amount of constellation data and a low computing amount.
The present disclosure is applicable to information communication industry.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2019/048509 | 12/11/2019 | WO |