The present invention relates to a state estimation system, a concealment signal generation device, a state estimation method, and a concealment signal generation program.
In digital coherent communication, communication quality is confirmed by generating constellation data in which transmission data is expressed by polar coordinate diagrams with an amplitude and a phase and analyzing a deviation of the constellation data from a theoretical value. By checking the communication quality, it is possible to quickly identify a cause of a deterioration in the communication quality and take measures for improving the communication quality.
For example, Non Patent Literature 1 discloses an optical communication state estimation method using sparse coding.
Non Patent Literature 1: Takayuki Nakachi, Yitu Wang, Tetsuro Inui, Takafumi Tanaka, Takahiro Yamaguchi, and Katsuhiro Shimano, “Intelligent Monitoring of Optical Fiber Transmission Using Sparse Coding”, Technical report of the institute of Electronics, Information and Communication Engineers (IEICE) Optical Communication System Study Group, vol. 119, no. 229, OCS2019-42, pp. 77-82.
However, analysis of constellation data strongly depends on the experience of experts. In order to estimate a cause of quality degradation of optical communication by a statistical approach or deep learning, it is necessary to acquire a large amount of constellation data, and there is a problem that a calculation amount becomes enormous. However, in the technology disclosed in Non Patent Literature 1 described above, a reduction in the amount of data is not described, and the problem that an amount of calculation is enormous has not been solved.
The present invention has been made in view of the foregoing circumstances, and an objective of the present invention is to provide a state estimation system, a concealment signal generation device, a state estimation method, and a concealment signal generation program capable of estimating a cause of a degradation in quality of optical communication without an increase in a calculation amount.
According to an aspect of the present invention, a state estimation system includes: a concealment signal generation unit configured to acquire learning constellation data and identification constellation data output from a signal processing circuit for optical communication, reduce and conceal the number of pieces of data from each of the pieces of constellation data through random projection, and generate a learning concealment signal and an identification concealment signal based on each piece of constellation data after the reduction and concealment of the number of pieces of data: a sparse dictionary learning unit configured to learn a concealment sparse dictionary using a sparse dictionary learning algorithm based on the learning concealment signal; and an identification unit configured to estimate a state of the optical communication using the concealment sparse dictionary based on the identification concealment signal.
According to another aspect of the present invention, a concealment signal generation device includes: a data acquisition unit configured to acquire learning constellation data and identification constellation data output from a signal processing circuit for optical communication; and a concealment signal generation unit configured to reduce and conceal the number of pieces of data from each of piece of constellation data through random projection and generate a learning concealment signal and an identification concealment signal based on each of constellation data after the reduction and concealment of the number of pieces of data.
According to another aspect of the present invention, a state estimation method includes: a step of acquiring learning constellation data output from a signal processing circuit for optical communication, reducing and concealing the number of pieces of data from the learning constellation data through random projection, and generating a learning concealment signal based on the learning constellation data after the reduction and concealment of the number of pieces of data; a step of learning a concealment sparse dictionary using a sparse dictionary learning algorithm based on the learning concealment signal; a step of acquiring identification constellation data output from a signal processing circuit of the optical communication, reducing and concealing the number of pieces of data from the identification constellation data through the random projection, and generating an identification concealment signal based on the identification constellation data after the reduction and concealment of the number of pieces of data; and a step of estimating a state of the optical communication using the concealment sparse dictionary based on the identification concealment signal.
According to another aspect of the present invention, a concealment signal generation program causes a computer to fun or as the concealment signal generation device.
According to the present invention, it is possible to estimate a cause of a degradation in quality of optical communication without an increase in the amount of computation.
Hereinafter, a state estimation system according to the present embodiment will be described. In the present embodiment, a data amount of constellation data is reduced and concealed through random projection, and a state of optical communication is estimated using the constellation data after the data amount is reduced and concealed. Hereinafter, a “state estimation method using constellation data” and a “state estimation method based on concealment calculation in which random projection is used” will be described.
As the constellation data, data transmitted in digital coherent communication can be expressed on a complex number plane. Phase and amplitude information of a coherent communication signal can be visually expressed by expressing the phase and amplitude information on the complex plane.
For example, in the case of a 16-QAM signal, 16 points (=4 bits) of information can be transmitted with one symbol. The constellation data represents an integrated state for a certain time, and a signal state indicates any one among 16 points at a certain time.
When the constellation data indicates a phase state and an amplitude state of a signal, it is possible to estimate a state of a transmission path and an optical transmitter according to the shape. Here, regarding state estimation of the optical transmitter, a specific example of the state estimation for three errors of a “modulator parent bias phase error”, an “I/Q gain imbalance state”, and an. “I/Q skew imbalance state” will be described. Each of the foregoing errors occurs mainly due to insufficient adjustment of an optical IQ modulation module of an optical communication device. Hereinafter, specific factors of the assumed error occurrence will be described the following (a) to (c).
As described above, the communication quality of the digital coherent communication can be estimated by using the constellation data. In the present embodiment, when random projection is adopted, the constellation data is concealed and a data amount and a calculation load are reduced to estimate a state of optical communication.
Hereinafter, a specific example of a state estimation method based on concealment calculation using random projection will be described. Hereinafter, [1. Overview of State Estimation System], [2. Generation of Concealment Signal Using Random Projection], [3. General Sparse Dictionary Learning and identification], and [4. Concealment Sparse Dictionary Learning and Concealment Identification] will be described.
An overview of a state estimation system that executes state estimation of optical communication using concealment sparse coding will be described.
As illustrated in
Each of the concealment signal generation devices 1 (1-1 to 1-N) includes a data acquisition unit 11 and a concealment signal generation unit 12. The calculation device 2 includes a sparse dictionary learning unit 21 and an identification unit 22.
The data acquisition unit 11 is connected to a digital coherent signal processing circuit 10 (hereinafter referred to as a “DSP 10”).
The DSP 10 processes signals transmitted and received in digital coherent communication. The data acquisition unit 11 acquires constellation data from the DSP 10.
The concealment signal generation unit 12 generates a concealment signal (a learning concealment signal) based on the learning constellation data acquired by the data acquisition unit 11. The concealment signal generation unit 12 also generates a concealment signal (an identification concealment signal) based on the identification constellation data acquired by the data acquisition unit 11. The details of the concealment signal generation unit 12 will be described below with reference to
The sparse dictionary learning unit 21 learns a concealment sparse dictionary using the “label consistent K-SVD algorithm” (hereinafter referred to as an “LC K-SVD algorithm”) of a sparse dictionary learning method.
The identification unit 22 calculates a sparse coefficient (of which the details will be described below) by using the concealment sparse dictionary learned by the sparse dictionary learning unit 21 and an orthogonal matching pursuit (OMP) which is an example of a greedy algorithm.
In the state estimation system 100 according to the present embodiment, the concealment signal generation device 1 and the calculation device 2 execute a step of the concealment sparse dictionary learning and a step of the concealment identification to be described below, calculate the sparse coefficient, and estimate the concealment state of optical communication.
In the step of the concealment sparse dictionary learning, learning is executed based on information regarding whether the learning constellation data is normal or an error, and what the error state is when the learning constellation data is the error, and a parameter such as a sparse dictionary is determined.
In the step of the concealment sparse dictionary learning, the concealment signal generation device 1 acquires the constellation data from the DSP 10 and generates the concealment signal. Thereafter, the generated concealment signal is transmitted to the calculation device 2 via the network 3.
The sparse dictionary learning unit 21 of the calculation device 2 learns the concealment sparse dictionary using the “LC K-SVD” algorithm of the sparse dictionary learning method.
In the step of the concealment identification, the concealment signal generation device 1 (1-1 to 1-N) installed in each base acquires the identification constellation data from the DSP 10 and generates a concealment signal. Subsequently, a process of identifying whether the identification constellation data is normal or in an error state is executed using the concealment sparse dictionary estimated in the step of the above-described concealment sparse dictionary learning.
Next, generation of the concealment signal using the random projection will be described in detail. The generation of the concealment signal is common to the step of the concealment sparse dictionary learning and the step of the concealment identification described above.
The random sampling unit 121 executes a process of reducing the number of pieces of sample data through the random projection. In the random projection, original d-dimensional data “Xd×N” is projected to a k-dimensional (where k<<d) subspace using a random matrix “Rk×d” of “k×d” in which a unit length of a column is random. That is, the data “XRPk×N” after the random projection is calculated with the following Expression (1).
[Math. 1]
X
k×N
RP
=R
k×d
X
d×N (1)
When each element of the randomization matrix “R” is “rij”, each constituent “rij” is set as in the following Expression (2). Accordingly, the number of pieces of data can be reduced by executing random projection. Expression (2) is an example of the randomization matrix “R”. A randomization matrix “R” for general random projection can be used.
The distribution calculation unit 122 calculates the number of pieces of constellation data or a histogram “I (s, t)” belonging to coordinates s, t (s=1, . . . , S, and t=1, . . . , T) in the constellation data acquired from the DSP 10.
The random projection unit 123 reduces the number of pieces of data using random projection in which both dimension reduction and a concealment process can simultaneously be implemented based on the number of pieces of constellation data calculated by the distribution calculation unit 122 or the histogram. “I(s, t)”.
[Math. 3]
y
i∈M (3)
[Math. 4]
y
i
=[I(1,1), . . . , I(S,1), I(1,2), . . . , I(S,2), . . . , I(1,T), . . . , I(S,T)]T (4)
In Expression (3), “M” is a non-negative integer defined by “S×T”, and “i” indicates a sample index of learning data or identification data and is “i=1, . . . , N”. “N” indicates the number of pieces of data.
The random projection is linear transformation by a random matrix and can be used to reduce dimensionality of high-dimensional data. In the random projection, a matrix R expressed in the following Expression (5) using a random number as an element is multiplied by an M-dimensional vector “yi” to be converted into a low-dimensional vector “yi({circumflex over ( )})” that has an “M({circumflex over ( )})” dimension (where M({circumflex over ( )})<M). The vector “yi({circumflex over ( )})” can be expressed by the following Expression (6).
[Math. 5]
R∈
{circumflex over (M)}×M (5)
[Math. 6]
ŷ
i
=Ry
i (6)
If elements of the random matrix R are random numbers of which an average is “0” and a variance “1/M({circumflex over ( )})”, the data is approximately stored with a high probability of a distance between the data before and after the random projection, as expressed in the following Expression (8) at the time of execution of random projection of any N pieces of learning data or identification data “yi (where i=1, . . . , N)” to the dimension of the following Expression (7).
[Math. 7]
{circumflex over (M)}=O(∈−2logM (7)
[Math. 8]
(1−∈)∥yi−yj∥2≤∥ŷi−ŷj∥2≤(1−∈)∥yi−yj∥2 (8)
In Expression (8), “ϵ” (where 0<ϵ<1) is a coefficient. This theorem indicates that at the time of mapping from an “M”-dimensional space to a space of a lower dimension “M({circumflex over ( )})”, a Euclidean distance be two certain points is stored with a considerably high probability. Further, it is known that this random projection is obtained by any random value. Through the above-described process, a concealment signal through the random projection is generated.
Next, sparse dictionary learning and identification when the vector “yi” is not concealed will be described. The sparse dictionary learning is executed by supervised sparse dictionary learning “LC K-SVD”. The unsupervised sparse dictionary learning “K-SVD” is required for a process of “LC K-SVD”. Therefore, “K-SVD” will be described first.
A set “Y” of vectors “yi” is expressed by the following Expression (9).
[Math. 9]
Y={y
i}i=1N (9)
[Math. 10]
Y=DX (10)
Here, “D” expressed in Expression (10) can be expressed as the following Expression (11). “X” expressed in Expression (10) can be expressed as the following Expression (12).
[Math. 11]
D={d
1
, . . . , d
K}∈M×K (11)
[Math. 12]
X={x
i}i=1N (12)
In Expressions (11) and (12), “D” is a dictionary matrix that has a base “dk” (M-dimensional column vector) as an element, and “X” is a matrix that has a sparse coefficient “xi” (K-dimensional column vector) as an element.
In general, as the dictionary matrix, an over complete dictionary matrix in which the number of bases is larger than the dimensions of an observation signal (that is, “K>M”) is used. In an expression “Y=DX” (the above-described Expression (10)) by bases greater than the dimensions of the observation signal, uniqueness of “X” cannot be guaranteed.
Therefore, the base usually used to express the observation signal “Y” is limited to some of “D”. That is, a constraint which only a small number “T0” of coefficients take non-zero values, and most of the remaining coefficients take zero values is provided. In this way, a state in which the number of non-zero elements is small with respect to all the elements is called sparse. An optimization problem that has the sparse constraint is formulated as the following Expression (13) that minimizes the reconfiguration error.
However, the following Expression (14) indicated by Expression (13) represents the L0 norm (the number of non-zero elements in the vectors).
[Math. 14]
∥·∥0 (14)
The following Expression (15) indicates the norm of Frobenius and is defined as in the following Expression (16).
[Math. 15]
∥·∥F (15)
[Math. 16]
∥A∥F=√{square root over (ΣijAij2)} (16)
In general, dictionary learning solves the optimization problem of the above-described Expression (13) by alternately repeating the following two steps S1 and S2. The sparse coefficient is calculated in step S1 to be described below, and the dictionary is updated in step S2.
In step S1, there is a problem of fixing the dictionary D and obtaining the sparse coefficient xi, which can be rewritten as the following Expression (17).
However, this problem is a combination optimization problem in which an optimum solution cannot be obtained unless all combinations of the bases are tested, and it is known that this problem is computationally difficult. (NP-hard). As a solution to this problem, many algorithms, such as a method based on a greedy method and a method of solving the problem after relaxing 10 constraints with 11 constraints, have been proposed. As an example, in the present invention, orthogonal matching pursuit (OMP) which is an approximation solution based on 10 constraints is used.
In step S2, the dictionary “D” is update by fixing “X (a matrix that has the sparse coefficient xi as an element)” obtained in step S1. “K-SVD” is placed by generalizing a “k-means” method. While clusters and samples have a one-to-one correspondence in the “k-means” method, samples are represented as a primary combination of cluster centroids (the bases in the K-SVD) in the “K-SVD” method. In “K-SVD”, a residual “Ek” with a linear prediction value obtained by removing a base “dk” from a set “Y” of observation signals is subjected to singular value decomposition (SVD) to obtain “dk” and “xkT”. The residual “Ek” is expressed in the following Expression (18).
However, since the obtained solution does not always satisfy the sparse constraint, only the non-zero element in “xkT” obtained in step S1 is updated in “K-SVD”. By applying “SVD” to an error “ERk” at that time and decomposing the error into orthogonal matrices “U” and “V” and a diagonal matrix “Σ”, the following Expression (19) is obtained.
In Expression (19), “ui” and “vj” are i-th column vectors of “U” and “V”, respectively, and “σi” is an i-th diagonal component of “Δ”. In “K-SVD”, an approximate solution of a row vector of the base and the sparse coefficient is obtained as in the following Expression (20) using a component “u1” related to the first singular value and “σ1vT1”.
The supervised sparse dictionary learning is executed by the sparse dictionary learning unit 21 illustrated in
The first term on the right side of Expression (21) is the same reconfiguration error as “K-SVD”. “Q” expressed in the following Expression (22) of the second term on the right side of the Expression (21) is an identification sparse code for classification of the vector “yi” of the observation signal, and the vector “yi” of the observation signal classified into the same class imposes a constraint that the same base “dk” is shared.
[Math. 22]
Q=[q
1
, . . . , q
n]∈L×N (22)
The third term on the right side of the Expression (21) is an identification error or classification. “W” is a projection matrix for classification, and “H” expressed in the following expression (23) is a class label of an input “Y”.
[Math. 23]
H=[h
1
, . . . , h
N]∈m×N (23)
“hi” expressed in Expression (23) can be expressed as the following Expression (24). “hi” is a label vector of a class corresponding to the vector “yi” of the observation signal, “l” indicates a corresponding class, and “m” indicates the number of classes.
[Math. 24]
h
i=[0, 0 . . . 1 . . . 0, 0]T∈m (24)
“α” and “β” expressed in the above-described Expression (21) are parameters for adjusting a contribution rate. The Expression (21) can be rewritten to the following Expression (25). This has the same form as the above-described Expression (13), and the dictionary can be learned by an algorithm similar to “K-SVD”.
The identification process is executed by the identification unit 22 illustrated in
Subsequently, the calculated sparse coefficient “xi” is projected using a matrix “W” with the following Expression (27).
[Math. 27]
ĥ
i
=Wx
i (27)
It is identified whether the vector “yi({circumflex over ( )})” of the concealment signal belongs to any one of the “m” classes based on the estimated value “hi ({circumflex over ( )})” after the projection. The vector “yi({circumflex over ( )})” is identified in a class corresponding to an element closest to “l” in “hi({circumflex over ( )})”.
Processes for the concealment sparse dictionary learning and the concealment identification are executed by the sparse dictionary learning unit 21 and the identification unit 22 illustrated in
Hereinafter, supervised concealment sparse dictionary learning be described. A set of vectors “yi({circumflex over ( )})” of the concealment signal is expressed as in the following Expression (28).
[Math. 28]
Ŷ={ŷ
i}i=1N (28)
At this time, the concealment dictionary “D({circumflex over ( )})” and the projection matrix “W” expressed by the following Expression (29) are obtained using “LC K-SVD”.
[Math. 29]
{circumflex over (D)}∈
{circumflex over (M)}×K (29)
A cost function of the concealment sparse dictionary learning in which “LC K-SVD” is used can be expressed with the following Expression (30) and can be solved by an algorithm similar to “K-SVD”.
Here, “Q” expressed in the following Expression (31) is an identification sparse code for classification of the vectors “yi” of the observation signal, and “W” expressed in Expression (30) is a projection matrix for classification. “H” expressed in the following Expression (32) is a class label of the input “Y”.
[Math. 31]
Q=[q
1
, . . . ,q
n]∈K×N (31)
[Math. 32]
H=[h
1
, . . . , h
N]∈m×N (32)
“hi” expressed in the following Expression (33) is a label vector of a class corresponding to the vector “yi” of the observation signal, “l” indicates a corresponding class, and “m” indicates the number of classes.
[Math. 33]
h
i=[0, 0 . . . 1 . . . 0, 0]T∈m (33)
“α” and “β” expressed in the Expression (30) are parameters for adjusting a contribution rate. It is assumed that the dictionary “D” is concealed through the random projection according to the relationship of “D({circumflex over ( )})=RD({circumflex over ( )})”.
Here, as expressed in the above-described Expression (8), a distance between the data is approximately stored at a high probability before and after the random projection. Therefore, an optimum solution of the concealment sparse dictionary learning of the above-described Expression (30) is calculated as a value close to the optimum solution of Expression (25) when the dictionary is not concealed.
At this time, the concealment dictionary “D({circumflex over ( )})” can be updated and obtained in a region of the concealment signal by the “LC K-SVD” algorithm. As described above,
The concealment identification process is executed by the identification unit 22 illustrated in
Next, the calculated sparse coefficient “xi” is projected with the following Expression (35) using the projection matrix “W” estimated in the concealment sparse dictionary learning.
[Math. 35]
ĥ
i
=Wx
i (35)
It is identified to which class the vector “yi({circumflex over ( )})” of the concealment signal belongs among the “m” classes based on the estimated value “h(?)i” after the projection. Then, the vector “yi({circumflex over ( )})” of the concealment signal can be identified as the class corresponding to the element closest to “l” of “h({circumflex over ( )})i”.
As described above, the state estimation system 100 according to the present embodiment includes: the concealment signal generation unit 12 configured to acquire learning constellation data and identification constellation data output from a signal processing circuit for optical communication, reduce and conceal the number of pieces of data from each of the pieces of constellation data through random projection, and generate a learning concealment signal and an identification concealment signal based on each piece of constellation data after the reduction and concealment of the number of pieces of data: the sparse dictionary learning unit 21 configured to learn a concealment sparse dictionary using a sparse dictionary learning algorithm based on the learning concealment signal; and the identification unit 22 configured to estimate a state of the optical communication using the concealment sparse dictionary based on the identification concealment signal.
In the present embodiment, it is possible to reduce the constellation data when the constellation data is concealed. Therefore, it is possible to estimate a state of the transmission path or the optical transmitter in optical communication with a small calculation amount while keeping high concealment. As a result, it is possible to estimate a cause of a degradation in quality of optical communication without an increase in the amount of computation.
Even when data is leaked due to an artificial mistake, it is possible to prevent data leakage.
The concealment signal generation unit 12 reduces the number of pieces of constellation data in the random sampling unit. Further, the distribution calculation is executed on the constellation data after the reduction in the number of pieces of data, and the random projection in which the dimension reduction and the concealment process are simultaneously implemented is executed to generate the learning concealment signal and the identification concealment signal. Therefore, the learning concealment signal and the identification concealment signal can be generated with high accuracy.
Further, the sparse dictionary learning unit 21 updates the sparse coefficient “xi”, the concealment dictionary “D({circumflex over ( )})”, and the projection matrix “W” by learning the concealment sparse dictionary. Therefore, the sparse dictionary learning can be always executed by adopting new data.
Since the identification unit 22 updates the sparse coefficient “xi”, the concealment dictionary “D({circumflex over ( )})”, and the projection matrix “W” estimated by the sparse dictionary learning unit 21, it is possible to execute state estimation of optical communication with high accuracy.
Further, as illustrated in
As illustrated in
The concealment signal generation device 1 may be implemented by one computer or may be implemented by a plurality of computers. Alternatively, the concealment signal generation device 1 may be a virtual machine mounted on a computer.
A program for the concealment signal generation device 1 can be stored in a computer-readable recording medium such as an HDD, an SSD, a universal serial bus (USB) memory, a compact disc (CD), or a digital versatile disc (DVD) or can be distributed via a network.
The present invention is not limited to the foregoing embodiment, and various modifications can be made within the scope of the gist of the present, invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2021/007876 | 3/2/2021 | WO |