The present invention relates to an optical signal processing device capable of reservoir computing in a complex number domain.
Attention has been attracted to machine learning with neural networks (NN) modeled after brain information processing. NN is a large-scale nonlinear network in which many neurons with nonlinear responses are coupled by synapses, and in particular, deep learning with a hierarchical NN in which neurons are arranged in a multilayer form has widely been employed. To handle time series data, NN typically requires a recursive network structure capable of referring to past information. Such NN is called recurrent neural network (RNN), which typically uses a network structure having feedback coupling between hierarchical NN layers. Although RNN has widely been employed for learning and processing of time series data such as sound recognition and sensing data, such RNN is disadvantageous in that, since synaptic coupling explodes with increasing the numbers of layers and neurons, more time is required for computation.
As a method for solving such a problem, an optical computing technique modeled after cerebellar information processing has been proposed in recent years, which is called reservoir computing (RC) (see Non-Patent Literatures 1 and 2).
Note that N is the number of neurons, xi(n) is the state of the i-th neuron at the time step n, Ωij is a coefficient representing mutual coupling between the neurons, mi is a coefficient representing coupling of the input signal with the neurons, and ωi is a coefficient representing coupling intensity of each neuron to the output. Furthermore, f(⋅) represents the nonlinear response at each neuron, in which, for example, tan h(⋅) is frequently used.
RC significantly differs from a typical RNN in that networks of the input layer 101 and the intermediate layer 102 are fixed, and a variable used for learning is only a weight coefficient of the output layer 103, that is, coupling intensity ωi of each neuron to the output. This method can greatly reduce variables to be learned, which thus has a great advantage in time series learning in which data is huge and high-speed processing is required.
Furthermore, this method is also advantageous in terms of storage method of past information. A signal entered into RC continues to drift for some time between neurons present in the intermediate layer 102. This means that RC itself retains short-term storage capacity and information interchange capacity. Accordingly, RC does not require operations of a typical RNN such as storing signals at previous time steps to an external memory and again referring to the data stored in the memory.
RC has been reported on its simple implementation using time delay as in
However, RC by a conventional optical implementation method generates the input signal u(n) and the output signal y(n) by signal processing using an intensity-modulation/direct-detection (IM/DD) method, which causes a problem in which only intensity information is used for information processing and phase information is lost. Therefore, information expression capacity inherent in optical waves has not been utilized sufficiently.
The present invention has been made in view of such a problem, and it is an object of the present invention to provide an optical signal processing device capable of RC in a complex space using optical intensity and phase information.
To solve the above problem, according to the present invention, there is provided an optical signal processing device including: a light source generating an optical signal; first optical modulation means for modulating at least one of intensity and phase of the optical signal at a first modulation period to generate a complex input signal; second optical modulation means for modulating the complex input signal in a time domain at a second modulation period that is shorter than the first modulation period; an optical circulation unit in which the modulated complex input signal circulates at a predetermined delay length; optical multiplex means for joining the modulated complex input signal in the optical circulation unit; a nonlinear response element giving nonlinearity to the optical signal circulating in the optical circulation unit; variable optical modulation means for modulating the optical signal circulating in the optical circulation unit; optical branch means for branching part of the optical signal circulating in the optical circulation unit; optical reception means for demodulating branched light output from the optical branch means to obtain a complex intermediate signal; and a signal processing circuit for weighting each of real and imaginary parts of the complex intermediate signal with any coupling weight and taking a sum to obtain a complex output signal, wherein the signal processing circuit changes the coupling weight so as to reduce an error between the complex output signal and a teacher signal.
In another aspect, the modulated complex input signal is a product of a complex vector having a period identical to the first modulation period and the complex input signal.
In another aspect, the predetermined delay length is 10 times or more the second modulation period.
In another aspect, optical pulse shaping means for optionally shaping an optical pulse of the optical signal circulating in the optical circulation unit, is further included.
In another aspect, the optical pulse shaping means includes: second optical branch means for N-branching (N is an integer of 2 or more) the optical signal circulating in the optical circulation unit; N delay lines being connected to each of N branches of the second optical branch means and having different delay lengths; control means for individually controlling intensity or phase of the optical signal passing through the N delay lines; and optical multiplex means for joining again the optical signal controlled by the control means.
The present invention is capable of RC in a complex space using optical intensity and phase information, enabling to double the effective number of neurons as compared to the conventional one.
Hereinafter, embodiments of the present invention will be explained in detail.
The converted input signal u′(t) passes through an optical transmission path 213 and enters an optical circulation circuit 215 via an optical coupler 214. The optical circulation unit 215 is loaded with, in addition to the optical coupler 214, a variable attenuator 216, a nonlinear response element 217, and an optical coupler 218. By the optical coupler 218, part of the circulating light is branched into two. One branched light enters the optical coupler 214 via the variable attenuator 216 and circulates in the optical circulation circuit 215. The other branched light is converted into a complex intermediate signal x(t) at a coherent optical receiver 219. This complex intermediate signal x(t) demodulated at the coherent optical receiver 219 is computed by Formula (2) at an electric signal processing circuit 220. Thereby, the operation as RC can be performed.
The input signal u(t) is described by the following formula using a real part term ur(t) and an imaginary part term ui(t).
Formula 3
u(t)=ur(t)+jui(t) (3)
Note that j=(−1)1/2. The signal light u(t) is modulated by some method at the modulation period T2 (T2<T1) in the time domain so as to be u′(t) as in the following formula.
Formula 4
u′(t)=m(t)u(t) (4)
Note that m(t) is a complex number generated by, for example, the optical modulator. Furthermore, u′(t) may be precomputed in the electric domain to cause the optical modulator 212 to directly modulate u′(t).
Formula 5
m(t)=m(t+T1) (5)
Within a range of satisfying the restriction of Formula (5), m(t) can be any value. However, for further excellent learning performance, m(t) is preferred to take a variety of values and, for example, is generated by various pseudo-random number generation algorithms. Furthermore, to prevent divergence of responses, the range which can be taken by m(t) is desired to be restricted to |m(t)|≤1.
Note that, for the optical transmission path 213 and the optical circulation unit 215, for example, optical fibers and optical waveguides can be used. For the optical attenuator 216, a variable attenuator using a Mach-Zehnder interference system or an MEMS mirror can be used to adjust the input light amount. Furthermore, for the nonlinear response element 217, an optical amplifier such as an Er-doped fiber amplifier (EDFA) or a semiconductor optical amplifier (SOA) can be used. The selection of the nonlinear element does not limit the scope of the present invention, which may use, for example, a method that utilizes a laser chaotic oscillation disclosed in Non-Patent Literature 2. Furthermore, in a specific problem, a linear circuit as disclosed in Non-Patent Literature 3 may be configured, without using the nonlinear element 217.
Learning generalization performance is determined by the diversity of the response of x(t). For securing this diversity, the circulation length T3 of the circulation unit is desired to be set so as to satisfy the relationship of T2<<T3. More specifically, it is desired to be set to T3≥10T2.
The complex intermediate signal x(t) obtained at the coherent optical receiver 219 is given as a solution of the following evolution formula.
Note that α is the product of the gain of the nonlinear response element 217 and the attenuation amount of the optical attenuator 216, and β and γ are the branch losses of the optical couplers 214 and 218. Here, where T3=T1 for simplicity, x(t) is described by a time discretized by the sampling time T1 as follows.
Formula 7
xi(n)=f{αxi(n−1)+miu(n−1)} (7)
Note that n represents the discretized time step. The subscript i means the i-th response of a signal within the sampling time T1 and further divided by the time T2. From the relationship described above, i ranges from 1 to N=T2/T3. The dynamics of Formula (7), from a comparison with Formula (1), correspond to those of reservoir computing in the case of having a diagonal matrix where all diagonal components of the coupling matrix Ωij are jΦ and having the number of neurons being N. That is, the electric signal processing circuit 220 computes Formula (2), and thereby the operation as RC can be performed. Furthermore, the electric signal processing circuit 220 may have an A/D conversion function that converts an analog input into a digital value, and in such a case of having the AD conversion function, computation of signals may be performed in the digital domain. Here, since this configuration handles input and output signals in a complex space, xi(n), y(n), and ωi are all complex numbers.
When the input signal u(t) modulated at the modulation period T from the optical system as described above is entered into the optical pulse shaper, the optical signal branched by the optical coupler 518 and going to the optical pulse shaper has a time response waveform x(t) described by the following formula.
Here, μj is the weight amount of the j-th (j=1, 2, . . . , M) delay line of the optical pulse shaper 521. M≤T3/T1 is desired. Here, where T3=T1 for simplicity, consider M≤T1/2T3. The following is a case where x(t) is described by a time discretized by the sampling time T1.
Formula 9
xi(n)=f{Σj=1MαΩijxi(n−1)+miu(n−1)} (9)
Here, Ωij is as follows.
It can be understood from the symmetry with Formula (1) that this configuration performs the coupling of the intermediate layer in the RC circuit. The number of neurons at this time corresponds to N. Each element of the coupling constant can be set by the weight amount μi of each delay line. As compared to the first embodiment, this configuration can set the matrix Ωij in a relatively optional manner, which thus has a high capacity to express RC. The operation of the output layer is the same as that in the first embodiment.
A specific implementation method of the FIR filter in the optical domain as described above will be explained.
Although an optical waveguide is used here to form the FIR filter, a spatial optical system can also be used to obtain a configuration equivalent to that in
Learning Method
In RC, a variable to be learned is only ωi, and several methods are available for determining the variable. As an example, a least mean square (LMS) method described by Formulas (11) and (12) will be explained here, but the present invention is not limited thereto, and the effect of the present invention can be obtained regardless of the algorithm of learning.
Formula 11
ωir(n+1)=ωir(n)+k(dr(n)−yr(n))xir(n) (11)
Formula 12
ωii(n+1)=ωii(n)+k(di(n)−yi(n))xii(n) (12)
Here, d(n) is a teacher value, and k is a coefficient for determining how much to move in the slope direction. The superscripts r and i indicate the real and imaginary parts for each variable. Since this method merely reduces the energy (error from the learning value) toward the neighboring local minimum, the global search is difficult in this state. Methods for giving an approximation to the global minimum solution include an annealing method. For this too, various methods are proposed. For example, as a function for the time step n, k may be given as follows.
Formula 13
k(n+1)=kmin+h(k(n)−kmin) (13)
Here, kmin and h are constants.
As a learning example according to the present invention, time series data approximation learning of a complex input and output signal will be shown. NARMA10 task, which is normally used as a benchmark for nonlinear time series learning, is performed to examine whether a teacher signal can be reproduced. The optical system of the optical signal processing device according to the first embodiment of the present invention is reproduced in the simulation to compute whether the output of NARMA10 described by Formula (14) can be approximated.
Here, y(n) is a time series signal to be predicted, and u(n) is an input signal. For the nonlinear element, the input signal u(n) is generated by Formula (15) below.
Here, f1, f2, and f3 are 2.11, 3.73, and 4.33, respectively. The modulation period T2 of the mask function m(t) is set to T2=T1/100, and the circulation time T3 is set to T3=4T1=400T2. For the nonlinear element, an SOA is used and its nonlinear dynamics are computed by a method disclosed in Non-Patent Literature 4. The initial values of the weight vector ωi in the output layer to be learned are all set to 1. Furthermore, α, which is the constant for determining the mutual coupling matrix in the intermediate layer of the network, is selected to be 1.2. As α increases, the dynamics that constitute the reservoir become chaotic. Accordingly, α=1.2 is set so as to maximize the reservoir network within a range of showing no chaotic property. Setting in this manner increases storage capacity of the reservoir network, exhibiting an excellent function of improving learning performance for tasks including past information as in NARMA.
The learning is performed using an LSM method. A teacher signal of 1000 symbols is learned and then 1000 symbols are estimated.
Number | Date | Country | Kind |
---|---|---|---|
2018-033746 | Feb 2018 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2019/006353 | 2/20/2019 | WO |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2019/167759 | 9/6/2019 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
20120141067 | Sakamaki | Jun 2012 | A1 |
Number | Date | Country |
---|---|---|
2018-200391 | Dec 2018 | JP |
Entry |
---|
Herbert Jaeger et al., Harnessing Nonlinearity: Predicting Chaotic Systems and Saving Energy in Wireless Communication, Science, vol. 304, No. 5667, 2004, pp. 78-80. |
Daniel Bruner et al., Parallel Photonic Information Processing at Gigabyte per Second Data Rates Using Transient States, Nature Communications, Jan. 15, 2013, vol. 4, No. 1364, pp. 1-7. |
Quentin Vinckier et al., High-Performance Photonic Reservoir Computer Based on a Coherently Driven Passive Cavity, Optica, vol. 2, No. 5, May 2015, pp. 438-446. |
Govind P. Agrawal, Self-Phase Modulation and Spectral Broadening of Optical Pulses in Semiconductor Laser Amplifier, IEEE Journal of Quantum Electronics, vol. 25, No. 11, Nov. 1989, pp. 2297-2306. |
International Search Report and Written Opinion dated Apr. 23, 2019, issued in PCT Application No. PCT/JP2019/006353, filed Feb. 20, 2019. |
Number | Date | Country | |
---|---|---|---|
20210026220 A1 | Jan 2021 | US |