Illustrative embodiments relate to a method for providing an artificial neural network. Furthermore, illustrative embodiments relate to a system for this purpose.
Disclosed embodiments are described in detail with reference to the drawings. The features mentioned in the claims and in the description can be essential individually or in any combination. In the figures:
There are many possible applications for the use of artificial neural networks, which are dependent on rapid processing of the items of information. For example, for automatic driving the most secure possible surroundings perception in real time is of great importance. The surroundings of a transportation vehicle can be acquired with the aid of sensors, such as radar, lidar, and camera. Integrated 360° 3D acquisition of the environment can often also be provided, so that all static and dynamic objects are acquired and classified.
In particular, the camera has a leading role in a redundant, robust surroundings acquisition, since this sensor type can measure angles precisely in the surroundings acquisition and can be used for classification of the surroundings. However, the processing and classification of the camera images is computing-intensive and architectonically complex. In particular, the 360° 3D surroundings acquisition is problematic, since many individual images have to be classified and processed and therefore the computing effort increases.
Conventional high-performance artificial neural networks (called NN or ANN) offer the possibility of classifying camera images or data of other sensors having image repetition rates of less than 10 Hz. For many applications, such as in secure surroundings acquisition in real-time, this image repetition rate is often inadequate, because modern camera systems operate at 30 Hz image repetition rate. In addition, the data load increases with increasing resolution of the camera images.
The often limiting factor is the processor speed or GPU speed of modern high-performance computers, which can, even with GPU acceleration, not be adequate to classify the images completely in real-time.
Disclosed embodiments at least partially remedy the above-described drawbacks. Disclosed embodiments propose an alternative to conventional ANNs.
The above is achieved by a method and by a system disclosed in the following description, and the drawings. Features and details which are described in conjunction with the disclosed method obviously also apply in conjunction with the disclosed system, and vice versa in each case, so that reference is always mutually made or can be made with respect to the disclosure of the individual facets of the disclosure.
Disclosed is a method for providing an artificial neural network, in particular, an optical, dispersive neural network. It is provided in this case that the following operations are carried out, optionally in succession in the specified sequence and/or repeatedly:
In this way, optical processing of the signal can be enabled, and can thus provide significant speed benefits over solely electronic processing. The processing and filtering can be predefined here, thus the extent of the change of the property for the processing (for example, in the context of a training phase) can be permanently specified during the creation of the network. This can be carried out, for example, structurally by the selection and/or adaptation of optical elements of the network for the use of the property and/or filtering of the optical signal.
The use of the property of the optical signal which is specific for the (spectral and/or temporal) phase of the optical signal to provide the neuron and/or the weight can take place, for example, in that the dispersion of the optical signal during the propagation in material is used as the neuron and/or weight. The at least one neuron of the artificial neural network (designated in short as ANN) can be embodied in each case, for example, as an optical and/or dispersive neuron and/or the at least one weight of the ANN can be embodied in each case as an optical and/or dispersive weight. In this way, processing of items of information (in particular, data) at light speed is possible.
For example, the optical signal comprises an optical input signal, which is input into the network. The items of information to be processed can be represented by the input signal, which are processed by the processing of the optical input signal, to obtain the output (for example, as an optical output signal, thus of the processed optical input signal) as a result of the processing.
One possibility for providing the optical signal is to use laser pulses as the optical signal (i.e., light signal). Nevertheless, alternative exemplary embodiments of the optical signal are also conceivable, for example, as a continuous laser beam or the like. The processing can be carried out by a change of the (spectral and/or temporal) phase profile, as will be described by way of example hereinafter. For this purpose, an optical, dispersive element can be used, which is suitable for adapting the (spectral and/or temporal) phase profile. The change of the (spectral and/or temporal) phase profile enables the property of the optical signal, which is specific for the (spectral and/or temporal) phase of the optical signal, such as the (spectral and/or temporal) phase itself or its derivative or its Fourier transform, to be able to be used as the optical neuron and/or as the optical weight.
Furthermore, it is conceivable in the scope of the disclosure that the provision of the optical signal takes place in that a light signal is transmitted into at least one of the at least one network component, wherein the light signal can be a carrier of an item of information which is processed by the processing of the network to obtain an assessment, in particular, a classification, of the information as the output. The optical signal can thus initially be input as an input signal into the network. In this way, items of information can be input into the network, for example, sensor data or the like. The items of information can also be transmitted here directly as the input signal (for example, light via an optical unit or LiDAR radiation) and/or the items of information can be read in electronically and then transmitted on an optical carrier signal as the input signal. (LiDAR is in this case an abbreviation for light detection and ranging, and can be used, for example, in a transportation vehicle for acquiring the surroundings and/or for distance and velocity measurement.)
It can be provided in the disclosed embodiments that the use of the property specific for the (spectral and/or temporal) phase of the optical signal takes place in that a (spectral and/or temporal) phase profile of the optical signal is changed nonlinearly and/or linearly to change the property (in particular, in a predefined manner), in particular, to execute a nonlinear function of the neuron (such as an activation function) and/or a linear weighting of the weight. Generally formulated, the use of the property of the optical signal is carried out, for example, by the use of the (spectral and/or temporal) phase or a property dependent thereon. The property (specific for the spectral and/or temporal phase of the optical signal) can accordingly be the spectral and/or temporal phase itself or also a group delay (abbreviated GD) or a group delay dispersion (abbreviated GDD) or a group velocity dispersion (abbreviated GVD) or a third order (abbreviated TOD) or higher order dispersion of the derivative of the spectral phase. To carry out the processing by way of the network and specifically by way of the network component, thus, for example, to execute a weighting by the weight and/or a function of the neuron (for example, an activation function), the above-mentioned property can be changed (for example, by using dispersive materials). The optical, dispersive elements described hereinafter are used for this purpose, for example, which can be structurally assembled to form the network to provide the processing. In the weighting, for example, a linear change of the dispersion and/or the spectral and/or temporal phase and/or the GD and/or the GDD and/or the TOD takes place. A frequency modulation of the optical signal can possibly also be carried out for the weighting. Furthermore, it is conceivable that to execute the (nonlinear) function of the neuron, a nonlinear change of the property, for example, of the spectral phase and/or the GD and/or the GDD and/or the TOD, is carried out. The function of the neuron can thus be provided, for example, as an activation function such as a Heaviside function or sigmoid function or RELU function.
It can be provided in the scope of the disclosure that the output and/or an output signal of the at least one network component is evaluated in that the property of the optical signal specific for the (spectral and/or temporal) phase of the optical signal is evaluated, in particular, measured. To carry out the processing by way of multiple weights and/or neurons, thus to combine the network components with one another (i.e., optionally to couple them with one another), a coherent or incoherent spectral combination of the optical (output) signal of multiple neurons can be provided, and the combined (output) signals can be passed on via optical weights to adjoining optical neurons of the next layer. Finally, the property specific for the (spectral and/or temporal) phase of the optical signal can be measured to evaluate the output of the network. Accordingly, the dispersion profile can be detected at the output of the network as the desired output information, for example, class information in the case of a classification, and the output can be passed on, for example, to a CPU (central processing unit) or GPU (graphics processing unit) for further processing such as a formation of a surroundings model.
The disclosed method can provide the benefit that the computing speed for the processing is increased by the ANN, and the creation of deeper optical neural ANNs is also enabled. The classification can take place at light speed here. Furthermore, lower optical performances can be necessary in relation to alternative solutions, to trigger a nonlinear response function, since no multiphoton processes have to participate or only material properties can be used. Furthermore, a ring line can be implementable easily. The entire ANN can be able to be physically manufactured as an optical neural network (abbreviated ONN) on a semiconductor. Furthermore, an integration of the ONN on a semiconductor chip can be possible in CMOS, SiN-CMOS, Bi-CMOS, hybrid Bi-CMOS processes on photonic-electronic cointegrated chips.
Furthermore, it can be provided in the scope of the disclosure that the optical signal is provided as an optical input signal which is specific for an item of input information, and is processed by the at least one network component to obtain an optical output signal specific for the output. The output signal can therefore, in contrast to the input signal, comprise an additional item of information about the input signal, for example, an item of class information of a classification.
It can further be provided that multiple network components are provided which comprise neurons and/or weights which are provided with one another in different levels of the network and are optically connected to one another. The network can typically be composed of multiple network components which are connected to one another in accordance with the network structure. The network components can be provided here in different levels of the network, and the output signals of the network components of one level can be transmitted to the network components of a following level as an input signal via connections such as optical fibers.
It can optionally be provided that the property specific for the (spectral and/or temporal) phase of the optical signal is the spectral and/or temporal phase itself or a group delay or a group delay dispersion or a group velocity dispersion or a third order or higher order dispersion of the derivative of the spectral phase. This enables reliable processing and evaluation of the optical signal. It can furthermore be possible that the neuron is embodied as a dispersive neuron which provides the nonlinear function of the neuron by way of a spectral phase modulation of the optical signal.
According to a further benefit, it can be provided that the network is used in a transportation vehicle, wherein optionally the optical signal is provided as an optical input signal which is specific for an item of input information about the surroundings of the transportation vehicle, and is processed by the at least one network component to optionally obtain the output as a classification of the input information. The transportation vehicle can be designed, for example, as a motor vehicle and/or passenger motor vehicle (or utility motor vehicle) and/or autonomous vehicle. The input information is, for example, a signal having the sensor data of a camera or the like.
A system for providing an artificial neural network, optionally for a transportation vehicle, is also the subject matter of the disclosure, including:
It is provided in this case that the at least one network component comprises at least one neuron and/or one weight of the network. The disclosed method can therefore provide an ANN in an optical, dispersive manner.
The disclosed system is thus accompanied by the same benefits as have been described in detail with reference to a disclosed method. Moreover, the system can be suitable for executing a disclosed method.
A plurality of the elements can be provided in the disclosed system and can be structurally coupled with one another to form the network. For example, for this purpose an integration of the elements into a semiconductor and/or a coupling by optical fibers takes place to transmit the optical signal between the network components. The dispersive element can include, for example, a dispersive medium to carry out the use, in particular, the change, of the property of the optical signal.
In addition, it can be beneficial in the scope of the disclosure that multiple optical elements are provided and are each embodied to change a spectral and/or temporal phase profile of the optical signal at least nonlinearly (or alternatively or additionally linearly) for the use of the property, in each case to provide an output signal of a neuron (alternatively or additionally a weight) of the network, wherein optionally at least one spectral combiner of the network is provided to spectrally combine the output signals of the neurons (or weights), and optionally a spectral phase analyzer is provided to evaluate a phase of the combined output signal for the provision of the output.
In the following figures, identical reference signs are used for the same technical features, even of different exemplary embodiments.
The operations of a disclosed method for providing an artificial neural network 200 are schematically visualized in
As is further illustrated in
The output signal of the neurons 251, and thus the signal passed on to the neurons 251 of the following layer, can be given here by a sigmoid function of the sum of weighted response functions
α11=σ(Σωiαi0) (1)
wherein ωi represents the weights 252, αi0 represents the neurons 251, and a represents the sigmoid function (see
f(α0, . . . ,αn)=(y0, . . . ,yk) (2)
with k, n∈, and wherein the function values yi can be output as the class information of the output 210.
Artificial neural networks are typically implemented on conventional computer architectures, which have the drawback of the slow processing of large amounts of data, however. In contrast, a significant speed benefit can be achieved by the optical design of the ANN 200, which also enables the use of the ANN 200 for driving functions, for example, also of autonomous driving.
For example, continuous laser beams (i.e., continuous wave, thus a wave emitted continuously over time) or laser pulses come into consideration as the optical signal 100, which may be transmitted in a technically reliable manner via optical fibers 283 to the at least one network component 250.
τ·Δω=const. (3)
It can be inferred from equation (3) that the pulse has a spectral bandwidth and accordingly is a superposition of monochromatic waves of various frequencies. As shown in
The velocity of the movement of the envelope is called group velocity υg (or also designated in short as GV) and is defined via the derivative of the wave number k (of the wave vector):
The index of refraction is given here by n(ω). The propagation velocity of the individual monochromatic waves is designated as the phase velocity υp:
wherein the wave number is given with
If the pulse propagates in a dispersion-free manner, υg and υp are thus identical since
(see
By solving the Helmholtz equation, the electrical field of a laser pulse E(t) can be described as a function of time t by
wherein ω0 describes the carrier frequency, ψ(t) describes the temporal phase, and I(t) describes the intensity. For Gaussian pulses, for example, I(t) is given by
I(t)∝|E0|2·e−2.76·t/τ (8)
with the amplitude of the electrical field E 0 and the full width at half maximum of the pulse τ, which defines the pulse duration. It is apparent from equations (7) and (8) that not only amplitude, frequency, and pulse duration are sufficient for the complete characterization of the pulse, but rather the temporal phase also has to be considered. A spectral observation of the pulse suggests itself for more insight into the dispersive dynamics within the pulse.
By way of Fourier transform of E(t) in the frequency space, the following results from equation (7) with centering of the pulse around its central frequency (ω−ω0)→ω:
{tilde over (E)}(ω)∝√{square root over (S(ω))}·eik
Therein, S(ω) represents the spectral power density and ϕ(ω) represents the spectral phase, which can be expressed by
The spectral phase defines the phase relationship of the individual monochromatic waves under the envelope.
Different oscillating frequencies ωi are schematically shown in illustration (a) in
It is clear from equations (9) and (10) that dispersion by the index of refraction has significant influence on the spectral phase and thus the chronological intensity curve of the pulse. However, the pulse can be completely characterized by measuring the spectrum and the spectral phase. The derivatives of the spectral phase, such as group delay (GD):
group delay dispersion (GDD):
third order dispersion (TOD):
and the like have great influence on the pulse dynamics and are variables that can be metrologically acquired to describe the interaction of laser pulses with material. During the propagation of a laser pulse through material, the dispersion can be described by the accumulation of a spectral phase. Short pulses may thus be time-chirped by accumulation of a spectral phase profile. Vice versa, however, chirped pulses can also be compressed in time by the accumulation of a negative phase contribution.
A possibility for using the spectral phase ϕ(ω) as a neuron 251 for the ANN 200 is to be described by way of example hereinafter. For artificial neurons 251, the nonlinear response function is essential. The sigmoid function is often used here in conventional ANNs (see
Effects of the spectral light-material interaction, such as propagation of laser pulses in dispersive media, offer the possibility of modulating an incident light wave nonlinearly in its spectral phase, comparable to the nonlinear modulation of the electric current by an artificial neuron. A dispersive material can act as an optical neuron 251, which outputs an item of nonlinear phase information.
To form an ANN 200, the optical signal 100 of the individual neurons 251 has to be passed on via a weighting 252 to all neurons 251 of the next layer. One possibility for spectrally joining together the output signals of the neurons 251 is formed by optical spectral combiners 284, in particular phase combiners 284. Optical gratings and/or prism sequences and/or dielectric layers and/or optical nonlinear media and/or polarization optics or the like can be used as such a component, for example. The spectral phase can be combined coherently or incoherently here.
To carry out a classification, it can be necessary for the spectral phase or its derivatives to be measured. Diverse established methods are available for this purpose.
For example, the interference signal of two optical pulses can be measured by spectral interferometry by the use of a spectrometer, wherein one of the two pulses is delayed by a time τ and the spectral phase of the pulse is known. The combined signal of both pulses at the spectrometer can be described by
{tilde over (E)}(ω)={tilde over (E)}1(ω)+{tilde over (E)}2(ω)e−iωτ (14)
For the oscillating term, the following phase relationship results
ϕ(ω)=ωτ+ϕ1(ω)−ϕ2(ω). (15)
With known spectral phase of the reference pulse, the spectral phase can be reconstructed from the spectral interferogram and the spectrometer can thus be used for a spectral phase analyzer 285.
According to a further option, a heterodyne detector for characterizing the spectral phase can be used as a spectral phase analyzer 285.
A further option is the use of a spectral shearing by a spectral phase analyzer 285. In this case, two time-delayed pulses are overlaid with a chirped replica pulse in a nonlinear crystal 260 of the spectral phase analyzer 285 (see
GD=ϕ(ω+Ω)−ϕ(ω) (17)
with the spectral shearing Ω, which is proportional to the time delay of the pulses τ. The spectral phase results as:
In a FROG structure of a spectral phase analyzer 285, two pulses time-delayed in relation to one another can be overlaid in a nonlinear crystal 260. The frequencies newly generated in this case are recorded by a spectrometer of the spectral phase analyzer 285. The interferogram results as a function of the frequency and the time delay of both fundamental pulses and the spectral phase can be reconstructed.
A complete dONN can be implemented, for example, by joining together at least one of the above-described elements 230. This is visualized by way of example in
Further exemplary embodiments of, in particular, optical, elements 230 for providing the network components 250 are described hereinafter. A temporal phase can thus be applied to the pulse by an electro-optical modulator as the element 230, such as a Mach-Zehnder modulator (MZM). This is equivalent to a spectral phase in the frequency space. Optical weights 252 can thus be synthesized by linear modulation by the MZM and optical neurons 251 can be synthesized by nonlinear modulation of the MZM.
Furthermore, it can be possible that so-called pulse shapers are used as the element 230, which can apply a temporal phase on the basis of LCDs or electronic index of refraction change.
A further possibility for providing a network component 250 is the use of the GD as a response function of the neuron (see
To provide a network component 250, it is also possible to use the GDD as a nonlinear function of the neuron 251 and weight 252 or to use higher orders of the dispersion, such as TOD or the like.
A further possibility for providing a network component 250 is to focus an intensive pulse of lower bandwidth in an optical nonlinear medium as the element 230. The pulse can experience an enlargement of the spectral bandwidth by self-phase modulation (SPM) and is passed on into the optical neuron 251. The spectral phase experiences a nonlinear modulation due to the increased bandwidth. The SPM is used here, for example, as an optical weight. Vice versa, spectral filters can prevent the triggering of the neuron.
Furthermore, the integration of the above-described components on a semiconductor 280, in particular, an electronic-photonic cointegrated chip in CMOS, bi-CMOS, bybrid bi-CMOS, Si—N CMOS process or the like, is conceivable to provide the at least one network component 250 or the network 200.
The above explanation of the embodiments describes the present disclosure exclusively in the context of examples. Of course, individual features of the embodiments can be freely combined with one another, if technically reasonable, without leaving the scope of the present disclosure.
Number | Date | Country | Kind |
---|---|---|---|
10 2020 214 907.0 | Nov 2020 | DE | national |
This patent application is a U.S. National Phase of International Patent Application No. PCT/EP2021/080287, filed 1 Nov. 2021, which claims priority to German Patent Application No. 10 2020 214 907.0, filed 27 Nov. 2020, the disclosures of which are incorporated herein by reference in their entireties.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2021/080287 | 11/1/2021 | WO |