The present disclosure generally pertains to the field of image and video processing, in particular to devices, methods and systems for image upscaling.
In many applications, images or video data is captured with undesirable properties, like a resolution that is too low. This can be due to sensor imperfections—like lens errors—or price restrictions on the sensors, or sometimes due to losses during transmission (e.g. if the video bandwidth mandates the use of compression). For example, in many cases, images captured by cameras or by other means (e.g. NMR, CT, X-ray and the like) do not have the required properties with respect to resolution or aberrations, e.g. due to lens errors.
There exist upscaling techniques for image improvement. For example, it is known to provide a high-resolution image from a number of overlapping low resolution frames of the same scene. At the displaying device, an improved version of the image(s) is restored or displayed, e.g. a higher resolution image, an undistorted image, or the like. In video technology, for example, the magnification of digital images is known as upscaling or resolution enhancement. By enhancement, a clearer image with higher resolution is produced.
It is also known to use pre-trained Deep Neural Networks for image enhancement or upscaling. The network is trained with a low quality image at the input, and a high quality image at its output, and learns the mapping between the two images. Typically, this is done offline on a large database of image pairs. As much data is typically needed to achieve a high level of robustness this process takes substantial time to process.
Although there exist image upscaling techniques for image improvement, it is desirable to provide devices, methods and computer programs which provide an improved quality in image upscaling.
It is generally desirable to provide devices, methods and computer programs which provide an improved quality in image up scaling.
According to a first aspect the disclosure provides a computer-implemented method comprising a pre-trained artificial neural network using higher-quality reference data together with lower quality data to obtain an adapted artificial neural network.
According to a further aspect the disclosure provides an electronic device comprising circuitry configured to create an improved image from a degraded image by mapping the degraded image to the improved image with an adapted artificial neural network, wherein the adapted artificial neural network is obtained by training a pre-trained artificial neural network using degraded data together with higher-quality reference data
Further aspects are set forth in the dependent claims, the following description and the drawings.
Embodiments are explained by way of example with respect to the accompanying drawings, in which:
The embodiments described below in more detail disclose a method comprising adapting a pre-trained artificial neural network using degraded data together with higher-quality reference data to obtain an adapted artificial neural network.
The pre-trained artificial neural network may in particular be adapted by performing a training process based on training data. This training data may comprise the degraded data. Adapting, respectively training the artificial neural network may for example comprise adapting weights related to the nodes of the artificial neural network. This adapting may for example be performed using a stochastic gradient descent method, or similar techniques. The adaptation may for example be similar to a standard gradient decent step in DNN training, where backpropagation is used to calculate the partial deviates.
The pre-trained artificial neural network, respectively the adapted artificial neural network obtained from the pre-trained artificial neural network may for example be any computing framework for machine learning algorithms to work together and process complex data inputs. For example, the pre-trained artificial neural network, respectively the adapted artificial neural network may be a deep neural network (DNN).
The embodiments disclose a process which creates an improved image from a distorted or low resolution original image. The mapping between the two is derived by adaptation of a pre-trained Deep Neural Network using data from the specific instance of the imager and the application, together with high-quality reference data that is supplied during a limited time period, called adaption process. As a result, a very high quality of the output image can be achieved, higher than with standard methods.
The method may comprise using the adapted artificial neural network to create an improved image from a degraded image by mapping the degraded image to the improved image. In the case that a pre-trained artificial neural network is trained using degraded data together with higher-quality reference data to obtain an adapted artificial neural network, the quality of the improved images is enhanced over upscaling with upscaling technology known from the prior art.
The lower quality (e.g. degraded) data is for example obtained under conditions related to the intended usage of the adapted artificial neural network. Intended usage may for example refer to the particular application in which the adapted artificial neural network is finally used for image enhancement. If the artificial neural network is trained based on degraded training data that is obtained under conditions from the specific instance of an imager and the application, other than a pre-trained static network, the artificial neural network according to the embodiments is not generic and static. The intended usage may also be referred to as “operational” usage.
The training may take into account any special characteristics (particular application) of the camera, lens, sensor, and/or compression scheme that is used during intended usage of the adapted artificial neural network.
If the adaptation takes into account, for example, any special property of the very camera, lens, or compression scheme that is being used in this particular application, as opposed to offline factory DNN training which is done using a generic training set, the artificial neural network can learn the specific image mapping necessary in the particular application (intended usage of the adapted neural network).
Other than a pre-trained static network, the adapted network according to the embodiments is not generic and static. Its properties do not only depend on the type of data that is captured in a static training image database, but it also takes into account the specific properties of the specific sensor at hand and in particular the specific type of input images that need improvement. Therefore, the quality of the improved images may be enhanced over upscaling with upscaling technology known from the prior art.
The lower-quality (degraded) data may for example take into account the specific type of degraded data that need improvement in the particular application. For example, if the adaptation is done using actual data from the particular application, for example liver data, the mapping does not need to learn how to map, say, images of a low resolution grassy meadow or images of the brain to high resolution images of the same, but can fully focus on liver cells. This also leads to higher quality images.
The degraded training data may be degraded data that relates to the high-quality reference data.
For example, the lower-quality data may result from the high-quality reference data by transmitting the high-quality reference data over a data link that does not support the full bandwidth necessary for transmitting the high-quality reference data
Alternatively, or in addition, the lower-quality data may result from the high-quality reference data by data compression. For example, compression might introduce artifacts that are highly undesirable in this problem setting and that should be mitigated by image enhancement. In the case of, for example, an operating room, where there is a higher-quality original signal but there are bandwidth limitations, it is possible to use the original signal supplied by a camera (e.g. of an endoscope) as higher-quality reference data. In other cases, a higher-quality camera can be temporarily used to generate the reference data, and after adaptation, it is no longer needed and can be used elsewhere.
The higher-quality reference data may for example be reference data that is generated on-the-fly during the adaption process using the hardware and the image content of the particular application. For the higher-quality reference, several methods can be employed.
For example, the higher-quality reference data is obtained with a higher-quality reference camera that is used along with degraded data that is captured side by side with a lower-quality camera.
According to some embodiments, the adaptation process happens during intended usage of the artificial neural network.
The adaption process may for example be performed during a limited time period at the beginning of intended usage of the neural network.
The method may further comprise pre-training an artificial neural network with generic training data to obtain the pre-trained artificial neural network. The pre-trained artificial neural network may for example depend on the type of data that is captured in a static training image database.
The degraded data may for example comprise a distorted or low resolution image. For example, the degraded data may be video data that comprises a sequence of video images (frames).
According to an embodiment, the adaptation process is done as a calibration step when devices are manufactured.
Adapting the pre-trained artificial neural network comprises updating the weights of the pre-trained artificial neural network using gradient descent and/or error backpropagation. The partial derivative of each of this pixel error signals with respect to each of the parameters of the Deep Neural Network is computed and, after one or several such images have been collected, the weights are updated by the accumulated partial derivatives multiplied by a small constant (the learning rate). This is the adaptation step, which is very similar to a standard backpropagation step in DNN training.
The degraded training data may for example comprise degraded images and the higher-quality reference data comprises higher-quality target images.
Adapting the pre-trained artificial neural network may comprise mapping a degraded image to an improved image (I1).
Still further, adapting the pre-trained artificial neural network may comprise aligning the improved image to a respective higher-quality target image.
Still further, adapting the pre-trained artificial neural network may comprise generating a difference image based on the improved image and the respective higher-quality target image.
The embodiments further disclose a method comprising: obtaining high quality reference data; obtaining lower quality data; and adapting a pre-trained artificial neural network using the higher-quality reference data together with the lower quality data to obtain an adapted artificial neural network.
The embodiments also disclose an electronic device comprising circuitry configured to create an improved image from a degraded image by mapping the degraded image to the improved image with an adapted artificial neural network, wherein the adapted artificial neural network is obtained by training a pre-trained artificial neural network using degraded data together with higher-quality reference data.
The circuitry may be configured to perform all or some of the processes described above and in the following details description of embodiments.
Circuitry may include a processor, a memory (RAM, ROM or the like), a storage, input means (mouse, keyboard, camera, etc.), output means (display (e.g. liquid crystal, (organic) light emitting diode, etc.), loudspeakers, etc., a (wireless) interface, etc., as it is generally known for electronic devices (computers, smartphones, etc.). Moreover, it may include sensors for sensing still image or video image data (image sensor, camera sensor, video sensor, etc.), for sensing a fingerprint, for sensing environmental parameters (e.g. radar, humidity, light, temperature), etc. In particular, the circuitry may comprise a DNN unit that may for example be a neural network on one or more GPUs or any other hardware specialized for the purpose of implementing an artificial neural network. Still alternatively, the circuitry may be configured to implement an artificial neural network by means of software. The circuitry may also be configured to run training algorithms such a stochastic gradient descent on the artificial neural network to adapt the neural network.
The embodiments also disclose a computer-implemented method comprising training a pre-trained artificial neural network using degraded data together with higher-quality reference data to obtain an adapted artificial neural network.
The embodiments also disclose a machine readable storage medium comprising instructions which when executed on a processor cause the processor to perform training a pre-trained artificial neural network using degraded data together with higher-quality reference data to obtain an adapted artificial neural network.
Embodiments are now described by reference to the drawings.
One example of the application of the disclosure of this application is an operating room in a hospital, in which video data needs to be transmitted from various image-capturing devices (endoscopes, high quality cameras, CT, pre-captured NMR etc) to multiple displays. Some or all of the data links might not support the full bandwidth of video data, and compression needs to be applied. Decompression might introduce artifacts that are highly undesirable in this problem setting. The inventive method provides a way how the quality of the displayed images and videos can be improved.
In the embodiments described here in more detail, a PowerLAN connection is used as an example for a data connection which provides low quality data transmission. The embodiments are, however, not restricted to this type of data connection. The same principle applies to other low quality transmission channels, e.g. bandwidth limited connections such as Bluetooth or low bandwidth Ethernet.
The adapted pre-trained DNN (adapted DNN 210 in
In the embodiment above the DNN is described by two distinguished functional units, i.e. the training DNN and the adapted DNN. Note, that nevertheless both distinguished functional units may be realized as one hardware component or as software component implemented on one electronic device.
The actual adaptation stage which is performed in the embodiment of
In the operating room 501 an endoscope 502 is used to perform a medical procedure on a patient and capture video data with high quality. The high quality video data is sent from the endoscope 502 to an image processing device 504. The image processing device 504 displays the video data in its original quality on a display screen 505 so that a surgeon may control the endoscope 502 based on the feedback provided by display screen 505. Furthermore, the image processing device 504 sends the high quality video data via a PowerLAN/WAN interface 503 using PowerLAN transmission to a PowerLAN interface 510 in the operation surveillance room 506. The image presentation device 508 receives the video data submitted from the image processing device 504 via the PowerLAN interface 510.
Due to interference factors from other devices or services using the same power lines and/or bandwidth restrictions the original video data of high quality may be received at the image presentation device 508 in the operation surveillance room 506 as video data of lower quality. Using an adapted DNN 507 the image presentation device 508 is able to improve the received low quality video data and display an improved video data at a screen 509. The adapted DNN receives regular updates from a training DNN and is therefore perfectly suited to improve low quality video data specialized on the errors and distortions specific to this exact setting. Furthermore, the image processing device 504 sends the high quality video data via WAN (for example DSL or Ethernet) using the PowerLAN/WAN interface 503 to the cloud computing systems WAN Interface 512. The high quality video data is used in the cloud computing system 511 to train the training DNN 513.
An advantage of the adaptation as described above lies in the specificity as opposed to the offline factory DNN training (see pre-training 801 in
Implementation
The electronic device 1100 further comprises a data storage 1102 and a data memory 1103 (here a RAM). The data memory 1103 is arranged to temporarily store or cache data or computer instructions for processing by the processor 1101. The data storage 1102 is arranged as a long term storage, e.g., for recording video data obtained from the graphical input unit 1109. The data storage 1102 may also store data obtained from the DNN 1107.
It should be noted that the description above is only an example configuration. Alternative configurations may be implemented with additional or other sensors, storage devices, interfaces, or the like.
In the embodiments of
It should be recognized that the embodiments describe methods with an exemplary ordering of method steps. The specific ordering of method steps is, however, given for illustrative purposes only and should not be construed as binding. For example steps 402 and 403 in
It should also be noted that the division of the electronic device of
All units and entities described in this specification and claimed in the appended claims can, if not stated otherwise, be implemented as integrated circuit logic, for example, on a chip, and functionality provided by such units and entities can, if not stated otherwise, be implemented by software.
In so far as the embodiments of the disclosure described above are implemented, at least in part, using software-controlled data processing apparatus, it will be appreciated that a computer program providing such software control and a transmission, storage or other medium by which such a computer program is provided are envisaged as aspects of the present disclosure.
Note that the present technology can also be configured as described below:
(1) A method comprising adapting a pre-trained artificial neural network (207; 513) using higher-quality reference data together with lower quality data to obtain an adapted artificial neural network (210; 507).
(2) The method of (1) further comprising using the adapted artificial neural network (210; 507) to create an improved image (I1) from a degraded image (I0) by mapping the degraded image (I0) to the improved image (I1).
(3) The method of (1) or (2), wherein the degraded data is obtained under conditions related to the intended usage of the adapted artificial neural network (210; 507).
(4) The method of anyone of (1) to (3), wherein the training takes into account any characteristics of the camera (202; 502; 803), lens, sensor, and/or compression scheme that is used during intended usage of the adapted artificial neural network (210; 507).
(5) The method anyone of (1) to (4), wherein the degraded data takes into account the specific type of degraded data that need improvement in the particular application.
(6) The method anyone of (1) to (5), wherein the degraded data results from the high-quality reference data by transmitting the high-quality reference data over a data link (204, 212; 510, 503) that does not support the full bandwidth necessary for transmitting the high-quality reference data.
(7) The method of anyone of (1) to (6), wherein the degraded training data results from the high-quality reference data by data compression.
(8) The method of anyone of (1) to (7), wherein the higher-quality reference data is reference data that is generated on-the-fly using the hardware and the image content of a particular application.
(9) The method of anyone of (1) to (8), wherein the higher-quality reference data is obtained with a higher-quality reference camera (804) that is used along with degraded data that is captured side by side with a lower-quality camera (803).
(10) The method of anyone of (1) to (9), wherein the adaptation process happens during intended usage of the artificial neural network (210; 507).
(11) The method of anyone of (1) to (10), wherein the adaption process is performed during a limited time period at the beginning of intended usage of the adapted neural network.
(12) The method of anyone of (1) to (11), further comprising pre-training an artificial neural network with generic training data (802) to obtain the pre-trained artificial neural network.
(13) The method of anyone of (1) to (12), wherein the degraded data comprises a distorted or low resolution image (I0).
(14) The method of anyone of (1) to (13), wherein the adaptation process is done as a calibration step when devices are manufactured.
(15) The method of anyone of (1) to (14), wherein adapting the pre-trained artificial neural network (210; 507) comprises updating the weights of the pre-trained artificial neural network using gradient descent and/or error backpropagation.
(16) The method of anyone of (1) to (15), wherein the degraded training data comprises degraded images (I0) and the higher-quality reference data comprises higher-quality target images (I2).
(17) The method of anyone of (1) to (16), wherein adapting the pre-trained artificial neural network comprises mapping a degraded image (I0) to an improved image (I1).
(18) The method of anyone of (1) to (17), wherein adapting the pre-trained artificial neural network comprises aligning the improved image (I1) to a respective higher-quality target image (I2).
(19) The method of anyone of (1) to (18), wherein adapting the pre-trained artificial neural network comprises generating a difference image (D) based on the improved image (I1) and the respective higher-quality target image (I2).
(20) An electronic device (210; 1100) comprising circuitry configured to create an improved image from a degraded image by mapping the degraded image to the improved image with an adapted artificial neural network, wherein the adapted artificial neural network is obtained by training a pre-trained artificial neural network using degraded data together with higher-quality reference data.
(21) A method comprising:
Number | Date | Country | Kind |
---|---|---|---|
19172785.8 | May 2019 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2020/062428 | 5/5/2020 | WO | 00 |