METHOD, SYSTEM AND COMPUTING DEVICE FOR AUTOMATIC LICENSE PLATE RECOGNITION

Description

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority to European Patent Application No. 23 201 829.1 filed Oct. 5, 2023, the disclosure of which is incorporated herein by reference.

TECHNICAL FIELD

The present disclosed subject matter relates to a method, a system and a computing device for automatic license plate recognition (ALPR).

BACKGROUND

Recognising license plates of vehicles is an important task in Intelligent Transportation Systems (ITS) and vehicle tolling applications to identify vehicles for surveillance, routing and tolling. In common ALPR schemes, a camera device is mounted at a surveillance location, e.g., next to or above a road, at the entry or within a parking lot, garage, etc., to record images which each include the license plate number (LPN) of a vehicle. Each of the recorded images is then machine-read by a computing device to recognise the LPN that is included therein. To this end, the computing device usually crops every image to an outer boundary of the license plate, identifies a bounding frame for each of the characters of the LPN within the boundary, feeds those bounding frames, one after the other, to an optical character recognition (OCR) algorithm reading the same, and merges the OCR read characters, one after the other, to the resulting LPN of each image. Based on the recognised LPN, the computing device may initiate a charging process, an opening of a gate or barrier, a routing of the vehicle, etc.

While algorithms for bounding frame identification and for OCR reading have evolved over the years, the error probabilities of wrongly identified bounding frames and wrongly OCR read characters are still high, in particular when images are recorded under bad lighting conditions, at oblique angles, for dirty license plates, concealed LPNs etc. Whenever an LPN is recognised by the computing device with a high error probability (low confidence), surveillance staff needs to review the corresponding recorded image to guarantee a correct recognition of the LPN. Surveillance staff, however, is expensive and the manual review of images takes a long time such that time critical applications like barrier or gate opening and routing are impeded by high error probabilities of current ALPR systems.

BRIEF SUMMARY

It is an object of the disclosed subject matter to provide a method, a system and a computing device for ALPR which allow to recognise LPNs with low error probabilities and, thus, more reliably, cost and time saving.

To this end, the disclosed subject matter provides in a first aspect a method for automatic license plate recognition, comprising:

- recording a first set of images each including one of N₁different license plate numbers;
- extracting the license plate number in each image of the first set;
- generating an artificial neural network with one output node for each of the N₁different license plate numbers in the first set;
- training the artificial neural network on the images and the extracted license plate numbers of the first set at least until for each image of the first set the output node for the license plate number included in that image outputs the highest value of all output nodes;
- recording a sample image including a license plate number that is also included in the images the artificial neural network has been trained on; and
- feeding the sample image into the artificial neural network and recognising the license plate number of the sample image as the license plate number of that output node which outputs the highest value.

The method of the disclosed subject matter is based on the surprising finding that images which each include a complete LPN, i.e. not just one character at a time but all the characters of the LPN, can be classified with low error rates by one single artificial neural network (NN) having one output node for each LPN, i.e. by means of a one-to-one mapping between output nodes and LPNs.

In the holistic approach of the disclosed subject matter, the LPNs are extracted from the images of the first set to determine the different LPNs included in the first set, to generate the NN with the appropriate number N₁of output nodes, and to train the NN with the images and the extracted LPNs. After training, the NN is able to reliably recognise the LPNs it has been trained on, i.e. when the sample image is fed into the NN, the output node for that LPN that is included in the image “fires” reliably, i.e. outputs a distinguished highest value. In this way, the correct LPN is recognised by the trained NN with a low error rate (a high confidence) and without the necessity to identify and OCR-read each of the characters of the license plate number separately. As the trained NN is less error prone, less or no surveillance staff needs to be employed.

Summing up, generating and training the NN with one output node per different LPN to be recognised achieves low error rates and allows for a more reliable, cost and time saving ALPR.

The first set may comprise the images recorded and stored within a certain time interval, e.g., of one hour or less, one day, one week, one year or more. However, due to storage space restrictions or data privacy requirements, a storage of the recorded images for a long time interval is often impossible, for instance prohibited by the General Data Protection Regulation (Regulation (EU) 2016/679, abbreviated GDPR). To overcome this issue, in an optional embodiment the method of the disclosed subject matter further comprises:

- deleting the images of the first set and recording a second set of images each including one of N₂different license plate numbers;
- extracting the license plate number of each image of the second set;
- extending the artificial neural network by one output node for each of the different license plate numbers that are included in the second set and not in the first set; and
- training the extended artificial neural network on the images and the extracted license plate numbers of the second set at least until for each image of the second set the output node for the license plate number included in that image outputs the highest value of all output nodes;
- wherein said steps of feeding and recognising are carried out with the extended artificial neural network.

In this embodiment the NN is generated, extended and trained charge-wise, always with a current one of (at least) two charges while the previous charge/s are deleted, i.e. with a first charge comprising the images and LPNs of the first set recorded and stored within a first time interval, and—after deleting the first charge—a second charge comprising the images and LPNs of the second set recorded and stored within a second time interval, and optionally—after deleting the second charge—with a further (third, fourth, etc.) charge.

To adapt the NN for a reliable recognition of the “new” LPNs, which are included in the images of the current (e.g., the second) set and not in the images of the previous (e.g., the first) set, the different new LPNs are determined from the extracted LPNs of the current set and the NN is extended by one further output node for each different new LPN.

While the current training of the NN is performed on the current set of images—as the images of the previous set/s is/are deleted—applicant's research has shown that the extended NN is still capable of recognising the new as well as the “old” LPNs which are included in the images of the previous set/s and not in the images of the current set. Hence, no “catastrophic forgetting” problem has been observed. Therefore, the sample image may include either a “new” or an “old” LPN, both being reliably recognised by the extended NN.

Seen from another perspective, the extended NN with its one-to-one mapping between output nodes and LPNs is well suited for the charge-wise generating/extending and training such that the images of the previous set/s may be deleted to save storage space and achieve a high standard of data privacy.

In a favourable variant of this embodiment the images of the second set are fed into the artificial neural network and the different license plate numbers that are included in the second set and not in the first set are determined as the different license plate numbers included in those images of the second set for which all of the output nodes output a respective value below a predetermined threshold value. In this way, the NN itself is employed to find out whether an image of the current (e.g., second) set fed into the NN includes an LPN that is old (has one high value of the output nodes) or that is new (only has low values of the output nodes). This allows for a fast determination of the different new LPNs. Optionally, the old LPNs may thereby simultaneously be extracted by the NN.

Advantageously, the mapping between the output nodes and the corresponding license plate numbers is stored in a mapping table. Utilising a mapping table allows to encode/decode each LPN by its corresponding output node, e.g., to increase data privacy when the mapping table may only be accessed by authorised users/personnel. Moreover, the mapping table may be accessed to quickly determine whether an LPN extracted from an image already has a corresponding output node or whether a new output node is to be added to the NN for that LPN.

In a favourable embodiment said step of extracting the license plate number comprises OCR reading the recorded images of the first and/or second sets of images character by character. In this embodiment the disclosed method utilises—often pre-existing—OCR reading capabilities to extract all or a part of the LPNs for the subsequent generating, extending and training of the NN.

The NN may be trained on and recognise LPNs in the images as recorded. However, to allow for a less complex NN, e.g., having less nodes and/or layers, and for further reducing the ALPR error rates, the recorded images may optionally be pre-processed, in particular by at least one of resizing, converting to grayscale, blur filtering, rotating, cropping to an outer boundary of the license plate, and image sharpening.

In a further beneficial embodiment, in said step/s of training, each image of the first and/or second set is fed into the artificial neural network P times, P being in a range from 2 to 50, in particular from 5 to 20, e.g., from 7 to 13. Thereby, the NN is sufficiently trained to recognise the LPN in the recorded images of the first and/or second sets. Moreover, with these numbers of repeated image feeding no catastrophic forgetting of old LPNs has been observed.

The method of the disclosed subject matter may be carried out with many types of NNs suitable for image recognition. In an optional embodiment the artificial neural network is a convolutional neural network (CNN). A CNN is particularly suited for ALPR and achieves particularly low error rates. Even with a low-complex CNN, which is fast in training and evaluation, low error rates have been achieved.

In a second aspect the disclosed subject matter provides a system for automatic license plate recognition, comprising:

- a camera device configured to record a first set of images each including one of N₁different license plate numbers; and
- a computing device configured to
  - extract the license plate number in each image of the first set,
  - generate an artificial neural network with one output node for each of the N₁different license plate numbers in the first set, and
  - train the artificial neural network on the images and the extracted license plate numbers of the first set at least until for each image of the first set the output node for the license plate number included in that image outputs the highest value of all output nodes;
- wherein the camera device is further configured to record a sample image including a license plate number that is also included in the images the artificial neural network has been trained on; and
- wherein the computing device is further configured to feed the sample image into the artificial neural network and recognise the license plate number of the sample image as the license plate number of that output node which outputs the highest value.

The system utilises the disclosed method in order to recognise LPNs. To this end the system may utilise any of the above-mentioned embodiments to achieve the above-mentioned advantages.

In a third aspect the disclosed subject matter provides for a computing device configured to

- receive a first set of images each including one of N₁different license plate numbers;
- extract the license plate number in each image of the first set;
- generate an artificial neural network with one output node for each of the N₁different license plate numbers in the first set;
- train the artificial neural network on the images and the extracted license plate numbers of the first set at least until for each image of the first set the output node for the license plate number included in that image outputs the highest value of all output nodes;
- receive a sample image including a license plate number that is also included in the images the artificial neural network has been trained on; and
- feed the sample image into the artificial neural network and recognise the license plate number of the sample image as the license plate number of that output node which outputs the highest value.

The computing device may as well utilise any of the above-mentioned embodiments to achieve the above-mentioned advantages.

BRIEF DESCRIPTION OF THE DRAWINGS

The disclosed subject matter will now be described by means of exemplary embodiments thereof with reference to the enclosed drawings, in which show:

FIG. 1 a system according to the disclosed subject matter for recording images, generating an artificial neural network (NN), training the NN on some of the recorded images, and automatically recognising license plate number (LPN) in at least one recorded sample image by means of the NN, in a schematic top view;

FIG. 2 a computing device of the system of FIG. 1 according to the disclosed subject matter processing the images recorded within a first time interval to generate and train the NN and automatically recognising the LPN in said sample image/s by means of the NN, in a schematic diagram;

FIG. 3 an exemplary data pipeline including the trained NN of FIG. 2 for automatic license plate recognition (ALPR) of images recorded within a second time interval, in a data flow diagram;

FIG. 4 an optional embodiment of the computing device of FIG. 2 processing the images recorded within the second time interval to extend and train the NN and automatically recognising the LPN in said sample image/s by means of the trained extended NN, in a schematic diagram;

FIG. 5 the exemplary data pipeline of FIG. 3 including the trained extended NN of FIG. 4 for ALPR of images recorded within a third time interval, in a data flow diagram; and

FIG. 6 a method according to the disclosed subject matter that is carried out by the system of FIG. 1 in a flow diagram.

DETAILED DESCRIPTION

FIG. 1 shows a system 1 that performs automatic license plate recognition (ALPR). The system 1 comprises a camera device 2 for image recording and a computing device 3 for image processing connected to the camera device 2.

The camera device 2 has one or more (here: six) cameras 4 mounted at one or more surveillance locations where license plate numbers (LPNs) L_jare to be recognised for tolling, surveillance, routing, gate opening, etc., here: at a road 5 which is traversed by vehicles 6-8, alternatively or additionally: at a parking lot, garage, gate, barrier, etc. By means of the cameras 4, the camera device 2 records images 9₁, 9₂, . . . , generally 9_i, of LPNs L_jof license plates 10-14 carried by the vehicles 6-8. Each recorded image 9_i, thus, includes one LPN L_j. Each LPN L_j, in turn, includes several (at least two, typically four or more) characters which may each be a number, a letter and/or a logogram such as a seal, escutcheon, state emblem, etc., a Chinese, Japanese or Korean character, or the like.

The computing device 3 may comprise one or more computers, servers, mobile devices, tablets, etc., which may be located in the vicinity of the camera device 2, e.g., at the road 5, and/or remote therefrom, e.g., at a back office. The computing device 3 recognises LPNs L_jin the recorded images 9; and, based thereon, initiates a tolling or charging process, opens a gate or barrier, routes the vehicles 6-8, etc.

Within a first time interval, e.g., one hour, day, week, month, year or more, the camera device 2 records and transmits a first set S₁of images 9_i(FIG. 2) to the computing device 3, which generates and trains an artificial neural network (NN) 15 (FIG. 2) on the recorded first set S₁.

FIG. 2 depicts the computing device 3 while processing the first set S₁. The computing device 3 receives, either via a wireless or a wire-bound connection, from the camera device 2 the first set S₁of images 9_icomprising M₁images 9₁-9_M1that each include one of N₁different LPNs L₁-L_N1. The first set S₁includes in total M₁LPNs L_j, N₁of which LPNs are different from one another. In other words, one or more LPNs L_jmay be included in more than one image 9_iof the first set S₁(M₁>N₁) or not (M₁=N₁).

The computing device 3 extracts the LPN L_jin each image 9_iof the first set S₁. The extracting may be carried out in many ways and optionally employ already existing infrastructure. In the example of FIG. 2 each image 9_iof the first set S₁is OCR-read character by character by an optical character recognition (OCR) unit 16 to extract the LPN L₁=‘123’ in the first image 9₁, the LPN L₂=‘456’ in the second image 9₂, etc. In another example the LPN L_jof each image 9_iof the first set S₁is read out by surveillance staff, stored in a memory of the computing device 3, and extracted by the computing device 3 by accessing the memory. In a further example the images 9_iof the first set S₁are OCR-read by the OCR unit 16 and only those images 9_iwhich could not be OCR-read correctly (had a high error probability, i.e. low confidence) are read out by surveillance staff.

The computing device 3 generates the NN 15 with one output node O₁, O₂, . . . , generally O_j, for each of the N₁different LPNs L₁-L_N1in the first set S₁. The NN 15 may be generated as any neural network suitable for image recognition such as a convolutional neural network (CNN), a capsule neural network (CapsNet), etc. and with many suitable structures, e.g., with a variety of types and numbers of layers 17-21 and nodes 22, a variety of node connections 23 and node activation functions. However, the NN 15 has one respective output node O_jfor each of the N₁different LPNs L_jin the first set S₁, i.e. N₁output nodes O₁-O_N1in the output layer 21.

The computing device 3 may generate the NN 15 at once, e.g., after extracting the LPNs L_j, by counting the N₁different extracted LPNs L₁-L_N1and creating the NN 15 with N₁output nodes O₁-O_N1. Alternatively, the computing device 3 may generate the NN 15 successively, e.g., during extracting the LPNs L_j, by checking for each extracted LPN L_jwhether it is novel (has no corresponding output node O_jyet) and, if so, adding one output node O_jto the NN 15 for that novel LPN L_j.

The computing device 3 optionally stores the mapping between each different LPN L_jand its corresponding output node O_j, e.g., the mapping indicated by arrows 24 between O₁and ‘123’, O₂and ‘456’, etc., in a mapping table 25. The computing device 3, thus, may access the mapping table 25 to determine the number N₁and/or to check whether an LPN L_jextracted from an image 9_iis novel. Alternatively to the mapping table 25, the computing device 3 may indicate the mapping by labelling the respective output node O_jwith the corresponding LPN L_j, e.g., by labelling the first node by O_‘123’, the second node O_‘456’ et cetera (not shown) to conserve the mapping.

The computing device 3 trains the NN 15 on the images 9_iof the first set S₁and the extracted LPNs L_jof the first set S₁. To this end, the computing device 3 feeds the images 9_iof the first set S₁into the NN 15 (indicated by arrow 26), compares the values 27_joutput by the output nodes O_jwith the correct values 27_j/LPN L_j/output node O_jfor the respective image 9_i(indicated by arrow 28) and adapts the parameters, e.g., the weights, biases and/or structure, of the NN 15 based on the comparison. For instance, when the first image 9₁is fed into the NN 15 the value 27₁of the first output node O₁aimed at shall be ‘1’ and the values 27_j,j≠1of the other output nodes O_j,j≠1aimed at shall be ‘0’ since the first LPN L₁is ‘123’ extracted from the first image 9₁corresponds to the first output node O₁.

For adapting the parameters the computing device 3 may use any known neural network training algorithm such as backpropagation, difference target propagation, etc.

The training is carried out at least until for each image 9_iof the first set S₁the output node O_jfor the LPN L_jincluded in that image 9_ioutputs the highest value 27_jof all output nodes O_j. For instance, after the training, when the second image 9₂including the second LPN L₂‘456’ is fed into the NN 15, the second output node O₂for the LPN L₂‘456’ included in that image 9₂should output the highest value 27₂of all output nodes O_j. To this end, the computing device 3 may feed each image 9_iof the first set S₁into the NN 15 P times, wherein the number P is in a range from 2 to 50, in particular from 5 to 20, e.g., from 7 to 13.

Said recording, extracting, generating and training steps may be carried out for the whole first set S₁at once or for successive subsets of the first set S₁, e.g., batch-wise or image-wise. In the latter case, the NN 15 may be generated by repeatedly adding one output node O_jfor each novel LPN L_jextracted from a newly recorded image 9_ior batch of the first set S₁and be trained by repeatedly adding the new recorded image 9_ior batch of the first set S₁to a training set.

Once the NN 15 has been trained, the computing device 3 utilises the NN 15 for recognising an LPN L_jin at least one fresh (“sample”) image 9_sit has not been trained on. To this end, the camera device 2 records the sample image 9_s, e.g., of the vehicle 6 when it traverses the camera device 2 a second time, indicated by dashed lines in FIG. 1. In the example of FIG. 2, the sample image 9_sincludes the LPN L₁=‘123’ that was also included in the image 9₁the NN 15 had been trained on. The computing device 3 feeds the sample image 9_sinto the NN 15 (see arrow 29) and recognises the LPN L_jof the sample image 9_sas the LPN L_jof that output node O_jwhich outputs the highest value 27_j, in FIG. 2: as the LPN L₁=‘123’ of the first output node O₁outputting the highest value 25₁of ‘0.7’ among all output nodes O_j. To retrieve the LPN L₁=‘123’ of the output node O₁with the highest value 27₁, the computing device 3 may access the mapping table 25 (if present), read the label of the output node O₁(if labelled), etc.

FIG. 3 shows an exemplary ALPR pipeline 30 including the NN 15 which has been trained on the first set S₁so far. In the ALPR pipeline 30 the camera device 2 records a stream 31 of images 9_iwithin a second time interval, e.g., another hour, day, week, month, year, etc. The stream 31 is fed into the trained NN 15 which recognises the LPNs L_jincluded in the images 9_iof a first portion 32 of the stream 31 and cannot recognise the LPNs L_jin a second portion 33 of the stream 31 because none of its output nodes O_joutputs a value 27_jabove a predetermined threshold for images 9_iof the second portion 33. The second portion 33 is then input into the OCR unit 16 which OCR-reads the LPNs L_jin a first part 34 of the second portion 33 and cannot OCR-read the LPNs L_jin a second part 35 of the second portion 33. The images 9_iin the second part 35 are then reviewed and LPN-labelled manually by surveillance staff on terminals 36 and forwarded as manually LPN-labelled part 37 such that finally the LPNs L_jof all images 9_iof the stream 31 are recognised.

As can be seen in FIG. 3, the NN 15 recognises LPNs L_jof images 9_ithat cannot be correctly OCR-read, such that the second part 35 is smaller as it were without the NN 15, wherefore less surveillance staff is required at terminals 36 than in conventional ALPR pipelines without the NN 15.

Alternatively to the shown ALPR pipeline 30 the NN 15 may be included in any other type of ALPR pipeline, e.g., be downstream of or parallel to the OCR unit 16, or may be used stand-alone, etc.

As indicated by the feedback stream 38 in FIG. 3, the images 9_iin the streams 34 and 37 (and optionally also in the stream 32) may be used as a second set S₂of recorded images 9_ito further train the NN 15 and to optionally extend the NN 15 by adding further output nodes O_N1+1-O_N1+N2′ thereto for each novel LPN L_N1+1-L_N1+N2′ in the second set S₂.

In one embodiment the computing device 3 carries out the further training and optional extending with the first and second sets S₁, S₂, which may be seen as an extension of the first set S₁by the images 9_iof the second set S₂.

In another embodiment the computing device 3 deletes the first set S₁of images 9_iafter the first time interval and carries out the further training and optional extending with the second set S₂only. With reference to FIG. 4 an extending and further training of the NN 15 in such an embodiment shall be described. Thereby, the computing device 3 receives the second set S₂of images 9_irecorded by the camera device 2 within the second time interval. The second set S₂comprises M₂images 9_M1+1-9_M1+M2that each include one of N₂different LPNs L_jof the second set S₂. Hence, the second set S₂includes in total M₂LPNs L_j, N₂LPNs thereof being different from one another and N₂′ different LPNs L_N1+1-L_N1+N2′ thereof being different from the N₁LPNs L₁-L_N1of the first set S₁.

The computing device 3 extracts the LPN L_jin each image 9_iof the second set 9₂, e.g., as described above with respect to the first set S₁. Alternatively or in addition to the above-mentioned extractions, the computing device 3 may extract old LPNs L_jof the second set S₂(which were included in the first set S₁) by feeding the images 9_iof the second set S₂into the NN 15 (following the dashed arrow 39) and recognising the LPN L_jof each image 9_ifor which one output node O_joutputs a respective highest value 27_jabove a given threshold as the LPN L_jof that output node O_j(following the dashed arrow 40). Those images 9_ifor which no output node O_joutputs a respective highest value 27_jabove a given threshold may be input to the OCR unit 16 (following the dashed arrow 41) and, when OCR-reading fails, reviewed by surveillance staff.

When the second set S₂includes at least one new LPN L_j, i.e. when N₂′ is greater than 0, the computing device 3 extends the NN 15 by one output node O_N1+1, O_N1+2, . . . , generally O_j, for each of the N₂′ different LPNs L_N1+1-L_N1+N2′ that are included in the second set S₂and not in the first set S₁, i.e. for each novel LPN L_j. Hence, the extended NN 15 has one respective output node O_jfor each of the N₁+N_2′ different LPNs L_jin the first and second sets S₁, S₂, i.e. N₁+N_2′ output nodes O₁-O_N1+N2′ in the output layer 21.

Similar to the above-mentioned generation, the computing device 3 may extend the NN 15 at once for all the extracted LPNs L_N1+1-L_N1+N2′ of the second set S₂by counting the N₂′ different LPNs L_jthat were not included in the first set S₁and extending the NN 15 by N₂′ output nodes O_N1+1-O_N1+N2′; or successively by incrementally adding one output node O_jto the NN 15 whenever a novel LPN L_jis extracted.

To determine the novel LPNs L_jin the images 9_iof the second set S₂for extending, the computing device 3 may optionally utilise the NN 15 trained on the first set S₁. Thereby, the computing device 3 feeds the images 9_iof the second set S₂into the NN 15 and determines the different LPNs L_N1+1-L_N1+N2′ that are included in the second set S₂and not in the first set S₁(the novel LPNs) as the different LPNs L_N1+1-L_N1+N2′ included in those images 9_iof the second set S₂for which all of the output nodes O_joutput a respective value 27_jbelow a predetermined threshold value, e.g., in the images 9_ifed into the OCR unit 16 via arrow 41. Alternatively, the computing device 3 may access the mapping table 25 (if present) to determine whether an extracted LPN L_jof the second set S₂is not already included therein and, thus, is novel.

The computing device 3 may optionally add the mapping between each novel LPN L_jand its corresponding output node O_jto the mapping table 25 (if present).

The computing device 3 trains the extended NN 15 on the images 9_iof the second set S₂and the extracted LPNs L_jof the second set S₂as detailed above with reference to FIG. 2 at least until for each image 9_iof the second set S₂the output node O_jfor the LPN L_jincluded in that image 9_ioutputs the highest value 27_jof all output nodes O_j. To this end, each image 9_iof the second set S₂may optionally be fed into the NN 15 P times.

Once the NN 15 has been trained on the second set S₂, the computing device 3 may utilise the NN 15 to recognise (again or for the first time) the LPN L_jin a sample image 9_srecorded by the camera device 2. Therefor, the computing device 3 feeds the recorded sample image 9_sinto the extended NN 15 and recognises the LPN L_jincluded in the sample image 9_sas detailed above. As shown in FIG. 4, the extended NN 15 may recognise LPNs L_jthat were included in the first set S₁only (like ‘789’ of the upper sample image 9_s), that were included in the second set S₂only (like ‘A89’ of the middle sample image 9_s), or in both sets S₁, S₂(like ‘123’ of the lower sample image 9_s).

FIG. 5 shows the ALPR pipeline 30 including the extended NN 15 which has been trained on the first and second sets S₁, S₂recorded within the first and second time interval, respectively. The camera device 2 may now record the stream 31 of images 9_iwithin a third time interval, e.g., another hour, day, week, month, year, etc. The stream 31 is fed into the extended NN 15 which now also recognises the LPNs L_N1+1-L_N1+N2′ that were included in the second set S₂. Consequently, the first portion 32 of the stream is larger and the second portion 33 of the stream 31 is smaller as compared to FIG. 3, such that even less images 9_iin the second part 35 of the second portion 33 need to be reviewed by surveillance staff at terminals 36.

The streams 34 and 37 (and optionally also stream 32) recorded within the third time interval may be used as a third set S3 of recorded images 9_ito further train the NN 15 and to optionally extend the NN 15 further by adding further output nodes O_jthereto for each novel LPN L_j, as described above. The above-mentioned training and optional extending of the NN 15 may, thus, be continued for further sets S₃, S₄, . . . , generally S_k, wherein any old set S_k−1of images 9_imay optionally be deleted before or while a new set S_kof images 9_iis recorded and stored within the respective time interval, LPN-extracted and used to train and optionally extend the NN 15.

Optionally, before said extracting, training and/or recognising steps, the computing device 3 may pre-process the recorded images 9_j. In particular the computing device 3 may, e.g., depending on the recording angles and conditions of the respective image 9_i, convert any image 9_ito grayscale, filter out a blur therein, rotate it, crop it to an outer boundary of the license plate 10-14 and/or sharpen it. All of these steps may, of course, may be applied in any order.

With reference to FIG. 6, a method 42 for ALPR will now be described. The method 42 may be carried out by the system 1 as described above and comprises at least the following steps 43-49.

In a first step 43 the first set S₁of images 9_iis recorded, e.g., by means of the camera device 2 within the first time interval. Each recorded image 9_iincludes one of N₁different LPNs L₁-L_N1and each LPN L_j, in turn, comprises two or more characters.

In a second step 44 the LPN L_jin each image 9_iof the first set S₁is extracted. Thereby, the recorded images 9₁of the first set S₁may be OCR-read character by character by means of the OCR-unit 16 and/or may be read out by surveillance staff.

In a third step 45 the NN 15 is generated with one output node O_jfor each of the N₁different LPNs L₁-L_N1in the first set S₁. The NN 15 may be generated as any neural network suitable for image recognition and with any suitable structure, be it at once for all the extracted LPNs L₁-L_N1, counting the N₁different LPNs and creating the NN 15 with N₁output nodes O_j, or successively by incrementally adding one output node O_jto the NN 15 whenever a novel LPN L_jis extracted, as detailed above. Optionally the mapping between each different LPN L_jand its corresponding output node O_jmay be stored in the mapping table 25.

In a fourth step 46 the NN 15 is trained on the images 9_iof the first set S₁and the extracted LPNs L₁-L_N1of the first set S₁. In this step, the images 9_iof the first set S₁are fed into the NN 15, the values 27_joutput by the output nodes O_jare compared with the correct values 27_jindicated by the extracted LPNs L₁-L_N1of the images 9_iand the parameters of the NN 15 are adapted based on the comparison according to any known neural network training algorithm such as backpropagation, difference target propagation, etc. The training is carried out at least until for each image 9_iof the first set S₁the output node O_jfor the LPN L_jincluded in that image 9_ioutputs the highest value 27_jof all output nodes O_j.

Optionally, each image 9_iof the first set S₁may be fed into the NN 15 P times in step 46, the number P being in a range from 2 to 50, in particular from 5 to 20, e.g., from 7 to 13. For example, the NN 15 may be trained a number of P epochs, wherein each image 9_iis fed into the NN 15 in each epoch.

The steps 43-46 of recording, extracting, generating and training may be carried out one after the other for the whole first set S₁at once or in an overlapping manner with successive parts of the first set S₁being recorded and used for said extracting, generating and training. For example, the NN 15 may be generated by repeatedly adding one output node O_jfor each novel LPN L_jextracted from a recently recorded image 9_ior batch of the first set S₁and be trained by repeatedly adding a recently recorded images 9_ior batch of the first set S₁to a training set.

In a fifth step 47 a sample image 9_sis recorded. The sample image 9_sincludes an LPN L_jthat was also included in the images 9_ithe NN 15 had been trained on so far.

In a sixth step 48 the sample image 9_sis fed into the NN 15 and in a seventh step 49 the LPN L_jof the sample image 9_sis recognised as the LPN L_jof that output node O_jwhich outputs the highest value 27_j. To retrieve the LPN L_jof the output node O_jthat outputs the highest value 27_j, the mapping table 25 (if present) may be accessed, the output node O_jbe read (if labelled), etc. as detailed above.

Optionally, the NN 15 may be trained and extended by means of the second set S₂of images 9_ieither in addition to the first set S₁or after deleting-in an optional eight step 50—the first set S₁. To this end, the following optional steps 51-54 (and further optionally step 55) may be carried out within a second time interval after said step 46 of training.

In a ninth step 51 the second set S₂of images 9_iis recorded, e.g., by means of the camera device 2 within a second time interval. Each recorded image 9_iof the second set S₂includes one of N₂different LPNs L_N1+1-L_N1+N2′, N₂′ of which LPNs were not included in the first set S₁.

In a tenth step 52 the LPN L_jof each image 9_iof the second set 9₂is extracted either automatically or manually as detailed above for step 44. In addition, the images 9_iof the second set S₂may optionally be fed into the NN 15 to recognise the LPNs L₁-L_N1that were already included in the first set S₁.

In an eleventh step 53 the NN 15 is extended by one output node O_N1+1, O_N1+2, . . . , generally O_j, for each of the N₂′ different LPNs L_N1+1-L_N1+N2′ that are included in the second set S₂but were not included in the first set S₁, i.e. for each novel LPN L_jof the second set S₂.

To determine the novel LPNs L_jof the second set S₂the images 9_iof the second set S₂may optionally be fed into the NN 15 and the different LPNs L_N1+1-L_N1+N2′ that are included in the second set S₂and not in the first set S₁(the novel LPNs L_jof the second set S₂) may be determined as the different LPNs L_N1+1-L_N1+N2′ included in those images 9_iof the second set S₂for which all of the output nodes O_joutput a respective value 27_jbelow a predetermined threshold value. Alternatively, the mapping table 25 (if present) may be accessed to check whether an extracted LPN L_jof the second set S₂is not already included therein, and, thus, novel.

Optionally the mapping between each novel LPN L_jof the second set S₂and its corresponding output node O_jmay be stored in the mapping table 25 (if present).

In a twelfth step 54 the extended NN 15 is trained on the images 9_iof the second set S₂and the extracted LPNs L_jof the second set S₂at least until for each image 9_iof the second set S₂the output node O_jfor the LPN L_jincluded in that image 9_ioutputs the highest value 27_jof all output nodes O_j. Thereby, each image 9_iof the second set S₂may optionally be fed into the NN 15 P times, P being in a range from 2 to 50, in particular from 5 to 20, e.g., from 7 to 13.

In a thirteenth (optional) step 55 the images 9_iof the second set S₂may be deleted after the second time interval.

As shown in FIG. 6, the optional steps 51-54 (and further optionally 55) may be carried out in a loop LP to record (step 51) a respective third, fourth, etc. set S₃, S₄, . . . , generally S_k, of images 9_iin step 51 for a third, fourth, etc. time interval, extract (step 52) the LPNs L_jof the respective set S_k, extend (step 53) the NN 15 by one output node O_jfor each of the different LPNs L_jthat are included in the respective set S_kand not included in the previous sets S_k−1, S_k−2, . . . , S₁, train (step 54) the extended NN 15 on the images 9_iand extracted LPNs L_jof the respective set S_kand optionally delete (step 55) the respective set S_k.

The steps 47-49 may be carried out after the first time interval with the NN 15 trained on the first set S₁, after the second time interval with the extended NN 15 trained on the first and second sets S₁, S₂, and/or after any further time interval with the extended NN 15 trained on the first, second, third, etc. sets S₁, S₂, . . . . S_k.

The steps 44, 46, 52, 54, 59 of extracting, training and recognising may be carried out on the images 9_ias recorded. Alternatively, before any of these steps, the recorded images 9_imay be pre-processed. For example, the recorded images 9_imay be resized, converted to grayscale, blur filtered, rotated, cropped to an outer boundary of the license plate and/or image sharpened. These pre-processing steps may be carried out in any order.

The steps 43-55 may be carried out in any order, some even simultaneously or in parallel, so far as one step does not depend on the result of another step. For example, the deletion of an old set S_k−1in steps 50 and 55 may be carried out before, during or after carrying out any of the recording, extracting, extending and training steps 51-54 for a new set S_k.

The disclosed subject matter is not restricted to the specific embodiments described above but encompasses all variants, modifications and combinations thereof that fall within the scope of the appended claims.

Claims

1. A method for automatic license plate recognition, comprising: recording a first set of images each including one of N1 different license plate numbers;extracting the license plate number in each image of the first set;generating an artificial neural network with one separate output node for each of the N1 different license plate numbers in the first set;training the artificial neural network on the images and the extracted license plate numbers of the first set at least until for each image of the first set the output node for the license plate number included in that image outputs the highest value of all output nodes;recording a sample image including a license plate number that is also included in the images the artificial neural network has been trained on; andfeeding the sample image into the artificial neural network and recognising the license plate number of the sample image as the license plate number of that output node which outputs the highest value.
2. The method according to claim 1, further comprising: deleting the images of the first set and recording a second set of images each including one of N2 different license plate numbers;extracting the license plate number of each image of the second set;extending the artificial neural network by one further output node for each of the different license plate numbers that are included in the second set and not in the first set; andtraining the extended artificial neural network on the images and the extracted license plate numbers of the second set at least until for each image of the second set the output node for the license plate number included in that image outputs the highest value of all output nodes;wherein said steps of feeding and recognising are carried out with the extended artificial neural network.
3. The method according to claim 2, wherein the images of the second set are fed into the artificial neural network, and the different license plate numbers that are included in the second set and not in the first set are determined as the different license plate numbers included in those images of the second set for which all of the output nodes output a respective value below a predetermined threshold value.
4. The method according to claim 1, wherein a mapping between the output nodes and the corresponding license plate numbers is stored in a mapping table.
5. The method according to claim 1, wherein said step of extracting the license plate number comprises OCR reading the recorded images of the first set of images character by character.
6. The method according to claim 2, wherein said step of extracting the license plate number comprises OCR reading the recorded images of the second set of images character by character.
7. The method according to claim 1, wherein the recorded images are pre-processed by at least one of resizing, converting to grayscale, blur filtering, rotating, cropping to an outer boundary of the license plate, and/or image sharpening.
8. The method according to claim 1, wherein, in said step of training, each image of the first set is fed into the artificial neural network P times, P being in a range from 2 to 50.
9. The method according to claim 1, wherein, in said step of training, each image of the first set is fed into the artificial neural network P times, P being in a range from 5 to 20.
10. The method according to claim 1, wherein, in said step of training, each image of the first set is fed into the artificial neural network P times, P being in a range from 7 to 13.
11. The method according to claim 1, wherein the artificial neural network is a convolutional neural network.
12. A system for automatic license plate recognition, comprising: a camera device configured to record a first set of images each including one of N1different license plate numbers; anda computing device configured to extract the license plate number in each image of the first set,generate an artificial neural network with one separate output node for each of the N1 different license plate numbers in the first set, andtrain the artificial neural network on the images and the extracted license plate numbers of the first set at least until for each image of the first set the output node for the license plate number included in that image outputs the highest value of all output nodes;wherein the camera device is further configured to record a sample image including a license plate number that is also included in the images the artificial neural network has been trained on; andwherein the computing device is further configured to feed the sample image into the artificial neural network and recognise the license plate number of the sample image as the license plate number of that output node which outputs the highest value.
13. The system according to claim 12, wherein the computing device is further configured to delete the images of the first set;wherein the camera device is further configured to record a second set of images each including one of N2 different license plate numbers; andwherein the computing device is further configured to extract the license plate number of each image of the second set,extend the artificial neural network by one further output node for each of the different license plate numbers that are included in the second set and not in the first set,train the extended artificial neural network on the images and the extracted license plate numbers of the second set at least until for each image of the second set the output node for the license plate number included in that image outputs the highest value of all output nodes; andcarry out said feeding and recognising with the extended artificial neural network.
14. The system according to claim 13, wherein the computing device is further configured to feed the images of the second set into the artificial neural network and determine the different license plate numbers that are included in the second set and not in the first set as the different license plate numbers included in those images of the second set for which all of the output nodes output a respective value below a predetermined threshold value.
15. The system according to claim 12, wherein the computing device is further configured to store a mapping between the output nodes and the corresponding license plate numbers in a mapping table.
16. The system according to claim 12, wherein the computing device is further configured to extract the license plate number by OCR reading the recorded images of the first set of images character by character.
17. The system according to claim 13, wherein the computing device is further configured to extract the license plate number by OCR reading the recorded images of the second set of images character by character.
18. The system according to claim 12, wherein the computing device is further configured to pre-process the recorded images by at least one of resizing, converting to grayscale, blur filtering, rotating, cropping to an outer boundary of the license plate, and/or image sharpening.
19. A computing device configured to receive a first set of images each including one of N1 different license plate numbers;extract the license plate number in each image of the first set;generate an artificial neural network with one separate output node for each of the N1 different license plate numbers in the first set;train the artificial neural network on the images and the extracted license plate numbers of the first set at least until for each image of the first set the output node for the license plate number included in that image outputs the highest value of all output nodes;receive a sample image including a license plate number that is also included in the images the artificial neural network has been trained on; andfeed the sample image into the artificial neural network and recognise the license plate number of the sample image as the license plate number of that output node which outputs the highest value.

Priority Claims (1)

Number	Date	Country	Kind
23201829.1	Oct 2023	EP	regional

METHOD, SYSTEM AND COMPUTING DEVICE FOR AUTOMATIC LICENSE PLATE RECOGNITION

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

Priority Claims (1)