This application is based upon and claims the benefit of priority from Japanese Patent Application No. 2011-230091, filed Oct. 19, 2011, the entire contents of which are incorporated herein by reference.
Embodiments described herein relate to an information processing apparatus and a method.
Formerly, a generic object recognition technology extracted the characteristic quantity of a target from the image data of a captured article and recognized (detected) the category and the like of the article by comparing the characteristic quantity with previously prepared contrast data (characteristic quantity). Moreover, a store system was provided, the generic object recognition technology was used for the recognition of a commodity such as a vegetable, a fruit and the like, and a sales registration was carried out on the recognized commodity.
According to one embodiment, an information processing apparatus includes an acquirement section, a detection section, a selection section, a display control section and a commodity recognition section. The acquirement section acquires an image captured by an image capturing section. The detection section detects all or part of targets included in the image acquired by the acquirement section. The selection section selects any one target in the condition that the detection section detects a plurality of targets. The display control section displays the target selected by the selection section in the plurality of targets on the image acquired by the acquirement section. The commodity recognition section recognizes a commodity captured by the image capturing section based on a similarity showing a degree with which all or part of the images of the target selected by the selection section are similar to the reference image of each commodity.
The POS terminal 11 is placed on the upper surface of a cash drawer 21 on a checkout platform 41. The opening operations of a cash drawer 21 is controlled by the POS terminal 11. The upper surface of the POS terminal 11 is equipped with a keyboard 22 pressed down and operated by an operator (salesclerk). Observed from one side of an operator operating the keyboard 22, a display 23 displaying information towards an operator is installed at the more inner side of the keyboard 22. The display 23 displays the information on its display surface 23a. A touch panel 26 is laminated on the display surface 23a. A rotatable display 24 for customer is vertically installed at the innermost side of the display 23. The display 24 for customer displays the information on its display surface 24a. In addition, according to the display 24 for customer shown in
A table-shaped counter 151 having a wide-width is arranged to form an L shape with the checkout counter 41 bearing the POS terminal 11 is placed. A placing surface 152 is formed on the upper surface of the counter 151. A shopping basket 153 containing a commodity G is placed on the placing surface 152. The shopping basket 153 may be distinguished in use to a first shopping basket 153a held by the hands of the customer and a second shopping basket 153b placed at a position opposite to the first shopping basket 153a through the commodity reading apparatus 101. In addition, the shopping basket 153 is not limited to the shape of a basket and also may be a tray and the like. Moreover, the shopping basket 153 (the second shopping basket 153b) is also not limited to the shape of an ordinary basket and further can be box-shaped, bag-shaped and the like.
The commodity reading apparatus 101 connected with the POS terminal 11 in the way of being transmitting data is installed on the placing surface 152 of the counter 151. The commodity reading apparatus 101 comprises a rectangular housing 102 having a relatively thin length. A reading window 103 is arranged at the front surface of the housing 102. A display/operation section 104 is mounted on the upper part of the housing 102. The display/operation section 104 is provided with a display 106, on the surface of which a touch panel 105 is laminated. A keyboard 107 is installed at the right side of the display 106. A card reading slot 108 which is not shown in figures and reads a card is installed on the right side of the keyboard 107. A display 109 for providing the information for the customer is installed at near the left inner side of the back surface of the display/operation section 104 at a position at which the operator operates.
The commodity reading apparatus 101 comprises a commodity reading section 110 (refer to
The commodity G of one transition is contained in the first shopping basket 153a held by the hands of the customer. The commodity G in the first shopping basket 153a is moved into the second shopping basket 153b by the operator operating the commodity reading apparatus 101. In the movement process, the commodity G is enabled to face to the reading window 103 of the commodity reading apparatus 101. At the moment, the image capturing section 164 (refer to
In the commodity reading apparatus 101, a picture for appointing whether or not the commodity G included in the image captured by the image capturing section 164 corresponds to the commodity registered in the following FLU file F1 (refer to
The CPU 61 of the POS terminal 11 is connected with the cash drawer 21, the keyboard 22, the display 23, the touch panel 26 and the display 24 for customer through various input and output circuits (all not shown in the figures). These components are controlled by the CPU 61.
The keyboard 22 includes a numerical keypad 22d, the upper surface of which displays a numeral such as “1”, “2”, “3” . . . and the like and a multiplication operational character such as “x”, a temporary closing key 22e and a closing key 22f.
The CPU 61 of the POS terminal 11 is connected with an HDD 64 (Hard Disk Drive) in which programs and various files are stored in the HDD 64. All or part of the program and the various files stored in the HDD 64 are copied to the RAM 63 and be sequentially executed by the CPU 61 when the POS terminal 11 is activated. An example of the program stored in the HDD 64 is a program PR for processing the sales data of the commodity. An example of the files stored in the HDD 64 is the PLU file F1 which is transmitted from a store computer SC to the POS terminal and stored in the HDD 64.
The PLU file F1 is a commodity file setting the relevancy of the information relevant with the sales registration of the commodity G and the image of the commodity G aiming at each commodity G exhibited and sold in a store.
The data structure of the PLU file F1 is not limited to the example in
Returning to
The CPU 61 of the POS terminal 11 is connected with the connection interface 65 which enables data transmission/reception with the commodity reading apparatus 101. The connection interface 65 is connected with the commodity reading apparatus 101. Moreover, the CPU 61 of the POS terminal 11 is connected with a printer 66 for printing receipts and the like. The POS terminal 11 prints the content of one transition on a receipt under the control of the CPU 61.
The commodity reading apparatus 101 further includes a microcomputer 160. The microcomputer 160 is formed by connecting an ROM 162 and an RAM 163 onto a CPU 161 by a bus line. The programs executed by the CPU 161 are stored in the ROM 162. The CPU 161 is connected with the image capturing section 164 and a sound output section 165 via various input and output circuits (all not shown in the figures). The operations of the image capturing section 164 and the sound output section 165 are controlled by the CPU 161. The display/operation section 104 is connected to the commodity reading section 110 and the POS terminal 11 through a connection interface 176. The operation of the display/operation section 104 is controlled by the CPU 161 of the commodity reading section 110 and the CPU 61 of the POS terminal 11.
The image capturing section 164 is a color CCD image sensor, a color CMOS image sensor and the like, and is the image capturing section carrying out capturing from the reading window 103 under the control of the CPU 161. For instance, the image capturing section 164 carries out the capturing of a 30 fps dynamic image. Frame images (captured images) captured with a fixed frame per second in sequence by the capturing section are stored in the RAM 163.
The sound output section 165 is a sound circuit, a loudspeaker and the like for radiating a preset warning sound and the like. The sound output section 165 informs events by utilizing the warning tone and a sound under the control of the CPU 161.
The CPU 161 is connected with a connection interface 175 which is connected with the connection interface 65 of the POS terminal 11 to transmit data with the POS terminal 11. Moreover, the CPU 161 transmits the data with the display/operation section 104 via the connection interface 175.
Next, functional components of the CPU 161 and the CPU 61 realized by sequentially executing the programs by the CPU 161 and the CPU 61 are described below with reference to
The image acquirement section 51 outputs a capturing-on signal to the image capturing section 164, so that the image capturing section 164 begins a capturing operation. The image capturing section 164 shoots the frame image R (refer to
The image region detection section 52 detects (extracts) all or part of the targets included in the frame image R acquired by the image acquirement section 51. More particularly, the image region detection section 52 detects an image region including all or part of the commodities G included in the frame image R by utilizing a pattern matching technology and the like. Particularly, contour lines and the like are extracted according to the images obtained by carrying out binaryzation on the acquired frame images. Subsequently, the contour line extracted from the frame image last time is compared with that extracted from the frame image this time, so as to detect the image region including the target.
In the example in
The image region selection section 53 selects any one target in the condition that the image region detection section 52 detects the plurality of targets. More particularly, in the condition that the image region detection section 52 detects a plurality of image regions, the image region selection section 53 selects any one image region based on the positions of the image regions including all or part of the targets (commodities G) in the frame image R.
The positions of the apexes P1 and P2 of the top left corners of the image regions A1 and A2 are compared in the description, but the positions of other parts of the image regions A1 and A2 also can be compared. As the other example, the gravity center positions of the image regions A1 and A2 can be compared.
In the above description, the mutual position relationship of the image regions is compared based on the apex positions and the gravity center positions, but the position of each image region used when the position relationship is compared is not particularly limited, and one image region also can be selected by utilizing other positions. Moreover, in the description, the position relationship is compared by utilizing the gravity center of the image region, but the gravity center also can be solved for aiming at all or part of the targets included in the image region, so as to compare the position relationship. Moreover, in such a condition, the gravity center of the target further can be solved for based on luminosity and color information in the image region. The image region selection section 53 (display control section) displays the target selected in such a format in the plurality of targets on the frame image R by utilizing a measure such as a frame, a mark and the like. That is, the image region selection section 53 displays the frame including one image region selected in such a format or displays the mark and the like near the target, so as to report the selected target to the operator.
In addition, so long as the W1 and the W2 can definitely show that the commodity G1 or the commodity G2 is selected uniquely, their shapes, display positions, sizes and colors are not limited particularly, and a frame in a shape other than a rectangle, such as the circular shape, the elliptical shape and the like can be utilized. Moreover, the size of the frame can be bigger than that of the selected image region and also can be smaller than that of the image region. Moreover, the selected image region can be displayed by a measure other than the frame, and the position where the selected image region is positioned also can be displayed by utilizing the mark such as an arrow and the like, and the like. Moreover, the selected image region further can be displayed by changing the luminosities, the colors, the contrasts and the like of the selected image region and a region other than the selected image region. In addition, the selected image region also can be displayed by combining these measures.
The similarity calculation section 54 reads a surface state such as the hue, the surface concave-convex status and the like of the commodity G as the characteristic quantity according to all or part of the images of the commodities G included in the image region selected by the image region selection section 53. In addition, in order to shorten processing time, the similarity calculation section 54 does not consider the contour and the size of the commodity G.
The similarity calculation section 54 reads the surface state such as the hue, the surface concave-convex status and the like of a registered commodity as the characteristic quantity according to the commodity image of each commodity (called as the registered commodity hereinafter) registered in the PLU file F1, and calculates the similarity of the commodity G and the commodity registered in the PLU file F1 by comparing the read characteristic quantity with the characteristic quantity of the commodity G respectively. Herein, the similarity represents the degree with which all or part of the images of the commodity G are similar in the condition that the commodity image of each commodity stored in the PLU file F1 is set to be 100%=“similarity: 1.0”. In addition, for instance, for the concave-convex statuses of an interface and a surface, the similarity can be calculated by changing weighting.
In this way, a method recognizing an object included in the image is normally called as generic object recognition. About the generic object recognition, various recognition technologies are explicated in the following literature.
Yanai Keiji, “present state and perspectives of generic object recognition”, collected papers of information processing society, Vol. 48, No. SIG16 [retrieved on Heisei Aug. 10, 22], Internet <URL:http://mm.cs.uec.ac.jp/IPSJ-TCVIM-Yanai.pdf>.
Moreover, a technology carrying out the generic object recognition by carrying out region segmentation on the image according to a goal is explicated in the following document.
Jamie Capturedton and the like, “Semantic Texton Forests for Image Categorization and Segmentation”, [retrieved on Heisei Aug. 10, 22], Internet <URL:http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.145.3036&rep=repl&type=pdf>.
In addition, a method calculating the similarity of the image of the captured commodity G and the commodity image of the registered commodity registered in the PLU file F1 is not particularly limited. For instance, the similarity of the image of the captured commodity G and each registered commodity registered in the PLU file F1 can be calculated as absolute evaluation and also can be calculated as relative evaluation. In the condition that the similarity is calculated as the absolute evaluation, the image of the captured commodity G is compared with each registered commodity registered in the PLU file F1 one by one, and the similarity educed from a comparison result is directly adopted. Moreover, in the condition that the similarity is calculated as the relative evaluation, if five registered commodities (commodities GA, GB, GC, GD and GE) are registered in the PLU file F1, the similarities of the captured commodity G are calculated to be 0.6 relative to the commodity GA, 0.1 relative to the commodity GB, 0.1 relative to the commodity GC, 0.1 relative to the commodity GD, 0.1 relative to the commodity GE, and the like, and the sum of the similarities relative to each registered commodity is 1.0 (100%).
The commodity candidate prompt section 55 displays a candidate (called as commodity candidate hereinafter) of the commodity G captured by the image capturing section 164 on the display 106 based on the similarity calculated by the similarity calculation section 54. More particularly, the commodity candidate prompt section 55 uses the registered commodity whose similarity reaches above a fixed value as the commodity candidate. Moreover, the illustration image and the commodity name of the registered commodity are read out from the PLU file F1 and are displayed in sequence on the display picture of the display 106 according to a sequence from a high similarity to a low similarity.
In this way, the similarity calculation section 54 and the commodity candidate prompt section 55 exerts functions as the commodity recognition section, that is, the commodity captured by the image capturing section 164 is recognized based on the similarity of the image of the target included in the image region selected by the image region selection section 53 and the commodity image of the registered commodity.
As shown in
That is, in
In
The input acceptance section 56 accepts various input operations corresponding to the display of the display 106 through the touch panel 105 or the keyboard 107. Moreover, the input acceptance section 56 accepts the selection operation on any one commodity candidate in the commodity candidates displayed by the display 106. The input acceptance section 56 accepts the selected registered commodity as the commodity corresponding to the commodity G.
The information output section 57 outputs the information (such as the commodity ID, the commodity name, the image file name of the selected commodity image, and the like) showing the commodity to the POS terminal 11 from the connection interface 175 aiming at the commodity accepted by the input acceptance section 56.
The information output section 57 also can output a sales number additionally input from the touch panel 105 or the keyboard 107 together with the commodity ID and the like to the POS terminal 11. Moreover, as the information output from the information output section 57 to the POS terminal 11, the commodity ID read out from the PLU file F1 by the information output section 57 can be directly notified, the file name and the commodity name of the commodity image which can specify the commodity ID also can be notified, and the storage location (storage address in the PLU file F1) of the commodity ID further can be notified to the POS terminal 11.
The sales registration section 611 of the POS terminal 11 carries out the sales registration of the corresponding commodity based on the commodity ID and the sales number output from the information output section 57. Particularly, the sales registration section 611 records the notified commodity ID, the commodity classification, the commodity name, the unit price and the like corresponding to the commodity ID and the sales number together in the sales master file and the like with reference to the PLU file F1, so as to carry out the sales registration.
Next, the operations of the checkout system 1 are described in detail.
First, the operations of the commodity reading apparatus 101 are described. When the processing begins corresponding to that the POS terminal 11 begins commodity registration and the like, the image acquirement section 51 outputs a capturing-on signal to the image capturing section 164, so that the image capturing section 164 begins capturing (Act S11).
The image acquirement section 51 acquires the frame image R stored in the RAM 163 after being captured by the image capturing section 164 (Act S12). Subsequently, the image region detection section 52 detects the image region including all or part of the commodities G included in the frame image R acquired by the image acquirement section 51 (Act S13).
The image region selection section 53 judges whether or not the plurality of image regions are detected (Act S14). In the condition that one image region is only detected (Act S14: No), the processing is turned to Act S17. In the condition that the plurality of image regions are detected (Act S14: Yes), the image region selection section 53 selects any one image region based on the position relationship of the image regions (Act S15). Moreover, the image region selection section 53 displays the frame surrounding the image region selected in Act S15 on the frame image R (refer to
Subsequently, in the condition that one image region is judged to be only detected in Act S14 (Act S14: No), the similarity calculation section 54 calculates the similarity of the commodity included in one image region and the registered commodity (Act S17). Moreover, in the condition that the plurality of image regions are judged to be detected in Act S14 (Act S14: Yes), the similarity calculation section 54 calculates the similarity of the commodity included in one image region selected in Act S15 and the registered commodity (Act S17).
Afterwards, the commodity candidate prompt section 55 re-sequences the commodity image and the commodity name of the registered commodity as the commodity candidate according to the sequence from the high similarity to the low similarity based on the similarity calculated in Act S17, and displays the commodity image and the commodity name on the commodity candidate prompt region 83 (refer to
The input acceptance section 56 judges whether or not the selection operation of the commodity image of the registered commodity is accepted (Act S19). In the condition that the selection is not accepted (Act S19: No), the processing is turned to Act S12. In the condition that the selection is accepted (Act S19: Yes), the input acceptance section 56 judges the selected registered commodity as the commodity to be subjected to the sales registration. Subsequently, the information output section 57 outputs the commodity ID and the like of the registered commodity selected in Act S19 together with the sales number additionally input through the keyboard 107 to the POS terminal 11 (Act S20).
The CPU 161 judges whether or not the POS terminal 11 gives a termination notice about the commodity registration and the like to terminate a service (Act S21). In the condition of continuing the service (Act S21: No), the CPU 161 returns the processing to Act S12 to continuously execute the processing. In the condition of terminating the service (Act S21: Yes), the image acquirement section 51 outputs a capturing-off signal to the image capturing section 164, terminates the capturing of the image capturing section 164 (Act S22), and terminates the processing.
In Act S20, the input of the sales number is accepted through the keyboard 107, but a method inputting the sales number is not patricianly limited. For instance, the touched times of the selected image region also can be accepted as the sales number.
The operations of the POS terminal 11 are described. First, when the processing begins corresponding to the beginning of the commodity registration and the like according to the operation indication of the keyboard 22, the CPU 61 receives the commodity ID and the sales number of the determined commodity output from the commodity reading apparatus 101 in Act S20 (Act S31). Subsequently, the sales registration section 611 reads out the commodity category, the unit price and the like from the PLU file F1 based on the commodity ID and the sales number received in Act S31, and registers the sales information of the commodity G read by the commodity reading apparatus 101 in the sales master file (not shown in the figures) (Act S32). Subsequently, the CPU 61 judges whether or not to give the termination notice about the sales registration and the like by the operation indication of the keyboard 22 to terminate the service (Act S33). In the condition of continuing the service (Act S33: No), the CPU 61 returns the processing to Act S31 again to execute the processing continuously. In the condition of terminating the service (Act S33: Yes), the CPU 61 terminates the processing.
While certain embodiments have been described, these embodiments have been presented by way of example only, and are not intended to limit the scope of the inventions. Indeed, the novel embodiments described herein may be embodied in a variety of other forms; furthermore, various omissions, substitutions and changes in the form of the embodiments described herein may be made without departing from the spirit of the inventions. The accompanying claims and their equivalents are intended to cover such forms or modifications as would fall within the scope and spirit of the inventions.
For instance, a method selecting one image region from the plurality of image regions is not limited to the example, and other methods also can be used. As an example, the image region selection section 53 selects one image region from the plurality of image regions by comparing the image data in the image regions, such as the luminosities and the like of the image regions (A1, A2 and the like). Moreover, a comparison method or a parameter and the like used in the comparison method can be selected or altered by a user.
Moreover, as the target in the description, the image region including all or part of the commodities is detected, but the target is not limited to the commodity. For instance, in the condition that the hand of the operator is captured as other target and in the condition that the target is judged to be the hand based on the similarity, the image region selection section 53 can select one target from the image region other than the image region including the hand.
The formation of each display picture is not limited to the examples in
In this embodiment, the form that the POS terminal 11 comprises the PLU file F1 is set, but is not limited to that, the form that the commodity reading apparatus 101 comprises the PLU file F1 also can be available, and the form that an external apparatus which can be accessed by the POS terminal 11 and the commodity reading apparatus 101 comprises the PLU file F1 further can be available.
The commodity reading apparatus 101 in this embodiment has the functions of the similarity calculation section 54, but is not limited to that, and the form that the POS terminal 11 has the functions of the similarity calculation section 54 and outputs the calculation result of the similarity to the commodity reading apparatus 101 further can be available.
In this embodiment, the construction of the POS terminal 11 and the commodity reading apparatus 101 is set, but is not limited to that, and one apparatus comprising the functions of the POS terminal 11 and the commodity reading apparatus 101 can be installed.
The program executed by each apparatus of the embodiment is provided by being previously programmed a storage medium (ROM or storage section) of each apparatus, but is not limited to that, and can be provided by being recorded in a computer-readable recording medium such as a CD-ROM, a floppy disk (FD), a CD-R, a DVD (Digital Versatile Disk) and the like by a file in an installable form or an executable form. In addition, the storage medium is not limited to a medium independent from a computer or an embedded system, and also includes the storage medium storing or temporarily storing the program after the program transmitted through an LAN, the Internet and the like is downloaded.
The program executed by each apparatus of the embodiment further may be stored in a computer connected with a network such as the Internet and provided by downloading through the network, or further may be provided or distributed through the network such as the Internet and the like.
As described above, according to the embodiment, in the condition that the plurality of targets are detected in the frame image, any one target is selected, and the commodity recognition is carried out on the target. Therefore, even though in the condition that the plurality of targets are detected, the plurality of targets also can be reduced to one target to carry out the recognition processing, so that the information processing apparatus and the program which can lighten the load of the recognition processing can be provided.
Number | Date | Country | Kind |
---|---|---|---|
2011-230091 | Oct 2011 | JP | national |