Embodiments described herein relate to a commodity information input apparatus and a commodity information input method.
A commodity management system using a POS (Point of sale) terminal is being popularized. For example, a customer selects commodities and reads names of the commodities by reference to a commodity menu. The operator of the POS terminal listens to the read commodity names and registers (inputs) data of the commodities into the POS terminal. The POS terminal sends the registered commodity data to a management server so that the management server can manage the commodities.
In accordance with an embodiment, a commodity information input apparatus comprises a recognition section configured to recognize an input voice; a detection section configured to detect, from a commodity database, data of more than one candidate for an input commodity based on the result of the recognition on the input voice; an output section configured to output information related to the more than one candidate for the input commodity; an acceptance section configured to accept the determination on the more than one candidate for the input commodity; an input section configured to input the data of the more than one candidate for the input commodity to an input database corresponding to the determination.
An order acceptance system (commodity information input system) is imported in a restaurant such as a family restaurant or pub, and in the embodiment, as such an order acceptance system, a POS system is described below with reference to the accompanying drawings.
A touch panel 14 (display section) is arranged on the side 11A of the first structural body 11 which is exposed when the second structural body 12 is opened with respect to the first structural body 11. The keyboard 15 is arranged on the side 12A of the second structural body 12 which is exposed when the second structural body 12 is opened with respect to the first structural body 11.
A power switch 16 is arranged on the left lateral side 11B (refer to
The operator can visually check the content displayed on the touch panel 14 of the handy terminal 3, and the customer can also visually check the content displayed on the touch panel 14 when the display surface of the touch panel 14 faces the customer. Further, the touch panel 14 may be arranged as a display device for the use by the operator while another different display device is arranged for the customer. In this case, the same content can be displayed on the touch panel 14 and the another display device for the operator and the customer to confirm the displayed content at the same time.
Further, the handy terminal 3 is provided with a microcomputer 60 functioning as an information processing section for carrying out an information processing. The microcomputer 60 is formed by connecting, via a bus line, a ROM (Read Only Memory) 62 and a RAM (Random Access Memory) 63 with a CPU (Central Processing Unit) 61 that executes various processing to control each other section. The keyboard 15 and the touch panel 14 both of which are connected with the CPU 61 of the handy terminal 3 via various input/output circuits. These components are controlled by the CPU 61. The keyboard 15 includes a numeric keyboard 15d on the surface of which numerals ‘1’, ‘2’ , ‘3’ and the like and operators including ‘*’ are indicated, a temporary closing key 15e and a closing key 15f.
Further, the handy terminal 3 comprises a microphone 181, a loudspeaker 182 and an order information registration processing section 131. The microphone 181 inputs the order information (voice) from a customer and outputs the order information as voice data. The loudspeaker 182 outputs the voice such as the result of the recognition on the order information or various kinds of guidance. The order information registration processing of the order information registration processing section 31 will be described later in detail.
For example, a customer selects a commodity and reads the name of the commodity by reference to a commodity menu. The microphone 181 of the handy terminal 3 inputs the name (voice) of the commodity read by the customer (Act 1) and outputs the name of the commodity as first voice data. For example, when an automatic language selection mode is set to be valid in a voice recognition processing section 311 (Yes in Act 2), the voice recognition processing section 311 inputs the first voice data from the microphone 181, selects (determines) the language (e.g. a firs language) of the first voice data (Act 3), and recognizes the first voice data based on the voice feature amount of the selected language (Act 4). In the case of the language selection, the corresponding language is one of the languages used inmost POS systems, including English, French, Chinese, Japanese and the like. When the automatic language selection mode is set to be invalid in the voice recognition processing section 311, for example, English is designated (No in Act 2), the first voice data is recognized based on the designated language (English) (Act 4).
The voice recognition result (data of more than one commodity candidate) output from the voice recognition processing section 311 is output to an ordered commodity information specifying processing section 312. The following processing is described here according to several voice recognition results. For example, the following processing is described on the assumption of a first voice recognition result (orange juice) and a second voice recognition result (orange juice and apple juice). Assume that the former result is the reading of one commodity (orange juice) by the customer and the latter result is the reading of a plurality of commodities (orange juice and apple juice) by the customer.
The ordered commodity information determination processing section 312 detects candidates for the more than one ordered commodity by the customer (Act 5) or specifies the more than one ordered commodity by the customer by comparing the dictionary data (commodity database) stored in a HDD 64 with the voice recognition result (data of more than one commodity candidates). For example, assume that a commodity ‘orange juice’ having a first similarity degree and commodities ‘apple juice’ and ‘lemon juice’ having a smaller second similarity degree than the first similarity degree are detected aiming at the first voice recognition result (orange juice). Further, assume that commodities ‘orange juice’ and ‘apple juice’ having a first similarity degree and commodities ‘lemon juice’ and ‘melon juice’ having a smaller second similarity degree than the first similarity degree are detected aiming at the second voice recognition result (orange juice and apple juice).
A voice synthesis processing section 313 generates second voice data corresponding to the data of the more than one commodity candidate. Further, corresponding to the selection of the first language by the voice recognition processing section 311, the voice synthesis processing section 313 generates the second voice data in the first language. Further, the first voice data and the second voice data may be the same content. Further, a character string generation processing section 314 generates character string data corresponding to the data of the more than one commodity candidate. Here, corresponding to the selection of the first language by the voice recognition processing section 311, the character string generation processing section 314 generates the character string data in the first language. Further, the character string generation processing section 314 may generate the character string data in a plurality of languages including the first language. For example, if a second language is comprehensible to the operator, the character string generation processing section 314 generates the character string data in the first and the second language through designating the second language in advance. Further, an image generation processing section 315 generates illustrative image data corresponding to the data of the more than one commodity candidate.
A commodity information repetition processing section 316 outputs (repeats), via the loudspeaker 182 shown in
When the first voice recognition result (orange juice) is obtained, basically, information related to a commodity ‘orange juice’ having the first similarity degree is output, however, information related to the commodity ‘orange juice’ having the first similarity degree may be output mainly (displayed in a first size), accompanied by the output of commodities ‘apple juice’ and ‘lemon juice’ having the second similarity degree (displayed in a smaller second size than the first size) Similarly, when the second voice recognition result (orange juice and apple juice) is obtained, basically, information related to commodities ‘orange juice’ and ‘apple juice’ having the first similarity degree is output, however, information related to the commodities ‘orange juice’ and ‘apple juice’ having the first similarity degree may be output mainly (displayed in a first size), accompanied by the output of commodities ‘lemon juice’ and ‘melon juice’ having the second similarity degree (displayed in a smaller second size than the first size).
The operator inputs information indicating whether or not to correct the registered commodities information to the handy terminal 3, if needed. If information indicating no correction (confirmation) is input via the keyboard 15 or the touch panel 14 of the handy terminal 3 (No in Act 7), for example, the data of the commodity candidate is input to the input database of the server (Store Computer: SC) via a communication interface 25.
If information indicating a correction operation is input via the keyboard 15 or the touch panel 14 of the handy terminal 3 (Yes in Act 7), the customer reads the name of a commodity again to restart the processing starting from the voice recognition processing carried out by the voice recognition processing section 311 (Act 1-Act 6) (the feedback processing shown in
Alternatively, if information indicating a correction operation is input via the keyboard 15 or the touch panel 14 of the handy terminal 3 (Yes in Act 7), by switching from the information displayed on the touch panel 14 related to the commodity ‘orange juice’ having the first similarity degree to information related to the commodities ‘apple juice’ and ‘lemon juice’ having the second similarity degree (information related to more than one input corrected commodity candidate), the designation on ‘orange juice’ or the designation on ‘lemon juice’ or the designation on ‘orange juice’ and ‘lemon juice’ can be accepted. Alternatively, if information indicating a correction operation is input via the keyboard 15 or the touch panel 14 of the handy terminal 3 (Yes in Act 7), by switching from the information displayed on the touch panel 14 related to the commodities ‘orange juice’ and ‘apple juice’ having the first similarity degree to information related to the commodities ‘lemon juice’ and ‘melon juice’ having the second similarity degree (information related to more than one input corrected commodity candidate), the designation on ‘lemon juice’ or the designation on ‘melon juice’ or the designation on ‘lemon juice’ and ‘melon juice’ can be accepted.
Further, in the description above, the storage of dictionary data (commodity database) in the HDD 64 of the handy terminal 3 is described; however, the dictionary data may also be managed by other devices excluding the handy terminal 3. For example, if needed, the handy terminal 3 may receive the dictionary data from an external server and sends the voice recognition result (data of the more than one commodity candidate) to the external server, and the external server compares the dictionary data with the voice recognition result to detect candidates for the ordered commodity and feeds the result of the detection on the ordered commodity candidate back to the handy terminal 3.
According to the embodiment, when a customer inputs commodity order information to the handy terminal 3, the commodity information can be specified automatically through a voice recognition process, without manually inputting information by an operator, and then registered in a server. At this time, the customer is required to confirm a name corresponding to the commodity information and information associating character string data with image data, further, the correction on the determined commodity information can be accepted.
Thus, the possibility is reduced that the customer, when registering order information, presses an error commodity key or that an error commodity is registered on an order as the languages used by the customer and the operator are different.
While certain embodiments have been described, these embodiments have been presented by way of example only, and are not intended to limit the scope of the invention. Indeed, the novel embodiments described herein may be embodied in a variety of other forms; furthermore, various omissions, substitutions and changes in the form of the embodiments described herein may be made without departing from the spirit of the invention. The accompanying claims and their equivalents are intended to cover such forms or modifications as would fall within the scope and spirit of the invention.