This Application claims priority of Taiwan Patent Application No. 101141725, filed on Nov. 9, 2012, the entirety of which is incorporated by reference herein.
1. Field of the Invention
The present invention relates to a system and method for automatically posting a messages by an audio signal, and in particular, relates to a system and method capable of receiving and analyzing an audio signal to generate a string, and automatically posting a message to a social network by generating the message corresponding to a target object, which is determined from a display screen according to the generated string.
2. Description of the Related Art
With the advances in different technologies, electronic devices, such as smart phones, tablet PCs, laptops, personal computers, have become more and more popular. In addition, many users often post messages on social websites or social networks (e.g. Facebook, Google+). For example, after logging onto a social website, the user may post messages corresponding to products, stores, objects, and events. When the electronic devices are used for word-of-mouth information on social websites, the user may post messages or check-in on a community page of a store or a product on the social website. However, the user usually needs to perform a lot of steps in advance before posting messages, such as logging onto a social website, preparing the messages or data (e.g. pictures, photos, or comments) to be posted, and uploading the messages to the social website. It is very inconvenient for the user to operate such a complicated procedure.
Currently, logon data of a user in social websites can be preset in many electronic devices in order to automatically logon onto the social websites and quickly posting messages. However, when a user wants to post a message on a social website, the user still has to use the interface of the social website to select photos/images or input texts, to post the message on the “wall” of the user in the social website (e.g. Facebook). But, the user can not post a message automatically on a social website by an audio signal from conventional electronic devices.
Posting of the message is based on a “landmark” or a “location” when the user wants to post a message or check-in on the community page of a store or a product on a social website. For example, corresponding information and the geographical location of a store can be preset on the social website by the store terminal. When the user reaches the geographical location of the store with his electronic device, the user's geographical location can be confirmed by a positioning mechanism in the electronic device. Accordingly, the user may check-in on the social website at the geographical location of the store, and the visitor history of the user or the consumption records at the store will be public on the social website. The geographical location of the user is primarily used to check-in on the social website in the conventional way, but the user can not check-in on the social website via an audio signal. In addition, the electronic device can not post a message associated with the store when the user has not reached the geographical location of the store yet.
In a conventional procedure, many steps, such as logging onto a social website, using user interfaces of the social website, are required for posting a message or check-in on the social website, and thus the conventional procedure is not intuitive and convenient for a user. When a user wants to automatically post a message on a social website by using a smart phone or a tablet PC with lower computation resources or a smaller display screen, it is very inconvenient for the user to use the interfaces of the social website, select photos/images manually, and input texts. Accordingly, there is a demand for a system and method capable of automatically posting a message in a more convenient and effective way, such as using an audio signal, thereby automatically posting the message intuitively and conveniently for the user.
A detailed description is given in the following embodiments with reference to the accompanying drawings.
In an exemplary embodiment, a system for posting a message by an audio signal is provided. The system comprises: a communication unit used to connect the system to a communications network; an audio receiving unit used to receive a first audio signal; a display unit; and a processing unit, connected to the communication unit, the audio receiving unit and the display unit, and used to recognize the first audio signal to generate a first string, determine a target object from a display screen displayed on the display unit according to the first string, and automatically generate a message corresponding to the target object, and post the message on a social network through the communication unit.
In another exemplary embodiment, a method for posting a message by an audio signal is provided. The method comprises the following steps of: connecting to a communications network via a communication unit; receiving a first audio signal via an audio receiving unit; recognizing the first audio signal to generate a first string via a processing unit; determining a target object according to the first string and a display screen displayed on a display unit via the processing unit; automatically generating a message corresponding to the target object via the processing unit; and posting the message on a social network via the communication unit.
The present invention can be more fully understood by reading the subsequent detailed description and examples with references made to the accompanying drawings, wherein:
The following description is of the best-contemplated mode of carrying out the invention. This description is made for the purpose of illustrating the general principles of the invention and should not be taken in a limiting sense. The scope of the invention is best determined by reference to the appended claims.
The audio receiving unit 120 is used to receive an audio signal. For example, the audio receiving unit 120 may be a microphone, a sound receiver, a sound collector, or other transformation device capable of converting audio signals to electrical signals, but the invention is not limited thereto. The communication unit 130 may be a network interface, which may be a wired or wireless network interface supporting TCP/IP, Wifi, or 802.11x protocols, or physical components for connecting to communications networks, such as a network adapter, a GPRS module, a 3G module/network adapter, a 3.5G module/network adapter, or a Bluetooth module, but the invention is not limited thereto. The display unit 160 may be a monitor or a display on the market for display information or pictures, such as a CRT, LCD, PDP, or LED display, but the invention is not limited thereto.
The processing unit 110, which is connected to the communication unit 130, the audio receiving unit 120 and the display unit 160, is primarily used to recognize a first audio signal received by the audio receiving unit 120 to generate a first string, determine a target object from the display screen displayed on the display unit 160 according to the first string, and automatically generate a message corresponding to the target object, and post the message to a social network or web pages through the communication unit 130. For example, the processing unit 110 may post the message corresponding to the target object on the “wall” of the social network or web pages (e.g. Facebook) with the user's account information, or post the message or check-in with the user's account information on the store/community pages. The message posted on the website may be a text, a picture, a photo, an image, a sound signal, or a hyperlink corresponding to the target object, or any two in combination. The target object, for example, may be a name, a picture, or a sound signal, a text or image information for indicating an object, which corresponds to a product, a store, an enterprise, a person or an event.
In another embodiment, the processing unit 110 may further comprise an audio detection module (not shown in
In other embodiments, before the processing unit 110 generates the first string, the processing unit 110 may start the following processes by receiving a specific sensor signal, a specific operation command, or a specific speech for generating the first string, determining the target object, generating the message, and automatically posting the message to a social network.
In another embodiment, when the display screen is a web page, the processing unit 110 may further analyze the syntax of the web page (i.e. web page syntax), thereby obtaining the boundary of the current display screen (e.g. the currently screen of the web page displayed on the display unit 160) for the web page, and recognize multiple objects and corresponding information from the web page by using any one of the following techniques of tag analysis of the web page, snippet information matching, pattern recognition, hyperlink and title matching, and optical character recognition. Specifically, the processing unit 110 may determine at least one candidate object according to the generated first string and the display screen, and determine the target object according to the at least one candidate object. For example, a display screen may usually comprise multiple objects. The processing unit 110 may recognize multiple objects from the display screen, and compare the first string with each object and/or corresponding information, thereby determining at least one candidate object which matches or is highly correlated to the first string from the objects. In some embodiments, the processing unit 110 may provide a user interface for selecting one of the candidate objects. In other embodiments, the processing unit 110 may further search for corresponding data of each candidate item from a network or a database, and determine whether each candidate item matches or is highly correlated to the first string, thereby determining one of the candidate objects as the target object.
In another embodiment, the system 100 may comprise a database (not shown in
In another embodiment, as illustrated in
In some other embodiments, the audio signal sent by the user may comprise information of the target object. After the processing unit 110 converts the audio signal into texts or strings, the processing unit 110 may determine whether a name (e.g. a product name, a store name, or an enterprise name) of the target object is included in the string. Accordingly, when the processing unit 110 determines there is a name of the target object in the string, the processing unit 110 may retrieve the corresponding target object from the display screen according to the name of the target object, and generate a message corresponding to the target object. Then, the processing unit 110 may further post the generated message to a social network through the communication unit 130. Alternatively, when the processing unit 110 determines that there is a name of the target object in the string, the processing unit 110 may automatically search for corresponding data associated with the name of the target object from the network or the database, thereby generating the message corresponding to the target object. However, when the processing unit 110 has converted the audio signal into texts or a string and the processing unit 110 is unable to find a name of the target object, the processing unit 110 may further recognize possible objects from the display screen (e.g. the screen of the web page currently viewed by the user). For example, the processing unit 110 may determine candidate objects from the pictures, the title, or the text description of the web page, and determine a target object from the candidate objects. In addition, the processing unit 110 may search for corresponding information of the target object from a communications network through the communication unit 130, thereby generating the message.
When the display screen is a web page, the processing unit 110 may use techniques, such as tag analysis of the web page, snippet information matching, pattern recognition, hyperlink and title matching, and optical character recognition, to determine a target object from the web page currently viewed by the user. For example, a web page is generally written by HTML language or Java scripts, and the processing unit 110 may analyze the syntax of the HTML and Java script source codes of the web page, thereby retrieving positions of each picture and description information of the web page in the screen. The processing unit 110 may perform the techniques, such as tag analysis of the web page, snippet information matching, and hyperlink and/or title matching, thereby obtaining names of possible candidate objects. Then, the processing unit 110 may perform pattern recognition and/or optical character recognition to the pictures of the web page in the display screen, thereby retrieving names and types of the candidate objects and description information in the pictures. When performing pattern recognition, the processing unit 110 may match the image features of the pictures in the screen with the identity reference images in the database, thereby determining whether there are conformed candidate objects.
If there is information corresponding to only one candidate object of the web page in the screen according to the matching result generated by the processing unit 110, the processing unit 110 may directly determine the candidate object as the target object. In an embodiment, if there is at least one candidate object in the matching result generated by the processing unit 110, the processing unit 110 may provide a user interface to display the at least one candidate object (e.g. name and/or pattern of the candidate objects) on the display unit 160, so that the user may select one of the at least one candidate object as the target object. For example, the user may use a peripheral device (e.g. a mouse or a keyboard) to select one of the candidate objects as the target object on the user interface displayed on the display unit 160.
In another embodiment, if there is at least one candidate object in the matching result generated by the processing unit 110, the processing unit 110 may display the at least one candidate object (e.g. name and/or pattern of the candidate objects) on the display unit 160. Then, the audio receiving unit 120 may further receive an audio signal A3, and the audio detection module may recognize the audio signal A3 to generate a string S3. Accordingly, the processing unit 110 may determine the target object from the at least one candidate object according to the generated string S3.
It should be noted that the processing unit 110 may automatically generate a message SC2 corresponding to the target object after the target object has been determined. Then, the processing unit 110 may post the message SC2 on the social website W2 through the communication unit 130.
The methods, or certain aspects or portions thereof, may take the form of a program code embodied in tangible media, such as floppy diskettes, CD-ROMs, hard drives, or any other machine-readable (e.g., computer-readable) storage medium, or computer program products without limitation in external shape or form thereof, wherein, when the program code is loaded into and executed by a machine, such as a computer, the machine thereby becomes an apparatus for practicing the methods. The methods may also be embodied in the form of a program code transmitted over some transmission medium, such as an electrical wire or a cable, or through fiber optics, or via any other form of transmission, wherein, when the program code is received and loaded into and executed by a machine, such as a computer, the machine becomes an apparatus for practicing the disclosed methods. When implemented on a general-purpose processor, the program code combines with the processor to provide a unique apparatus that operates analogously to application specific logic circuits.
While the invention has been described by way of example and in terms of the preferred embodiments, it is to be understood that the invention is not limited to the disclosed embodiments. To the contrary, it is intended to cover various modifications and similar arrangements (as would be apparent to those skilled in the art). Therefore, the scope of the appended claims should be accorded the broadest interpretation so as to encompass all such modifications and similar arrangements.
Number | Date | Country | Kind |
---|---|---|---|
101141725 | Nov 2012 | TW | national |