1. Field of the Invention
The present invention relates to an information selection method selecting information associated with an audio information source, and an information selection apparatus employing such an audio information selection method.
2. Description of the Background Art
As a conventional technique of the method of selecting one information from a plurality of information, a retrieval engine using character information provided on a display is known. However, the task of continuously viewing the screen will be a burden on the user side. Accordingly, attention is now focused on the usage of voice and sound. In the case where a great number of audio media such as radio programs, music CDs and the like are the subject of selection, it will be easier for the user to actually listen to the contents rather than by selection using only character information.
A method of selecting information associated with a sound source according to a plurality of sound sources is disclosed in, for example, Japanese Patent Laying-Open No. 10-124292. According to this method, a plurality of sound sources are placed around the user. The audio outputs are basically issued at the same volume. The user distinguishes audibly the audio outputs generated simultaneously and specifies a desired direction to select the information associated with the sound of that direction. More specifically, various audio messages such as “play”, “record”, “rewind” and “stop” are determined for the front, right, back, and left, respectively, as the operation of a video equipment. When the user wants to effect recording, the right direction is to be selected using a pointing device such as a cross pad. Another method disclosed in this publication is devised to facilitate the audible feature of each generated sound by issuing the sound of each audio output with a slight time difference to define sound quality difference (male voice, female voice).
A method of playing again information that was missed audibly by the user indicating a certain direction with a single sound source rotating about the user is disclosed in “Dynamic Soundscape: mapping time to space for audio browsing” (CHI97) by Minoru Kobayashi and Chris Schmandt (MIT). According to this method, one sound source moves around a user while issuing audibly various topics at a constant volume and sound quality. In the case where the user missed a certain topic by the ear, the user points out the area providing the audio output of that topic using a pointing device, whereby a sound source is generated at that site. Playback is resumed from the topic that was audibly issued when passing that site of the sound source. In this playback operation, the volume of the former sound source is lowered and the sound source that newly provides the audio output effects playback at a higher volume. Both sound sources move in an orbit at the same time. Up to eight sound sources are allowed simultaneously in this system.
However, even if audio output is provided with time difference or with different sound quality corresponding to each direction to facilitate identification of the position of the sound as in the above-described publication, the direction that can be distinguished audibly by the human being is limited to eight directions at most. The case where there are a great number of selection branches cannot be accommodated. Audio output of only single words such as playback or recording as in the case of video reservation is generally of no problem. However, in the case where audio output of continuous contents such as a plurality of news programs is issued, it is difficult to audibly distinguish the contents even if the sound sources are located at less than eight directions.
In the present specification, “the position of sound” implies the site from which sound is audibly output, or a direction from which a sound can be heard.
In the above method, only one sound is audibly output unless the user provides an input. A plurality of information cannot be obtained at the same time.
The telephone push phone service is known as an information selection interface dedicated to audio (method 1). In selecting information, a voice guidance of “Please depress 1 for . . . ” is output. The user depresses an appropriate button according to the voice guidance.
Another method is known to operate a system by voice using a speech recognition function (method 2). According to this method, a predetermined operation command is input through voice, or a natural language processing function is added to the speech recognition function to operate the system in the manner of ordinary conversation.
A method in which the item to be subjected to selection is altered over time is disclosed in Japanese Patent Laying-Open No. 6-149517. According to this method, the item to be subjected to selection is altered by the user or program request (output at an elapse of a predetermined time). A label of that item is displayed on the screen and a tone scale corresponding to that item is issued from a speaker. The user can select a certain item by carrying out a predetermined input operation when the label of the desired item appears.
Method 1 is disadvantageous in that the number of information that can be selected or the button to be depressed differs each time depending upon the contents. The user has to depend upon the voice guidance at every select operation, which is time-consuming. Since the number of buttons that can be depressed increases, the user will not be able to remember the location of each appropriate button. The user will have to depress the appropriate button while confirming the location of each button. This labor is tedious. Particularly in the case where it is dangerous for the user to carry out a task with his/her view off, this button position confirmation will induce danger.
Method 2 is disadvantageous in that the user must learn and operate a plurality of types of predetermined voice commands, if any is predetermined. To date, the natural language processing function lacks the ability to recognize the meaning of the word input through an audio input of high degree of freedom. A technique at the level of practical usage is not yet established.
According to the technique disclosed in Japanese Patent Laying-Open No. 6-149517, a display device is inevitable. The required information cannot be provided to the user by just through the audio output. Also, the audio information associated with each information corresponds to only the musical scale of a predetermined tone. It will be difficult for the user to master the difference of the tone and the musical interval information.
In view of the foregoing, an object of the present invention is to provide an information selection method including a user-friendly user interface employing voice in selecting one desired information out from a plurality of information.
Another object of the present invention is to provide a sound information selection apparatus that facilitates selection of desired sound information from a plurality of sound information.
A further object of the present invention is to provide a sound information selection apparatus that can have the position change of a sound controlled by the user's intention.
Still another object of the present invention is to provide an information selection apparatus that allows selection of an information source without depending upon only visual recognition.
A still further aspect of the present invention is to provide an information selection apparatus that allows the user to identify at one time more information retained by an information source.
Yet a further object of the present invention is to provide a sound information selection apparatus that allows the presentation status of sound information to be easily modified according to the user status.
Yet another object of the present invention is to provide an information selection apparatus including a user-friendly user interface employing voice in selecting one desired information from a plurality of information.
Yet a still further aspect of the present invention is to provide a computer readable recording medium including a user-friendly user interface employing voice in selecting one desired information out from a plurality of information.
According to an aspect of the present invention, a method of selecting desired information according to a plurality of sound information includes the steps of time-controlling independently the position of each sound, and selecting a sound. In selecting a sound, information associated with the sound is selected. As a result, a sound selection method can be provided that allows desired sound information to be easily selected from a plurality of sound information. Preferably, the method includes the step of controlling volume in association with the position of each sound. Information can be selected in association with the volume.
The step of controlling independently the position of each sound facilitates distinction between each sound by arranging the position of each sound on the circumference of substantially a circle to move in an orbit.
Further preferably, the position change of all the sounds can be set earlier than that of the normal time control or returned to the former position. As a result, the position change of sound can be controlled by the user's intention.
According to another aspect of the present invention, an information selection apparatus with a sound source to select desired information includes a unit time-controlling independently the position of each sound, a sound selection unit selecting a sound, and a selector selecting information associated with the sound in response to a selection signal of the sound selection unit.
According to a further aspect of the present invention, an information selection apparatus with a sound source to select desired information source from a plurality of information sources includes means for sequentially switching a plurality of information sources as audio information and presenting the same by a sound source, means for selecting audio information relevant to a desired information source from the presented audio information.
By presenting an information source as audio information by voice, an information source can be selected without depending upon only visual recognition. By selecting that audio information when presented, an information source can be selected by unitary operation.
Preferably, the information selection apparatus includes means for indicating switching of the audio information.
This allows switching of the audio information by an indication of the user. Audio information can be presented at a timing suiting the user's wishes.
Further preferably, the switch specify means specifies presentation of audio information presented after or before the currently-presented audio information.
Accordingly, switching can be effected by the user's specification so as to present audio information that was to be presented afterwards or that is already presented during presentation of audio information. Therefore, the user can sequentially refer to information according to his/her wishes. This switching is not limited to audio information immediately preceding or succeeding the currently-presented audio information. Several audio information can be skipped by rewinding or fast-forwarding to present an appropriate audio information.
Further preferably, the information source retains information other than the audio information. The information selection apparatus can present the other information when the desired audio information is selected.
By presenting information other than the voice information such as image information retained by the information source, another information of audio information that is not presented can be provided in addition to the another information of the currently-presented audio information. Thus, the user can identify at one time more information retained by the information source.
The information other than the audio information retained by the information source is preferably image information. Information depending upon the five senses of the user (the tactual sense, the palate, the sense of smell) is allowed as long as it conveys information. For example, information can be presented by conducting a current through the clothing of the user, the scent, the taste, or the like.
According to the information stored in information storage unit 816, main control unit 812 commands play control unit 815 to play the audio information stored in audio storage unit 811 or to play the audio which is the contents of information storage unit 816 converted by audio conversion unit 814.
By presenting information other than audio information such as the image information retained by the information source during presentation of that audio information, the user can identify at one time more information retained by the information source.
According to still another aspect of the present invention, an information selection method includes the step of gradually narrowing down the information from categorized information sources by repeating the information selection method.
By gradually narrowing down the information out from categorized information sources, the user can easily arrive at the desired information out from a large amount of information sources.
According to a still further aspect of the present invention, a recording medium can be applied in which a program is recorded to cause a computer to execute the step of sequentially switching a plurality of information source as audio information and presenting the selected information source by a sound source, and the step of selecting from the presented audio information the audio information relevant to desired information source.
According to yet a further aspect of the present invention, an information presentation apparatus presenting a plurality of information as sound information includes a unit to modify, in presenting sound information, the presentation status according to the property of the sound information to be presented or the user status.
Accordingly, the presentation status of sound information can be easily modified according to the property such as the length volume, stereo/monaural audio, and sampling frequency of the audio information to be presented and the sound quality depending upon the reproduction hardware, or according to the user status such as the case where the user is aware or not aware of the information preceding or succeeding the information to be presented, or the acceptability of the information applied as sound to the user.
Preferably in the information presentation apparatus, the presentation status to be modified includes either the status of presenting at the same time the plurality of sound information with the presentation position changed, or the status of sequentially presenting the plurality of sound information.
By selecting the status of presenting sound information simultaneously or sequentially, the sound information can be presented, not in an inflexible manner, but in a more appropriate manner suiting the situation when the information is to be presented as audio output.
Further preferably, the information presentation apparatus includes an arrangement unit time-controlling independently the position of sound information for arrangement in presenting the plurality of sound information at the same time with the presentation position changed. The position of each sound is arranged on the circumference of substantially a circle to move in an orbit. The rotation condition and sound placement condition are set according to the property of the sound information to be presented.
By arranging the sound information on the circumference of a circle to move in an orbit with the user present at a certain contact point of that circle, the user himself/herself can easily recognize the mutual position relationship of the sound such as the most closest sound information position, the farthest sound position information, and the like in presenting a plurality of sound information at the same time. By setting the rotation condition such as the rotation speed or rotation radius, and the sound placement condition such as the distance between sounds, an information presentation environment suiting each user can be established.
The circle on which the sound information is to be arranged does not have to be a complete circle. Depending on the number of sound information presented at the same time or the user's preference, sound information can be presented with the portion of the circumference of the circle that is most remote from the user cut.
Further preferably, the information presentation apparatus includes an arrangement unit time-controlling independently the position of sound information in presenting a plurality of sound information with the presentation position altered. The position of each sound is modified to the position specified by the user independent of rotation of the position of the sound information.
By the provision of means for modifying the position of each sound to a position specified by the user independent of the rotation, i.e., without waiting for the orbital motion, sound information associated with the desired information can be obtained immediately within the reach of the user, i.e., obtained at one step without having to depress the “rewind”, “forwarding” buttons several times.
Various information selection is allowed according to the information presentation apparatus such as directly calling up another sound from a slight position change of all the sounds being presented.
According to yet another aspect of the present invention, an information presentation apparatus groups, when there is difference in the property between each sound information retained by the information, the property of each sound information prior to presenting the sound information.
By grouping the property of each sound information and presenting the normalization as sound information after monauralizing the stereo audio in advance when stereo audio and monaural audio are mixed in the sound information in the case where there is difference in the volume level of the sound information, the user can recognize the placement position of the sound information for all sound information.
Preferably, the information includes presentation information other than the sound information in the information presentation apparatus. The presentation information other than the sound information can be displayed together in displaying each sound information.
By presenting together presentation information other than sound information such as image information or information by touch, the user can obtain more easily the desired information.
Further preferably, the information presentation method presenting a plurality of information as at least sound information includes the step of modifying, in presenting sound information, the presentation status according to the property of the sound information to be presented.
According to yet a still further object of the present invention, a computer readable recording medium is provided in which a program is recorded to cause a computer to execute the step of presenting a plurality of information as sound information, and the step of modifying the presentation status of the sound information to be presented according to the property thereof.
The foregoing and other objects, features, aspects and advantages of the present invention will become more apparent from the following detailed description of the present invention when taken in conjunction with the accompanying drawings.
Embodiments of the present invention will be described hereinafter with reference to the drawings.
Referring to
At step S12, sounds differing in volume move in an orbit as described above. At step S13, the depression of select button 5 at the status of
Referring to step S12 of
A second embodiment of the present invention will be described with reference to
In the case where the user is to select information, the user operates a pointer F on display device 8 using a pointing device such as a cross pad 7, for example, to directly select an image on display device 8. Alternatively, when that user wishes to select another sports program under the status of
Referring to step S32 of
When categorization is possible in the case where there are many selection menus, an index of the categories is produced by the sound corresponding to a read out of the label of each index. By selecting that sound, the contents of a relevant category can be made to move in an orbit as a certain sound.
Control device 31 may be a personal computer. A drive device 40 of a recording medium 41 is provided at personal computer 31. A program to cause the personal computer to carry out an operation according to the control flow shown in
Following the above description of the method of the first and second embodiments, an apparatus realizing the present invention will be described hereinafter with reference to
According to the information stored in information storage unit 814, main control unit 812 commands play control unit 815 to play the audio information stored in audio storage unit 811 or to play the audio which is the contents of information storage unit 814 converted by audio conversion unit 812.
When there is an input from presentation indication device 84, or when there is a command from main control unit 812, play control unit 815 is directed to switch the presentation information according to menu control unit 813 for playback.
The operation of the third embodiment will be described hereinafter with reference to
User 1 listens to the audio information sequentially presented by control device 32 through headphone device 2. This differs from the operation of the previous first and second embodiments. Control device 32 includes a controller with a rewind button 4, a select button 5, and a forward button 6. Control device 32 sequentially switches the audio information corresponding to a plurality of information sources. More specifically, when there is a total of S information P to be presented, information P is sorted from P1 to PS. Control device 32 sequentially switches the information for presentation such as presenting information P2 following the presentation of P1, presenting information P3 following the presentation of P2, . . . .
The sorting sequence (arranged order) of the information presented to the user may reflect the user's preference corresponding to a preceding selection.
By providing three operation buttons which are actually presented by voice and selecting appropriate audio information to proceed the operation, an interface that alleviates the confirmation task of the operation buttons and that can be specified audibly is realized.
By dividing the information presentation into four stages, i.e., presentation of a plurality of mails (S302) and a menu of operations for the selected mail (S304, S306, S308), the required information can be presented to the user corresponding to each situation. Selection of the mail per se and the operation with respect to the selected mail can be effected according to the same method and same buttons.
The operation of the memory will be described with reference to
According to the contents of information storage unit 816, main control unit 812 provides control to enter the number of mails in the appropriate portion of the information “A new mail has arrived. There are now ◯◯ new mails.” (S301) The information is converted into an audio message at audio conversion unit 812, and play control unit 815 is commanded to play the converted audio message. Then, information of a list of the senders of the received mails stored in information storage unit 816 and that are now converted to the voice of “◯◯th mail, from ◯◯” through audio conversion unit 812 (S3021–S3023), and also the message of “End” stored in audio storage unit 811 (S3024) are stored in information storage unit 816 as the mail selection menu.
Menu control unit 813 directs play control unit 815 to play in order each information in the stored audio menu S302.
In the case where user 85 depresses the select button in presentation indication device 84 during playback of the voice reading out the name of the mail sender (S3021–S3023), main control unit 812 makes an inquiry to information storage unit 816 as to the label information corresponding to the mail that was currently displayed when the user depressed the button. The relevant information is converted into a voice message indicating the number, sender name, and title label of the selected mail such as “◯th mail from ◯◯ regarding ◯◯” (S303) via audio conversion unit 812. The converted audio information is stored in information storage unit 816 and played by representation control unit 815.
Then, the voice information reading out the operation that can be carried out with respect to the selected mail such as “Read out (S3041)”, “The next mail will be read out (S3042)” is provided from audio storage unit 811, and stored in information storage unit 816 as the function menu (S304). Menu control unit 813 directs play control unit 815 to play in each sequential order the operations in function menu (S304) stored in information storage unit 816.
In a similar manner thereafter, the voice menu is read into information storage unit 816 according to the instruction of user 85, and audio presentation is directed by menu control unit 813.
The fixed messages of S3024, S304, S305, S306, S307 and S308 in
In contrast, the messages of S301, S3021–S3023 and S303 differ according to the contents of the mail. The mail contents stored in information storage unit 815 are converted into appropriate speech via audio conversion unit 814. The converted audio information is stored in information storage unit 815 and played by play control unit 815.
Here, audio conversion unit 814 applies voice synthesization on the text since mail is handled in the present embodiment. Not only voice synthesization, but audio other than the human voice can be handled depending upon the information of interest.
The present invention is not limited to a mail application, and is also applicable to the program guidance and the like of music CDs and multichannel broadcasting.
According to the example of
One or a plurality of the “Return to one previous menu” message can be prepared in a series of indexes. When there is a great number of indexes, this message can be prepared at a constant interval.
In the case where the present invention is applied to the program guidance of a multichannel broadcast, indexes such as for each genre, target age and the like can be prepared in addition to the indexes for each channel and time to facilitate the user's selection of a desired program. The index label may be presented in the form of a synthesized speech, or in the form of the exact audible output of the program.
When a list of the programs that is currently being broadcasted is to be presented according to the method of the present invention, presentation can be provided in a fashion switching the channel for every predetermined time. Alternatively, an audio output characteristic of respective programs can be presented as the program information. In this case where the program of a certain channel changes to another program of that same channel over time, the audio output of the new program is sequentially presented corresponding thereto.
The sixth embodiment is directed to presenting information visually and audibly using visual information in addition to the auditory information. The sixth embodiment employing visual information in a supplementary manner will be described hereinafter with reference to
Referring to
Information other than the information that is currently presented by audio can be viewed by employing the image information as described above. Also, information that cannot be specified for presentation by just one depression of rewind button 4 and forward button 6 can be specified using the cross pad. The cross pad used as the pointing device may be a shuttle ring, a jog dial, a mouse, a joy stick or the like.
Following the description of the method of the present invention, an audio information selection apparatus according to the third to sixth embodiments will be described hereinafter with reference to
An audio information selection apparatus 200 of the third to sixth embodiments includes a headphone 72, a hard disk 73, a memory 74, a CPU 75, and a controller 76 with a cross pad, a select button, a forward button, a rewind button, and a button to return to the previous hierarchy. A program is supplied via a network or from hard disk 73 in the form of a program medium to be stored in memory 74. CPU 75 reads out the program on memory 74 and supplies the information to headphone 72. Also, control is provided to emphasize the image of the information that is currently presented corresponding to the presented audio output. Upon an instruction from controller 76, CPU 75 determines that the information according to the audio information that was supplied to headphone 72 at the time controller 76 was depressed is the desired information of the user.
The present invention is applicable to a wide range as the technique of selecting a desired one out from a menu of a plurality of information in addition to the above-described mail application, CD categorizing, and multichannel broadcasting applications corresponding to the third to fifth embodiments.
The seventh embodiment of the present invention is basically similar to the first to third embodiments. The method of presenting the plurality of information as sound information differs from the previous-described embodiments. The block diagram and the system diagram of the audio information selection apparatus are basically similar to
The flow of directing the switching of the presentation method will be described with reference to
The threshold value may be a consistent value of the system, or a value set for each user according to the operation history of each user.
A step of normalizing the volume or stereo/monaural audio prior to initiating presentation can be carried out(S1205). The system executes the process automatically up to S1205, and then initiates presentation(S1206). A switching specification from the user (switching between the methods (S1207, S1208), and specifying/canceling normalization of audio parameters (S1209, S1212)) is always acceptable after initiation of the presentation and until the end of the presentation (S1210).
Method 1 will be described with reference to
Control device 11 carries out the operation to realize the left and right volume adjustment of headphone device 2 or a stereo speaker so as to allow user 1 to recognize sounds P1–P2N located on a horizontal plane in the front direction of user 1, and provides control to effect simultaneous play as if the plurality of sounds P1–P2N at a constant interval move in an orbit. In controlling the rotation of the orbit, the volume is altered according to the rotation so that the volume of a certain sound P is set to the maximum when at a point A closest to the user and to the lowest volume when at a point B farthest from the user. The volume is repeatedly increased and lowered gradually to attain the largest volume at point A and the lowest volume at point B. The relationship of the volume of each sound in
Only a predetermined number of programs can be extracted to move in an orbit for selection when there are so many programs that it is difficult for a simultaneous orbital motion of all the programs such as in the case of a movie broadcast guidance of a multichannel broadcast. By replacing the sound that has not been selected over a predetermined period with information that is not yet included in the orbit at a point B most remote from the user, one information can be selected from a great many selection branches (S7322, S7323 in
In the case where the position of the sound is to be slightly advanced or returned, fast-forward button 6 or rewind button 4 is shortly depressed, respectively, in order to control the rotation of the sound. Fast-forward button 6 or rewind button 4 is to be depressed for a rather long time when another sound is to be drawn close. The manner of button depression is not limited to this example. The position change of the sound is controlled in method 1 in response to the step indicating sound position change by the user (S733 and S734 in
Presentation may be devised to facilitate the usability for the user by altering the parameters related to the sound placement and rotation such as the rotation speed, the rotation radius, the distance between sounds and the like according to the length and the like of the time of play of each sound for method 1. Also, a step or means for commnding the change of the parameters on the user side can be added.
Method 2 will be described with reference to
The operation of method 2 is similar to that of the third embodiment. Therefore, description thereof will not be repeated.
The sorting sequence of the information presented to the user (arranged order) can reflect the user's preference in selecting previous information in methods 1 and 2. When there is difference in the volume of each audio to be presented, normalization can be carried out prior to presentation to play all the audio at substantially the same volume level.
It is needless to say that the sixth embodiment is applicable to the apparatuses for mails and CDs as in the fourth and fifth embodiments.
An example of employing the seventh embodiment in processing mails will be described hereinafter.
When it is not necessary to listen to the message from the beginning to the end in order to ascertain the contents of the information, depression of forward button 6 in
By employing method 1 in the case where the menu to be presented is long or when the information is to be obtained auditory while knowing the preceding and succeeding mails and by employing method 2 in the case where the menu to be presented is relatively short and each one is to be listened, information can be presented at the optimum status for the user regardless of the contents of presentation. Switching between the methods can be effected automatically by main control unit 112. Alternatively, a button for the user to specify switching can be provided so that main control unit 112 executes switching upon receiving the user's instruction.
By providing a common interface including at least a select button 5 with respect to methods 1 and 2 and additionally a rewind button 4 and fast-forward button 6, the user can obtain information with the same button and same operation even if the method is switched.
The operation of the memory is similar to that of the third embodiment. Therefore, description thereof will not be repeated.
The method of switching between method 1 and method 2 appropriately corresponding to the seventh embodiment applied to a music CD retrieval system will be described hereinafter. The normal operation is identical to that of the third embodiment.
Upon starting the system of the present application, a voice that reads out the genre as information is first presented. In the case of voice information by a guidance speech that can be presented in a relatively short time such as “Pops” and “Classic”, determination is made that method 2 is appropriate. The audio information is sequentially switched and presented to the user according to method 2. In the case where music representative of each genre is employed as an index instead of the voice guidance, to determination is made that method 1 is appropriate. The sound of each music is played and moved in an orbit using method 1 to be presented to the user. In the present example, main control unit 112 switches between method 1 and method 2 depending upon the property of the audio to be presented. Information is presented to the user according to either method 1 or 2. A button that allows switching between method 1 and 2 according to the user's preference can be applied so that main control unit 112 switches between method 1 and 2 according to the user's instruction.
By providing a common interface that includes at least determination button 5 for methods 1 and 2 and additional rewind button 4 and fast-forward button 6, the user can obtain information with the same button and operation even if the method is switched.
In the case where the present invention is applied to a program guidance of a multichannel broadcast, indexes such as for each genre, target age and the like can be prepared in addition to the indexes for each channel and time to facilitate the user's selection of a desired program. The index label may be presented in the form of a synthesized speech, or in the form of the exact audible output of the program.
When a list of the programs that is currently being broadcasted is to be presented according to the method of the present invention, presentation can be provided in a fashion switching the channel for every predetermined time. Alternatively, an audio output characteristic of respective programs can be presented as the program information. In this case, when the program of a certain channel changes to another program of that same channel over time, the audio output of the new program is sequentially presented corresponding thereto.
Here, main control unit 112 can implement determination automatically to switch between method 1 or 2 such as selecting method 1 when the audio output of a program is to be directly output and method 2 when a relatively short audio output such as an index is used.
When presentation is effected using method 1 in the case where each audio to be presented is recorded at a different record level, there is a possibility of difference in volume which is one element for the user to recognize the sound placement position that “sound at a remote site is small whereas the sound at a close site is large” may be lost. Presentation can be presented without deteriorating the usability by normalizing the volume of each sound prior to presentation using method 1. By incorporating means to switch between method 1 and method 2, presentation can be provided switched to method 2 with the volume at the same level when the user decides to negate normalization of the volume level. It is also possible to provide presentation by method 1 without altering the volume level even if the audibility is slightly difficult. Thus, a user-friendly presentation means can be selected by the user corresponding to various cases.
In the case where there is audio that includes a spatial wide sense such as recording at a concert in the audio to be presented, it may be difficult for the user to identify the location of the sound when presentation is carried out by method 1 using such audio. By converting such audio including this spatial wide sense into monaural audio prior to presentation using method 1, presentation can be provided without deteriorating the usability.
The present invention affords an application of presenting candidates of home pages disclosing information preferable for the user in the form of voice in browsing the home page on WWW. More specifically, the home page can be presented in voice form by an appropriate method such as voice-synthesizing the text information on the home page, employing the music information attached to the home page, converting the color information of the home page into sound and the like.
By converting the information that is to be presented as a candidate to the user into audio by an appropriate method, information can be presented to the user by method 1 or 2 according to the property (playback time or the like) of the voiced information as audio. The user can browse any type of a plurality of information.
The eighth embodiment of the present invention presents information to the user both visually and audibly using visual information in a supplementary manner in addition to the auditory information of the seventh embodiment. The eighth embodiment employing visual information in a supplementary manner will be described with reference to
In the case where the user is to select information, the user operates a pointer F on display device 8 using a pointing device such as a cross pad 7, for example, to directly select an image on display device 8. Alternatively, when that user wishes to select information located at a remote site, the user waits until the desired image is exactly in front and the sound is heard at the largest volume. Instead of waiting, the user can provide control to let the desired information to be located in front and hear the sound at the largest volume using fast-forward button 6 or rewind button 4. Then, select button 5 or cross pad 7 is depressed.
The change in the screen of display device 87 when the presented information is switched is identical to that described with reference to
An audio information selection apparatus 400 according to the seventh and eighth embodiments will be described hereinafter.
Referring to
The pointing device of the present invention may also have the function of repeating tactual sensation feedback according to the degree of the operation of the user. More specifically, tactual sensation feedback can be incorporated besides sound and visual recognition of the portion of information. For example, it may be generally difficult for the user to identify the location of sound even if image information is added to the audio information. The pointing device such as a track ball itself can be rotated slowly corresponding to the presentation status of the sound. By applying the tactual sensation feedback such as a smooth move when in the fast-forward operation and a move with slight resistance when in a rewind operation in moving the sound information, the user can recognize more properly the sound position. In the operation of drawing sound information near, the operability of the user side can be improved by applying some feedback such as the tactual sensation in the interface device of the pointing device in synchronization with the rotation of the sound information.
The present invention is not limited to the above-described embodiment. The present invention may be a computer-readable recording medium in which a program is recorded to cause a computer to operate as control device 3. For example, it may be any type of a recording medium such as a magnetic tape, a CD-ROM, an IC card, a RAM card or the like.
In addition to the application to mail, CD categorization, multichannel broadcasting, and WWW browsing described in the present specification, the present invention is applicable to a wide range of field as the technique of selecting a desired one from a menu selecting one information from a plurality of information.
Although the present invention has been described and illustrated in detail, it is clearly understood that the same is by way of illustration and example only and is not to be taken by way of limitation, the spirit and scope of the present invention being limited only by the terms of the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
11-036299 | Feb 1999 | JP | national |
11-182801 | Jun 1999 | JP | national |
11-306499 | Oct 1999 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
5963215 | Rosenzweig | Oct 1999 | A |
6327567 | Willehadson et al. | Dec 2001 | B1 |
Number | Date | Country |
---|---|---|
5-273991 | Oct 1993 | JP |
5-281987 | Oct 1993 | JP |
A6149517 | May 1994 | JP |
09-097154 | Apr 1997 | JP |
A10124292 | May 1998 | JP |
11-039132 | Feb 1999 | JP |