This application claims the priority of Korean Patent Application No. 10-2003-0092468 filed on Dec. 17, 2003 in the Korean Intellectual Property Office, the disclosure of which is incorporated herein in its entirety by reference.
1. Field of the Invention
The present invention relates to a method supporting text-to-speech (TTS) navigation and a multimedia device using the same and, more particularly, to a method for providing information on multimedia files by use of voice signals at a multimedia device such as a multimedia player instead of image signals displayed through a Liquid Crystal Display (LCD) screen and a multimedia device using the method.
2. Description of the Related Art
As shown in
The multimedia file stored in the multimedia device 3 is played through a multimedia output unit 36 by allowing a user to manipulate a functional button, such as a Next/Previous button, of a User Interface (UI) processing unit 30.
However, when intending to select a desired media file according to the prior art, the user has no choice but to observe and search the desired media file through a character displayer of the UI unit, or to directly play and confirm the desired media file only though the corresponding content.
In addition, when the multimedia device has a built-in device for generating a Text-To-Speech (TTS) voice file, for example for providing TTS navigation, as disclosed in Korean Unexamined Patent Publication No. 2002-0048357, the production cost is increased, moreover the multimedia device requires a high-powered processor and mass storage space to provide the TTS navigation ensuring high voice quality.
To address the above-indicated problems, it is, therefore, an objective of the invention to provide a multimedia device that does not require a separate device or cost for the TTS navigation, so that a user of the multimedia device is able to search a multimedia file while hearing a voice without seeing a screen.
To achieve these objectives, consistent with one aspect of the invention, there is provided a multimedia device supporting text-to-speech navigation. The multimedia device comprises a storage unit for analyzing at least one multimedia file and storing both at least one text-to-speech voice file generated and received from a host device and the multimedia file, a user interface processing unit providing a user interface and receiving a command of a user to make it possible to search information on the file, and a data read unit reading the text-to-speech voice file from the storage unit according to a control of the user interface processing unit in order to output the text-to-speech voice file.
Consistent with another aspect of the invention, there is provided a method for providing text-to-speech navigation at a multimedia device consistent with the exemplary embodiment of the invention. The method comprises the step of analyzing information on a stored multimedia file to generate a text-to-speech voice file, receiving the generated text-to-speech voice file from a host device, storing the received text-to-speech voice file, and searching the stored text-to-speech voice file according to selection of a user, and the step of outputting the searched text-to-speech voice file.
The above aspects, features and advantages of the present invention will become more apparent from the following detailed description when taken in conjunction with the accompanying drawings, in which:
Exemplary embodiments of the invention will be described below in detail with reference to the accompanying drawings.
Multimedia devices are divided into two kinds according to a type of a storage unit for storing the multimedia file.
One kind of multimedia device is one having a separate storage unit (write-enable compact disc (CD)) like a CD player. In this case, the following description will be made regarding the multimedia device with reference to FIGS. 2 to 4, and regarding the operation of the multimedia device with reference to
The other kind of multimedia device refers to one providing direct online connection to the host device to transmit a file through a download manager like an MP3 player. In this case, the following description will be made regarding the multimedia device with reference to FIGS. 5 to 7, and regarding an operation of the multimedia device with reference to FIGS. 10 to 12.
Hereinafter, the exemplary embodiment of the invention will be described in detail with reference to the drawings.
As shown in
The multimedia storage unit 10 receives and stores the multimedia file stored in the separate storage unit 5 of the multimedia device.
The multimedia read unit 12 reads meta data, such as a file name, information of a header and so on, through the multimedia file stored in the multimedia storage unit 10.
The multimedia information analysis unit 14 receives the meta data which the multimedia read unit 12 has read to generate a TTS voice file by means of the TTS file generation unit 16.
The multimedia record unit 18 records and stores the generated TTS voice file in the separate storage unit 5.
As shown in
Further, when voice guidance information of the multimedia file as well as the TTS voice file for directory information is required, it is possible to add the TTS voice file for the directory information. For example, in case that the number of multimedia files is increased on the CD, there is a possibility of inconvenience in searching and outputting a specific multimedia file while the user hears the voice guidance information of the TTS voice file. In this case, if the TTS voice file for the directory information is added, the user can retrieve the directory while hearing the voice guidance information of the TTS voice file.
The TTS voice file, whether it contains the directory information or not, may be realized in such a manner that it is stored in a buffer memory of the multimedia device.
The buffer memory, in the case of the portable CD player, is used to pre-store a particular part of multimedia information against a shock caused by a user in action. Generally, the multimedia file stored in the buffer memory is used when such a shock is applied to the CD player and thus the multimedia file may not be read smoothly out from a CD rotating at a high speed. Further, the buffer memory is hardly used on retrieving. Hence, in the case where the TTS voice files are stored in the buffer memory in advance, the user is able to rapidly search the multimedia information contained in the CD while hearing the voice guidance information.
As shown in
The UI processing unit 30 consists of a UI section 300 and a UI control section 302. The UI section 300 provides a user interface for a user and functions to receive the resultant command of the user. The UI control section 302 interprets input information of the user, and reads the TTS voice file stored in the separate storage unit 5 by means of a TTS voice file read section 320 of the data read unit 32, so that the UI control section 302 makes it possible to search information on the file.
The data read unit 32 includes the TTS voice file read section 320 and a multimedia data read section 322. The TTS voice file read section 320 functions to read the TTS voice file from the separate storage unit 5 according to the control of the UI control section 302 over the input command of the UI section 300. The multimedia data read section 322 reads the TTS voice file from the separate storage unit 5.
The multimedia output unit 34 outputs the TTS voice file.
As shown in
The multimedia storage unit 10 reads and stores the multimedia file stored in a built-in storage unit 38 (see
The download management unit 17 records and stores the TTS voice file in the built-in storage unit 38.
The multimedia read unit 12, the multimedia information analysis unit 14 and the TTS file generation unit 16 are the same configuration as that described with reference to
As shown in
The data read unit 32 includes a TTS voice file read section 320 and a multimedia data read section 322. The TTS voice file read section 320 functions to read the TTS voice file from the built-in storage unit 38 according to the control of a UI control section 302 of the UI processing unit 30 on the input command of a UI section 300 of the UI processing unit 30. The multimedia data read section 322 reads the TTS voice file from the built-in storage unit 38.
The built-in storage unit 38 includes a download management section 380, a multimedia record section 382 and a multimedia storage section 384.
The download management section 380 takes charge of transmission of the TTS voice file from the host device 1. A process of transmitting the TTS voice file will be described below with reference to
The multimedia record section 382 records the TTS voice file received from the host device 1 in the multimedia storage section 384.
The multimedia storage section 384 is comprised of a multimedia file directory 3840, a TTS voice file storage part 3842, and a multimedia file storage part 3844. See
The UI processing unit 30 and the multimedia output unit 34 are the same configuration as that described with reference to
As shown in
The multimedia file directory 3840 refers to a table containing information not only on a name of the multimedia file and its storage location but also on a storage location of the TTS voice file, in which these information are used to provide a service for the TTS navigation. The multimedia file directory 3840 has a conceptual structure, which may be realized into the table form as well as another form (e.g., a tree form, a connection list, and so forth) capable of expressing information of the name, the storage location etc.
It is possible to realize the multimedia file directory 3840 to be stored in a memory area, which is distinguished from a memory area in which the multimedia file or the TTS voice file is stored. Information of the multimedia file directory 3840 allows the name and storage location of the multimedia file, the storage location of the TTS voice file, etc. which are stored in the storage section in reality to be rapidly searched.
As shown in
The step S80 of extracting information on files consists of a step S802 of reading files stored in the storage unit 10 and a step S804 of extracting the information on the files.
The step S82 of generating a TTS voice file consists of a step of, at the multimedia read unit 12, analyzing information on a name or header of the multimedia file to generate a meta file in a text format, and a step of, at the TTS file generation unit 16, generating the TTS voice file from the meta file.
In the step S84 of storing the files, the generated TTS voice file and multimedia file as well as location information representing locations of the TTS voice file and the multimedia file are recorded in the separate storage unit 5.
The step S86 of terminating the recording of the files terminates the recording of the files when there is no multimedia file to be recorded.
As shown in
As shown in
The step S100 of extracting information on files consists of a step S1002 of reading files from the multimedia storage unit 10 of the host device and a step S1004 of extracting the information on the files. In the step S104 of storing the files, the generated TTS voice file is recorded in the built-in storage unit 38. The step S102 of generating the TTS voice file and the step S106 of terminating the recording of the files are the same configuration as that described with reference to
As shown in
As shown in
According to the invention, it is possible to provide the multimedia device that does not require a separate device or cost for the TTS navigation, so that a user of the multimedia device is able to search the multimedia file while hearing a desired voice without seeing a screen.
While the invention has been shown and described with reference to a certain exemplary embodiment thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
10-2003-0092468 | Dec 2003 | KR | national |