Information
-
Patent Application
-
20040261133
-
Publication Number
20040261133
-
Date Filed
July 15, 200420 years ago
-
Date Published
December 23, 200420 years ago
-
Inventors
-
Original Assignees
-
CPC
-
US Classifications
-
International Classifications
Abstract
By providing a function of converting one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 into an MPEG-4SL stream (single data stream), and a function of transmitting that MPEG-4SL stream to a host (20) using Isochronous or a bulk transfer complying with USB2.0, one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 can be transmitted to another apparatus (host or the like) using Isochronous or a bulk transfer complying with USB2.0.
Description
TECHNICAL FIELD
[0001] The present invention relates to a video processing apparatus, which handles an elementary stream and, especially, one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999, and the like.
BACKGROUND ART
[0002] In recent years, as one of communication interfaces used to connect a host (personal computer or the like) and peripheral device (including a video processing apparatus such as a digital video camera or the like), USB2.0 (see Universal Serial Bus Specification Revision 2.0, Apr. 27, 2000) is known.
[0003] Also, in recent years, as one of visual and audio object encoding methods, an encoding method called MPEG-4 has received a lot of attention (e.g., see ISO/IEC 14496-1:1999, Information technology—Coding of audio-visual objects—Part 1: Systems, ISO/IEC 14496-2:1999, Information technology—Coding of audio-visual objects—Part 2: Visual and ISO/IEC 14496-3:1999, Information technology—Coding of audio-visual objects—Part 3: Audio).
[0004] However, there is no conventional video processing apparatus (especially, a digital video camera) which can transmit one or a plurality of elementary streams complying with MPEG-4 (ISO/IEC 14496-1:1999) to another apparatus (host or the like) using Isochronous Transfer or Bulk Transfer specified by USB2.0. Furthermore, there is no conventional video processing apparatus which can receive one or a plurality of elementary streams complying with MPEG-4 (ISO/IEC 14496-1:1999) from another apparatus (host or the like) using Isochronous Transfer or Bulk Transfer specified by USB2.0.
DISCLOSURE OF INVENTION
[0005] The present invention has been made to solve such problems, and has as its principal object to provide an apparatus and method, which can transmit one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 to another apparatus (host or the like) using Isochronous Transfer or Bulk Transfer specified by USB2.0.
[0006] It is another principal object of the present invention to provide an apparatus and method, which can receive one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 from another apparatus (host or the like) using Isochronous Transfer or Bulk Transfer specified by USB2.0.
[0007] According to an aspect of the present invention, there is provided a video processing apparatus comprising: conversion means for converting one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 into one data stream; and transmission means for transmitting the data stream to another apparatus using an isochronous transfer complying with USB2.0.
[0008] According to another aspect of the present invention, there is provided a communication method of a video processing apparatus, comprising: converting one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 into one data stream; and transmitting the data stream to another apparatus using an isochronous transfer complying with USB2.0.
[0009] According to further aspect of the present invention, there is provided a video processing apparatus comprising: conversion means for converting one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 into one data stream; and transmission means for transmitting the data stream to another apparatus using a bulk transfer complying with USB2.0.
[0010] According to yet further aspect of the present invention, there is provided a communication method of a video processing apparatus, comprising: converting one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 into one data stream; and transmitting the data stream to another apparatus using a bulk transfer complying with USB2.0.
[0011] According to another aspect of the present invention, there is provided a video processing apparatus comprising: reception means for receiving one data stream transmitted from another apparatus using an isochronous transfer complying with USB2.0; and restoration means for restoring the data stream to one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999.
[0012] According to still further aspect of the present invention, there is provided a communication method of a video processing apparatus, comprising: receiving one data stream transmitted from another apparatus using an isochronous transfer complying with USB2.0; and restoring the data stream to one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999.
[0013] According to further aspect of the present invention, there is provided a video processing apparatus comprising: reception means for receiving one data stream transmitted from another apparatus using a bulk transfer complying with USB2.0; and restoration means for restoring the data stream to one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999.
[0014] According to another aspect of the present invention, there is provided a communication method of a video processing apparatus, comprising: receiving one data stream transmitted from another apparatus using a bulk transfer complying with USB2.0; and restoring the data stream to one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999.
[0015] Other features and advantages of the present invention will be apparent from the following description taken in conjunction with the accompanying drawings, in which like reference characters designate the same or similar parts throughout the figures thereof.
BRIEF DESCRIPTION OF DRAWINGS
[0016] The accompanying drawings, which are incorporated in and constitute a part of the specification, illustrate embodiments of the invention and, together with the description, serve to explain the principles of the invention.
[0017]
FIG. 1 is a block diagram showing principal building components of a communication system according to the first embodiment;
[0018]
FIG. 2 is a block diagram showing building components of a video processing apparatus 10 and host 20;
[0019]
FIG. 3 is a flowchart for explaining a sequence for converting one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 into an MPEG-4SL stream;
[0020]
FIG. 4 is a flowchart for explaining a sequence for restoring an MPEG-4SL stream to one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999;
[0021]
FIG. 5 is a view for explaining the format of an Mux packet;
[0022]
FIG. 6 is a chart for explaining an example of a sequence for transferring an MPEG-4SL stream using Isochronous IN Transaction of USB2.0;
[0023]
FIG. 7 is a chart for explaining an example of a sequence for transferring an MPEG-4SL stream using Isochronous OUT Transaction of USB2.0;
[0024]
FIG. 8 is chart for explaining an example of a sequence for transferring an MPEG-4SL stream using Bulk IN Transaction of USB2.0;
[0025]
FIG. 9 is a chart for explaining an example of a sequence for transferring an MPEG-4SL stream using Bulk OUT Transaction of USB2.0;
[0026]
FIG. 10 is a view for explaining the format of a stream header;
[0027]
FIG. 11 is a view for explaining the format of an MPEG-4SL Stream Format Descriptor; and
[0028]
FIG. 12 is a view for explaining the structure of a stream error code control data.
BEST MODE FOR CARRYING OUT THE INVENTION
[0029] Preferred embodiments of the present invention will now be described in detail in accordance with the accompanying drawings.
[0030] [First Embodiment]
[0031] Principal building components of a communication system in the first embodiment will be described first with reference to FIG. 1. A video processing apparatus 10 and host 20 are apparatuses which can handle one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999. For example, these apparatuses can handle elementary streams of visual objects complying with ISO/IEC 14496-2:1999, audio objects complying with ISO/IEC 14496-3:1999, and scene description complying with ISO/IEC 14496-1:1999. The video processing apparatus 10 comprises, e.g., a digital video camera, and the host 20 comprises, e.g., a personal computer. Note that the video processing apparatus 10 is not limited to a digital video camera, but may comprise a digital video recorder, a portable phone with a camera function, and the like.
[0032] The video processing apparatus 10 and host 20 can communicate with each other using digital interface units complying with the USB2.0 standard. The video processing apparatus 10 is an apparatus having a function of “device” defined by USB2.0, and the host 20 is an apparatus having a function of “host” defined by USB2.0.
[0033] The video processing apparatus 10 has a function of converting one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 into an MPEG-4SL (Synchronization Layer) stream (a single data stream which is uniquely defined by the first embodiment), and a function of transmitting that MPEG-4SL stream to the host 20 using Isochronous Transfer or Bulk Transfer of USB2.0. Furthermore, the video processing apparatus 10 has a function of receiving an MPEG-4SL stream transmitted from the host 20 using Isochronous Transfer or Bulk Transfer of USB2.0, and a function of restoring the received MPEG-4SL stream to one or a plurality of elementary streams. The host 20 also has these functions as in the video processing apparatus 10.
[0034] Building components of the video processing apparatus 10 and host 20 will be described below with reference to FIG. 2. Each of SL packet processors 201-1 to 201-N executes a process for generating a plurality of SL (Synchronization Layer) packets from each of elementary streams 1 to N, and its reverse process. An Mux packet processor 202 executes a process for generating one Mux packet from respective SL packets, and its reverse process. An MPEG-4SL stream processor 203 executes a process for generating one MPEG-4SL stream by multiplexing a plurality of Mux packets, and its reverse process. The MPEG-4SL stream processor 203 also generates and interprets a stream header to be described later. A digital interface unit 204 complies with USB2.0, and transmits or receives an MPEG-4SL stream according to the sequences shown in FIGS. 6 to 9.
[0035] The processing sequence of the function of converting one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 into one MPEG-4SL stream will be described below with reference to FIG. 3.
[0036] Step S301: Each of the SL packet processors 201-1 to 201-N generates a plurality of SL (Synchronization Layer) packets from each of elementary streams 1 to N, and supplies the plurality of generated SL packets to the Mux packet processor 202. Since the format of each SL packet is defined by ISO/IEC 14496-1:1999, a detailed description thereof will be omitted.
[0037] Step S302: The Mux packet processor 202 generates one Mux packet from respective SL packets, and supplies the generated Mux packet to the MPEG-4SL stream processor 203. The Mux packet is different from a FlexMux packet defined by ISO/IEC 14496-1:1999, and is uniquely defined by the first embodiment. The format of the Mux packet will be described later (see FIG. 5).
[0038] Step S303: The MPEG-4SL stream processor 203 generates one MPEG-4SL stream by multiplexing a plurality of Mux packets.
[0039] In this way, the video processing apparatus 10 and host 20 can convert one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 into an MPEG-4SL stream as a single data stream. In other words, these apparatuses can multiplex one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 to an MPEG-4SL stream as a single data stream. Also, since the video processing apparatus 10 and host 20 can convert one or a plurality of elementary streams (complying with ISO/IEC 14496-1:1999) into one MPEG-4SL stream, they can output one or a plurality of elementary streams from one end point (Isochronous end point or Bulk end point).
[0040] The processing sequence of the function of restoring one MPEG-4SL stream to one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 will be described below with reference to FIG. 4.
[0041] Step S401: The MPEG-4SL stream processor 203 demultiplexes one MPEG-4SL stream into a plurality of Mux packets, and supplies these Mux packets to the Mux packet processor 202.
[0042] Step S402: The Mux packet processor 202 extracts ES_ID data and SL packets from the respective Mux packets, and identifies an elementary stream to which SL packets belong using the ES_ID data. The Mux packet processor 202 supplies the SL packets to one of the SL packet processors 201-1 to 201-N. The ES_ID data will be described later (see FIG. 5).
[0043] Step S403: Each of the SL packet processors 201-1 to 201-N reclaims each of elementary streams 1 to N from a plurality of SL packets obtained from the Mux packet processor 202.
[0044] In this way, the video processing apparatus 10 and host 20 can restore one MPEG-4SL stream to one or a plurality of elementary streams. Since the video processing apparatus 10 and host 20 can restore one MPEG-4SL stream to one or a plurality of elementary streams, they can input one or a plurality of elementary streams from one end point (Isochronous end point or Bulk end point).
[0045] The format of a Mux packet generated by the Mux packet processor in this embodiment will be described below with reference to FIG. 5. Referring to FIG. 5, MuxH is the header of a Mux packet, and MuxP is the payload of the Mux packet. MuxH includes additional information such as ES_ID (Elementary Stream Identifier) data and the like. The ES_ID is an identifier used to identify an elementary stream to which an SL packet in MuxP belongs, and is the same as ES_ID defined by ISO/IEC 14496-1:1999. In MuxP, SLH is the header of an SL packet, and SLP is the payload of the SL packet. In the first embodiment, one Mux packet is generated from only one SL packet. Also, in the first embodiment, even when an Mux packet is generated from only one elementary stream (i.e., even when an MPEG-4SL stream is generated from only one elementary stream), the ES_ID is not omitted. Since each Mux packet includes ES_ID data, SL packets obtained from a plurality of elementary streams can be multiplexed and restored.
[0046] The stream format (MPEG-4SL stream format) upon transmitting an Mux packet in various transfer modes of USB2.0 will be described in detail below with reference to FIGS. 6 to 9.
[0047] An example of the MPEG-4SL stream format and sequence upon transferring the Mux packet using Isochronous IN Transaction of USB2.0 will be described first with reference to FIG. 6. When Isochronous IN Transaction is used, the transmission direction of an MPEG-4SL stream is a direction from the video processing apparatus 10 to the host 20. FIG. 6 will explain an example using High-speed mode of USB2.0.
[0048] Referring to FIG. 6, an IN token is a transmission request packet of DATA2, DATA1, or DATA0 from the video processing apparatus 10 to the host 20. DATA2, DATA1, and DATA0 are data packets to be transferred from the video processing apparatus 10 to the host 20 in Isochronous Transfer mode. SH (stream header) is a header which is transmitted every microframe (125 micro-seconds), and payloads #1 to #N (N is an integer equal to or larger than 2) are data obtained from one Mux packet. That is, one Mux packet is formed of payloads #1 to #N.
[0049] The MPEG-4SL stream with the format shown in FIG. 6 is generated for each Mux Packet, and is transmitted in turn. By repeating this process, the video processing apparatus 10 can transmit the MPEG-4SL stream to the host 20. In the first embodiment, in order to identify switching of Mux packets based on the stream header (SH), data obtained from a plurality of Mux packets are prevented from being mixed in one microframe. That is, the video processing apparatus 10 starts transmission of the next Mux packet from the next microframe.
[0050] In this manner, according to the first embodiment, an MPEG-4SL stream generated from an Mux packet can be transmitted using Isochronous IN Transaction of USB2.0. In other words, one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 can be transmitted from the video processing apparatus 10 to the host 20 using Isochronous Transfer of USB2.0.
[0051] An example of the MPEG-4SL stream format and sequence upon transferring the aforementioned Mux packet using Isochronous OUT Transaction of USB2.0 will be described below with reference to FIG. 7. When Isochronous OUT Transaction is used, the transmission direction of an MPEG-4SL stream is a direction from the host 20 to the video processing apparatus 10. FIG. 7 will explain an example using High-speed mode of USB2.0.
[0052] Referring to FIG. 7, an OUT token is a transmission request packet of MDATA, DATA2, or DATA1 from the host 20 to the video processing apparatus 10. MDATA, DATA2, and DATA1 are data packets to be transferred from the host 20 to the video processing apparatus 10 in Isochronous Transfer mode. SH (stream header) is a header which is transmitted every microframe (125 micro-seconds), and payloads #1 to #N (N is an integer equal to or larger than 2) are data obtained from one Mux packet. That is, one Mux packet is made up of payloads #1 to #N.
[0053] The MPEG-4SL stream with the format shown in FIG. 7 is generated for each Mux Packet, and is transmitted in turn. By repeating this process, the host 20 can transmit the MPEG-4SL stream to the video processing apparatus 10. In the first embodiment, in order to identify switching of Mux packets based on the stream header (SH), data obtained from a plurality of Mux packets are prevented from being mixed in one microframe. That is, the host 20 starts transmission of the next Mux packet from the next microframe.
[0054] In this manner, according to the communication system of the first embodiment, an Mux packet can be transmitted in the MPEG-4SL stream format using Isochronous OUT Transaction of USB2.0. In other words, one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 can be transmitted from the host 20 to the video processing apparatus 10 using Isochronous Transfer of USB2.0.
[0055] An example of the MPEG-4SL stream format and sequence upon transferring the Mux packet using Bulk IN Transaction of USB2.0 will be described below with reference to FIG. 8. When Bulk IN Transaction is used, the transmission direction of an MPEG-4SL stream is a direction from the video processing apparatus 10 to the host 20. FIG. 8 will explain an example using High-speed mode of USB2.0.
[0056] Referring to FIG. 8, an IN token is a transmission request packet of DATA0 or DATA1 from the video processing apparatus 10 to the host 20. DATA0 and DATA1 are data packets to be transferred from the video processing apparatus 10 to the host 20 in Bulk Transfer mode. SH (stream header) is a header which is transmitted every microframe (125 micro-seconds), and payloads #1 to #N (N is an integer equal to or larger than 2, N>M) are data obtained from one Mux packet. Note that M is the number of payloads to be transferred within one microframe. That is, one Mux packet is made up of payloads #1 to #N. Note that M indicates the number of payloads to be transmitted per microframe.
[0057] The MPEG-4SL stream with the format shown in FIG. 8 is generated for each Mux Packet, and is transmitted in turn. By repeating this process, the video processing apparatus 10 can transmit the MPEG-4SL stream to the host 20. In the first embodiment, in order to identify switching of Mux packets based on the stream header (SH), data obtained from a plurality of Mux packets are prevented from being mixed in one microframe. That is, the video processing apparatus 10 starts transmission of the next Mux packet from the next microframe.
[0058] In this manner, according to the first embodiment, an MPEG-4SL stream generated from an Mux packet can be transmitted using Bulk IN Transaction of USB2.0. In other words, one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 can be transmitted from the video processing apparatus 10 to the host 20 using Bulk Transfer of USB2.0.
[0059] An example of the MPEG-4SL stream format and sequence upon transferring the aforementioned Mux packet using Bulk OUT Transaction of USB2.0 will be described below with reference to FIG. 9. When Bulk OUT Transaction is used, the transmission direction of an MPEG-4SL stream is a direction from the host 20 to the video processing apparatus 10. FIG. 9 will explain an example using High-speed mode of USB2.0.
[0060] Referring to FIG. 9, OUT tokens are transmission request packets of DATA0 and DATA1 from the host 20 to the video processing apparatus 10. DATA0 and DATA1 are data packets to be transferred from the host 20 to the video processing apparatus 10 in Bulk Transfer mode. SH (stream header) is a header which is transmitted every microframe (125 micro-seconds), and payloads #1 to #N (N is an integer equal to or larger than 2, N>M) are data obtained from one Mux packet. That is, one Mux packet is made up of payloads #1 to #N. Note that M indicates the number of payloads to be transmitted per microframe.
[0061] The MPEG-4SL stream with the format shown in FIG. 9 is generated for each Mux Packet, and is transmitted in turn. By repeating this process, the host 20 can transmit the MPEG-4SL stream to the video processing apparatus 10. In the first embodiment, in order to identify switching of Mux packets based on the stream header (SH), data obtained from a plurality of Mux packets are prevented from being mixed in one microframe. That is, the host 20 starts transmission of the next Mux packet from the next microframe.
[0062] In this manner, according to the communication system of the first embodiment, an Mux packet can be transmitted in the MPEG-4SL stream format using Bulk OUT Transaction of USB2.0. In other words, one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 can be transmitted from the host 20 to the video processing apparatus 10 using Bulk Transfer of USB2.0.
[0063] An example of the format of the stream header will be described below with reference to FIG. 10. The stream header (SH) is a header transmitted every microframe (125 micro-seconds), as shown in FIGS. 6 to 9. Upon transmitting an MPEG-4SL stream from the video processing apparatus 10 to the host 20, the video processing apparatus 10 generates each stream header. Upon transmitting an MPEG-4SL stream from the host 20 to the video processing apparatus 10, the host 20 generates each stream header.
[0064] Referring to FIG. 10, a bHeaderLength field (1 byte) stores a value indicating the data length (2 bytes) of the stream header. A bmHeaderInfo field (a variable length but 1 byte in the first embodiment) includes FID (Frame ID), EOF (End of Frame), PTS (Presentation Time Stamp), SCR (Source Clock Reference), RES (Reserved), STI (Still Image), ERR (Error bit), and EOH (End of Header) fields. Note that the data length of each field in the bmHeaderInfo field is 1 bit.
[0065] Every time an Mux packet is switched, the FID field stores 0 or 1. That is, when the FID field of the immediately preceding Mux packet is 0 (or 1), that of the next Mux packet is 1 (or 0). When a payload to be transmitted after the stream header includes the last part of an Mux packet, the video processing apparatus 10 or host 20 stores 1 in the EOF field; otherwise, it stores 0 in the EOF field. The host 20 and video processing apparatus 10 can detect switching of Mux packets by checking the contents of the FID and EOF fields.
[0066] The PTS stores presentation time stamp data, and the SCR field stores source clock reference fields. However, since the first embodiment does not use these fields, 0 is stored in these fields. Since the RES field is preserved for the future use, 0 is stored in the RES field.
[0067] If all payloads to be transmitted after the stream header include still image data, the STI field stores 1; otherwise, the STI field stores 0. If at least one of payloads to be transmitted after the stream header includes any error, 1 is stored in the ERR field; otherwise, 0 is stored in the ERR field. The EOH field stores a value (1) indicating the end of the stream header.
[0068] As can be seen from the above description, according to the first embodiment, the video processing apparatus 10 can transmit one or a plurality of elementary streams to the host 20 using Isochronous or Bulk Transfer of USB2.0. Also, the host 20 can transmit one or a plurality of elementary streams to the video processing apparatus 10 using Isochronous or Bulk Transfer of USB2.0.
[0069] According to the first embodiment, the video processing apparatus 10 can receive one or a plurality of elementary streams from the host 20 using Isochronous or Bulk Transfer of USB2.0. Also, the host 20 can receive one or a plurality of elementary streams from the video processing apparatus 10 using Isochronous or Bulk Transfer of USB2.0.
[0070] According to the first embodiment, since the video processing apparatus 10 can convert one or a plurality of elementary streams into a single data stream (i.e., an MPEG-4SL stream) and can transmit that stream to the host 20, the number of end points of the video processing apparatus 10 can be reduced to one. In an example that transmits a plurality of elementary streams from independent end points, control among these end points is required. However, the first embodiment can obviate the need for such control.
[0071] According to the first embodiment, since the video processing apparatus 10 can restore a single data stream (i.e., an MPEG-4SL stream) transmitted from the host 20 to one or a plurality of elementary streams, the number of end points of the video processing apparatus 10 can be reduced to one. In an example that transmits a plurality of elementary streams from independent end points, control among these end points is required. However, the first embodiment can obviate the need for such control.
[0072] According to the first embodiment, the video processing apparatus 10 can transfer a plurality of elementary streams with different data formats using a single data stream. As described above, the video processing apparatus 10 can transfer an elementary stream of visual data complying with ISO/IEC 14496-2:1999, and that of audio data complying with ISO/IEC 14496-3:1999 using one data stream, as described above. Furthermore, an elementary stream of scene description complying with ISO/IEC 14496-1:1999 can be added to these elementary streams.
[0073] An example of the format of an MPEG-4SL Stream Format Descriptor used in the first embodiment will be described below with reference to FIG. 11.
[0074] Referring to FIG. 11, a bNumPackets field is set with the number of packets which are sent by the MPEG-4SL Stream Format Descriptor. This number is used as the number of elements of subsequent sequence data. A bPacketLength field indicates the byte length of packet data set in the next bPacket field. The bPacket field is set with an MPEG4_Mux packet of a stream to be transferred. A pair of bPacketLength and bPacket fields is used as an element of a sequence. Upon sending a plurality of packets, pairs of bPacketLength and bPacket fields which describe data of respective packets are successively described.
[0075] Note that the bPacketLength and bPacket fields are options. When there is no packet to be sent by the descriptor, 0 is set in the bNumPackets field to omit subsequent fields. In the MPEG-4SL Stream Format Descriptor of the first embodiment, a sequence data structure used to set packets of a stream to be reliably transferred is defined.
[0076] The sequence data of packets set in the MPEG-4SL Stream Format Descriptor of the first embodiment can be transmitted to the host 20 at the time of initialization or at an arbitrary timing after initialization.
[0077] As described above, the Error bit (ERR field) is defined in the stream header (SH) of the MPEG-4SL stream. When some error has occurred in the video processing apparatus 10 as a device (for example, the data format of one elementary stream has changed), the video processing apparatus 10 sets 1 as the value of the ERR field of the stream header. When the data format of a data stream has changed, the video processing apparatus 10 stops transfer of a payload output from the digital interface unit 204, and transmits the stream header alone. On the other hand, when 1 is set in the ERR field of the stream header, the host 20 requests the video processing apparatus 10 to send stream error code control data to detect the cause of the error that has occurred in the video processing apparatus 10. FIG. 12 shows the structure of the stream error code control data.
[0078] The stream error code control data has a bStreamErrorCode field. If the value of this field is other than 0, it indicates that a stream error has occurred upon transferring a video or still image. When such error has occurred, the video processing apparatus 10 as a device sets an appropriate value in the bStreamErrorCode field. In the first embodiment, when one of the following events has occurred, the host 20 must transmit a GET_CUR request that requests the stream error code control data so as to specify the cause of the error:
[0079] (1) when the device sets 1 in the Error bit (ERR field) of the stream header;
[0080] (2) when the device has issued a “Stream Error” interrupt signal; and
[0081] (3) when a bulk video end point transmits a stall packet to the host 20 in the data or handshake stage of transaction.
[0082] MPEG-4 Systems data may be transferred as a stream on a USB pipe without using any descriptor. Normally, MPEG-4 Systems data is basically transferred as a stream by a highly reliable method such as Bulk Transfer. Also, MPEG-4 Systems data may be transferred as a stream using Isochronous communications. The first embodiment will not touch any recovery process upon occurrence of a transmission error in Isochronous Transfer, but a recovery process that periodically resends MPEG-4 Systems data may be made as needed.
[0083] In the first embodiment, for example, the MPEG-4SL Stream Format Descriptor uses property information of an MPEG-4SL data stream. In the video processing apparatus 10, terminals corresponding to the USB IN and OUT end points and interfaces to which these terminals belong support a Format Descriptor corresponding to one or more formats. Upon selecting a format, host software sends a control request to a corresponding interface. In an MPEG-4SL payload, a Frame Descriptor to be appended need not be defined.
[0084] In MPEG-4 Systems, an initial object descriptor that indicates an initial stream configuration must be transmitted first. To this end, the transmitting side (video processing apparatus 10) of a stream sets the initial object descriptor in a bInitialObjectDescriptor field of the MPEG-4SL Stream Format Descriptor and sends it upon initialization. In the first embodiment, the contents of data to be set in the initial object descriptor are arbitrary as long as they follow the specification of ISO/IEC 14496-1:1999. However, the following data are normally set:
[0085] *profile_and_level_indication
[0086] ES_Descriptor
[0087] DecoderConfigDescriptor
[0088] DecoderSpecificInfo (data contents vary depending on the encoding format of a stream; in case of MPEG-4 Visual, Visual Object Sequence data is set)
[0089] *SLConfigDescriptor (used to interpret the layout of an SL packet header)
[0090] Normally, the initial object descriptor is sent only once in the initialization process upon selection of an MPEG-4SL Payload Format, but it is not sent twice or more. When the initial object descriptor is resent after completion of initialization, the receiving side executes an initialization process again according to the contents of the resent initial object descriptor. MPEG-4 Systems streams such as OD, BIFS, and the like defined by ISO/IEC 14496-1:1999 above are normally transmitted by a method which is free from any data losses due to transmission errors. In the first embodiment, such data stream can be reliably transferred by setting it in the MPEG-4SL Stream Format Descriptor.
[0091] In the first embodiment, the bInitialObjectDescriptor field is an option, and may be omitted. In this case, the video processing apparatus 10 can initialize a stream using default properties without any initial object descriptor.
[0092] [Second Embodiment]
[0093] The first embodiment has exemplified transmission/reception of the MPEG-4SL stream using High-speed mode of USB2.0 with reference to FIGS. 6 to 9. However, the present invention is not limited to such specific embodiment. The first embodiment may be modified to an embodiment that transmits/receives the MPEG-4SL stream using full-speed mode of USB2.0. In this case, the aforementioned stream header is transmitted, e.g., every USB frame (1 ms). If a transfer mode faster than High-speed mode is specified in the future, the first embodiment may be modified to an embodiment that transmits/receives the MPEG-4SL stream using such transfer mode.
[0094] [Third Embodiment]
[0095] The present invention is not limited to embodiments that uses existing USB2.0. The present invention can be applied to, e.g., an embodiment that uses a communication system complying with the extended standard of USB2.0, an embodiment that uses a communication system complying with the standard which specifies functions equivalent to USB2.0, and the like.
[0096] [Fourth Embodiment]
[0097] The present invention is not limited to an embodiment that uses elementary streams complying with existing ISO/IEC 14496-1:1999. The present invention can be applied to, e.g., an embodiment that uses elementary streams complying with an extended standard of ISO/IEC 14496-1:1999, an embodiment that uses elementary streams complying with a standard which specifies functions equivalent to ISO/IEC 14496-1:1999, and the like.
[0098] [Other Embodiments]
[0099] The present invention can be implemented not only in the above embodiments but also in other embodiments. For example, the present invention can be implemented by a program that can be executed by a computer and, in such case, the program itself forms the present invention. As a storage medium that supplies the program, a ROM, floppy® disk, hard disk, optical disk, magnetooptical disk, CD-ROM, CD-R, magnetic tape, nonvolatile memory card, and the like may be used.
[0100] According to the present invention, one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 can be transmitted to another apparatus (host or the like) using Isochronous or Bulk Transfer of USB2.0.
[0101] Also, according to the present invention, one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 can be received from another apparatus (host or the like) using Isochronous or Bulk Transfer of USB2.0.
[0102] As many apparently widely different embodiments of the present invention can be made without departing from the spirit and scope thereof, it is to be understood that the invention is not limited to the specific embodiments thereof except as defined in the appended claims.
Claims
- 1. A video processing apparatus comprising:
conversion means for converting one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 into one data stream; and transmission means for transmitting the data stream to another apparatus using an isochronous transfer complying with USB2.0.
- 2. A communication method of a video processing apparatus, comprising:
converting one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 into one data stream; and transmitting the data stream to another apparatus using an isochronous transfer complying with USB2.0.
- 3. A video processing apparatus comprising:
conversion means for converting one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 into one data stream; and transmission means for transmitting the data stream to another apparatus using a bulk transfer complying with USB2.0.
- 4. A communication method of a video processing apparatus, comprising:
converting one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999 into one data stream; and transmitting the data stream to another apparatus using a bulk transfer complying with USB2.0.
- 5. A video processing apparatus comprising:
reception means for receiving one data stream transmitted from another apparatus using an isochronous transfer complying with USB2.0; and restoration means for restoring the data stream to one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999.
- 6. A communication method of a video processing apparatus, comprising:
receiving one data stream transmitted from another apparatus using an isochronous transfer complying with USB2.0; and restoring the data stream to one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999.
- 7. A video processing apparatus comprising:
reception means for receiving one data stream transmitted from another apparatus using a bulk transfer complying with USB2.0; and restoration means for restoring the data stream to one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999.
- 8. A communication method of a video processing apparatus, comprising:
receiving one data stream transmitted from another apparatus using a bulk transfer complying with USB2.0; and restoring the data stream to one or a plurality of elementary streams complying with ISO/IEC 14496-1:1999.
Priority Claims (1)
Number |
Date |
Country |
Kind |
2003-144189 |
May 2003 |
JP |
|
Continuations (1)
|
Number |
Date |
Country |
Parent |
PCT/JP04/07128 |
May 2004 |
US |
Child |
10891106 |
Jul 2004 |
US |