1. Field
One embodiment of the invention relates to an information storage medium (or data structure), an information recording method and information playback method, and an information recording apparatus and information playback apparatus, which are suited to record/play back a digital stream signal used in digital TV broadcast or the like.
2. Description of the Related Art
In recent years, TV broadcasting has entered the era of digital broadcasts of Hi-Vision programs (programs of high-definition AV information) as principal broadcast content. Current DBS digital broadcasting (and forthcoming terrestrial digital broadcasting) adopts an MPEG2 transport stream (to be abbreviated as MPEG-TS hereinafter as needed). In the field of digital broadcasting of moving pictures, the MPEG-TS will be used as a standard format in the future. In the expectation of such digital TV broadcasting, market needs for a streamer that can directly record digital TV broadcast contents are increasing.
As an example of a streamer that utilizes an optical disc such as a DVD-RAM or the like, “A recording and playback apparatus” is known (for example, Jpn. Pat. Appln. KOKAI Publication No. 2002-84479).
For example, when a long-term music program in which a short news program is inserted is stream-recorded in a news cut mode (video recording is paused at the news part), a stream object of a recorded program is divided into two stream objects at the news cut part. In such case, the two stream objects are normally continuously recorded at physically neighboring locations. However, these stream objects may be discontinuously recorded at physically separate locations. In this example, the playback times of the contents of the two stream objects are logically continuous independently of whether or not the two stream objects are physically continuous. The same applies to a case wherein one movie in which commercials are inserted is stream-recorded in a commercial cut mode. Even when physical discontinuity occurs among a plurality of stream objects at commercial cut parts, the playback times as a series of movie contents are logically continuous as a whole.
On the other hand, when program A of channel X is stream-recorded, and program B of channel Y is then stream-recorded, the playback times of the contents of stream objects of programs A and B are not continuous (logically discontinuous) even when their recording locations are physically continuous.
In this manner, when stream video recording is made by a plurality of stream objects, whether or not neighboring stream objects have logical continuity (continuity of playback times in a single program) in place of physical continuity influences the decoding processing (setting processing of system time clock STC and the like) upon playback. More specifically, when the STC setting is inappropriately made (STC reset, etc.) without the recognizing the continuity of playback times, a wait time for still picture display may be generated for a relatively long time when playback shifts from the end of the former stream object of a single program to the head of the latter stream object.
Note that information to be recorded (digital broadcast, etc.) often includes information such as PSI (Program Specific Information), SI (Service Information), and the like. A case wherein PSI and SI are unknown is not considered so far. Video information to be recorded may have various resolutions. However, a case wherein one (horizontal resolution) of the horizontal and vertical resolutions is unknown in designation of the resolution is not considered.
A general architecture that implements the various feature of the invention will now be described with reference to the drawings. The drawings and the associated descriptions are provided to illustrate embodiments of the invention and not to limit the scope of the invention.
Various embodiments according to the invention will be described hereinafter with reference to the accompanying drawings. In general, one embodiment of the invention, may use an information storage medium (e.g., 100 in
According to one embodiment of the invention, it is possible to provide a management structure of a digital stream signal, considering a case wherein PSI (Program Specific Information) or SI (Service Information) is unknown.
Data area 112 includes areas 120 and 122 which record general computer data, and area 121 that records AV data. AV data recording area 121 is configured to include AV data management information area 130 that stores a video manager (VMG) file used to manage AV data, ROM_Video object group recording area 131 that records a file of object data complying with the DVD-Video (ROM Video) standard, VR object group recording area 132 that records a file (VRO file) of object data (ESOBS: Extended Video Object Set) complying with the video recording (VR) standard, and recording area 133 that records a stream object data (ESOBS: Extended Video Object Set) file (SRO file) which records objects compatible to digital broadcasting. Note that the recording standard for the SRO file will be described as stream recording (SR) as needed.
Note that different file directories are prepared in correspondence with formats (e.g., video title set (VIDEO-TS) for DVD-Video (ROM Video) and DVD-RTAV for recordable/reproducible DVD (DVD-RTR), and a new DVD standard file compatible to digital broadcasting to be described in this embodiment is recorded in, e.g., a DVD_HDVR directory (to be described later with reference to
That is, the DVD_HDVR directory (to be described later with reference to
The structure of each ESOBU includes one or more ESOBs 141. Each ESOB corresponds to, e.g., one program. The ESOB includes one or more ESOBUs (Extended Stream object units), each of which corresponds to object data for a given time interval (which changes depending on the value of an ESOBU_PB_TM_RNG) or one or more GOP data. When the transfer rate is low, one GOP data cannot often be sent within 1 second (VR can freely set the data unit configuration since it adopts internal encoding, but digital broadcast cannot specify the next incoming data since encoding is done by a broadcast station). On the other hand, the transfer rate may be high, and I-picture data may be sent frequently. In such case, the ESOBU is delimited frequently, and ESOBU management information increases accordingly, thus ballooning the whole management information. For this reason, it is appropriate to delimit ESOBUs by a given time interval (minimum limitation is to delimit ESOBUs by picture data except for the last ESOBU of the ESOB: the delimitation unit corresponds to an I-picture [e.g., for each sec]) or by one or more GOP data.
When management information is formed on the PATS base in case of a non-cognizable stream, the AT_SOBUs (Arrival Time based SOBUs) are delimited at time intervals indicated by AT_SOBU_TM data. There are two types of AT_SOBU_TM data: designated in seconds (see
In this embodiment, one ESOBU includes one or more packet groups, each of which can correspond to 16 (or 32) Logical Blocks (1 LB=2048 bytes; 16 LBs=32640 bytes). Each packet group includes a Packet_Group_Header and (170) TS packets. The Arrival Time of each TS packet can be expressed by a PAT (Packet Arrival Time: 4 bytes) which is allocated before each TS packet.
The arrival times of TS packets are to be linearly counted up until a video recording end time to have a video recording start time as 0 (or a predetermined value). Note that the STC and the PATS may not always indicate the same value (due to different default values or the like). However, the count interval of a PATS counter has to be synchronized with that of an STC counter which corresponds to the interval between neighboring PCR fetch timings in a playback synchronized state. Note that the PCR is included in an adaptation field (not shown) in an MPEG-TS. A packet group can include a maximum of two ESOBs. That is, packet groups need not be aligned for respective ESOBs.
The management information will be described below with reference to
As shown in
An MPEG-TS scheme as a basic format common to broadcast schemes which broadcast (distribute) compressed moving picture data such as digital TV broadcast, broadcast that uses a wired network such as the Internet or the like, and so on is divided into a packet management data field and payload.
The payload includes data to be played back in a scrambled state. According to ARIB, a PAT (Program Association Table), a PMT (Program Map Table), and SI (Service Information) are not scrambled. Also, various kinds of management information are generated using the PMT and the SI (SDT: Service Description Table, EIT: Event Information Table, BAT: Bouquet Association Table).
The contents to be played back include MPEG video data, Dolby AC3(R) audio data, MPEG audio data, data broadcast data, and the like. Also, the contents include information used upon playback (e.g., a PAT, a PMT, SI, and the like) although they are not directly related to the contents to be played back. The PAT includes the PID (Packet Identification) of the PMT for each program, and the PMT records the PIDs of video data and audio data.
A normal playback sequence of an STB is as follows. That is, when the user determines a program based on EPG information, the PAT is loaded at the start time of the target program. The PID of a PMT, which belongs to the used program, is determined based on that data, and the target PMT is read out in accordance with that PID. Then, the PIDs of video and audio packets to be played back, which are included in the PMT, are determined. Video and audio attributes are read out based on the PMT and SI and are set in respective decoders. The video and audio data are extracted and played back in accordance with their PIDs. Note that the PAT, PMT, SI, and the like are transmitted at intervals of several 100 ms since they are also used during playback.
Different digital broadcast schemes are adopted in respective countries and broadcast stations: for example, DVB (Digital Video Broadcasting) in Europe; ATSC (Advanced Television Systems Committee) in U.S.A.; and ARIB (Association of Radio Industries and Businesses) in Japan.
In DVB, the video format is MPEG2, the resolutions are 1152*1440i, 1080*1920(i, p), 1035*1920, 720*1280, (576, 480)*(720, 544, 480, 352), and (288, 240)*352, the frame frequencies are 30 Hz and 25 Hz, the audio format includes MPEG-1 audio and MPEG-2 Audio, and the sampling frequencies are 32 kHz, 44.1 kHz, and 48 kHz.
In ATSC, the video format is MPEG2, the resolutions are 1080*1920(i, p), 720*1280p, 480*704(i, p), and 480*640(i, p), the frame frequencies are 23.976 Hz, 24 Hz, 29.97 Hz, 30 Hz, 59.94 Hz, and 60 Hz, the audio format includes MPEG1 Audio Layer 1 & 2 (DirecTV) and AC3 Layer 1 & 2 (Primstar), and the sampling frequencies are 48 kHz, 44.1 kHz, and 32 kHz.
In ARIB, the video format is MPEG2, the resolutions are 1080i, 720p, 480i, and 480p, the frame rates are 29.97 Hz and 59.94 Hz, the audio format includes AAC (MPEG-2 Advanced Audio Coding), and the sampling frequencies are 48 kHz, 44.1 kHz, 32 kHz, 24 kHz, 22.05 kHz, and 16 kHz.
In this manner, digital broadcast schemes are different in different countries, and may also be different for respective broadcast stations. For this reason, a recorder has to record objects as one or a plurality of files in accordance with each individual scheme to be used.
For this reason, in the embodiment of the invention, files to be further added to the existing VR file configuration are configured to allow the presence of a plurality of “x”s in the file names HR_SFIx.IFO and HR_SFIx.bup, as shown in
For example, when “x”=00, such files can be used when the broadcast scheme is unknown or the recorder does not support that broadcast scheme. In this case, a stream whose broadcast scheme is unknown or a stream which is not supported by the recorder can be saved as a stream (SOB_STRB) of TYPE B. Hence, since ESTR_FI as management information for digital broadcast is changed for each broadcast station (or each broadcast scheme), a plurality of pieces of ESTR_FI exist.
Furthermore, each EX_PL_SPRT includes representative picture information (PL_REP_PICTI) of the playlist of interest, which sets representative picture information (a marker of a picture to be displayed as a thumbnail on a title menu or the like) for each playlist. This PL_REP_PICTI is set to include target cell number CN, picture pointer PIC_PT (the start PTM, PATS, E_EVOB_ENT number, and the like of the representative picture of interest) on the target EVOB, ESI number V_ESN of a video stream (default stream) to be played back, the playback time or playback end time of the representative picture of interest, date information PIC_CL_TM indicating the date of creation (updating) of the marker of the representative picture of interest, and the like.
Note that EX_M_AVFITI (VTMAP_LAST_MOD_TM) in
The EVOB_TMAP_GI allows to recognize information of the EVOB without reading out the TMAP file, since the TMAPs are stored in independent files (see
As attributes of the EVOB, as shown in
In this configuration, there are a plurality of pieces of ESTR_FI because there is one ESTR_FI as management information for digital broadcast for each broadcast station (or each broadcast scheme). Hence, ESTR_FI_SRPT information exists to designate an ESTR_FI_— ESTR_FI to be used. As shown in
Especially, if TOTAL_STMAP_SZ sets a limitation of a maximum size of 2 MB or less in the standard, the file size has to be confirmed so as not to exceed that size. More specifically, the STMAP has to be configured so that it can be mapped on a work RAM of an MPU within a maximum size of 2 MB or less. For this purpose, TOTAL_STMAP_SZ allows confirmation of the TMAP size.
Note that the update date information (STMAP_LAST_MOD_TM in
The number of pieces of ESTR_FI is, e.g., four or less, and the number of pieces of ESOBI is 999 or less. A part “nn” of ESTR_FI file name: HR_SFInn.IFO is reflected on that of File Name: HR_STMnn.IFO of the STMAP, thus determining the file name of the STMAP.
The ESTR_FI_GI includes the file name/file number (SFI_ID) of an object managed by the ESTR_FI of interest, the number (ESOBI_SRP_Ns) of ESOBI_SRPs in the ESTR_FI of interest, the Version number (VERN) of the file of interest, a packet type (PKT_TY: e.g., 1=MPEG-TS), a packet group size (PKT_GP_SZ: e.g., fixed to 16 logical blocks), the number of TS packets in a packet group (PKT_Ns: e.g., 0xAA: fixed to 170 TS packets), the STMAP update time (STMAP_LAST_MOD_TM), an STMAP size (STMAP_SZ), and packet arrival time status (PATS_SS), and the like.
Note that a plurality of pieces of STR_FI can be assured in correspondence with the number of pieces of HR_SFIxx.IFO. These pieces of STR_FI can be prepared for respective broadcast stations and/or broadcast schemes (ARIB for Japan, ATSC for U.S.A., DVB for Europe, and the like). Also, a plurality of time maps (see ETMAPI/STMAPI in
In the case of a cognizable stream (STRA of TYPE A), a TMAP can be generated on the PTM base. However, in case of a non-cognizable stream (when data cannot be descrambled or when data of a scheme different from the assumed broadcast station is input: STRB of TYPE B), the TMAP may be generated in a reception time (PATS) base in place of PTM base. However, since the PATS is not a playback time, special playback or the like which is accurate in terms of time cannot be made, but rough special playback (fastforwarding playback, rewinding playback, or the like which allows the user to roughly confirm the recorded contents) can be made.
In
As one of features of digital broadcast, for example, multi-view broadcast is known. In multi-view broadcast, a plurality of video data are broadcasted at the same time (by time sharing), and the user can play back one of these video data of his or her choice. In this manner, the user can select one of a plurality of contents according to his or her taste or the like. For example, when a recorder receives, as one TS, streams X, Y, and Z as multi-view broadcast and stream U as rain attenuation broadcast, the control should be made to allow the user to select and play back a used stream upon playback, and to freely switch among streams using a key. To cope with this, grouping information (GPI) is added to achieve this object.
Note that in case of TYPE_B, a stream is often recorded without being cognized. In this case, since it is determined that the PSI and SI values are unknown (or unreliable), SERVICE_ID, PMT_PID, NETWORK_ID, TS_ID, FORMAT_ID, SERVICE_TYPE, PCR_PID, and the like cannot be described. In such case, a flag indicating invalidity of information of PSI and PI can be set in ESOB_TY: b12. In this case, the values of SERVICE_ID, PMT_PID, NETWORK_ID, TS_ID, FORMAT_ID, SERVICE_TYPE, and PCR_PID become invalid.
In place of the whole flag (b12 of ESOB_TY), invalid values (0xff) may be set for respective values of SERVICE_ID, PMT_PID, NETWORK_ID, TS_ID, FORMAT_ID, SERVICE_TYPE, and PCR_PID, and these values may be set in case of invalidity. However, even in TYPE_B, PSI and SI values may often be valid.
The ESOB_REC_MODE included in the ESOB_GI indicates the TYPE of stream: 00h=a recording mode of Type A, and 01h=a recording mode of Type B. Type A is a stream whose stream structure is cognizable, and whose management information is managed on the PTM base. On the other hand, Type B is a stream whose structure is not cognizable and, hence, whose management information is managed on the PATS base. For this reason, Type A adopts a TMAP on the PTM base, and Type B adopts a TMAP on the PATS base. Also, the ESOB_TY includes a TE flag indicating temporary erase or not, and a Flag indicating the validity/invalidity of data generated based on the PSI and SI.
Note that ESOB_ES_Ns, ESOB_V_ES_Ns, ESOB_A_ES_Ns, and ES_TMAP_Ns have the following relations:
ESOB_ES_Ns≧ESOB_V_ES_Ns+ESOB_A_ES_Ns
ESOB_V_ES_Ns+ESOB_A_ES_Ns ES_TMAP_Ns
Note that “unspecified” is set to describe it if the contents of an object cannot be examined when it is not determined based only on interpretation of PSI and SI. In ARIB, especially, since the vertical resolution alone is specified but the horizontal resolution is unspecified by interpreting only the descriptor, only the vertical resolution can be described.
The ESOB_TMAP_GI includes ADR_OFS (a packet group number (or LB address) from the head of a file to the head of an ESOB), and also includes, in case of the PTM base, ESOBU_PB_TM_RNG (ESOBU playback time range: 1=2s or less, 2=3s or less, 3=is or less), ESOB_S_PKT_POS (the start position of the head of an ESOB in a packet group: 0≦ESOB_S_PKT_POS≦169), ESOB_E_PKT_POS (the end position of the head of an ESOB in a packet group: 0≦ESOB_E_PKT_POS≦169), ESOB_SZ (ESOB size), and ES_TMAP_GI_Ns (the number of ES_TMAPs that belong to the ESOB of interest. Each ES_TMAPI_GI includes ESIN (the number of an ESI of a target ES of the TMAP of interest), ADR_OFS (logical addresses from the head of an ESOB file to the head of the ES of interest), ES_S_PTM (start PTM), ES_E_PTM (end PTM), ES_ESOBU_ENT_Ns (the number of ESOBU_ENTs), LAST_ESOBU_E_PKT_POS (position of the last ESOBU in a packet group), and STMAP_N (the number of a TMAP in the STMAPT, which belongs to the ES of interest: this number may be omitted when TMAPs are recorded in turn in each STMAPT).
Note that TMAPI information can be prevented from becoming extremely large by setting ESOBU_PB_TM_RNG (the same applies to EVOBU_PB_TM_RNG in
Note that the ESOB_TMAP_GI (
For this reason, when TMAP data are extracted (checked) from the PID data of a Video ES to be played back in an ESOB, ESTMAP_GI having an ESI number corresponding to the PID data of the Video ES to be played back is inspected from those in ESOB_TMAPI, and the numbers (order) in that ESOB are retained. Then, ETMAPI_SRPs are determined in the order in the ESOB in the STMAPI_GI corresponding to the ESOB_TMAP_GI, and ETMAPI is specified based on each SRP information. Which ETMAP_SRP belongs to the STMAP_GI is determined by adding number of ETMAPI_SRPs as that in the STMAPI_GI from the first one.
ESOB_SZ is the number of packet groups from a packet group to which the head of an AT_SOB belongs to that to which the end of the AT_SOB belongs. ESOBU1_SZ is the number of packet groups from the first packet group of an ESOBU to the last packet group of the ESOBU. Each ESOBU_S_PKT_POS represents the difference between the divisions of the ESOBU and packet group using the number of packets.
Time information is expressed by PATs to have ESOB_S_PATS as the ESOB start time, and ESOB_E_PATS as the end time, since it is on the PATS base. However, the ESOB_E_PATS is the PATS (arrival start time) of the last packet of the last Packet group, and is not the last reception end time. The edit processing is done for respective ESOBUs, and the playback start time (CELL_S_PATS of CELLI) is designated. Since the edit processing is done for respective ESOBUs, each ESOB_S_PATS always matches the head of the ESOBU.
In case of the PATS base, the ESOB_TMAP_GI includes ESOB_ADR_OFS (a packet group number (or LB address) from the head of a file to the head of an ESOB), AT_SOBU_TM (arrival time interval of ESOBUs: 0=1s, 1=2s), ESOB_S_PKT_POS (the start position of the head of an ESOB in a packet group: 0≦ESOB_S_PKT_POS≦169), ESOB_E_PKT_POS (the end position of the head of an ESOB in a packet group: 0≦ESOB_E_PKT_POS≦169), AT_SOBU_ENT_Ns (the number of AT_SOBU_ENTs which belong to the ESOB of interest), and ESOB_SZ (ESOB size). The edit processing is done for respective AT_SOBUs, and the adjustment processing is done based on PATS start time and PATS end time (CELLI).
A normal DVD recorder has time map information (TMAPI) as video object (VOB) management information. This information is used to divide an EVOB or ESOB into ESOBUs or ESOBUs, and to allow playback, special playback, and the like using these playback units. However, one piece of information is used per a maximum of 0.5 s. For this reason, if the disc size increases or a compression method with high compression efficiency is adopted in the future, the number of pieces of TMAPI increases, and complicated management is used upon execution of edit processing or the like. If the time map information is included in the management information (.IFO), non-related data in other fields has to be moved or rewritten every time TMAPI is changed, resulting in poor efficiency. In order to improve such situation, TMAPI is recorded in an independent field (see
As shown in
On the other hand, as shown in
In case of time search, an ESOBU corresponding to a target time is calculated by accumulating ESOBU_PB_TM data, and the playback start PTM is converted into the number of fields from the head of that ESOBU. Note that the target address is given by:
A=ESOB
—
ADR
—
OFS+ES
—
ADR
—
OFS of target ES+ΣN=1k−1ESOBU—SZ(N)×16+1
where K is the target ESOBU, and A is the target address. Furthermore, the first packet becomes a packet corresponding to the value of ESOBU_S_PKT_POS, and this address is accessed.
On the other hand, there are two types of AT_SOBU_ENTs (
In case of the packet unit, each AT_SOBU_ENT is configured by AT_ESOBU_SZ and AT_SOBU_S_PKT_POS. The AT_ESOBU_S_PKT_POS indicates the first packet position of the AT_SOBU in the Packet_Group by the number of packets.
In case of the packet group unit, each AT_SOBU_ENT is be configured by AT_ESOBU_SZ. In this case, AT_SOB_S_PKT_POS and AT_SOB_E_PKT_POS are fixed to zero.
The ESOB_TMAP_GI describes ADR_OFS, AT_SOB_SZ, AT_SOB_PKT_POS as values associated with those of the whole AT_SOB.
AT
—
ADR
—
E
—
OFS=AT
—
SOB
—
SZ−(AT—ADR—S—OFS+ΣN=1k−1AT_SOBU—SZ(N)+1)
Note that inequalities AT_SOB_SZ>AT_ADR_S_OFS, AT_SOB_SZ>AT_SOBU_SZ, and the like hold.
The EX_PGI also includes RSM_MRKI (included in PL_SRP) to provide a resume marker (a marker indicating the playback location upon interrupting playback) for each program as in the playlist. As information used to restart playback, an EX_CELL number, playback start PTM and date information indicating the date of creation of that marker, an ESI number of a video stream to be played back, an ESI number of an audio stream to be played back, and main/sub information in case of Dual-Mono are set. This information is used as title resume.
Furthermore, the EX_PGI includes PG_REP_PICTI, which is set with representative picture information (a marker of a picture to be displayed as a thumbnail on a title menu or the like) for each PG. This PG_REP_PICTI is set with a cell number, start PTM, date information of the date of creation of that marker, and an ESI number of a video stream to be played back.
In order to utilize the manufacturer's information (MNFI or MNI stored in the EX_MNFIT in
Furthermore, when PG update date information (program update date information at the end in the PGI in
When time is designated by a playback time=real time upon playback, the same access method as in the existing DVD-Video recording (DVD-VR) is allowed although stream recording that records incoming bitstreams intact is made. Since the user can designate a recording position using a playback time, a user's wish can be perfectly reflected. However, this method can be adopted only when the stream contents can be sufficiently cognizable. If the contents of the recorded stream are not sufficiently cognizable, a time has to be designated using a transfer time of a stream packet (MPEG-TS packet in case of digital broadcast recording).
If the recording position is designated using a playback time while the contents of the recorded stream are not sufficiently cognizable, playback cannot always be started from the head of I-picture data. If a frame at the playback start position is not that of I-picture, decoding starts from the immediately preceding I-picture, and display of a playback video picture starts when the target frame is decoded. In this way, a picture can be presented to the user as if playback were started from the designated frame.
As for an ID to be referred to in the playback processing or the like, a method of setting the PID of a representative one of streams to be played back, a method of setting the ID of a component group in case of multi-view TV or the like, and a method of designating an ESI number (example:
By assigning a unique ID number (PG_INDEX: EX_PGI#p, and the like in
STR_A_CELL_EPI_TY_A (ESOB TYPE A) includes EPI_TY (EPI type information), a PTM to which an EP is assigned, a corresponding PID and GP number (PID/GP_N), an ESI number of an ES to which that ES is assigned, an ESI number of an audio ES, and main/sub information in case of Dual-Mono. STR_A_CELL_EPI_TY_B further includes PRM_TXTI (text information) and REP_PIC_PTM (thumbnail pointer) (no PID and GI_N are included in TY_B).
STR_B_CELL_EPI_TY_A (ESOB TYPE B) includes EPI_TY (EPI type information) and a PATS to which an EP is assigned. STR_B_CELL_EPI_TY_B also includes a PID to which that ES is assigned, PRM_TXTI (text information), and REP_PIC_PTM (thumbnail pointer).
For example, information CNT_SEGN indicating the number of CNT_SEGs from the head of the ESOB can be set as follows. That is, in case of an ESOB of type A, the value of CNT_SEGN is valid, but CNT_SEGN is set to zero for objects other than the ESOB. As the values of valid CNT_SEGN, for example, when CNT_SEGN=4, the number of CNT_SEGs in the ESOB of interest is zero; when CNT_SEGN=5, the number of CNT_SEGs in the ESOB of interest is 1; when CNT_SEGN=6, the number of CNT_SEGs in the ESOB of interest is 2; and when CNT_SEGN=7, the number of CNT_SEGs in the ESOB of interest is 3.
The example of the ESOB has been described. In case of an EVOB, the PTM can also have the same data structure. The number of CNT_SEGs (CNT_SEGN) from the head of an ESOB is substituted in time information PTM to confirm in advance if STC Wrap-around has occurred, and can be used in TMAP calculations and the like.
Note that the lower 4 bytes unique to each PATS 162 are included in that PATS, but the upper 2 bytes of the first PATS are included in First_PATS_EXT described in the packet group general information (PKT_GRP_GI) in Packet_Group_Header 161. With this configuration, the data size can be reduced compared to a case wherein 6-byte packet arrival times are independently described in respective PATS.
Furthermore, the PKT_GRP_SS includes bit STUF indicating if stuffing is done (if this STUF bit is set, it indicates that the Valid_PKT_Ns assumes a value other than 0xAA), and PATS_SS. Note that the PATS_SS includes a value indicating the accuracy of the PATS (when PATS_SS=00, both PATS and FIRST_PATS_EXT are valid and accuracy=6 bytes is set; when PATS_SS=01, only PATS is valid and accuracy=4 bytes is set; and when PATS_SS=10, both PATS and FIRST_PATS_EXT are invalid and no accuracy is set).
Note that extended bytes FIRST_PATS_EXT of the PATS of the first packet include the upper 2 bytes of the arrival time of the packet at the head of the packet group, and the remaining 4 bytes are assigned before each packet. In this manner, the playback process with an accurate time is allowed.
Alternatively, CCI (or CPI) may store digital copy control (00=copy never, 01=copy once, 11=copy free), analog copy control (00=no APS, 01=APS type 1, 10=APS type 2, 11=APS type 3), EPN (0=contents protection, 1=no contents protection), and ICT (0=analog video output resolution constraint, 1=no constraint). Note that APS is an abbreviation for “Analog Protection System”, and the embodiment of the invention assumes Macrovision(R).
Also, the copy control information (CCI or CPI) is set on the management information side (ESOBI_GI:
That is, a recorder may have unique functions which are not described in the DVD format depending on the manufacturers and models, and may be differentiated from other manufacturers. In this case, manufacturer unique information has to be embedded in object data in some cases. Hence, in the embodiment of the invention, MNI (Manufacturer's Information) is assured in a Packet_Group_Header as its field.
DCI_CC_SS (a flag indicating the presence of DCI and CCI) includes DCI_SS and CCI_SS. The DCI_SS sets “0=no valid DCI exists, 1=only valid aspect information exists, 3=all pieces of DCI exist”, and CCI_SS sets “0=no valid CCI exists, 1=only source information exists, 2=only APS exists, 3=only source information and APS exist, 4=only CGMS exists, 5=only CGMS and source information exist, 6=only CGMS and APS exist, 7=all exist”.
The DCI (Display Control Information) includes Aspect_Ratio (0=4:3, 1=16:9, 8=14:9 letter box (center), 4=14:9 letter box (top), 13=16:9 letter box (center), 2=16:9 letter box (top), 11=>16:9 letter box (center), 7=14:9 full), Subtitling Mode (0=non-open subtitle, 1=subtitles in active image area, 2=subtitles out of active image area), and film/camera (0=camera mode: the source is a camera, 1=film mode: the source is a film).
The CCI (Copy Control Information) or CPI (Contents Protection Information) includes CGMS (0=copy never; 1=copy free), APS (0=no APS, 1=append APS type 1, 2=append APS type 2, 3=append APS type 3), -Source (0=analog pre-encoded media), and -EPN (1=contents protection (protection upon home network output), 0=no contents protection).
The RECI includes International Standard Recording Codes, the contents of which are the same as those in DVD-VIDEO.
That is, as shown in
Non-seamless angle information (NSML_AGLI) in the EX_PCI includes jump addresses (NSML_AGL_C#1 DSTA to NSML_AGL_C#9 DSTA) of a maximum of nine non-seamless angle cells.
On the other hand, as shown in
Seamless playback information (SML_PBI) in the EX_DSI includes a category (EVOBU_SML_CAT) of a seamless EVOBU, the end address (ILVU_EA) of an interleaved unit, the start address (NXT_ILVU_SA) of the next interleaved unit, the size (NXT_ILVU_SZ) of the next interleaved unit, a video start time (EVOB_V_S_PTM) in an EVOB, a video end time (EVOB_V_E_PTM in the EVOB, an audio stop time (EVOB_A_STP_PTM) in the EVOB, and an audio gap length (EVOB_A_GAP_LEN) in the EVOB.
Seamless angle information (SML_AGLI) in the EX_DSI includes jump addresses (SML_AGL_C#1 DSTA to SML_AGL_C#9 DSTA) of a maximum of nine seamless angle cells.
EVOBU search information (EVOBU_SRI) in the EX_DSI describes start addresses before and after the playback start time of an EVOBU that includes the EX_DSI in predetermined time units (e.g., in integer multiple units of 0.5 sec). More specifically, the start address before the playback start time of the EVOBU that includes the EX_DSI is described using FWDIxx, and that after the playback start time of the EVOBU that includes the EX_DSI is described using BWDIxx.
Sync information (SYNCI) in the EX_DSI includes address information of audio data and sub-picture data synchronized with video data of the EVOBU that includes the EX_DSI. More specifically, the SINCI includes the addresses (A_SYNCA0 to A_SYNCA7) of a maximum of eight target audio packs, and the addresses (SP_SYNCA0 to SP_SYNCA31) of a maximum of 32 target sub-picture packs.
Note that “interoperable VTS (INT-VTS)” is provided as a bridge used to play back EVOB data of HD_DVD-VR by an HD_DVD-VIDEO player. INT-VTS has a configuration as shown in
In
In
In this embodiment, a concept called “ESOBU_Cluster” has been introduced, as shown in
Note that the REF-PIC means a Picture which corresponds to I-PIC in case of the conventional MPEG2 compression method, and can form one frame (field) by only this picture. Since the embodiment of the invention supports a plurality of image compression methods (MPEG4-AVC, VC-1, and the like), since a picture corresponding to I-PIC has to be defined even in a stream which is encoded by a method other than MPEG2, “REF-PIC” is used as a more generic term in place of I-PIC.
The encoder unit includes an analog-to-digital unit, video encode unit, audio encode unit, SP encode unit, formatter unit, and buffer memory unit. The decoder unit includes a demultiplexer, video decode unit, SP decode unit, audio decode unit, TS packet transfer unit, V-PRO unit, and audio digital-to-analog unit. Furthermore, an antenna for receiving digital broadcast is connected to the STB unit. Note that the STC unit is configured to count on a 27-MHz base.
The flow of signals upon recording is, for example, as follows. That is, TS packet data received by the STB unit (or terrestrial digital tuner) are packed into packet groups by the formatter unit and the packet groups are saved in the temporary storage unit. When the saved packet groups reach a predetermined size, they are recorded on the disc. Internal counter 90a for PATS is connected to this formatter unit 90. The arrival time of each TS packet is counted by PATS counter 90a, and that count value is appended to the head of each TS packet when the packet is buffered. This counter 90a can perform fine adjustment of count intervals by PCR (or SCR) values, but never loads the PCR (or SCR) values unlike STC 102.
As the operations to be executed at that time, upon reception of TS packets, a group is formed every 170 packets, and a Packet_Group_Header is generated.
In this case, only the upper 2 bytes (First_Pats_Ext) of the PATS of the first packet of the packet group are stored in the header, and only the lower 4 bytes of each of other PATS are saved together with the TS packet (before the TS packet: in the PATS). An analog signal input from the terrestrial tuner or line input is converted into a digital signal by the analog-to-digital unit. That digital signal is input to respective encoder units. That is, a video signal is input to the video encode unit, an audio signal is input to the audio encode unit, and text data of, e.g., teletext broadcasting is input to an SP encode unit. The video signal is compressed by MPEG, the audio signal is compressed by AC3 or MPEG audio, and the text data is compressed by runlength coding.
Each encoder unit (for VR) packs compressed data to form 2084-byte packets and inputs them to the formatter unit. The formatter unit packs and multiplexes the packets as a program stream, and sends it to the D-PRO unit.
The D-PRO unit forms ECC blocks for every 16 logical blocks, appends error correction data to them, and records the ECC packets on the disc via the disc drive unit. When the disc drive unit is busy due to seek, track jump, and the like, data are stored in an HDD buffer unit, and wait until the DVD-RAM disc drive unit is ready. Furthermore, the formatter unit generates each segmentation information during video recording, and periodically sends it to the MPU unit (GOP head interrupt or the like). The segmentation information includes the number of packs of an EVOBU (ESOBU), the end address of I-picture data from the head of the EVOBU (ESOBU), the playback time of the EVOBU (ESOBU), and the like.
In the flow of signals upon playback, data are read out from the disc by the disc drive unit, undergo error correction by the D-PRO unit, and are then input to the decode unit. The MPU unit determines the type of input data (i.e., VR or SR data) (based on Cell TYPE), and sets that type in the decoder unit before playback. In case of SR data, the MPU unit determines the PID to be played back based on the ESI number to be played back, determines the PIDs of items (video, audio, and the like) to be played back based on that PTM, and sets them in the decoder unit. In the decoder unit, the demultiplexer sends TS packets to the respective decode units based on the PIDs. Furthermore, the TS packets are sent to the TS packet transfer unit, and are transmitted to the STB unit (1394 interface unit) in the form of TS packets in the order they arrived. The respective decode units execute decoding, and decoded data are converted into an analog signal by the digital-to-analog unit, thus displaying data on the TV. In case of VR data, the demultiplexer sends data to the respective decode units according to the fixed IDs. The respective decode units execute decoding, and decoded data are converted into an analog signal by the digital-to-analog unit, thus displaying data on the TV.
Upon playback, pack data read out from the disc are interpreted by the demultiplexer. Packs that store TS packets are sent to the TS packet transfer unit, and are then sent to the decoders, thus playing back data. When pack data are transferred to the STB unit (or are transmitted to an external apparatus such as a digital TV or the like), the TS packet transfer unit transfers only TS packets at the same time intervals as that when they arrived (see
The features of medium 100 (100a) used in the apparatus of
The apparatus shown in
That is, if the key input is, for example, a key operation made to set timer program recording, program setting processing starts (block ST20). If the key input is a key operation made to start video recording, video recording processing starts (block ST22). If the key input is a key operation made to start playback, playback processing starts (block ST24). If the key input is a key input made to output digital data to the STB, digital output processing starts (block ST26). If the key input is a key operation of edit processing, the edit processing starts (block ST28).
The processes in blocks ST20 to ST28 are parallelly executed as needed for respective tasks. For example, the processing for outputting digital data to the STB (ST26) is executed in parallel during the playback processing (ST24). Or new program setting processing (ST20) can be executed in parallel during the video recording processing (ST22) which is not timer program recording. Or by utilizing the feature of disc recording that allows high-speed access, the playback processing (ST24) and digital output processing (ST26) can be parallelly executed during the video recording processing (ST22). Also, the disc edit processing (block ST28) can be executed during video recording on the HDD.
When one of the program information EX_PGI, cell information EX_CI, or EVOB/ESOB has been changed, this program update date may be set. When EVOBI and/or ESOBI have/has been changed, the edit times/time (EDIT_TIME) of the EVOBI and/or ESOBI can be set in ESOB_EDIT_TIME (not shown) or the like. Alternatively, this program update date may be set.
In this connection, in the process in block ST284, the manufacturer ID of the apparatus that has made the operation in one of blocks ST282A to ST282D may be set in an editor ID (LAST_MNF_ID). Every time one of the PGI, CI, and SOB (or VOB) has been changed, this editor ID can be set (or updated) to the ID information of the apparatus used at that time.
d1) A program to be recorded is determined using an EPG (Electronic Program Guide) in the program setting processing, reception of that program starts, and the determined program is recorded.
d2) Upon reception of a recording command from key input unit 103, MPU unit 80 loads management data from disc 100 (or HDD unit 100a) via disc drive unit 51 and determines a write area. At this time, the MPU unit checks the file system to determine whether or not recording can be proceeded. If recording cannot be proceeded, the MPU unit displays a message that advises accordingly so that the user can abort the recording processing. On the other hand, if recording can be proceeded, the MPU unit executes pre-recording processing (block ST105 in
d3) If data to be recorded is digital broadcast data (YES in block ST106), MPU unit 80 checks if the stream to be recorded is cognizable. If the stream to be recorded is cognizable (YES in block ST107), MPU unit 80 makes settings to generate management information as a type A stream on the PTM base (block ST109A); otherwise (NO in block ST107), MPU unit 80 makes settings to generate management information as a type B stream on the PATS base (block ST109B). After that, the MPU unit sets the write start address of stream data (video data) in disc drive unit 51, thus preparing for data recording (block ST112).
d4) In this preparation process, the MPU unit resets the count time of STC unit 102. Note that STC unit 102 is a system timer, and recording and/or playback are or is done with reference to this STC value.
d5) The PAT of a program to be recorded is loaded to determine the PID used to fetch the PTM of the target program. Then, the target PTM is loaded to determine the PIDs of data (video, audio) to be decoded (to be recorded). At this time, the PAT and PTM are saved in work RAM unit 80A of MPU unit 80, and they (PAT, PTM) are written in the management information (HDVR_MG). At this time, VMG file data is written in the file system (see
d6) Video recording settings are made in respective units (block ST114). At this time, a segmentation setting of data and a reception setting of TS packets are made in formatter unit 90. Also, the PID of data to be recorded is set to record only a target video stream. Furthermore, buffer 91 is set to start holding of TS packets (block ST116). Then, formatter unit 90 starts its operation as follows.
d7) ESOB_ESI is generated based on the PTM (block ST120 in
d8) Next, a TS packet stream to be recorded is fetched onto buffer 91 (block ST130). If data stored in buffer 91 reaches a predetermined size (YES in block ST140), ECC processing is done via D-PRO unit 52, thus recording the data that have undergone the ECC processing on disc 100 (and/or 100a) (block ST142).
d9) During video recording, segmentation information is saved in work RAM 80A of MPU unit 80 periodically (before buffer RAM 91 of formatter unit 90 becomes full of data) (YES in block ST144; block ST146). The segmentation information to be saved is ESOBU segmentation information, which includes the ESOBU start address, ESOBU pack length, I-Pic (reference picture) end address, the ESOBU arrival time (PATS), or the like.
d10) After the segmentation information is saved in work RAM 80A (block ST146) or if the save timing of the segmentation information is not reached (NO in block ST144), MPU unit 80 checks if ESOB data are to be delimited. If ESOB data are to be delimited (YES in block ST147), the MPU unit executes ESOB delimitation processing (
d11) It is checked if video recording is to end (if the user has pressed a video recording end key or if no recordable space of the disc (100 or 100a) remains). If video recording is to end (YES in block ST148), the remaining segmentation information is fetched from formatter unit 90, and is added to work RAM 80A. These data are recorded in management data (VMGI or HDVR_MGI), the average recording rate upon video recording is recorded, and the remaining information is recorded in the file system (block ST150).
d12) If video recording is not to end (NO in block ST148), the control returns to d8) to continue the data fetch and write processes.
In order to display the contents of stream data, whose video recording is in progress, on the TV or the like, the stream data to be recorded is sent to decoder unit 59 simultaneously with D-PRO unit 52, so as to make simultaneous video recording monitor. In this case, MPU unit 80 makes settings upon playback in decoder unit 59, which then automatically executes playback processing. D-PRO unit 52 forms ECC groups by combining, e.g., every 16 packs of stream data to be recorded, appends ECC data to each group, and sends them to disc drive unit 51 (and/or HDD 100a). When disc drive unit 51 is not ready to record on disc 100, D-PRO unit 52 transfers the ECC groups to temporary storage unit 53 and waits until disc drive unit 51 is ready to record. When disc drive unit 51 is ready, D-PRO unit 52 starts recording onto disc 100. As temporary storage unit 53, a large-capacity memory is assumed since it should hold recording data for several minutes or longer by high-speed access. Note that MPU unit 80 can make read and write accesses to D-PRO unit 52 via a dedicated microcomputer bus, so as to read and write the file management area and the like.
The flow of signals upon recording will be briefly summarized below. That is, MPEG-TS packet data received by STB 83 (or terrestrial digital tuner 89) are converted into packet groups by formatter 90, and the packet groups are saved in buffer 91. When data stored in buffer 91 reach a predetermined size (for one or an integer multiple of CDA size), they are recorded on the disc (100 and/or 100a).
e1) It is checked if data is to be continuously recorded. If data is not to be continuously recorded (NO in block ST1600), this processing ends.
e2) If it is determined that data is to be continuously recorded (YES in block ST1600), “1” is set in ESOB_CONN_SS (
e3) ESTR_FI of the ESOB of interest is set (ST1618), thus ending this process.
f1) A TS packet is received (block ST1300).
f2) It is checked if the STC has reached an end (Wrap-around). If the STC has reached an end (YES in block ST1301), a CNT_SEG is generated based on position information of the TS packet at the time of Wrap-around. In this manner, position information CNT_SEG_S_PKT_POS (see
f3) If the packet of interest is the first one of a packet group (YES in block ST1306), Header_ID: 0x00000fa5 is set (block ST1308A); otherwise (NO in block ST1306), the control advances to step f6).
f4) In block ST1308A, PATS data is used as the arrival time of the TS packet, the lower 4 bytes of the PATS data are allocated before that TS packet, and the upper 2 bytes of the first PATS data are set in the Packet_Group_Header as FIRST_PATS_EXT.
f5) In the TS packet fetched in the TS packet data area, the lower 4 bytes of the PATS data are appended before that TS packet (block ST1317C), and the TS packet is set in a packet group data area (block ST1317D).
f6) It is checked if a packet group is formed (if 170 TS packets are grouped). If a packet group is not formed yet (NO in block ST1322), the flow returns to f1). If the packet group is formed (YES in block ST1322), PKT_GRP_GI setting processing (block ST1340), CCI or CPI processing (ST1330), and MNFI processing (ST1350) are executed, and group data for one packet group are temporarily saved in buffer RAM 91 (block ST1332).
g1) The packet type is checked. If the packet type indicates an MPEG-TS packet, a value “01” is set in PKT_GRP_TY; otherwise, a value suited to that type is set in PKT_GRP_TY (block ST13400).
g2) A value (e.g., “11”) corresponding to the BOOK version of the standard of interest is set in VERSION, and a STUF bit indicating if stuffing is done is set (to be, e.g., “0”) (block ST13400).
g3) If 0 is set in the STUF bit, “0xaa” is set in Valid_PKT_Ns (including the number of valid packets in a packet group and the upper 2 bytes of PATS data appended to the first packet) (block ST13406).
h1) PSI and SI are examined to check the number of set streams (block ST1201).
h2) f4) and f5) are repeated in correspondence with the number of set streams (in case of YES in block ST12130).
h3) A stream type is checked based on PSI and SI (block ST1203) to determine if the stream of interest is a video or audio stream, or another type of stream to branch the control to the next stream check processes.
h4) In this case, the stream type is categorized to MPEG1 video, MPEG2 video, MPEG1 audio, MPEG2 audio, . . . , and internal data are checked depending on the determined type to read out respective kinds of attribute information.
h5) In case of a video stream, ES_TY=0 (block ST1213A), and respective kinds of attribute information are set (especially, resolution data, aspect information, and the like are extracted) to generate V_ATR (block ST1213C). The control then advances to f8).
h6) In case of an audio stream, ES_TY=0x40 (block ST1215A), and respective kinds of attribute information are set (especially, the coding mode, sampling frequency, the number of channels, and the like are extracted) to generate A_ATR (block ST1215C). The control then advances to f8).
h7) In case of another kind of stream, ES_TY=0x80 (block ST1217A), and respective kinds of attribute information are set (block ST1217C). The control then advances to f8).
h8) It is checked if streams for which ESI is to be generated still remain. If such streams remain, the control returns to check the next stream (NO in block ST1230).
j1) The number of search pointers (ESOBI_SRP) is increased by one to add another ESOBI, an area for that ESOBI is assured, and 0: MPEG_TS is set in PKT_TY (block ST1500).
j2) The video recording time is set in ESOB_REC_TM (block ST1502A). Note that the internal clock of the apparatus is set and corrected based on a TDT (Time Data Table), so that an accurate time can always be obtained.
j3) In this case, ESOB_S_PTM and ESOB_E_PTM data are extracted from the stream, and STC discontinuity information (e.g., CNT_SEGN in
j4) If the stream type is a TS stream (ARIB, DVB) (YES in block ST1506), “188” is set in AP_PKT_SZ and “16” is set in PKT_GRP_SZ (block ST1508A); otherwise (NO in block ST1506), a value corresponding to the broadcast scheme is set in AP_PKT_SZ (block ST1510). For example, in block ST1508A JPN (Japan) is set as country_code, and JapanISDB is set as AP_FORMAT1. Also, in block ST1510 the country code (e.g., USA) of the apparatus of interest is set as country_code, and the corresponding broadcast scheme (e.g., ATSC) is set as AP_FORMAT1.
j5) It is checked if PSI information and SI information are valid. If the PSI information and SI information are invalid (i.e., an unknown stream: NO in block ST1511), “1” is set in ESOB_TY: b12 or 0xff is set in respective values (block ST1513), and the control advances to j9).
j6) If the PSI information and SI information are valid (that is, a known stream: NO in block ST1511), TS_ID, NETWORK_PID, and PTM_ID (the PID of PTM data used by the ESOB of interest: there are two description methods of the PID: a method of describing the PID using 13-bit real data, and a method of describing the order in the PTM) are set based on PAT data (block ST1514).
j7) Program_Number (SERVICE_ID in PTM), PCR_PID, and the like are set based on PTM data. Furthermore, as for FORMAT_ID and VERSION, default values in the apparatus (in case of the built-in tuner) or Registration_Descriptor values sent via a digital input (in case of an external digital input) are set. Also, ESOB_TY is set according to the TMAP type (block ST1516A).
j8) Moreover, the number of recorded ESs, the number of Video ESs, and the number of audio ESs are set (block ST1516A). (The PTM is set with information: the number of all broadcasted ESs, but since not all ESs are always recorded upon video recording, the number of recorded ESs is set.)
j9) GPI setting processing (ST1530), TMAP setting processing (block ST1540), and the like are executed, and TMAPI is generated for each stream based on each segmentation information.
j10) The video recording start LB address is set in ADR_OFS (block ST1550A), and a default PID is set. Note that the default video PID corresponds to that with a smallest component tag value or the ESI number of a stream corresponding to a component tag described in a main component group in case of multi-view TV.
j11) An edit date is set (block ST1554).
k1) A stream type is checked (block ST15300B).
k2) If a plurality of programs form one stream (YES in block ST15300B), information indicating the presence of GPI is set in ESOB_TY, GPI_TY=0, PRIORITY=0 for all programs, one GPI is generated per program, and the number of groups is set (block ST15302B). The control then advances to k5).
k3) In case of rain attenuation broadcast (YES in block ST15304B), information indicating the presence of GPI is set in ESOB_TY, GPI_TY=40h, the top layer is set to be PRIORITY: 1, and other layers are set to be PRIORITY: 2. One GPI is generated per layer, and the number of groups is set (block ST15306B). The control then advances to k5).
k4) In case of multi-view broadcast (YES in block ST15308B), information indicating the presence of GPI is set in ESOB_TY, GPI_TY=40h, the top layer is set to be PRIORITY: 1, other layers are set to be PRIORITY: 2, and one GPI is generated per view (block ST15310B). Otherwise (NO in block ST15308B), 1 is set in ES_TMAP_Ns and information indicating the absence of GPI is set in ESOB_TY (block ST15312B). It is checked if ESs to be grouped (GP) still remain. If such ESs still remain (YES in block ST15314B), the control returns to k1); otherwise (NO in block ST15314B), the number of groups is set, and the control advances to k5).
k5) It is checked if another group (GP) remains. If such group remains, the control returns to k1); otherwise, a playlist is generated based on the PID of the currently selected group (block ST15316B), thus ending this processing.
k6) In this way, if playback is made using the currently selected group, the playlist automatically generated in block ST15316B can be played back.
m1) The ESOB/EVOB structure is determined (block ST15400).
m2) In case of the ESOB, TMAP TY is determined (block ST15403). If this ESOB is on the PTM base, ESs used to generate an STMAP are determined in consideration of the number of GPs, the number of ESs (the number of video ESs) is set as the number of TMAPs, and ES_PID to be generated is set for each TMAP. (However, one TMAP need not always be assigned to one GP. If no TMAP is available, another ES_TMAP of the identical ESOB is used to implement playback, search, special playback, and the like.) On the other hand, in case of an ESOB (AT_ESOB) on the PATS base or EVOB, one TMAP is added (see
m3) The ESOB (PTM base)/EVOB start and end times, the start and end times for each TMAP, the number of entries, the arrival time of the first packet of the ESOB (PATS base), the arrival time of the last packet of the ESOB, and the like are set based on segmentation information (block ST15405).
m4) A TMAPT is added, and ENTRY information (in case of an ESOB) is generated based on segmentation information. That is, 1ST_REF_PIC_SZ (the end address of the first I-pic of a target VES; if no I-Pic is available, zero is set), ESOBU_SZ (indicating the ESOBU size by PacketGP units), and ESOBU_S_PKT_POS (the head position of an SOBU in a PacketGP) are set for each VIDEO_ES. As ENTRY information in case of an AT_SOBU, AT_SOBU_SZ (indicating the ESOBU size by PKT units) and AT_SOBU_S_PKT_POS (the position of the first packet of the AT_SOBU in a packet group (PKT unit)) are set. Furthermore, as ENTRY information in case of an EVOBU, 1ST_REF_PIC_SZ (set the end address of the first I-pic), EVOBU_SZ, the number of playback frames, and the like are set (block ST15407). Note that TMAPT information is recorded as an independent file.
m5) The STMAP edit date is updated (block ST15409).
m6) It is checked if the sum total of the STMAP data of the STR_FI of interest exceeds 2 MB. If the sum total exceeds 2 MB (YES in step ST15411), “2 MB” (or the sum value of an ESOB segmented at 2 MB) is set in TOTAL_STMAP_SZ, and an ESOB is segmented not to exceed 2 MB. New STR_FI is generated, and a new ESOB is registered there (block ST15413), thus ending this processing.
m7) If the sum total does not exceed 2 MB (block ST15413), the sum total of STMAPs is set in TOTAL_STMAP_SZ (block ST15415), thus ending this processing.
With the above processing, the sum total of STMAPs can be prevented from exceeding 2 MB (the upper limit of an available memory size) (the sum total of STMPAPs is expected to equal the sum of previous TOTAL_STMAP_SZ and STMAP_SZ of the currently added ESOB).
As a method of setting the STMAP size to fall within 2 MB, a method of increasing the number of pieces of STR_FI by segmenting an ESOB into two objects as in the above processing, a method of increasing the number of pieces of STR_FI without segmenting an ESOB, and registering the ESOB in that information, and a method of broadening the STMAP interval by changing ESOBU_PB_TM_RNG may be used.
n1) The recorded time is checked (block ST154000). If the recorded time is equal to or shorter than two hours, the control advances to n2); if it falls within the range from two to four hours, the control advances to n3); or if it is equal to or longer than four hours, the control advances to n4) (block ST154001).
n2) “0” is set in EVOB/ESOB_PB_TM_RNG, and EVOBU/ESOBU_ENT data are generated based on segmentation information (information of 0.4 to 1 s) so that each ESOBU has a time range of 0.4 to 1 s (block ST154002). The control then advances to n5).
n3) “1” is set in EVOB/ESOB_PB_TM_RNG, and EVOBU/ESOBU_ENT data are generated based on segmentation information (information of 0.4 to 1.0 s) so that each ESOBU has a time range of 1.0 to 2.0 s (block ST154003). The control then advances to n5).
n4) “2” is set in EVOB/ESOB_PB_TM_RNG, and EVOBU/ESOBU_ENT data are generated based on segmentation information (information of 0.4s to 1.0s) so that each ESOBU has a time range of 2.0s to 3.0s (block ST154004). The control then advances to n5).
n5) This processing ends.
p1) It is checked if the latest PTM and EIT include copy information (digital copy control descriptors). If copy information is included (YES in block ST12200), the copy control descriptors are extracted (block ST12204), and CCI (APS, digital copy control information, etc.) is formed and set based on the extracted information (block ST12206). The control then advances to p3). At this time, “1” is set in PKT_GRP_GI: STUF, and the number of valid packets is set in PKT_GRP_GI: VALID_PKT_Ns.
p2) If no copy information is included in the received TS packet (NO in block ST12200), “copy free” is set (block ST12202).
p3) It is checked if the latest PTM and EIT include contents use descriptors. If the contents use descriptors are included (YES in block ST12208), the contents use descriptors are extracted (block ST12212), and ICT and EPN are set based on the extracted information (block ST12214A).
p4) If no contents use descriptors are included (NO in block ST12208), ICT and EPN are formed as “copy free” (block ST12210). Note that the ICT, EPN, and the like in block ST12214A or ST12210 have been described in the description of CCI with reference to
Another example of the CCI setting processing will be additionally explained below.
1) It is checked if the latest PTM and EIT include copy information. If copy information is included, copy information is formed and set based on that information. The control then advances to 3).
2) If the received TS packet does not include any copy information, the same information as in the previous pack is formed as copy information.
3) It is checked if the latest PTM and EIT include contents use descriptors. If the contents use descriptors are included, the following processing is made. That is, if the values of the contents use descriptors have changed in the middle of a packet group, dummy data is inserted in the previous packet group to form a new packet group after the changed position, and CCI is set based on that information. At this time, 1 is set in PKT_GRP_GI: STUF, and the number of valid packets is set in PKT_GRP_GI: VALID_PKT_Ns.
4) If the received TS packet does not include any copy information, CCI or CPI is formed as “copy free”.
q1) It is checked if a disc of interest undergoes the first video recording. If the disc of interest undergoes the first video recording (YES in block ST1600Z), new ORG_PGC is generated (block ST1602Z); otherwise (NO in block ST1600Z), a setting is made to add program PG after the already recorded PGC (ORG_PGC) (block ST1604Z).
q2) Erase permission: 0 is set in PG TY, the number of CELLs is set in Cell Ns, and the video ESI number is also set (block ST1700Z).
q3) In the setting of block ST1700Z, if the digital broadcast to be recorded is ARIB, and if language_code in a short event descriptor in an EIT is “jpn”, “0x12” is set in CHR in VMG_MAT, EVENT_NAME is set in the second field of PRM_TXTI, and representative picture information is set in REP_PICTI.
q4) The absolute number of PG is set in PG_INDEX to allow another application software or the like to refer to each PG (block ST1702Z). In this case, the start cell number and start time (start PTM) are set in resume information (PG_RSM_IFO).
q5) Information indicating a streamer is set in CELL_TY (e.g., cell type included in cell information EX_CI in
q6) In the setting of block ST1704Z, the reference ESOB number is set, the representative (video) ES number (ESIN) is set as the ES to be played back, and the number of pieces of entry point information EPI (
q7) Moreover, in the setting of block ST1704Z, start information is set in RG_RSM_INF (playback start PTM, video ESI number, audio ESI number, main/sub information of Dual-Mono) so that playback can start from the head of the program. The factors of automatic EP assignment in the video and time relationships are a constant time and a video mode change (an aspect ratio, and large motion vectors), and the first packet (the first packet of a sequence header, the first packet of I-PIC) of the first packet (Unit Start Indicator) GOP of a video frame is combined with these conditions. Furthermore, the factors of automatic EP assignment in the audio relationship are a change in audio (a change in audio volume or the like)/audio mode (ST/MONO), and the first packet (Unit Start Indicator, frame header) of an audio frame is combined with these conditions.
r1) Disc check processing is made first to check if the disc of interest is a Recordable/Rewritable Disc (R, RW, RAM). If the disc of interest is not a Recordable/Rewritable Disc, a message that advises accordingly is returned, and the processing ends.
r2) If the disc of interest is a Recordable/Rewritable Disc, the file system of the disc is read out to check if data has already been recorded (block ST207). If no data is recorded, a message “no data is recorded” is displayed, thus ending the processing.
r3) The VMG file is loaded (block ST207) and programs and cells to be played back are determined (they are determined as defaults or are selected by the user) (block ST208). In this case, if playback processing in the recorded order is selected, playback is made according to ORG_PGCI; if playback processing for each program (edited by the user) is to be made, playback is made according to UD_PGC (playlist) with a number corresponding to the program to be played back.
r4) The ESOB/EVOB to be played back, playback start PTM, and the like are determined based on title information (if PSI information and SI information are unknown, the setting is made to execute only transfer processing to the STB), resume information (PL_RSM_IFO, PG_RSM_IFO), cell information EX_CI) to be played back, and the like, and a playback start file pointer (logical address) and ESI of a stream to be played back are determined based on the playback start PTM. Furthermore, respective decoder units are set based on STI and ESI values to prepare for playback (block ST211A).
r5) Next, a playback method is determined based on AP_FORMAT1 and AP_FORMAT2 (see
r6) If PSI information and SI information are valid (YES in block ST211C), a stream to be played back is determined based on the PSI information and SI information, and the PSI information and SI information are saved in the work RAM (block ST211D). If the PSI information and SI information are invalid (NO in block ST211C), a setting is made to transmit all streams to the STB (block ST211E).
r7) Processing upon playback start is executed. It is checked if an object to be played back is an ESOB. Even if the object to be played back is an ESOB (YES in block ST213), decoder setting processing starts (block ST217); otherwise (NO in block ST213), only TS packet transmission processing is executed (block ST219).
r8) Next, cell playback processing is executed (block ST220), and it is then checked if playback is to end. If playback is to end (YES in block ST230), error check processing is executed. If any error is found (YES in block ST240), a message that advises accordingly is displayed (block ST242), and playback end processing is executed (block ST244). If no error is found (NO in block ST240), anther playback end processing is executed (block ST246), thus ending this operation.
r9) If playback is not to end (NO in block ST230), the next cell is determined based on PGCI (block ST232), and the flow returns to block ST211A. It is checked if the settings of decoder unit 59 (block ST217) have been changed. If the settings of decoder unit 59 have been changed, changed attributes are set in decoder unit 59 so as to change decoder settings in response to the next sequence end code.
r10) The same processes (blocks ST211A to ST232) are repeated while checking if playback is to end (block ST230).
s1) If an object to be played back is an ESOB (YES in block ST2170), a group to be played back is determined, and ESs to be played back are determined in accordance with GPI (block ST2171). If an object to be played back is an EVOB (NO in block ST2170), the control skips block ST2171.
s2) Attribute information (STI, ESI) of the ESOB (or EVOB) to be played back is loaded (block ST2172).
s3) It is checked if the ESOB (or EVOB) to be played back has a format supported by the recorder (the apparatus in
s4) If video data to be played back can be played back (YES in block ST2173), playback preparation is made (block ST2174A). In this case, the PID can be used intact if the 13-bit PID is set. However, the PID is determined with reference to PTM data if it is set based on the order in the PTM data.
s5) It is checked if audio data to be played back can be played back. If the audio data can be played back (YES in block ST2176), playback preparation is made (block ST2177A). In this case, the PID can be used intact if the 13-bit PID is set. However, the PID is determined with reference to PTM data if it is set based on the order in the PTM data. If the audio data cannot be played back (NO in block ST2176), apparatus settings are made not to play back the audio data and audio mute is set (block ST2178).
s6) Copy control processing is executed based on CCI or CPI information which includes the contents generated in the processing of, e.g.,
t1) Start file pointer FP (logical block number LBN) and end file pointer FP (logical block number LBN) of EX_CELL are determined based on the contents of TMAPI. Furthermore, start ESOBU_ENTRY and end ESOBU_ENTRY are determined based on the start and end times in EX_CI, and the data lengths of entries until target ESOBU_ENTRY are accumulated in ADR_OFS, thus obtaining a start address (LB=FP) and end address. The remaining EX_CELL length is calculated by subtracting the start address from the end address, and the playback start time is set in the STC (block ST2200). The PID to be played back is determined and is set in the decoder (STB, digital tuner). In this case, the PID can be used intact if the 13-bit PID is set. However, the PID is determined with reference to PTM data if it is set based on the order in the PTM data.
t2) ESOB continuity check processing is executed (block ST2201).
t3) Read processing during playback is executed to determine the read address and read size based on the start file pointer (block ST2206).
t4) The read unit size to be read out is compared with the remaining cell length. If the remaining cell length is larger than the read unit size (YES in block ST2207), a value obtained by subtracting the read unit size to be read out from the remaining cell length is set as the remaining cell length (block ST2208). If the remaining cell length is smaller than the read unit size (NO in block ST2207), the read unit size is set to be the remaining cell length, and the remaining cell length is set to be zero (block ST2209).
t5) The read length is set to be a read unit length, and the read address, read length, and read command are set in the disc drive unit (block ST2210).
t6) If data transfer starts (YES in block ST2212), the control waits until data for one ESOBU are stored. If data for one ESOBU are stored (YES in block ST2214), data for one ESOBU are loaded from the buffer (block ST2216), and buffer decoder transfer processing is executed (block ST2220). After read file pointer FP is incremented and the MPEG decoder is set in a normal mode (block ST2224), the control advances to t7).
t7) It is checked if transfer is complete. If transfer is complete (YES in block ST2226), the control advances to t8).
t8) It is checked if an angle key or the like has been pressed. If the angle key has been pressed (YES in block ST2238), it is checked if GPI is available. If GPI is available (YES in block ST2239), GP switching processing is executed (block ST2240); otherwise (NO in block ST2239), the control advances to the processing in block ST2228 without any process.
t9) If the angle key or the like has not been pressed (NO in block ST2238), it is checked if a Skip SW has been pressed. If the Skip SW has been pressed (YES in block ST2248), SKIP processing is executed (block ST2250).
t10) If the Skip SW has not been pressed (NO in block ST2248), it is checked if a STOP SW has been pressed. If the STOP SW has been pressed (YES in block ST2258), resume information (RSM_IFO) is saved in PG_RSM_IFO in case of title playback or in PL_RSM_IFO in case of playlist playback, and end processing is executed (block ST2260A).
t11) If the STOP SW has not been pressed (NO in block ST2258), the remaining cell length is checked. If the remaining cell length is not “0”, i.e., if the current cell is not the last one (NO in block ST2228), the flow returns to block ST2206; if it is “0” (YES in block ST2228), this processing ends.
u1) It is checked if the current ESOB is continuously recorded with the previous ESOB (ESOB_CONNI in
u2) If the two ESOBs are continuously recorded (YES in block ST22010), a setting is made to continuously play back the two ESOBs (to stop processing for, e.g., inserting a black frame between ESOBs until playback starts) (block ST22011).
v1) The number of packet groups in the buffer RAM is checked. If no packet group is found, the control skips the processing in
v2) A target packet group is read out from the buffer RAM (block ST22201). The head of the packet group is detected based on the packet group length and Header_ID (
v3) The STUF bit (
v4) Each TS packet is sent to the decoder unit (STB unit) at a time which is calculated using FIRST_PATS_EXT as the upper 2 bytes of PATS data of the first packet of the packet group and the lower 4 bytes of PATS data of the TS packet allocated immediately before that packet (block ST22202B). In other words, the PATS accuracy is detected based on PATS_SS, a transfer time of each TS packet is calculated from the PATS data (FIRST_PATS_EXT+PATS of the immediately preceding TS packet: in case of 4-byte accuracy) and PATS_SS (block ST22202B), and each TS packet is sent to the decoder unit (STB unit) at that time (block ST22203).
In case of the 6-byte accuracy, the transfer time of each TS packet is calculated using FIRST_PATS_EXT as the upper 2 bytes of PATS data of the first packet of the packet group, and the lower 4 bytes of PATS data of the TS packet allocated immediately before that packet. In case of 4-byte accuracy, PATS data is calculated from the immediately preceding PATS data in consideration of carry. In case of no accuracy, after packet data is extracted, a TS packet is output as soon as a request is received.
v5) Upon completion of packet transfer to the decoder unit (YES in block ST22204), copy control setting processing (CCI processing) is executed (block ST22205).
v6) After that, it is checked if manufacturer information MNF is available. If manufacturer information MNF is available, it is checked if its manufacturer ID matches that of the manufacturer of the apparatus of interest. If the two IDs match, data of manufacturer information MNF is loaded to execute predetermined processing (processing unique to each manufacturer) (block ST22270).
v7) Next, discontinue processing is executed (block ST22280).
v8) The control waits for completion of transfer, and it is checked if packet groups still remain in the buffer RAM. If no packet group remains in the buffer RAM (NO in block ST22206), this processing ends.
v9) If packet groups still remain in the buffer RAM (YES in block ST22206), a setting is made to process the next packet group (block ST22207), and the flow returns to block ST22201.
x1) The type of selector switch SW is checked (block ST22400X).
x2) Grouping information GPI of packet group GP whose playback is currently in progress is loaded (block ST22401X).
x3) It is checked if the GPI is stored. If no GPI is stored (NO in block ST22403X), this processing ends.
x4) If the GPI is stored (YES in block ST22403X), the GPI information is loaded to switch another GP (block ST222405X), and decoder setting processing is executed (block ST22410).
y1) Discontinuity information DCNI is read out and checked (block ST22800). If a CNT_SEG gap is found at the playback position (YES in block ST22802), the playback mode of the decoder is shifted to an internal clock mode (an operation mode that ignores the PTS value, makes playback using only internal clock values, and enables PTS data again at the time of reception of PCR data: external sync mode) (block ST22804), thus ending this processing.
y2) If no CNT_SEG gap is found at the playback position (NO in block ST22802), this processing ends without any processing.
z1) Entry point information table EPIT is loaded (block ST22500).
z2) The SKIP direction (determined by the type of SKIP key) is checked. If the SKIP direction is a forward direction (YES in block ST22502), entry point EP which is located after the current playback position and has the same PID as the current playback PID is searched for, and its information is loaded (block ST22504). On the other hand, if the SKIP direction is a backward direction (NO in block ST22502), entry point EP which is located before the current playback position and has the same PID as the current playback PID is searched for, and its information is loaded (block ST22506).
z3) An ESOBU_ENT to be played back is determined based on the detected EPI (block ST22508).
z4) ESOBU_ENT information is loaded to determine the playback start time (STC) (block ST22510). In this case, an ESOBU_Cluster (
z5) It is checked if the target ESOBU_ENT includes I-PIC (by checking if 1ST_REF_SZ=0). If the target ESOBU_ENT includes no I-PIC (NO in block ST22512), the immediately preceding ESOBU_ENT information of the identical group is loaded (block ST22514) to repeat the processes in blocks ST22512 to ST22514.
z6) If the target ESOBU_ENT includes I-PIC (or reference picture) (YES in block ST22512), sequence header SH in the ESOBU_ENT is loaded and is set in the decoder (block ST22522). Then, the I-PIC (or reference picture) found previously is read out, and the decoder is set to start decoding from that position, and to start display from the playback time designated by the EP (block ST22514), thus shifting to normal playback processing.
With the aforementioned processing, a flexible control operation corresponding to digital broadcast can be implemented.
1. In a digital recorder (DVD streamer or the like) which can record digital streams, if Wrap-around of the STC has occurred, that position is set in ESOBI as CNT_SEG, and the CNT_SEG count information from the head of the ESOB is appended to each PTM.
2. In the digital recorder (DVD streamer or the like) which can record digital streams, an ESI number used in a video stream upon playback is appended to each representative picture information so as to specify a video stream.
3. In the digital recorder (DVD streamer or the like) which can record digital streams, an ESI number used in a video stream upon playback, an ESI number used in an audio stream, and main/sub information when audio is Dual-Mono are appended to each resume information so as to specify a stream to be played back.
4. In the digital recorder (DVD streamer or the like) which can record digital streams, an ESI number used in a video stream upon playback, an ESI number used in an audio stream, and main/sub information when audio is Dual-Mono are appended to each EP information so as to specify a stream to be played back.
5. An STC continuous flag and/or PATS continuous flag and its offset value are appended as seamless information indicating continuity among logically continuous ESOBs in addition to a continuous recording flag.
Even when PSI or SI is unknown, various kinds of digital broadcast recording and playback can be supported.
<Corresponding Example Between Embodiments and Inventions>
<Information Storage Medium (Part 1) . . . TOTAL_STAMP_SZ in FIG. 12>
In an information storage medium (100 in
the information storage medium has a management area (111, 130 in
the management area (DVD_HDVR in
the management information (HDVR_MG in
<Information Storage Medium (Part 2) . . . TYPE_B/ESOB_TY/b12 in
(PSI=Program Specific Information)(SI=Service Information)
In an information storage medium (100 in
the information storage medium has a management area (111, 130 in
the management area (DVD_HDVR in
<Information Storage Medium (Part 3) . . . HDVR_MG in FIG. 8/EX_MB_VOB_STI in FIG. 10/V_ATR in FIG. 11>
In an information storage medium (100 in
the information storage medium has a management area (111, 130 in
the data area (131 to 133 in
the management area (DVD_HDVR in
the management information (DVD_HDVR in
<When Resolution Information in
The management information (ESTR_FIT in
<Recording Method using Information Storage Medium>
An information recording method (
<Playback Method using Information Storage Medium>
An information playback method (
<Recording Apparatus using Information Storage Medium>
An information recording apparatus (the encoder side in
<Playback Apparatus using Information Storage Medium>
An information playback apparatus (the decoder side in
Note that the invention is not limited to the aforementioned embodiments, and various modifications may be made based on techniques available at that time without departing from the scope of the invention when it is practiced at present or in the future. The respective embodiments may be combined as needed as much as possible, and combined effects can be obtained in such case. Furthermore, the embodiments include inventions of various stages, and various inventions can be extracted by appropriately combining a plurality of constituent elements disclosed in this application. For example, even when some constituent elements are omitted from all the constituent elements disclosed in the embodiments, an arrangement from which those constituent elements are omitted can be extracted as an invention.
While certain embodiments of the inventions have been described, these embodiments have been presented by way of example only, and are not intended to limit the scope of the inventions. Indeed, the novel methods and systems described herein may be embodied in a variety of other forms; furthermore, various omissions, substitutions and changes in the form of the methods and systems described herein may be made without departing from the spirit of the inventions. The accompanying claims and their equivalents are intended to cover such forms or modifications as would fall within the scope and spirit of the inventions.
Number | Date | Country | Kind |
---|---|---|---|
2005-118733 | Apr 2005 | JP | national |
This application is a divisional application of U.S. application Ser. No. 11/403,003, filed Apr. 13, 2006, and is based upon and claims the benefit of priority from Japanese Patent Application No. 2005-118733, filed Apr. 15, 2005, the entire contents of each of which are incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
Parent | 11403003 | Apr 2006 | US |
Child | 13955389 | US |