The present application claims priority from Japanese application JP 2008-180886 filed on Jul.11, 200, the content of which is hereby incorporated by reference into this application.
The present invention relates to an information record/reproduction apparatus and information record/playback methodology.
Person recognition and face detection technologies have recently been built in surveillance systems, digital still cameras and digital video cameras or camcorders. Use of these technologies results in commercial popularization of advanced imager products, including a still camera of the type having its ability to recognize the face of a subject of shooting, e.g., person, and using the resultant information for focusing and exposure settings.
For example, Japanese Patent Bulletin JP-A-2007-19845 relates to a “surveillance camera, surveillance method and surveillance program,” and discloses in its paragraph [0028] that the camera's recognized person information is used to perform stream division in a way such that a “moving-picture divider 60 splits a plurality of frame images, which are received from an image storage 30, into a plurality of streams with respect to each person as extracted in a moving-picture extraction unit 55.”
In addition, prior known hybrid digital video cameras having a plurality of types of built-in recording media include a camcorder capable of performing video dubbing to removable media within the camera per se, called the in-camera dubbing.
Examples of the record media housed in such hybrid camcorder are undetachable fixed storages, such as large-capacity hard disk drive (HDD), semiconductor flash memory, etc. Other examples are removable storage media, such as optical disks—e.g., digital versatile disc (DVD), Blu-ray™ disc (BD) or else—and secure digital (SD) cards or secure memory cards or like solid-state storages.
Known hybrid digital video cameras having multiple built-in recording media include a camcorder capable of performing in-camera video dubbing to removable media.
Examples of the record media housed in such hybrid camcorder are fixed storages, such as large-capacity HDD, flash memory or else, and removable media, such as optical disks—e.g., DVD or BD—and SD cards or the like.
The fixed storage media as built in a hybrid video camera are large in storage capacity and simultaneously offer long-time recordability. On the other hand, for purposes of long-term data saving along with browsing and playback of videos captured, it is general to use a method of dubbing a video saved on HDD to removable media, such as optical disks or else.
In such known technologies, most removable media are less in storage capacity than camera built-in fixed media. For example, an 8 cm BD having a single recording layer is about 7.5 gigabytes (GB) in storage capacity. An SD card is 2 GB in maximum capacity. A secure digital high capacity (SDHC) card is 32 GB in max capacity. Regarding the in-camera fixed media, an HDD, which is one known example thereof, has a storage capacity of 60 GB or more. A flash memory, which is another example, is 32 GB in capacity.
Under these circumstances, when performing dubbing of a video from a fixed media to removable media, a problem occurs as to the failure to record an entirety of video data on a single piece of removable medium. In such case, the only remedy for this problem is to perform dubbing to each of a plurality of removable media after having divided the video into two or more parts.
In this event, it is general to simply split image data having been recorded at almost equal interval of time period. However, this method accompanies a penalty which follows: a scene to be played back continuously is undesirably interrupted in a midstream thereof and its playback suddenly stops when watching and enjoying the recorded image as a home-made video movie.
The above-cited JP-A-2007-19845 is silent about this point.
This invention has been made in view of the above-stated technical background, and an object of the invention is to provide an information record/reproduction method and apparatus capable of appropriately performing dubbing while improving the usability for users.
To attain the foregoing object, this invention provides a technique for performing the dubbing using a specific kind of marking codes, which are different from usually used marks, to separate or “split” a video stream, although this invention should not exclusively be limited to embodiments as disclosed herein.
According to the invention, it is possible to provide an information record/reproduction method and apparatus capable of appropriately performing dubbing while retaining increased usability for users.
Currently preferred embodiments of the present invention will be described with reference to the accompanying drawings below.
As shown in
The CPU 101 performs control of respective components or modules, including the DSP 103, codec 104 and ATA/ATAPI controller 107. The AV sensor 102 is for converting video data of captured scene/subject images into electrical signals, which is sent forth to the DSP 103. DSP 103 is responsive to receipt of this input video signal from sensor 102, for applying thereto signal processing and for performing control of the face detector 105 and processing of a face detection signal. In this way, in this embodiment, the face detector 105 automatically detects the face of a subject of shooting, e.g., person, when a moving picture is taken. Upon receipt of an input video signal from DSP 103, the codec 104 applies compression processing thereto. This compression is performed while using the SDRAM 106 as a work buffer. A resultant video signal that was compressed by the codec 104 is passed via ATA/ATAPI controller 107 to either HDD 108 or a recording medium being presently loaded in BD drive 109 and then recorded thereto. In a playback mode, a video signal which is read out of either HDD 108 or the recording medium of BD drive 109 is transferred to the codec 104 through ATA/ATAPI controller 107, and is applied extension processing in codec 104. This extension processing is performed by using the SDRAM 106 as a work buffer therefor. The resulting extended or “stretched” video signal is sent to the monitor output unit 110 for visual display on its screen, such as a color liquid crystal display (LCD) panel. In a dubbing mode, a video signal as read out of the HDD 108 is transferred to the codec unit 104 via ATA/ATAPI controller 107 and is then applied extension processing by codec 104. For this processing, the SDRAM 106 is used as a work buffer.
During the video shooting, when the face detector 105 detects the presence of a human face in a scene being captured, an interruption signal is uploaded from the face detector 105 to CPU 101 via DSP 103. In responding thereto, CPU 101 recognizes the face detection. Upon recognition of the face detection, CPU 101 generates information indicative of a mark which becomes a break point or “transition” of the scene. This mark information is sent to the HDD 108 via ATA/ATAPI controller 107 and is then stored in HDD 108.
In cases where a scene transition is created due to stoppage of video shooting, the CPU 101 generates a usual or standard mark information, which is sent and recorded to the HDD 108 via ATA/ATAPI controller 107.
Turning to
The marks M1 and M4 are such that each is to be added when a user starts the video shooting, which mark is for routine use as a scene transition. The marks M2 and M3 are the ones that are automatically generated and inserted by the CPU 101 of the camcorder in responding to the occurrence of interruption from the face detector 105. The marks M1 and M4 are usually used marks which are to be displayed as scene transitions during both playback and dubbing sessions; the marks M2 and M3 are the marks that are used for dubbing only, which are recorded as a specific type of marks that are different from the usual marks.
It should be noted here that the marks M2 and M3 are designed so that these are not visually displayed during playback in order to prioritize the continuity of a scene being played back; however, in cases where it is required during playback to perform quick head search also at a position whereat the number of faces of subjects changed to increase or decrease, the marks M2-M3 may alternatively be designed so that these are displayable during the playback in order to meet the user's needs. Furthermore, it may also be a good idea to use different types of marks for different kinds of cases—i.e., when the subject face number increased and when the face number decreased. This makes it possible to manage the subject face number increase/decrease timings by means of such different types of marks, thereby enabling accommodation of a wide variety of user needs.
Alternatively, the marks may be designed, for example, so that a mark is added only at the timing of an increase in subject face number in accordance with the user's end-usage while preventing addition of such mark at the timing of a decrease in subject face number. Adversely, the marks may be designed so that a mark is added only at the timing of a decrease in person face number while preventing such mark addition at the timing of an increase in face number. Note that the mark to be added here may be either a dubbing-dedicated mark or a usual mark. With such the arrangement, it is possible to accommodate various kinds of user needs, thereby enabling improvement of the usability and user friendliness. In such arranged system, since image data having been recorded is not simply split into a plurality of parts with at almost equal interval of time period, it then becomes possible to resolve the problem in a general system, that is, a scene to be played back continuously is undesirably interrupted in a midstream thereof and its playback suddenly stops when watching and enjoying the recorded image as a home-made video movie.
In this embodiment, a directory 902 named “MARK” is further formed at a level just below the root directory. In the MARK directory 902, certain information as to dubbing-use marks are stored, such as the above-stated marks M2-M3. Generally, in a case where an extra directory different from the standard-defined directories is formed, this directory is not played back by standard apparatus, such as video players complying with the standards. In view of this, an extra or “special” directory which is different from usual directories or folders pursuant to the BDMV/AVCHD standards is intentionally prepared for storing therein the information relating to the dubbing-use marks, such as the marks M2 and M3. By doing so, it is possible, when dubbing is performed, to refer to the dubbing-use marks M2-M3 on the dubbing device side. This in turn makes it possible to make these marks M2-M3 invisible, or hidden, in any way from video players during playback sessions.
Alternatively, in case a need is felt to refer to the marks M2 and M3 in a playback session also, the information as to these marks M2-M3 may be saved in one of the standard directories complying with the standards, e.g., the PLAYLIST folder.
As for the dubbing procedure shown in
As apparent from the foregoing, it is possible for the hybrid camcorder embodying the invention to perform, by use of dubbing-dedicated marks, the dubbing to two or more separate recording media without impairing the inherent scene continuity and also to play a resultant home video movie while avoiding occurrence of the user's unintentional scene splitting during playback.
In this embodiment, there has also been stated an example which performs face detection during video shooting. With the face detection, it becomes possible to acquire in advance the face detection information when performing video shooting. This makes it unnecessary to reacquire such face detection information prior to execution of dubbing, thereby enabling smooth and rapid execution of the dubbing with the face detection information being taken into consideration.
Although in the above-stated embodiment a specific case was explained where a mark is added at a position at which the number of face images is changed from zero to one or, alternatively, from one to zero, it is not always necessary for this face number to become zero. For example, the mark may be added at a position at which the face number increased from one to two, a position whereat the number increased from two to three, a position whereat it reduced from five to four, etc. In this case, it is possible to achieve more fine scene splitting at shorter time intervals, which leads to an advantageous ability to use the capacity of a to-be-dubbed disk more efficiently.
The camcorder of
In video shooting and playback modes, signal processing is performed in a similar way to the first embodiment, except that face detection is done not during shooting but during dubbing. The dubbing also is similar to the embodiment 1 in that a video signal as read from the HDD 108 is sent to codec 104 via ATA/ATAPI controller 107 and subject to expansion processing at codec 104, with SDRAM 106 being used as a work buffer.
In this embodiment the video signal that was expanded during dubbing is passed to the face detector 105, which applies thereto face detection processing. In addition, the expanded video signal is again sent to the codec 104, which applies thereto compression processing, causing the resulting signal to be recorded on a recording medium of the BD drive 109 in a similar manner to that during recording. More specifically, upon startup of the dubbing, a stream is read out of a dubbing source, such as HDD 108, followed by execution of face detection.
When a face image is detected by the face detector 105 during dubbing, an interruption signal is uploaded from face detector 105 to CPU 101 via codec 104, resulting CPU 101 recognizing such face detection. Upon recognition of the face detection, CPU 101 determines that it must be a scene transition and then adds a mark to the scene to thereby perform scene splitting. Mark-related information is temporarily saved in a buffer memory, such as SDRAM 106 or else for example. The mark added in response to the face detection is a temporarily used mark which will possibly be deleted in the middle of the dubbing processing; so, this mark is called the temporary mark.
Upon completion of the dubbing of video contents up to a tail end of the scene from the HDD 108 to the disk in BD drive 109, the temporary mark of this scene is deleted, thereby enabling the user to care nothing about unnecessary scene splitting.
In a case where the first disk's remaining capacity becomes empty in the middle of the dubbing of a scene, the position of a last added temporary mark of those temporary marks added in the process of dubbing such scene is regarded as the tail end of this scene while simultaneously deleting those temporary marks other than the last added mark, thereby enabling the user not to bother about unnecessary scene splitting. After having loaded another disk, e.g., blank BD, in place of the fully recorded disk, the dubbing is restarted from the position of the last added temporary mark.
Regarding the timings of reading the stream from the dubbing source and performing the face detection along with recording to the dubbing destination, the readout and face detection plus the recording may be performed in a parallel way; alternatively, the recording may be done after having completed the readout and face detection up to the tail end of a stream. In the case of the readout and face detection plus the recording being performed in parallel, it becomes possible to shorten a total time as taken for the dubbing. In the case of the recording being done after completion of the readout and face detection, it is possible to perform the recording efficiently. This can be said because a mark position to be accommodated within the storage capacity of the dubbing destination medium is determinable in advance.
The dubbing flow in this embodiment will be described with reference to
Firstly, at step S1201, the dubbing gets started. Then at step S1202, a stream is read out of a dubbing source medium, such as HDD 108. In steps S1203 to S1207, processes are carried out, which are basically similar to those at steps S103 to S107 of
Repeating the above-stated procedure makes it possible to achieve the scene splitting at a position with no face images even when performing the dubbing by splitting the scene into parts to be recorded on two or more disks. This brings an advantageous effect unique to this embodiment: it becomes possible to realize scene splitting without damaging the scene continuity.
In this embodiment, the face detection is performed in the process of dubbing. Thus, it becomes possible to reduce the processing load during video shooting, when compared to the method for acquiring face detection information during shooting. Another advantage of the camcorder arranged to perform face detection not during shooting but during dubbing is as follows: in case a camcorder with no face detection function is used to shoot a subject(s) for making a home video movie and, thereafter, this camcorder is upgraded to have an add-in firmware module with the face detection function, it becomes possible to perform face detection with respect to the already shot video even after completion of the video shooting activity.
Note that in this embodiment, a specific case was explained where a mark is added at a position at which the number of face images is changed from 0 to 1 or, alternatively, from 1 to 0, it is not always necessary for this face number to become zero. For example, the temporary mark may be added at a position at which the face number increased from 1 to 2, a position whereat the number increased from 2 to 3, a position whereat it reduced from 5 to 4, etc. In this case, it is possible to achieve more fine scene splitting at shorter time intervals, which leads to an advantageous ability to use the capacity of a to-be-dubbed disk more efficiently.
Also note that although this embodiment camcorder is arranged to perform face detection only in the process of dubbing while preventing execution of the face detection during video shooting, this embodiment may be modified to perform the face detection both during shooting and during dubbing. In this case, when an attempt is made to perform highly precise face detection, the camcorder increases in load because it must perform in a shooting session an increased amount of processing tasks—that is, processing for the face detection in addition to the signal processing for video shooting. An approach to avoiding this is to design the camcorder to perform face detection during video shooting while prioritizing the signal processing for the shooting and separately perform precise face detection when dubbing is performed thereafter. With this arrangement, it is possible to achieve face detection with increased accuracy. The face detection may be based on a change in number of subject faces as stated previously, with or without additional detection of the kinds of such faces. With simultaneous execution of the face kind detection and the face number detection, it is possible to improve the usability for users while increasing the accuracy of face detection.
By applying this invention to hybrid digital video cameras or “camcorders” of the type having a built-in large-capacity fixed storage media, such as HDD, and a relatively small capacity of removable media such as DVD, BD or like optical recordable disks, it becomes possible to achieve the dubbing with automatic scene splitting feature without impairing the scene continuity in the event of watching and enjoying a home-made movie.
It is noted that the present invention should not exclusively be limited to the illustrative embodiments stated supra and may also include a variety of modifications and alterations as well as equivalents thereto. For example, any one of the above-stated embodiments is for detailed explanation of the invention, and the invention should not be interpreted to be limited only to apparatus or equipment having all of the constituent elements or components as disclosed herein. Additionally, it is possible to replace a part of the configuration of one of the embodiments by the configuration of another embodiment. It is also possible to add the configuration of one embodiment to that of another embodiment.
It should be further understood by those skilled in the art that although the foregoing description has been made on embodiments of the invention, the invention is not limited thereto and various changes and modifications may be made without departing from the spirit of the invention and the scope of the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
2008-180886 | Jul 2008 | JP | national |