The present invention relates generally to authoring, delivering and reproducing auxiliary media in a Digital Cinema application.
This section is intended to introduce the reader to various aspects of art, which may be related to various aspects of the present invention that are described and/or claimed below. This discussion is believed to be helpful in providing the reader with background information to facilitate a better understanding of the various aspects of the present invention. Accordingly, it should be understood that these statements are to be read in this light, and not as admissions of prior art.
The term “Digital Cinema” refers generally to the creation and/or distribution of motion pictures to cinemas in digital form, in contrast to traditional cinema whose content, including the image, soundtrack and subtitles, exists in an analog form on motion picture film. Digital Cinema projection systems now exist and make use of a variety of technical solutions, rather than a single standardized solution embraced by all. The existence of content in digital form affords the opportunity to provide alternative information along with the original content with little effort. For that reason various organizations, such as the Digital Cinema Initiative (“DCI”) and the Society for Motion Picture Engineering (“SMPTE”) have devoted much effort towards establishing specifications for digital cinema content distribution.
The DCI specification defines preferred solutions in many areas, such as encoding, distribution, and security. The SMPTE has sought to refine the DCI specification from a technical and implementation perspective. The goal of these specifications is to provide an architecture that promotes high level and uniform technical performance. The system can provide enhanced sound and image content while providing the theatre patron with a more consistent movie experience. The system also affords potential simplification of production and distribution, delivery, and storage of the movie content.
Although Digital Cinema and the DCI specification affords potential simplification of production and distribution, delivery, and storage of the movie content, they currently do not support the inclusion of different kinds of auxiliary media, such as auxiliary soundtracks, in a normative and interoperable way. This drawback is important. For example, consider the case in which a digital film is to be presented in a cinema auditorium to viewers that communicate in different languages. If only one set of viewers can hear the audio in their spoken language and only another set of viewers can view subtitles in their written language, all the remaining viewers may have a difficult time following and enjoying the digital film. As a result, by not providing auxiliary media such as auxiliary soundtracks, Digital Cinema may be viewed as not providing all the viewers of a digital film with a rewarding experience.
The present disclosure is directed towards overcoming these drawbacks.
The disclosed embodiments of the present invention are directed towards creating, packaging, distributing, and reproducing auxiliary media tracks, such as a second language audio track, within a digital mastering and distribution system for motion pictures, such as that used in Digital Cinema. One embodiment of the present invention appends data structures and elements to the current existing management structure for representing digital cinema presentations maintaining complete interoperability and causing minimal impact to the existing structure while providing an enhanced service and experience to movie audience.
Another embodiment of the present invention includes a composition playlist having main assets identifying a sound file and a picture file to be reproduced in parallel, and an auxiliary asset associated with at least one of the main assets and identifying an auxiliary file to be reproduced in parallel with the sound file and the picture file.
A further embodiment of the present invention includes a method for packaging an auxiliary media track, the method includes the steps of receiving a digital cinema distribution master that contains media tracks, identifying main media tracks, identifying an auxiliary media track, and creating a digital cinema package containing the main media tracks, the auxiliary media track, and instructions to reproduce the main media tracks and the auxiliary media tracks in parallel.
Yet another embodiment of the present invention includes a method for reproducing an auxiliary media track, the method including the steps of receiving a digital cinema package containing presentation content, separating a main presentation content from an auxiliary presentation content, processing the main presentation content to extract main media tracks, processing the auxiliary presentation content to extract an auxiliary media track, and providing the main media tracks in parallel with the auxiliary media track.
In the drawings:
The characteristics and advantages of the present invention may become more apparent from the following description, given by way of example.
One or more specific embodiments of the present invention will be described below. In an effort to provide a concise description of these embodiments, not all features of an actual implementation are described in the specification. It should be appreciated that in the development of any such actual implementation, as in any engineering or design project, numerous implementation-specific decisions must be made to achieve the developers' specific goals, such as compliance with system-related and business-related constraints, which may vary from one implementation to another. Moreover, it should be appreciated that such a development effort might be complex and time consuming, but would nevertheless be a routine undertaking of design, fabrication, and manufacture for those of ordinary skill having the benefit of this disclosure.
The following describes a system for delivering auxiliary media, e.g., auxiliary soundtracks, associated with a movie presentation using digital cinema. Other systems utilized to deliver auxiliary media related to other presentations where the input may be supplied by some other means may include very similar structures. Those of ordinary skill in the art will appreciate that the embodiment of the system and circuits described herein is merely one potential embodiment. As such, in alternate embodiments, the components of the system and circuits may be rearranged or omitted, or additional components may be added.
Turning now to the drawings and referring initially to
The Central Storage Server 130 supplies a DCP to one or more auditoriums 160a,b,c over a network using a network interface 150. In a preferred embodiment, the network interface 150 is an Ethernet hub. The auditoriums 160a,b,c may be located together within one building complex. Alternately, the auditoriums 160a,b,c may be located in different buildings and even located significant distances from each other. The arrangement of the auditoriums 160a,b,c may determine the type of network employed for delivery of the DCP.
In order to prevent unauthorized reception of the DCP from the central storage server 130, a security system 170 may be provided. The security keys may be generated in conjunction with or separately from the delivery system 100. In one embodiment, the security keys are provided by the DCP content provider based on valid authorization information for both the content and the auditorium using the content. Further, each of the keys may be associated with only one of the auditoriums 160a,b,c, preventing unauthorized delivery of content to any of the other auditoriums 160a,b,c.
The auditoriums 160a,b,c receive a DCP based on requests made by equipment within the auditoriums 160a,b,c, not shown. Alternately, the auditoriums 160a,b,c may receive an DCP based on a delivery schedule provided by either the central storage server 130 or by an additional auditorium content manager, not shown. The auditoriums 160a,b,c process the DCP in order to produce the media for both audio and picture presentation to the patrons in attendance. The auditoriums 160a,b,c may also be able to transmit any auxiliary content provided with the DCP to the patrons. The delivery and reception of the auxiliary content is further described below.
Turning now to
The DCP signal is supplied from a network interface to a cinema server 210 located in the equipment room of auditorium 200. The cinema server 210 provides the signal processing necessary to separate the various elements of the DCP stream into appropriate picture, audio, and subtitle signals for presentation. The cinema server 210 may also provide any picture signal formatting necessary, such as converting the picture signal to a correct format and aspect ratio for display in the presentation room of the auditorium 200. The cinema server 210 may also process and convert the audio signal in order to separate and further generate the various audio channels as needed for the audio reproduction system employed.
The cinema server 210 also separates and manages any auxiliary media provided through the DCP stream. The cinema server 210 establishes any timing for delivery of the auxiliary media content by decoding synchronization information supplied within the DCP.
A picture content signal output from the cinema server 210 is supplied to a cinema projector 220. The cinema projector 220 receives the picture content signal and generates a light emitting picture output for display on a cinema screen 230 located in the presentation room of the auditorium 200. In a preferred embodiment, the cinema projector 220 receives a picture content signal in the form of a digital data stream representative of the luminance levels of the three colors red, green, and blue. Picture information pertaining to each of these colors is separated and provided to a digital light projection (DLP) circuit that uses a high intensity polarized light source in order to produce and project the picture through an opening in the wall adjoining the two rooms in the auditorium 200. The projected light source, representing the cinema presentation picture image, is projected to the other end of the presentation room of the auditorium 200 and displayed on the cinema screen 230.
A subtitle content signal from the cinema server 210 may also be supplied to the cinema projector 220. The cinema projector 220 receives the subtitle content signal and combines the subtitle content signal with the picture content signal and generates a light emitting picture output for display on the cinema screen 230 located in the presentation room of the auditorium 200.
An audio content signal from the cinema server 210 is supplied to speakers 240a-f located in the presentation room of auditorium 200. The speakers 240a-f may include additional signal processing such as audio equalization and/or amplification. The audio content signal may alternately use a centrally located processing system, not shown, for audio equalization and amplification. The central processing system may then provide the equalized and amplified signals to speakers 240a-f. The number and location of the speakers used in the presentation room may vary depending on requirements and design. In a preferred embodiment, six speakers 240a-f are located with three on each side wall of the presentation room of the auditorium 200. The speakers 240a-f are positioned equidistant spanning the length of a side and pointed perpendicular to the cinema screen 230.
An auxiliary media signal (e.g., an auxiliary soundtrack signal) from the cinema server 230 is supplied to antenna 250. The cinema server 230 may include any signal processing, including signal amplification, to put the auxiliary media signal in a condition for transmission by antenna 250. Alternately, a separate amplifier, not shown, may process and amplify the auxiliary media signal prior to transmission. The transmission method may involve the protocols associated with a wireless network system such as Wi-Fi. The transmission method may also involve use of cellular phone network communications. The transmission method may also involve a proprietary approach using either analog or digital communications techniques such as frequency modulation (FM) or quaternary phase shift keying (QPSK).
The auxiliary media signal is transmitted from antenna 250 and may be received by auxiliary media device 260. The auxiliary media device 260 receives the auxiliary media signal, decodes the auxiliary media signal, and provides the auxiliary media signal to the patron via the electronic media device 260. The auxiliary media device 260 may be embodied as a wireless network or Wi-Fi terminal, a headphone, a cellular phone, a pager, or a proprietary communications device.
It should be appreciated that alternative delivery systems for providing the auxiliary media signal (e.g., the auxiliary soundtrack signal), other than the wireless delivery system described above, may be utilized as known by those skilled in the art. For example, the auxiliary media signal may be delivered (in a wireless or wired manner) to speakers in a special sound-proof room contained within the presentation room. The special sound-proof room would permit the viewers within the room to view the cinema presentation on the same cinema screen 200 as the viewers outside of the sound-proof room but would allow the viewers within the sound-proof room to listen to the auxiliary sound associated with the cinema presentation. Another example may include the auxiliary media signal be delivered through a wired network to headphones provided at seats positioned in the presentation room of the auditorium 200.
The auxiliary media device 260 may include a display and some type of audio driver such as a speaker or a headphone connection. The auxiliary media device 260 may further include user controls for permitting a patron to control operation.
The cinema server 210 may additionally include a receiver circuit and include processing for determining and maintaining a roster of auxiliary media devices 260 that are active and in use in the auditorium 200. The cinema server 210, through an identifier signal for an auxiliary media device, could further target delivery of auxiliary audio tracks only to patrons who desire receive the auxiliary audio tracks.
Turning now to
The converted data stream from the input interface 310 is provided to a presentation processor 320. The presentation processor 320 separates the converted data stream into sets of individual presentation content, such as picture, audio, subtitles, and auxiliary media content. The presentation processor 320 also separates and decodes any code instructions supplied as part of the DCP. The presentation processor 320 operates on the incoming converted data stream following code instructions provided within the DCP. The presentation processor 320 may operate using instructions included internally. The presentation processor 320 may also separate and decode any security information, and may perform such functions as key validation for valid receipt of the DCP. The presentation processor 320 may also provide initial signal processing for the individual presentation content.
The presentation processor 320 also processes content synchronization information for the presentation. The synchronization information may be supplied along with, or as part of, the instructions provided in the DCP. Synchronization of the content then proceeds based on instructions within the DCP structure as well as instructions within the presentation processor 320. Time base information required to perform the synchronization may also be supplied within the instruction provided in the DCP, or alternately may be generated by the presentation processor 320.
A memory 322 may be connected to the presentation processor 320. Memory 322 may primarily store portions of the incoming converted data stream as well as store portions of the presentation signals in order to facilitate content synchronization. Memory 322 may also be used to store control information and operating code for the presentation processor 320 as well as intermediate computational values for any processing. In a preferred embodiment, memory 322 is in the form of random access memory (RAM) and is used for all memory requirements. In another embodiment, memory 322 includes RAM for operations control of the presentation processor 320 as well as storage of portions of the data stream and presentation signal. A read only memory (ROM) is used to store initialization and control software for the presentation processor 320.
Memory 322 may also store critical security information such as keys or algorithms. Portions of the security information may be stored in either RAM or ROM. It is also important to note that some other method of storing and retrieving security information may be used as is well known to those skilled in the art.
A storage device 324 may also be connected to the presentation processor 320. The storage device 324 has more storage capacity than the memory 322 and may also be capable of storage over a longer period of time. Storage device 324 may be used to store larger segments of the incoming converted data stream. Alternately, storage device 324 may store an entire DCP, allowing the cinema server 300 to essentially download a DCP in its entirety prior to processing the DCP using the presentation processor 320. In a preferred embodiment, storage device 324 is a hard disk drive.
The presentation processor 320 outputs several presentation signals, including the picture, audio and subtitle presentation signals as required for presentation.
The picture presentation output signal from the presentation processor 320 is provided to the picture output driver 330. In a preferred embodiment, the picture presentation output signal is a digital picture signal and the picture output driver 330 provides the picture presentation output signal to the cinema projector 220 for projection onto the cinema screen 230.
The audio presentation output signal from the presentation processor 320 is provided to the audio output driver 340. The audio output driver 340 provides the audio presentation signal to the speakers 240a-f.
The subtitle presentation output signal from the presentation processor 320 is provided to the subtitle output driver 345. The subtitle output driver 345 provides the subtitle presentation output signal to the cinema projector 220 for projection onto the cinema screen 230.
The auxiliary media output signal (e.g., an auxiliary soundtrack signal) from the presentation processor 320 is provided to the auxiliary processor 350. The auxiliary processor 350 provides any additional processing of the auxiliary media output signal, if necessary. The auxiliary processor 350 manages the auxiliary audio media output signal and also manages any additional auxiliary data.
It should be noted that although the presentation processor 320 and auxiliary processor 350 are illustrated as separate processors, the processors may be combined into a single processor as known by those skilled in the art.
The auxiliary media signal is output from the auxiliary processor 350 to the auxiliary transmitter 390. The auxiliary transmitter 390 may format the auxiliary media signal into a suitable transmission signal such as a QPSK signal. The auxiliary transmitter 390 may also process the transmission signal to add elements such as error correction, as required by a particular transmission standard or as is well known to one skilled in the art. As discussed above, the auxiliary media signal may be delivered in an alternative wired manner such as through a wired network to speakers in a soundproof room or to speakers provided at theater seats. When the auxiliary media signal is delivered in a wired manner, the auxiliary transmitter 390 may be replaced by an auxiliary driver, as known by those skilled in the art.
A memory 352 may be connected to the auxiliary processor 350. Memory 352 may primarily store portions of the auxiliary media content or any additional auxiliary data to facilitate content synchronization. Memory 352 may also be used to store control information and operating code for the auxiliary processor 350 as well as intermediate computational values for any processing. In a preferred embodiment, memory 352 is in the form of random access memory (RAM) and is used for all memory requirements. In another embodiment, memory 352 includes RAM for operations control of the auxiliary processor 350 as well as storage of portions of auxiliary media content. A read only memory (ROM) is used to store initialization and control software for the auxiliary processor 350.
A storage device 354 may also be connected to the auxiliary processor 350. The storage device 354 has more storage capacity than the memory 352 and may also be capable of storage over a longer period of time. Storage device 354 may be used to store larger segments of auxiliary media content. Alternately, storage device 354 may store additional auxiliary media content provided to or generated by the cinema server. The storage device 354 allows the cinema server 300 to receive additional auxiliary media content in its entirety at period of time in advance of delivering the content as auxiliary audio tracks in conjunction with a presentation. In a preferred embodiment, storage device 354 is a hard disk drive.
A controller 360 is connected to both the presentation processor 320 and auxiliary processor 350. Controller 360 may manage the interaction between the two processors as well as execute or process instructions delivered with the DCP. Controller 360 may maintain identifiers for devices capable of and/or receptive of delivery of auxiliary media. Controller 360 may also be capable of processing return data.
A user interface 370 is connected to controller 360 and may allow interactive control information between a person operating or controlling the presentation and the cinema server 300. The user interface 370 may provide external connections for a control display monitor, touch screen system, mouse, and/or keyboard.
Turning now to
The digital post-production of a film occurs at the studio or post production house 410. More specifically, film is scanned from camera-original film negatives into a digital format on a scanner or high-resolution telecine. Alternatively, data from digital motion picture cameras may be converted to a convenient image file format for processing. All of the files are conformed to match an edit list created by the film editor, and are then color corrected under the direction of the film's staff. The end result of post-production is a digital intermediate. The digital intermediate may be used to record the motion picture to film or as the basis for the digital cinema release. When all of the sound (e.g., main soundtrack and all auxiliary soundtracks), picture, and data elements of a production have been completed, they may be assembled into a Digital Cinema Distribution Master (DCDM) that contains all of the digital material needed for a presentation. The mastering stage for creating the DCDM may occur internally within the studio or post-production house or may occur external to the studio or post production house.
The DCDM is sent to a packaging system 420. The packaging system 420, in accordance with embodiments of the present invention, compresses, encrypts, and packages the pictures, sounds, and subtitles contained in the DCDM to form a Digital Cinema Package (DCP). The DCP contains the essence to be presented including compressed and encrypted track files containing audio, picture, and subtitle data, and a composition playlist (CPL) that dictates, among other things, how the track files are to be played.
The DCP is transmitted from the packaging system 420 to the central storage server 130 via different methods that may include hard drives, DVDs, or satellite, as previously discussed. The packaging system 420 may also interact with a key generation network 430 that ensures that the proper encryption keys are also communicated to the Digital Cinema theaters (i.e., to the security system 170 associated with the Digital Cinema theaters).
Turning now to
The auxiliary sound asset 540, in accordance with an embodiment of the present invention, links to an auxiliary soundtrack file 560 containing sound essence that is not already contained in the main soundtrack file 550. Although only one auxiliary sound asset 540 and auxiliary soundtrack file 560 is shown, multiple auxiliary sound assets and soundtrack files may be provided. Some examples of auxiliary sound essence include, but are not limited to, foreign soundtracks, narration sound for a Digital Cinema presentation, and the like. As discussed above in the description of
It should be noted that the auxiliary sound preferably has the same operational constraints as the main sound. This enables the reuse of soundtrack files 550, 560 in CPLs as either main or auxiliary sound, as discussed in further detail below. It should also be noted that auxiliary sound preferably has the same audio characteristics as main sound. More specifically, the bit depth should be 24 bits per sample, the sample rate should be 48.000 kHz or 96.000 kHz matching the sample rate of the main sound in the composition, the channel count should be ≦16 full-bandwidth channels and the reference level should equal −20 dBFS.
The total channel count between the main and auxiliary soundtracks 550, 560 in the CPL 500 is preferably limited to 16 channels. This enables the cinema server 210 (or more precisely a Media Block (not shown) in the cinema server 210) to handle CPLs with both main and auxiliary sound.
The auxiliary soundtrack files 560 make use of additional decryption contexts. The number of parallel decryption contexts for a Digital Cinema presentation may be limited. Therefore, the number of auxiliary sound assets 540 in CPL 500 and associated auxiliary soundtracks 560 may also be limited.
The reproduction of the auxiliary soundtrack file 560 is preferably synchronized with the reproduction of the main picture track file 545 and main soundtrack file 550. Although sample accuracy is not needed, at least frame accuracy is desirable (i.e., synchronization error shall be between +/−one frame).
Although the main soundtrack files 550 in the CPL 500 could be altered to contain auxiliary soundtracks such as additional language soundtracks, one advantage of using the auxiliary sound structure of the embodiment of the present invention is the flexibility of reusing existing soundtrack files 550, 560 for multiple CPLs 500. For example, consider the case in which the same digital film is to be presented in a first auditorium 160a to a primarily Spanish-speaking audience and in a second auditorium 160b to a primarily English-speaking audience. Using the auxiliary sound structure of an embodiment of the present invention, the same Spanish and English soundtrack files 550 and 560 could be used for two different CPLs: the first CPL using Spanish as the main sound and English as auxiliary sound (not shown), and the second CPL using English as the main sound and Spanish as the auxiliary sound (as shown in
The auxiliary sound structure of an embodiment of the present invention defines additional sound essence that may be reproduced in parallel with the main soundtrack files and the other media assets in a reel. In particular, the AuxiliarySound element (
Referring now to
Referring now to
The AuxiliarySoundTrackFileAssetType structure 700 is defined using XML schema, as shown below. A TrackId element 710 uniquely identifies the auxiliary soundtrack to which the auxiliary soundtrack file belongs. The TrackID element 710 is needed to distinguish between different auxiliary soundtracks in reels that contain multiple auxiliary soundtracks files (e.g., a reel containing a foreign soundtrack and a narration soundtrack). An AnnotationText element 720 is a free-form, human-readable text annotation associated with the auxiliary soundtrack file. The AnnotationText element 720 appears in at least one of, and preferably the first, auxiliary soundtrack file for each auxiliary soundtrack. The AnnotationText element 720 describes the purpose and use of the auxiliary soundtrack (e.g., “secondary Spanish soundtrack”).
One possible methodology for implementing the integration of auxiliary media content with main presentation content is based on XML (eXtensible Markup Language), a standardized, clear-text computer code. XML is a standardized language intended for use in publishing content on the Internet, but is being used to define data in other areas. In Digital Cinema for instance, XML is the common language for describing content, the method to play the content, sub-titles, and associated metadata.
An example of the definition of an auxiliary sound structure in a CPL in XML schema is provided below.
An example of a valid instance of a CPL and the auxiliary sound structures is provided below. The example represents a composition with one reel that contains one picture track file, one main soundtrack file, and one auxiliary soundtrack file.
Referring now to
Referring now to
While the invention may be susceptible to various modifications and alternative forms, specific embodiments have been shown by way of example in the drawings and are described in detail herein. However, it should be understood that the invention is not intended to be limited to the particular forms disclosed. Rather, the invention is to cover all modifications, equivalents and alternatives failing within the spirit and scope of the invention as defined by the following appended claims.
This application claims the benefit under 35 U.S.C. § 119 of a provisional application 60/699,264 filed in the United States on Jul. 14, 2005.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US2006/027289 | 7/14/2006 | WO | 00 | 1/11/2008 |
Number | Date | Country | |
---|---|---|---|
60699264 | Jul 2005 | US |