The present invention relates to energy-efficient data distribution systems and methods in general and, more particularly, relates to data distribution systems and methods for partial display updates in holographic light projection devices.
In conventional display application, pixel values are refreshed periodically in a row-wise manner. Updating the display given a new frame of video content data thus requires an entire scan of the display area which entails a dead time during which the pixel values of a display cannot be updated and an amount of energy spent to distribute and upload the new video content data of frames to all the display pixels. Consequently, a well-performing, underlying data distribution hardware is needed which can cope with the timely and energy-efficient distribution and updating of large data volumes in a received stream of input video data. This is particularly important in the case of high resolution displays having a large amount of pixels that need regular updating, and is even more important in compact displays offering enhanced viewing comfort and capabilities, e.g. stereoscopic displays or 3D displays such as 3D light field displays or holographic displays offering depth perception, for a single viewer or multiple viewers, at large viewing angles.
Known examples of systems for data distribution or for dynamic addressing (for write and read) of data are crossbar switch designs or DRAM memory access technology. For the former the number of switching components grows quadratically and does not provide a sufficiently energy-efficient solution when scaling up the access bandwidth; the latter is conventionally overwriting an entire row concurrently even if only one data location needs updating. Moreover, at least for volatile CMOS designs the distributed, stored data requires regular refreshing, typically in the millisecond range. Therefore, there exists a need for solutions that save resources as much as possible and are also characterized by high throughput and low update latencies.
It is an object of embodiments of the present invention to provide an efficient high throughput-low latency system for distributing data for 3D light field and holographic projection
The above objective is accomplished by a method and device according to the present invention.
In a first aspect, the present invention relates to a system for distributing data for 3D light field projection. It comprises a plurality of input terminals which are suitable for receiving a stream of input data, and a plurality of output terminals which are connectable to pixel elements of a display. A plurality of data paths exists between input terminals and output terminals and a plurality of data switches are suitable for controlling, via control variables, a transfer of input data, when received at the input terminals, on a data path. The system for distributing data further comprises a control plane which is adapted for applying control variables to the data switches. The control plane includes a plurality of control switches for selecting, via enable variables, one or more control variables from sequences of control variables and for applying the one or more control variables to the data switches. At least one first delay line suitable for propagating sequences of control variables, and at least one second delay line suitable for propagating sequences of enable variables are also included in the control plane. The at least one first delay line and the at least one second delay line are comprising each one or more delay units. Each of the one or more delay units of the at least one first delay line are in a synchronized relationship with exactly one of the one or more delay units of the at least one second delay line. The system for distributing data also comprises means for detecting patterns contained in the stream of input data, when received during system run-time. The detected patterns are determining the sequences of control variables.
It is an advantage of embodiments of the invention that a selection of control variables provides partial updating of the holographic image data applied to the connectable pixel elements of a display, whereby an energy per area unit overhead is reduced.
It is also an advantage that control variables are sequentially sent on short delay lines having a lower capacitive load and shorter latency.
It is an advantage of embodiments of the invention that control variables or entire control sequences can be stored on the data switches for a long time without being refreshed.
In some embodiments of the present invention, the means for detecting patterns may include a run-time engine which is deciding on the selection of update patterns for the control plane, which positively influences the energy-efficient use of the system.
It is an advantage of embodiments of the present invention that the control plane design reduces a routing overhead, thereby providing an energy- and area efficient data distributing system.
It is an advantage of embodiments of the present invention that high input data volume traffic is handled by the system and high throughput rates are achieved.
According to some embodiments of the present invention, the system for distribution data further comprises means for carrying out local postprocessing computations on transferred input data for at least one of the plurality of output terminals.
The means for carrying out local postprocessing computations may, in particular embodiments of the present invention, comprise local data decoders operating on transferred input data.
In other embodiments of the present invention, the means for carrying out local postprocessing computation may comprise a circuit for identifying whether newly transferred input data for at least one of the plurality of output terminals has been changed compared to input data previously transferred to that output terminal.
For some embodiments of the present invention, the means for detecting patterns may also be adapted to control the execution of local postprocessing computations.
It is an advantage of some embodiments of the present invention that local postprocessing computation may be carried out on a local level and in a distributed fashion. This allows for more postprocessing functionality and an increased pixel-level control.
It is an advantage of some embodiments of the present invention that local decoding means may reduce the number of wires necessary for input data transfer, and to allow a more compact representation of input data by means of input data compression. Higher input data throughput rates are therefore achievable.
It is an advantage of some embodiments of the invention that many devices may be implemented locally, without the need for latches or registers, e.g. the case in CMOS logic, and that the many devices may perform postprocessing computations independently form each other in a distributed fashion.
It is an advantage of some embodiments of the present invention that already transferred input data is reused locally. Therefore, redundant writing of input data to a pixel element may be avoided and the data distributing system is operating in a more energy-efficient way.
According to some embodiments of the present invention, a synchronized relationship between each of the one or more delay units of the at least one first delay line with exactly one of the one or more delay units of the at least one second delay line is established by a synchronous clock signal distributed to the one or more delay units of the at least one first and second delay line. This has the advantage that very precise control of delay times of the various delay units and near perfect matching of delay times may be obtained.
According to some embodiments of the present invention, a synchronized relationship between each of the one or more delay units of the at least one first delay line with exactly one of the one or more delay units of the at least one second delay line is established by clock-free wave-pipelining circuits. This is of advantage since there is no need for clock distribution circuitry and an associated clock load is avoided. Therefore, a very energy-efficient implementation of the data distributing system may be provided.
According to some embodiments of the present invention, the plurality of data switches and/or the plurality of control switches comprises at least one thin film transistor (TFT). This has the benefit that TFT backplane technology may be used to stack multiple layers in a back-end-of-line process, each layer comprising thin film transistors and being connected to the next layer via intermediate metal layers.
It is an advantage of some embodiments of the present invention that TFT devices with larger nodes are manufactured at a lower cost.
It is an advantage of some embodiments of the present invention that TFT devices with a high threshold voltage allow better long term storage of control data.
For some embodiments of the present invention, the TFT devices may be implemented in IGZO materials, which is beneficial, as this allows for very low leakage currents.
According to some embodiments of the present invention, the system for distributing data further comprises means for generating sequences of control variables and/or means for generating sequences of enable variables. These sequence generating means may, in particular embodiments of the present invention, be algorithms for compression and holographic data transformations which are running on a computing device off-line. The compressed and/or transformed data may be stored on disks from which it is streamed to the system for distributing data.
In a second aspect, the present invention relates to 3D light field projection device which comprises a system for distributing data according to any of the embodiments of the first aspect, and a display comprising pixel elements arranged on a display surface. Each output terminal of the plurality of output terminals is connected to and addresses at least one pixel element such that a transfer of received input data to output terminals is causing a updating of the addressed pixel elements.
A group of pixel elements (e.g. color pixels, block of pixels in coarse rendering) may be addressed by one only output terminal, which further reduces the wiring overhead. A single pixel element may be addressed by a group of output terminals (e.g. phase and intensity information). It is an advantage the display can be updated partially, which greatly reduces the power consumption of the device.
According to some embodiments of the present invention, the sequential selection of each control variable from a sequence of control variables propagating along the at least one first delay line defines a corresponding sequence of pixel elements or groups of pixel elements being addressed such that a curve sequentially connects the pixel elements or groups of pixel elements of said corresponding sequence on said display surface.
According to the same or other embodiments of the present invention, one sequence of control variables is determining at least one curve of updated pixel elements or groups of pixel elements on said display surface. The updated pixel elements or groups of pixel elements along the at least one curve are addressed sequentially by the order of selection of control variables from said sequence and the at least one curve does not intersecting itself on said display surface.
This is advantageous because partial updating of the display occurs only in a local area of the total display, whereby spatial correlations in the image content are exploited more easily. An advantage of non-intersecting curves of sequentially addressed pixel elements or groups of pixel elements is given by the less complex and more compact design layout.
According to the same or other embodiments of the present invention, one sequence of control variables is determining at least one curve of updated pixel elements or groups of pixel elements on said display surface. The updated pixel elements or groups of pixel elements along the at least one curve are addressed sequentially by the order of selection of control variables from said sequence and the at least one curve is connecting neighboring pixel elements or groups of pixel elements of the display.
This is of advantage, as partial updating of the display along at least one curve connecting nearest neighbor pixel elements or groups of pixel elements on the display avoids or reduces the length of wire routing.
According to the same or other embodiments of the present invention, at least one curve is a space-filling winding curve along which straight curve segments are joined by right-angled turns such that the curve connects all the pixel elements belonging to a connected region of the display.
It is an advantage of these embodiments of the present invention that this organization of the display plane leads to a geometry which is still simple and allows for compact spatial clusters which do not require long chain lengths. Therefore, shorter wiring distances may be obtained, resulting in lower latencies.
According to some embodiments of the present invention, a plurality of curves on said display surface are defined and each curve is a straight line on said display surface. A straight line corresponds to a row of pixel elements of said display.
This has the advantage that addressing pixel elements of the display by rows allows easier routing/floor planning.
According to some embodiments of the present invention, each pixel element comprises electrically controllable phase change material.
It is an advantage of those embodiments of the present invention that existing phase change material technology is used for implementing pixel elements, resulting in a fully integrated solution offering fast and efficient electronic control of pixel elements with memory.
According to some embodiments of the present invention, a plurality of disjoint clusters of pixel elements provides a spatial partitioning of the display, the pixel elements of each cluster having similar update rates for each stream of input data out of a collection of representative streams.
Clusters of pixel elements have the advantage that they can be assigned at design time based on prior knowledge gathered by profiling. Therefore, an energy- and resource efficient system may be implemented.
According to some embodiments of the present invention, the projection device further comprises a splitter for splitting a received stream of input data into several smaller chunks of input data and for applying these smaller chunks of input data to the data input electrodes of more than one cluster.
It is an advantage of some embodiments of the present invention that the data distributing system can handle very high data rates, e.g. in can support terabits per second (Tbps) data transfer rates, which are necessary for a high enough frame rates/display update rates which give the viewer(s) an impression of continuity. By breaking up the input data in more chunks, also the latency of the system can be controlled, so as to stay quite low when needed.
According to some embodiments of the present invention, a shape of each of the plurality of disjoint clusters in the display plane is assigned at design-time, based on histograms obtained through profiling of the system for distributing data, when fed with a collection of representative streams.
This has the advantage that cluster shapes can be optimized for a particular application with regard to energy-efficiency, image quality, etc. Existing representative video data may be efficiently exploited to optimize the cluster shapes and cover many possible applications.
According to some embodiments of the present invention, the profiling of the system for distributing data is obtained by simulating it in software. According to other embodiments of the present invention, the profiling of the system for distributing data is obtained by emulation in hardware.
According to some embodiments of the present invention, the update rates of pixel elements of each cluster are adapted dynamically, at run-time, by the means for detecting patterns. This allows for a flexible design in which clusters assigned at design-time are efficiently exploited during system run-time.
In a third aspect, the present invention describes a method for of distributing streams of concurrent input data to a 2D or 3D storage medium for writing. First, streams of concurrent input data are provided and applied to one or more input terminals. Patterns contained in the stream of concurrent input data are then detected and sequences of control variables are determined as a function thereof. Next, the sequences of control variables are injected into at least one first delay line and at least one enable variable is injected into at least one second delay line. The at least one first delay line and the at least one second delay line each comprise one or more delay units. Each of the one or more delay units of the at least one first delay line are in a synchronized relationship with exactly one of the one or more delay units of the at least one second delay line. Control variables from one of the sequences of control variables propagating along the at least one first delay line are selected and the selected control variables are applied to data switches such that data paths between input terminals and output terminals are established. Furthermore, a plurality of control switches is controlling the selection of control variables. A state of each control switch depends on the at least one enable variable propagating along the at least one second delay line. Eventually, concurrent input data is transferred along each of the established data paths such that transferred input data at an output terminal can be written to a memory location of a connectable 2D or 3D storage medium.
Particular and preferred aspects of the invention are set out in the accompanying independent and dependent claims. Features from the dependent claims may be combined with features of the independent claims and with features of other dependent claims as appropriate and not merely as explicitly set out in the claims.
For purposes of summarizing the invention and the advantages achieved over the prior art, certain objects and advantages of the invention have been described herein above. Of course, it is to be understood that not necessarily all such objects or advantages may be achieved in accordance with any particular embodiment of the invention. Thus, for example, those skilled in the art will recognize that the invention may be embodied or carried out in a manner that achieves or optimizes one advantage or group of advantages as taught herein without necessarily achieving other objects or advantages as may be taught or suggested herein.
The above and other aspects of the invention will be apparent from and elucidated with reference to the embodiment(s) described hereinafter.
The invention will now be described further, by way of example, with reference to the accompanying drawings, in which:
The drawings are only schematic and are non-limiting. In the drawings, the size of some of the elements may be exaggerated and not drawn on scale for illustrative purposes. The dimensions and the relative dimensions do not necessarily correspond to actual reductions to practice of the invention.
Any reference signs in the claims shall not be construed as limiting the scope.
In the different drawings, the same reference signs refer to the same or analogous elements.
The present invention will be described with respect to particular embodiments and with reference to certain drawings but the invention is not limited thereto but only by the claims.
The terms first, second and the like in the description and in the claims, are used for distinguishing between similar elements and not necessarily for describing a sequence, either temporally, spatially, in ranking or in any other manner. It is to be understood that the terms so used are interchangeable under appropriate circumstances and that the embodiments of the invention described herein are capable of operation in other sequences than described or illustrated herein.
Moreover, directional terminology such as top, bottom, front, back, leading, trailing, under, over and the like in the description and the claims is used for descriptive purposes with reference to the orientation of the drawings being described, and not necessarily for describing relative positions. Because components of embodiments of the present invention can be positioned in a number of different orientations, the directional terminology is used for purposes of illustration only, and is in no way intended to be limiting, unless otherwise indicated. It is, hence, to be understood that the terms so used are interchangeable under appropriate circumstances and that the embodiments of the invention described herein are capable of operation in other orientations than described or illustrated herein.
It is to be noticed that the term “comprising”, used in the claims, should not be interpreted as being restricted to the means listed thereafter; it does not exclude other elements or steps. It is thus to be interpreted as specifying the presence of the stated features, integers, steps or components as referred to, but does not preclude the presence or addition of one or more other features, integers, steps or components, or groups thereof. Thus, the scope of the expression “a device comprising means A and B” should not be limited to devices consisting only of components A and B. It means that with respect to the present invention, the only relevant components of the device are A and B.
Reference throughout this specification to “one embodiment” or “an embodiment” means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, appearances of the phrases “in one embodiment” or “in an embodiment” in various places throughout this specification are not necessarily all referring to the same embodiment, but may. Furthermore, the particular features, structures or characteristics may be combined in any suitable manner, as would be apparent to one of ordinary skill in the art from this disclosure, in one or more embodiments.
Similarly it should be appreciated that in the description of exemplary embodiments of the invention, various features of the invention are sometimes grouped together in a single embodiment, figure, or description thereof for the purpose of streamlining the disclosure and aiding in the understanding of one or more of the various inventive aspects. This method of disclosure, however, is not to be interpreted as reflecting an intention that the claimed invention requires more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment. Thus, the claims following the detailed description are hereby expressly incorporated into this detailed description, with each claim standing on its own as a separate embodiment of this invention.
Furthermore, while some embodiments described herein include some but not other features included in other embodiments, combinations of features of different embodiments are meant to be within the scope of the invention, and form different embodiments, as would be understood by those in the art.
It should be noted that the use of particular terminology when describing certain features or aspects of the invention should not be taken to imply that the terminology is being re-defined herein to be restricted to include any specific characteristics of the features or aspects of the invention with which that terminology is associated.
In the description provided herein, numerous specific details are set forth. However, it is understood that embodiments of the invention may be practiced without these specific details. In other instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure an understanding of this description.
A control chain, as referred to in figures and exemplary embodiments of the detailed description, corresponds, in the context of the present invention, to a portion of the control plane that includes all the elements necessary to steer the distribution of input data to the location(s) where it is used to modify pixel elements of a display operatively coupled to the data distributing system. So the control chain also steers the full or partial update of these pixel elements. The actual distribution of input data is performed by switches which belong to the data plane of the data distributing system. The control plane may comprise several control chains, each control chain being adapted for steering input data to specific locations and locations addressed by different control chains may overlap.
Figures and exemplary embodiments of the invention having data input electrodes and data output electrodes as its respective input and output terminals are described hereinafter. The skilled person will appreciate, however, that embodiments of the invention are not limited to electrodes and that any type of suitable electrical contact or electrical connector may be provided as an input or output terminal.
Exemplary embodiments of the invention are commonly referring to holographic displays/projectors as an example for 3D light field creating displays or projection devices; the terms holographic display and holographic projector are used interchangeably in the sense that they enable viewing of a full 3D scene. This does not exclude other displays or projection devices creating 3D light fields from falling under the scope of embodiments of the second aspect of the invention. The skilled person would know, for example, how to encode, format, or organize a stream of 4D light field information for projection with a (near-eye) light field display such that the encoded, formatted, or organized stream can be distributed with a data distributing system in accordance with embodiments of the first aspect of the invention. A similar reasoning also applies to data distributing systems in non-projecting displays with or without depth perception, e.g. to autostereoscopic lenticular displays which use microlenses and define macro-pixels to achieve non-uniform angular lighting and depth perception. As a consequence, scaled, denser, and larger pixel arrays are striven for in 3D applications of 2D displays augmented by lens arrays. More generally, the invention may also be put into practice for data distributing systems in standard 2D displays having a large pixel densities and/or large pixel count, e.g. compact LCD or TFT 2D displays having millions of pixels, the latter being non-limiting examples of projection-free devices, i.e. suitable for direct viewing. Furthermore, the present invention can be applied to any display where partial dynamic updates are useful and that requires energy-efficient, high-throughput, low-latency data distribution. High-throughput, low-latency data distribution in the context of above cited example of a holographic projector implies dense data throughputs exceeding 10 Gbits/cm2 for good resolution also with blue light at sub-wavelength ranges, e.g. at quarter wavelength resolution. This data is likely to be distributed at typical frame rates for fluid motion perception without flicker, e.g. at 24 fps, 48 fps, etc. This frame rate may triple for a three-color display. Thus, for typical pixel counts beyond 1MegaPixel and 24-bit color depth, a low-latency data distributing system supports overall data throughput rates beyond 0.5 Gbit/sec, and data throughput densities beyond 1 Tbit/sec/cm2. The related power consumption preferably ranges below a few Watts, more preferably below a few milliwatts. However, there may be no unique or preferred way of specifying the overall system performance and in general the following rule that, if the system operation at the maximum performance (speed) cannot be met, additional parallelism is introduced to cope with the expected overall system specifications, applies.
A system for distributing data according to embodiments of the first aspect of the present invention is now described referring to
The data output electrodes 103 are connectable to pixel elements 110 of a display, e.g. a holographic display. The connection to the pixel elements 110 may be such that exactly one data output electrode 103 is connected to one pixel element. More generally, more than just one data output electrode 103 may be connected to one pixel element (e.g. to parallelize the number of bits that can be written to a pixel element concurrently or separate information on intensity and phase levels for each pixel element). The system 100 for distributing data comprises a control chain containing elements to steer the distribution of input data 101 from the input electrodes 102 to the output electrodes 103. Such control chain of the system for distributing data 100 comprises a first delay line 111, and control switches 104 which are electrically coupled to the first delay line 111 and to the data switches 105. The electrical coupling is such that, if a control switch 104 is in a predetermined state, e.g. an on-state for which it is conducting, a control variable propagating along the first delay line 111 is selected and directed towards at least one of the data switches 105, whereby a transfer of input data 101 from corresponding data input electrodes 102 towards corresponding data output electrodes 103 is controlled. The control switches 104 are also operatively connected to a second delay line 112 so as to receive enable variables that control the switching events of the control switches 104. As more extensively described in EP17182232.3, incorporated herein by reference, the first delay line 111 comprises one or more delay units 106, 107 and is in a synchronized relationship with the second delay line 112, which also includes one or more delay units 108, 109. The synchronized relationship between both delay lines may be established by pairwise matching of delay units of both delay lines. For example, a delay time T22 of a delay unit 108 of the second delay line 112 is determined as a function of a delay time T12 of a corresponding delay unit 106 of the first delay line 111, and a delay time T23 of a delay unit 109 of the second delay line 112 is determined as a function of a delay time T13 of a corresponding delay unit 107 of the first delay line 111, etc. In some embodiments of the invention, it may be advantageous to design corresponding delay units of first and second delay line such that the delay time T22 of a delay unit 108 of the second delay line 112 is, within error margins that are acceptable for a given application, twice the delay time T12 of the corresponding delay unit 106 of the first delay line 111, the delay time T23 of a delay unit 109 of the second delay line 112 is, within said error margins, twice the delay time T13 of the corresponding delay unit 107 of the first delay line 111, etc. This synchronized relationship between delay units 106, 107 of the first delay line 111 and delay units 108, 109 of the second delay line 112 has the effect that an enable variable (propagating along the second delay line 112) is aligned temporally with successive control variables composing a sequence of control variables (propagating along the first delay line 111), each time it advances by one delay unit 108, 109. Equivalently, the synchronized relationship states that an offset in time between two sequences, e.g. a control sequence of control variables and an enable sequence of enable variables, is increased in a controlled fashion. The two sequences are co-propagating on the first and second delay line 111, 112. In an exemplary embodiment, delay units 108, 109 of the second delay line 112 may be provided as clocked 2-bit shift registers, whereas corresponding delay units 106, 107 of the first delay line 111 may be provided as 1-bit shift registers clocked by the same clock signal as the 2-bit shift registers of the second delay line 112. In another exemplary embodiment, delay units 106, 107, 108, 109 may be provided by flip-flops or registers which are not synchronized by a common clock signal, but wherein the temporal alignment of an enable variable with successive control variables is achieved by wave-pipelining circuitry, e.g. by the controlled insertion of delay buffers into the second delay line 112. In yet another exemplary embodiment, delay units 106, 107, 108, 109 may be provided as transmission line segments which can be modeled and built as lumped RC circuits. A careful matching of resistance values and capacitances of transmission line segments results in the desired control of the time offset between the two co-propagating sequences. A delay time of a delay unit 108, 109 of the second delay line 112 is not necessarily exactly twice a delay time of a corresponding delay unit 106, 107 of the first delay line 111. There exists some degree of tolerable variability as long as sufficient temporal overlap between an enable variable and successive control variables is ascertained. For instance, the enable variable may be chosen to be a pulse which is present or absent, and likewise for control variables. A shorter pulse duration for the enable variable, e.g. 10% shorter, compared to control variable pulse durations provides some flexibility in terms of time jitter in the offset during propagation. It may therefore be possible that a delay time T22 of a delay unit 108 of the second delay line 112 is 2.1 times, instead of a targeted value of 2.0, the delay time T12 of the corresponding delay unit 106 of the first delay line 111, and the delay time T23 of a delay unit 109 of the second delay line 112 is 1.9 times, instead of a targeted value of 2.0, the delay time T13 of the corresponding delay unit 107 of the first delay line 111, etc.
In some embodiments of the invention, the first delay line 111 and/or the second delay line 112 may be conceived to propagate control variables or enable variables which are represented as multiple bits. In other embodiments of the invention, the first delay line 111 and/or the second delay line 112 may be conceived to propagate control variables or enable variables which are represented as single bits.
The means for providing input data 101, e.g. data transfer means for transferring input data 101 from a storage medium to the data distributing system, and possibly through a pre-routing network, may be included in the data distributing system 100 or may be external to it. The input data 101 may, but not limited thereto, be transferred from an external storage disk of a computer or server to the data distributing system 100 via a wire connection or a wireless connection.
In particular embodiments, the input data 101 is encoded and/or compressed so as to achieve suitable error correction and/or data compression of the stream of input data 101 with the advantage of higher data transmission, distribution and display update rates being obtained. For some applications, the input projection data 101 may be preprocessed input data, obtained off-line, e.g. compressed and/or encoded input data, and/or input data transformed for holographic data projection, and the so preprocessed input data is then stored on a suitable storage medium, e.g. storage disks on a computer, from which it is retrieved and streamed to the data distributing system 100 when it is operative. In the context of the present invention, the input data 101 typically represents a primitive out of a pre-defined set of primitives forming a representative, high-level description of a 3D scene. Each primitive is then interpreted in a local decoding and/or local postprocessing step and results in a physical representation of the 3D scene information at the level of individual or groups of pixel elements of the (holographic) display, e.g. as optical phase and/or amplitude distributions for light interacting with the display. As a very simple example, one may imagine a simple point in 3D space constitutes the whole 3D scene. A physical representation of this 3D scene point at the level of individual or groups of pixel elements of the (holographic) display may correspond to transmission modulated pixel elements of the display exhibiting quasi-continuous or discrete versions of Fresnel zone plates, or to phase modulated pixel elements of the display mimicking Fresnel lens profiles. A primitive may then address the transmissivity/reflectivity or the phase response of an entire ring of pixel elements or an arcuate portion thereof.
In some embodiments of the invention, the streamed input data 101 may be split into several smaller chunks of input data 101 being applied to the data input electrodes 102 associated with more than one control chain. It is an advantage of embodiments of the invention that the data distributing system 100 according to embodiments of the present invention can handle very high data rates, e.g. in can support terabits per second (Tbps) data transfer rates, which are necessary for a high enough frame rates/display update rates that give the one or more viewers an impression of continuity. By breaking the input data 101 up into chunks, also the latency of the system 100 can be controlled, so as to stay as low as possible when needed.
Means 115 for detecting patterns contained in the stream of input data 101 may receive and analyze streamed input data 101 (e.g. primitives contained therein or their interpreted counterparts, i.e. their decoded representations), and in response thereto, may send instructions to one or more control chains. Non-limiting examples of such instructions are, among others, instructions for updating the sequences of control variables applied and stored at the data switches 105 or instructions to local computation means for repeating a postprocessing step of already distributed input data, but with updated parameters. Patterns which are analyzed and detected by the pattern detection means 115 may encompass translations, rotations, and scale transforms of the whole 3D scene to be displayed or only parts thereof. Detection of these patterns is useful as the primitives change in a deterministic way under these patterns, e.g. under translations (e.g. moving object of a 3D scene) or rotations (e.g. rotating object of a 3D scene). A modification of the already transferred input data by virtue of local postprocessing/re-computation may be more energy efficient under these circumstances as compared to starting a complete new cycle of input data reloading and redistribution. While above described patterns are all implying functional transformations of the input data 101, also dynamic patterns may be analyzed and detected. The dynamic patterns are concerned with temporal aspects of input data 101 distribution, for instance, at which rates data output electrodes 103 are updated. Under normal conditions it is expected to have display regions which have quickly evolving image or scene content requiring frequent updates, meaning frequent updating/overwriting of input data 101 to the data output electrodes 103 corresponding to these regions, whereas other regions of the display may have image or scene content which is slowly evolving and thus requires less frequent distribution and overwriting of input data 101. That is, the plurality of data output electrodes 103 are only partially updated at every display refresh cycle. Therefore, means of detecting patterns 115 may also be configured for detecting quasi-static patterns in the input data 101 updates over some period of time. For these quasi-static input data 101 upload patterns the control chains of the control plane do not undergo an updating action, which enables a more energy-efficient use of the control plane. For this purpose, the means for detecting patterns may compare how quickly detected attributes of a primitive for update change or detect if a whole new primitive is updated. A stable input data 101 upload pattern detected over some segmented, non-overlapping region of the display is typically associated with a cluster of data output electrodes 103 and a constant sequence of control variables stored on the data switches 105. Hence, the input data 101 update rates of one cluster are tied together as long as the quasi-static pattern persists (and a sequence of control variables is not renewed), which may be an approximation to the actual input data 101 update rates of the streamed input data 101. However, this approximation is allowed as long as the neglected deviations would not contribute or not critically contribute (e.g. by significantly affecting the image quality) to a change in the reconstructed 3D scene if they were accounted for in every refresh cycle.
Ultimately, the means for detecting patterns 115 may also detect input data 101 which is identical to the one previously transferred to a particular data output electrode 103. This also applies within the detected quasi-static patterns. In this case input data 101 is preferably not distributed again for overwriting the old one and also local postprocessing is unnecessary. This is achieved by keeping the respective data switches 105 closed once the reusable input datum has been transferred for the first time, i.e. the respective data switches 105 (e.g. high threshold voltage transistor devices) act like a pass gate.
Pattern detection may be applicable to input data 101 update events at each data output electrode 103, or to updating events of a cluster of data output electrodes 103 corresponding to clustered regions of a connected display addressed thereby. Detected patterns for the input data 101 update events may be subject to a thresholding operation, deciding whether or not the detected pattern shall be used as an input for local postprocessing of already distributed input data 101. If not used to initiate local postprocessing, it may trigger the reloading and redistribution of new input data 101. A rapidly moving object in a 3D scene, for instance, would require a more frequent input data 101 updating, e.g. fresh input data to be distributed, as compared to a slowly rotating object of a 3D scene for which there is time enough to reprocess the already distributed input data 101 still present at one or several clusters of data output electrodes 103, e.g. clusters of data output electrodes whose dominant solid angles optimally support the rotating 3D object of the scene. As a consequence, control variables belonging to one or more control chains that steer the distribution of input data 101 are only updated if strictly necessary in order to ameliorate energy efficiency. The necessity criterion generally depends on the target application and/or desired image quality. As a result, the system for distributing data 100 is adapting dynamically to a more efficient way of input data 101 distribution. If a new sequence or new sequences of control variables become necessary, e.g. in response to a detected pattern beyond a threshold value or no detected patterns, those are determined by the control plane and provided to the first delay line 111 for injection (or first delay lines if several control chains are involved). Updating of the control variables at respective data switches 105 is achieved by simultaneously generating and injecting into the second delay line 112 an enable sequence, e.g. a travelling “one” (single pulse). The means for generating the sequences of control variables 113 and the means for generating the sequences of enable variables 114 may be included in the control plane structure of the data distributing system 100, but may also be provided as external sequence generating means, e.g. as programmable bit pattern generators, FPGAs, or other computing hardware implementations.
An example of a rapid change of input data 101, corresponding to fast input data distribution and update rates at the data output electrodes 103 (and corresponding pixel elements 110 of the display), is a complete scene change in the transferred associated data, e.g. video data. Moving objects in the foreground of a static scene or sudden texture changes in slowly moving and static objects are other non-limiting examples of video content data that induces a change in input data 101 update rates (if no local postprocessing is available) and/or allows for a segmentation/clustering of the holographic display surface, and hence of the underlying data output electrodes 103, into regions of high input data 101 update rates, e.g. a triggered by a moving object or a texture change requiring update rates of the order of 100 frames per second (fps), and regions of moderate or low input data 101 update rates, e.g. triggered by static backgrounds or slowly moving objects requiring update rates significantly less than 100 fps or no updating at all. These clustered regions may be supported by rectangular sub-matrices of the larger matrix of pixel elements 110 of the connected display, or may be supported by circular sections of the larger matrix of pixel elements 110. The skilled person will know that other choices are possible. Small imperfections in the reconstituted 3D scene may be tolerable. Therefore, some regions may not require updating if the 3D scene is partially modified, for instance, regions whose dominant solid angles support parts of the 3D scene that do not change during the modification. Hence, it is possible to study a representative ensemble of streams of input data 101, including large varieties of video scenes and image contents, and to profile the update rates of addressed data output electrodes 103, which, if connected to their respective pixel elements 110 of a holographic display, would recreate the video scene or image content once it has been successfully distributed. This profiling happens during design-time of the data distributing system 100. It may, for example, comprise the recording of histograms which are informative on the distribution of update rates across the plurality of data output electrodes 103 for a given scene scenario. As a result of this design-time profiling, e.g. by emulation or simulation of the data distributing system 100, clusters of data output electrodes 103 can be identified that have similar input data 101 update rates for a given scenario. The display plane, and hence the plurality of underlying data output electrodes 103 is therefore partitioned into a plurality of disjoint clusters. Each of the so identified clusters of data output electrodes 103 may be provided with a separate control chain in the control plane of the data distributing system 100. A control chain provides the necessary system infrastructure to realize the changes in the update rates. Since changes in the update rates correspond to updates in the control variables of the data switches 105, a single control chain provides all the elements of the control plane discussed so far, i.e. first and second delay lines 111, 112, and a plurality of control switches 104. However, the control plane as a whole may comprise a plurality of control chains, because the control plane as a whole is controlling all the clusters identified. An exemplary embodiment of the invention as shown in
A display comprising pixel elements 110 is connectable to or may be included, for some embodiments of the invention, in the data distributing system 100 such that the pixel elements 110 of the display are electrically coupled to respective data output electrodes 103 when the data distributing system 100 is operative. The pixel elements 110 of the display may be formed as electronically controllable cells comprising a phase change material, but are not limited thereto. Liquid crystal materials, electro-optic materials, actively controlled light emitting diodes are other non-limiting examples of electronically controllable pixel elements 110, any particular choice depending on the display type and application. An advantage of phase change material pixel elements 110 is given by the fact that they do not require a constant power supply to remain in their current state; a power source powering the phase change material pixel elements 110 may effectively be disconnected, yet the pixel elements 110 will remember and remain in their latest applied state. Therefore, any suitable memory material may be used for the pixel elements 110 of a holographic projector display if it provides sufficient interaction with light incident on the display, e.g. changing the optical phase, angular distribution, and/or amplitude of incident light via diffraction, reflection, absorption, or combinations thereof. In preferred embodiments of the invention, the display comprising the pixel elements 110 is an integral part of the data distributing system 100, e.g. is formed on top of a semiconductor substrate in which the data distributing system 100 is laid out. For example, the pixel elements 110 are formed as cells, comprising for example a suitable phase change material, which are deposited and patterned on top of a semiconductor substrate such as a silicon substrate which includes the electronic control structures for addressing the individual cells and also includes the data distributing system, e.g. in a vertical back end of line (BEOL) stack. This has the advantage that a compact, fully integrated holographic display and data distributing system 100 can be obtained which is also wafer processable, thus well-suited for mass manufacturing at reduced costs and high repeatability. Alternatively, it is possible to provide the data distributing system 100 and the display separately and connect them via suitable connecting means such as wires. Still other embodiments may provide pixel elements 110 at a bottom surface of s semiconductor substrate, e.g. by connecting them by TSVs with the electronic control structures and BEOL stack. In preferred embodiments of the invention, the pixel elements 110 are characterized by very small lateral dimensions, for instance a single pixel element 110 may be as small as 100 nanometer in both lateral dimensions or may be even smaller. Pixel elements 110 of reduced lateral dimensions are particularly useful for holographic displays or other stereoscopic displays that accommodate a larger range of viewing angles, e.g. a full 180 degree viewing angle is achieved even with 400 nm blue light if pixel elements 110 are designed smaller than 200 nm in their lateral dimensions, e.g. designed to have 100 nm in both lateral dimensions. A reduced area occupied by each pixel element 110 also allows the design of more compact displays with a reduced wafer die area necessary for each display which may further decrease manufacturing costs. Alternatively, a given die area for a display may be filled with more pixel elements 110 which is favorable for an increased display aperture yielding better display resolution. In preferred embodiments of the invention, a display comprises at least one megapixel.
In some embodiments of the invention, the data switches 105 and/or control switches 104 may be provided as microelectronic transistor devices. For example, the data switches 105 and/or control switches 104 may be provided as pass transistors. This is a benefit for data distributing systems 100 that are characterized by a reduced number of transistors required to build logic gates, e.g. the pass transistor switches, hence reducing energy per area overhead, circuit and routing complexity. In some embodiments of the invention, each data switch and/or each control switch may correspond to exactly one transistor device (typically an n-MOS transistor device) controlling the transfer of input data 101 from a data input electrode 102 to a data output electrode 103. In other embodiments of the invention, each data switch and/or each control switch may correspond to more than just one transistor device, for instance it may be, but it is not limited thereto, a full complementary n-p CMOS switch, for controlling the transfer of input data 101 from a data input electrode 102 to a data output electrodes 103.
In particular embodiments of the invention, the data switches 105 and/or control switches 104 are provided as cheaply fabricated thin film transistors (TFT). This has the advantage that TFT technology can be used to vertically stack many TFT layers integrated between the metallization layers of a BEOL process, thus achieving up to ten vertically stacked TFT device layers or more. Accordingly, it is possible to obtain a reduced area for each pixel element of the display and, at the same time, provide the distributed electronic pixel control logic and optional data postprocessing logic locally, at a per pixel basis. In contrast to conventional planar CMOS technology, the energy per area overhead due to the up-and-down routing between die stacks, by means of through-silicon vias (TSV), is avoided, even though a typical TFT node may be scaled to 35 nm to 40 nm. In particular embodiments of the invention, the control switches 104 are high threshold voltage, high impedance devices, e.g. tunnel FETs or TFT devices. This has the advantage of realizing low power, low leakage devices. It is particularly advantageous to use high threshold voltage, high impedance TFT devices, for instance TFT devices implemented with Indium-Gallium-Zinc-Oxide (IGZO) materials or with suitable 2D materials such as graphene, MoS2, etc. The term IGZO encompasses all realizable varieties of the compound InxGayZnzOw in terms of the values of the atomic numbers x, y, z, and w, for example In2Ga2ZnO. However, embodiments of the invention are not limited to the devices that combine a high impedance characteristic with a high threshold voltage. Alternative embodiments may implement devices with only high impedance or devices with only high threshold voltages.
For embodiments of the invention in which the control switches 104 are implemented as high threshold voltage, high impedance TFT devices, very low leakage currents for a charge stored on the TFT device, e.g. a charge stored on the gate of a pass-gate thin film transistor, can be realized. A single TFT device implemented with IGZO materials, for example, may have leakage currents not higher than 10 fA, e.g. between 1-10 fA, at threshold voltages that may be below 2V. Consequently, the control variables may be preserved for days or even weeks, whereas a typical CMOS device would require a refreshing action of the leaked charge in a regular time interval on the order of seconds, even for a static scene or static portion of a scene which would not require any updating action. It is an advantage of such embodiments of the present invention that the control plane does not need to be updated too often, as it contains most devices. This is realistic for practical scene updates. User-defined quality measures for similarity between subsequent update values may further limit refreshing actions of the control plane and increase the time intervals during which control variables are stored. The data switches 105 are preferably TFT devices having a lower threshold voltages, e.g. significantly below 2V, e.g. below 1 V, and may be implemented with IGZO material as well. A decrease in the threshold voltage for TFT devices as data switches 105 according to some embodiments of the invention is acceptable if the gain in switching frequency is appreciable, for example data switches 105 having switching frequencies greater than 1 MHz, for instance larger than 10 MHz, e.g. switching frequencies of 100 MHz, are appreciated. Data switches 105 operating at faster switching rates are also possible for transferring input data 101 even more rapidly, but is leading to an increase in power consumption. It is generally preferred to not maximize switching frequencies but to exploit parallelism in the data distributing system 100 instead, e.g. by providing a plurality of chains that operate in parallel, i.e. that transfer and update input data 101 at data output electrodes 103 that correspond to pixel elements 110 belonging to distinct regions of the display. However, for some embodiments of the invention, speed requirements for the data switches 105 prevail, e.g. in applications that demand the highest holographic image quality, e.g. at input data 101 streaming rates larger than about 50 frames per second (fps), e.g. 72 fps for three-color, 24 fps video quality. For those cases, the threshold voltage of the data switches 105 may be lowered to achieve the higher switching frequencies. In contrast thereto, more energy-sensitive applications demand lower input data update rates, e.g. corresponding to input data streaming rates of about 30 frames per second or lower, and in return accept a somewhat lower holographic image quality. For this case embodiments of the invention may implement high threshold data switches 105, e.g. having threshold voltages greater than 1V, greater than 2V, or greater than 5V, e.g. 10 V, depending on factors such as the device stack or material choices. As a result thereof, a transferred projection input datum 101 will be efficiently stored on a data output electrode 103, e.g. as a stored charge value, as long as the relevant data switch 105 that connects to that data output electrode 103 is interrupting/blocking and has low leakage characteristics. Implementing the data switches 105 with high threshold voltage CMOS devices (e.g. high-k oxide CMOS), for example, may cause a charge to be stored on the connected data output electrode 103 for several seconds after the CMOS data switch is switched off. This is advantageous in embodiments of the invention which exploit the reuse of redundancy or repetition of information in the stream of received input data 101. Indeed, if a received projection input datum 101 for transfer to a particular data output electrode 103, or pixel element 110 connected thereto, is identical or similar enough to the previously projection input datum 101 transferred to that data output electrode 103 or connected pixel element 110, it may be more energy-efficient to detect these reuse/repetition patterns and decide to not re-transfer them. Hence, in embodiments of the invention that are adapted for such energy-efficient input data 101 reuse, high threshold data switches 105 ensure that a previously transferred projection input datum 101 does not leak away from a particular data output electrode 103. It is expected that typical input data 101 streams show a large amount of these repetitive patterns given the temporal correlations between frames in typical video projection datasets.
In alternative embodiments of the invention, planar CMOS technology may be used to implement the transistor devices of the data switches 105 and/or control switches 104, and active device dies may be stacked vertically by means of TSV technology. This has the advantage that the very advanced technology nodes, e.g. below 14 nm, e.g. a 10 nm node, may lead to very compact devices and dense logic. Although
The system for distributing data 100 of
For the particular embodiment shown in
In a second aspect the invention relates to a 3D light field projection device, e.g. a holographic display, which comprises a data distributing system of the first aspect. The 3D light field projection device also includes a display having pixel elements with the pixel elements being coupled to the data output electrodes of the data distributing system. A plurality of pixel elements are arranged on a display surface, which preferably is a planar surface. However, for some embodiments relating to the projection device it may be useful to arrange the plurality of pixel elements on a flexible substrate. This has the advantage that the display may be mounted or removably attached to an uneven, non-planar support structure, and also withstands higher flexural strain. The display and the pixel elements of the display may share some or all of the properties already described in respect of previous embodiments relating to the first aspect of the invention, in particular in respect of the description relating to
For some embodiments of the projection device, the data distributing system may be optimized to work with a particular display or a particular region of a display as explained in more detail hereafter.
As each data switch enables the transfer of input data to a particular data output electrode, and hence to a particular pixel element or group of pixel elements of the display connected thereto, the ordering of data switches, driven by control sequences of a control chain, leads naturally to an ordering of the addressed pixel elements or group of pixel elements on the display surface. Connecting the geometric centers of addressed pixel elements in that order results in a curve that is defined and restricted to the display surface. Addressed groups of pixel elements are hereby considered as a block pixel or macro-pixel entity being represented by only one point on the curve. As a result, each cluster of data output electrodes, and connected pixel elements, is provided with at least one control chain. Usually one control chain per cluster will be provided, but if the control chain becomes too long (e.g. in terms of latency for updating pixel elements), it needs to be broken up into a plurality of control chains. The control chain(s) associated to a cluster control the update rates of pixel elements in that cluster and a change of update rates in that cluster is performed sequentially along a curve connecting (block) pixel elements of that cluster. Therefore, it is also possible to design particular chains such that they exhibit desirable update sequence shapes (fine granularity) within a cluster.
For some embodiments of the present invention, the sequences of control variables and enable variables are precomputed (off-line calculation) during system design time, e.g. a limited set of instruction sequences is created and used during system use. For other embodiments of the present invention, the sequences of control variables and enable variables are generated at system run-time with the benefit of achieving more flexibility and of not having to build a fully predefined instruction set at design-time any longer. Based on design-time profiling, a proper set of run-time seed scenarios grouping the most likely to occur input data sets (streams) and their corresponding sequences of control and enable variables is determined such that the granularity (defining “shapes” of update sequences) thereof is much smaller than the final grain size of control and enable variable sequences used by the control plane during system run-time. At run-time, starting with the available seed scenarios of likely input data sets to occur, combinations thereof may be formed such that larger composite ‘likely’ input data sets and related control/enable variable sequences will develop and which, in consequence, will cover larger surfaces or volumes of the 2D or 3D holographic transducer, respectively. This may be achieved by a run-time decision engine which explores the most promising composites and, based thereon, performs the final implementation of both the control/enable variable sequences for the control plane and the ‘likely’ data sets for the data plane. This means that the streamed input data is approximated by a combination of seed scenario data sets. Additionally, particular embodiments of the present invention may also decide on design-time rules/conditions which govern the way the primitive seed scenario clusters can be combined into the composites, whereby the amount of exploration effort and time that has to be spent at run-time is further limited.
In some embodiments of the invention, a plurality of control chains is thus corresponding to a plurality of such curves, and preferably, but not limited thereto, the curves are non-intersecting on the display surface. This means that such curves do not cross themselves, nor do they cross other curves on the display surface. However, the skilled person will appreciate that the non-intersecting curves are not limiting embodiments of the invention, as known 3D BEOL stack technology does also allow for intersecting curves to be designed if appropriate for the desired application. In the same or other embodiments, such curves connect neighboring (block) pixel elements on the display surface. Here neighboring (block) pixel elements refers to (block) pixel elements that are nearest neighbors. An example therefore is shown in
While the invention has been illustrated and described in detail in the drawings and foregoing description, such illustration and description are to be considered illustrative or exemplary and not restrictive. The foregoing description details certain embodiments of the invention. It will be appreciated, however, that no matter how detailed the foregoing appears in text, the invention may be practiced in many ways. The invention is not limited to the disclosed embodiments.
Other variations to the disclosed embodiments can be understood and effected by those skilled in the art in practicing the claimed invention, from a study of the drawings, the disclosure and the appended claims. In the claims, the word “comprising” does not exclude other elements or steps, and the indefinite article “a” or “an” does not exclude a plurality. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measures cannot be used to advantage. Any reference signs in the claims should not be construed as limiting the scope.
Number | Date | Country | Kind |
---|---|---|---|
18176173.5 | Jun 2018 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2019/064374 | 6/3/2019 | WO | 00 |