The present invention is directed to methods of and systems for information processing, information mapping, pattern recognition and image analysis in computer systems.
With the increasing proliferation of imaging capabilities, information transactions in computer systems increasingly require the identification and comparison of digital images. In addition to, conventional viewable digital images, other types of information, both viewable and non-viewable, are subject to pattern analysis and matching. Image identification and pattern analysis/recognition is usually dependent on analysis and classification of predetermined features of the image. Accurately identifying images using a computer system is complicated by relatively minor data distorting the images or patterns resulting from changes caused when, for example, are shifted, rotated or otherwise deformed.
Object invariance is a field of visual analysis which deals with recognizing an object despite distortion such as that caused by shifting, rotation, other affine distortions, cropping, etc. Object invariance is used primarily in visual comparison tasks. Identification of a single object or image within a group of objects or images also complicates the image identification process. Selective attention, or “priming”, deals with how a visual object can be separated from its background or other visual objects comprising distractions.
Current pattern recognition, image analysis and information mapping systems typically employ a Bayesian Logic. Bayesian Logic predicts future events through the use of knowledge derived from prior events. In computer applications, Bayesian Logic relies on prior events to formulate or adjust a mathematical model used to calculate the probability of a specific event in the future. Without prior events on which to base a mathematical model, Bayesian Logic is unable to calculate the probability of a future event. Conversely, as the number of prior events increases, the accuracy of the mathematical model increases as does the accuracy of the resulting prediction from the Bayesian Logic approach.
Currently, two common paradigms accommodating some degree of distortion (i.e., image deformation) of a visual object under deformations; point-to-point mapping and high order statistics. Point-to-point, or matching with shape contexts, achieves measurement stability by identifying one or more sub-patterns with the overall patterns or images being compared. Once these sub-patterns are identified, the statistical features of sub-patterns are compared to determine agreement between the two images. Point-to-point mapping methodologies are further described in “Matching with Shape Contexts” by Serge Belongie and Jitendra Malik in June, 2000 during the IEEE Workshop On Content-based Access of Image and Video Libraries (CBAIVL). A second method of point-to-point mapping is what/where networks and assessments of lie groups of transformations based on back propagation networks. In this method an optimal transformation is identified for a feature in a first image and is used to compare the same feature in a second image. This approach deconstructs the image into a sum or a multiplicity of functions. These functions are then mapped to an appropriately deconstructed image function of a compared, or second image. What/where networks have been used by Dr. Rajesh Rao and Dana Ballard from the Salk Institute in La Hoya, Calif. The point-to-point mapping techniques described attempt to map a test or input image to a reference or target image that is either stored in memory directly or is encoded into memory. The point-to-point approach achieves limited image segmentation and mappings through the use of a statistical approach.
In the high order statistical approach both the original input image and the compare target image are mapped into a high dimensional space and statistical measurements are performed on the images in the high dimensional space. These high order statistical measurements are compared to quantity an amount of agreement between the two images indicative of image similarity. This approach is used by Support Vector Machines, High Order Clustering (Hava Siegelmann and Hod Lipson) and Tangent Distance Neural Networks (TDNN). Support Vector Machines are described by Nello Christianini and John Shawe-Taylor ISBN 0-521-78019-5.
Both the point-to-point mapping and the high order statistics approach have been used in an attempt to recognize images subject to various transformations due to shifting, rotation and other deformations of the subject. These approaches are virtually ineffective for effectively isolating a comparison object (selective attention) from the background or other visual objects.
In contrast to these two common paradigms, the human brain may compare two objects or two patterns using “insight” without the benefit of prior knowledge of the objects or the patterns. A Gestalt approach to comparing objects or comparing patterns attempts to include the concept of insight by focusing on the whole object rather than individual portions of the object. Gestalt techniques have not been applied to computer systems to perform pattern recognition, image analysis or information mapping. Gestalt mapping is further described in Vision Science-Photons to Phenomenology by Stephen E. Plamer, ISBN 0-262-16183-4.
According to one aspect of the present invention, a method of comparing an input pattern with a memory pattern comprises the steps of loading a representation of said input pattern into cells in an input layer; loading a representation of said memory pattern into cells in a memory layer; loading an initial value into cells in an intermediate layers between said input layer and said memory layer; comparing values of cells in said intermediate layers with values stored in cells of adjacent layers; updating values stored in cells in said intermediate layers based on said step of comparing; and mapping cells in said memory layer to cells in said input layer.
The following terms and their definitions are provided to assist in the understanding of the various embodiments of the present invention.
is the Christoffel symbol. The General Relativistic energy function is a model of the cosmos as a transformation engine in equilibrium.
such that i is the index of the ith dimension of the vectors V and U.
One embodiment of present invention provides a “transformation engine” that incorporates a Gestalt approach to image comparisons, pattern recognition and image analysis. The present invention may be used to create a point-to-point mapping or a point-to-point map between an input pattern and a stored pattern (for convenience of explanation together referred to as two visual objects although applicable to other data having or representative of a pattern). This point-to-point mapping may be implemented by performing a transformation between the two visual objects and creating a multi-layered gradual transition between the input pattern and the stored or memory pattern. The point-to-point mapping is performed by Resonance as defined hereinabove. Thus, Resonance is used to construct a transformation between the two visual objects using a multi-layered gradual transition. This multi-layered entity according to one embodiment of the invention constitutes a matrix formed of a plurality of layers. Each layer, within the multi-layered approach, maintains the topological features of each of the two visual objects. That is, intermediate layers maintain and exhibit characteristics common to the two visual objects such that significant features of the pattern are recognizable. As the number of layers increases, the difference between two adjacent layers is reduced. While this embodiment of the present invention does not explicitly minimize the global “energy” function, the embodiment may achieve a Gestaltist energy minimization of a transformation between the two visual objects.
Energy minimization is a consequence of the structure of the transformation engine of the embodiment. Energy minimization may be effected by an information “smearing” procedure, which causes a migration of data through the matrix to achieve a minimized energy potential between layers. Transitions between layers may be created by providing “waves” of modifying functions. Two waves, a top down wave and a bottom up wave, may be used to create a Gestalt transformation between the two visual objects. The input pattern may be stored in the input layer or the short term memory (STM) layer and the stored “reference” or “target” pattern may be stored in the memory layer or the long term memory (LTM) layer. The transformation engine of the present invention achieves the point-to-point mapping between the two visual objects without either directly or indirectly calculating statistical values. Once the point-to-point association between the two visual objects is achieved, stable statistical measurements may be calculated to determine the amount of agreement between the two visual objects. This embodiment of the present invention may be used for comparing patterns of visual objects (a comparison mode) and for isolating an object from the surrounding background or other objects (selective attention task).
The transformation engine of the present invention includes or operates in two phases. In the first phase the transformation engine builds a map between the LTM (referenced or target image) pattern and the STM (input pattern under examination) pattern. In the second phase, the transformation engine uses “vibration” waves to turn the map created into either an explicit point-to-point mapping or into an explicit pattern recognition engine. The vibration waves cause the intermediate layers to maintain a low energy state while encouraging conformance to LTM and STM patterns based on distance from such.
The map created by the transformation engine may be created in a number of ways, each of which is included in the present invention. In one embodiment the map may be created by digital means, e.g., a digital computer. In another embodiment the map may be created by physical means, i.e., incorporating a media responsive to patterned conditions so as to provide a transition between patterns, such as by placing elongated pieces of conducting material within an electric field such that the conducting material becomes aligned within the electric field. Other means of creating the mapping from the LTM pattern to the STM pattern are also included within the present invention.
During the second phase of operation of the transformation engine, vibration waves cause the intermediate values in the intermediate layers—between the LTM and the STM layers to oscillate which prevents the transformation engine from permanently storing local minimum of the corresponding Action Integral within the intermediate layers. In other words, the transformation engine is aimed at eliminating the Variation of the (static) Action Integral:
Where LTM=P(X,Y,0) and STM=P(X,Y,1).
The elimination of the variation of the Action Integral means that the potential p maintains up to second order derivatives as the pattern that is encoded by p changes from one layer to the next adjacent layer. Convergence in a digital implementation where neither a real potential nor its gradient (the induced vector field) may be achieved through digital means, i.e., by use of a digital representation of the vector field using for example, a mathematical model to be explained below. In a preferred embodiment, the transformation engine forces values to change in the intermediate layers through the use of a learning rate (R) that is typical to concurrent neural networks approach. In this embodiment, if P(S) represents the value stored in an intermediate layer, and it is desired to change P(S) into a new value W then the learning rate 0<R<1 defines the learning paradigm, Pnew(S)=(1−R)*Pold(S)+R*W. If R is close to 1 then the learning process is rapid but relatively unstable. The reduction of this parameter from 0.995 to 0.835 with the number of layers is 128 along with the reduction of the Far Span k-Distance Connections (a) is a way to minimize the Action Integral. In a “real” physical system, i.e., one relying on some physical property of a component of the system, neither the learning rate nor the Far Span K-Distance Connection would be necessary.
A digital implementation of the comparison mode of the present invention will be described first.
In Step 103 data values representing some pattern to be identified (e.g., an input image) are stored in the input layer and copied into each level of the top or upper half of the multiple layers. Similarly, in Step 104 data values representing some previously stored pattern are stored in the memory layer and copied into each level of the lower or bottom half of the matrix. Steps 103 and 104 provide the cells of the inner layers of the matrix with initial data values to initialize the matrix. In Step 105 a top-down “wave” and a bottom-up :wave: are promulgated through the multi-layer matrix as further described below. When Step 105 is performed during a comparison mode, the values in the input layer and the memory layer are held constant, but the initial values stored in each of the inner layers of the matrix may be adjusted to provide a gradual transition of pattern data between layers. During this step the “intensity” of the waves is gradually reduced, i.e., the magnitude of change between layers is gradually reduced, as is the span of data checked as the method progresses towards convergence and energy minimization. This reduction in the intensity of the waves occurs in the first phase of the transformation engine when a map is constructed between the LTM and the STM pattern. Ebbing Resonance is also performed in Step 105 as further described below. The values in each of the layers are held constant in Step 106 and a one-to-one mapping is performed in Step 107 between the input layer and the memory layer.
Various methodologies may be used to perform this one-to-one mapping including sequentially sending a single wave for each input cell and determining, based on the single wave, the best matching memory cell. In this case, for each pixel of the STM input board/layer represented by (i, j) the value of the pixel (i,j) is oscillated or varied over a predefined range while the value for all other pixels (i.e., data elements of the pattern) of the STM board are held constant. This process is repeated serially for each one of the pixels of the STM board. The best matching memory cell using this methodology is the memory cell which receives the strongest vibration wave from the single wave applied to the input cell. Once the best matching memory cell is identified, the corresponding input cell is mapped to the memory cell. Alternatively, a coherent vibration wave may be sent from all of the cells in the input layer and spectral analysis of the wave may be performed to determine the associations between input cells and memory cells. This second methodology creates a vibration wave pattern in each of the memory cells which is a superposition of small ripple vibration waves. By measuring and performing spectral deconstruction on the resulting pattern in the memory layer, a Bayesian pattern may be identified.
Once each layer of the matrix stores initial values, Step 105 is performed and both top-down and bottom-up waves are passed alternately through the matrix. The top-down and bottom-up waves may modify the values stored in each of the cells to form a mapping between layers. (Note that, in a comparison mode, the values in the cells in the input layer and the memory layer remain constant through the comparison.) Also in Step 105, an ebbing resonance process is applied to the multi-level matrix. The ebbing resonance process is a relaxation process that achieves convergence via hierarchical zooming. In a preferred embodiment, both the intensity of the resonance and the radius at which differences between a cell and its neighboring cells are checked and gradually reduced. The complete ebbing resonance uses convolutions and time varying patterns in a top-down and a bottom up approach alternately. The convolutions, as described more fully below, are instrumental in overcoming energy pits and process degeneration around, for example, local minimum.
At the completion of Step 104 the present value of each cell is equal to the initial cell value as described previously, i.e., to the value stored in the closest top and bottom layer. Additionally, the determination of the future value of a cell is dependent on the position of the cell within the layer. For interior cells, those cells which are not positioned at the start or the end of the layer, the future value of the cell is dependent on a number of cells, e.g., three other cells contained in nearly positions of the adjacent row according to a one dimensional transformation engine as described below. For end cells, those cells in the first or last position of a layer, the future value of the cell is dependent on one other cell in a one dimensional transformation engine as described below. Note that the dependency of a cell on three other cells represents one embodiment of the present invention and that other dependencies are within the scope of the present invention.
For the bottom-up wave, the future value of an interior cell is determined by checking the present value in three closest cells in the adjacent layer above. In one embodiment of the present invention, when the multi-level matrix is operating in a comparative mode, the values of the top most row (the input layer) are not adjusted and continue to contain values representing the input pattern. Processing of the bottom-up wave therefore begins in the second layer. For each interior (i.e., non-end) cell in the second layer, the three closest cells in the first layer are examined to determine the cell in the first layer which has the value closest to the current value of the cell in the second layer. If “t” represents the cell index/location within a layer, and “s” represents the layer, Unit (s, t) may be used to represent the cell (or unit) in layer “s” in position “t”. Using this convention, the reference for cell 402 in
In the bottom-up wave, once the value for each cell in a layer has been determined using the three cells in the adjacent layer above, the values for the cells of the adjacent layer below are calculated. In other words, for the bottom-up wave, once the values for the cells in layer 2 are determined using the values of the cells in layer 1, the value for the cells in layer 3 are determined using the values of the cells in layer 2. This process continues until the values stored in the cells of layer 4, are used to update the values of the cells in the next to the bottom layer (layer 5 in
The top-down wave operates similarly. For the top-down wave, the future value of the cell is determined by examining the three closest cells in the adjacent layer below. In one embodiment of the present invention when in comparison mode, the values of the lowest-most row (the memory layer) are not adjusted and continue to contain values representing the memory pattern. The top-down wave therefore begins in the second from the bottom layer (layer 5 of
As described, the bottom-up wave travels from the layer below the input layer (or the second layer in
This alternating process completes one application of the bottom-up and top-down waves in the 6 row matrix of the present example. Note that bottom-up and the top-down waves must reach the middle layers in intermittent order. If there are n=2m layers then in one cycle the bottom-up wave reaches layer m first and only then does the top-down wave reach the m−1 layer. In the following cycle the top-down wave reaches the m−1 layer first and only then does the bottom-up wave reach the m layer.
As described, each interior cell or unit is compared to three other cells or units in either a layer above (for the bottom-up wave) or a layer below (for the top-down wave). Thus, the cell is compared to the t−1, t and t+1 cells in the corresponding layer. For end cells either the t−1 or the t+1 cell is unavailable. For the t=1 cells, the t−1 cell in the layer above or below is unavailable and for the t=maximum (t=8 in
Note that using simple Euclidean distance for updating Unit (s, t) cannot guarantee convergence to a global minima of the sum of square distances. A more efficient method of achieving the global minimum is by use of a Far Differential−Far Span K Distance Connection. The Far Differential−Far Span K Distance Connection describes the metrics necessary to achieve a global minimum. This procedure minimizes a more complex energy or distance function in order to implement complex visual comparison tasks. The Far Differential−Far Span K Distance Connection between Unit (s, t) and Unit (s, t+k) is Unit (s, t+k)−Unit (s, t). Similarly, the Far Differential−Far Span K Distance Connection between Unit (s, t−k) and Unit (s, t) is Unit (s, t−k)−Unit (s, t).
There is a gradual reduction in the intensity of the changes in the values stored in the cells during the application of the top-down and bottom-up waves and an ebbing resonance stage is reached. Ebbing Resonance is a relaxation process. The word “relaxation” as used herein has the same meaning as in non-linear optimization theory. Here, relaxation is accomplished by gradually reducing the Far Span K Distance Connections. While this reduction alone is insufficient to guarantee convergence, it provides additional stability to the procedure. In order to guarantee convergence using a full numeric solution, a power of logarithmic Radial Basic Decay function is used in a Hierarchical Zooming convolution and is applied to the input/STM layer and the memory/LTM layer simultaneously. In addition, the Convolution scale is also gradually reduced.
The Far Span K Distance Connections modifies the Euclidean distance that is used by the top-down and bottom-up waves. For example, application for Far Span K Distance Connections causes the Euclidean distance |U(s,i)−U(s,i+1)| to become Distance(U(s,i+1))=|U(s,i)−U(s+1,i+1)|+a*|U(s,i+k)−U(s,i)−U(s+1, i+l+k)+U(s+1, i+1)+a*|U(s, i−k)−U(s,i)−U(s+1, i+1−k)+U(s+1, i+1)|. During the ebbing resonance, the real number “a” is also gradually reduced with “k” and with the radius of the Hierarchical Zooming convolution. If a convolution tool is not available and the values in the cells or units are kept constant during the Ebbing Resonance, convergence may still be obtained by the use of a very high Resonance rate (0.995 for 128 layers) and gradually reducing the resonance rate, the “k” parameter and the “a” parameter. This procedure allows the resonance rate to be gradually reduced during Hierarchical Zooming.
Once the Ebbing Resonance process has been completed, update waves may be used to carry information between the layers of the matrix. For example, an update wave may be used to propagate index related information from the input layer, through each of the intermediate layers to the memory layer to create a point-to-point mapping between cells in the input layer and cells in the memory layer. Top-down and bottom-up update waves may be applied to the matrix to determine point-to-point mappings. In a one-dimensional transformation engine, each unit or cell checks three units in an adjacent layer (the layer above for bottom-up and the layer below for the top-down). One of the three units in the adjacent layer holds a value of x, y which is closest to the values of x, y of the current unit in Euclidean distance. An intrinsic index, which represents the unit in the adjacent cell which had the closest value, is then stored in the current unit. This process is repeated for each cell in the layer, and for each layer in the matrix. After each cell of each layer has been examined and an intrinsic index has been stored, the intrinsic index propagates from the input layer to the memory layer and vice versa.
The final mapping is influenced by the number of layers contained in the matrix and the pattern degeneracy. Here a degenerated pattern is a function f that is defined on a domain Df such that there exists an open set Q such that the Gaussian curvature K of the manifold Dff on Q is 0.
In performing Selective Attention the background pixels of an image that are stored in the LTM memory layer are “turned loose.” They can converge to any value. This requires that the Convolution that is used in the Hierarchical Zooming process that is part of the Ebbing Resonance must be redefined.
Referring to
such that q1 and q2 are the charges of pixel1 and pixel2 respectively and d1 and d2 are the “horizontal” distance from the interacting point on the memory/LTM board. As r increases, the influence of d1 and d2 is reduced and the image on the memory board becomes blurred. The loose cells in this case are on the post memory layer.
Note that while the examples used to explain the invention used a matrix containing 6 layers with each layer including eight cells or units, the present invention may include a matrix with any number of layers with each layer containing any number of cells. As the number of layers increases the granularity of the process is reduced such that difference in values stored from one cell to an adjacent cell is minimized.
Note also that while the examples used to describe the present invention were limited to a one-dimensional numeric transformation engine, other embodiments exist, including multi-dimensional matrices and data spaces that are within the present invention.
The embodiments of the present invention described thus far incorporate a digital means to produce the metamorphosis of the patterns from the LTM pattern to the STM pattern. Instead of using digital numeric calculations another embodiment of the present invention uses electric fields to accomplish the time varying convolutions and the vibration waves used to complete the gradual metamorphosis. In this physical embodiment, the units that constitute the layers between the input/STM layer and the memory/LTM layer may be either passive (for example induced or constant dipoles) or active (for example CMOS components). In either case, the units that constitute the intermediate layers cannot be fixed in space. If passive units constitute the intermediate layers, these passive units cannot be spherical in shape, but must provide some form of asymmetry with respect to a desired physical characteristic, e.g., electric change (dipole), magnetic orientation (North/South); etc. One requirement of the passive units is that they be able to point in any direction, i.e., free to orient in a position under influence of a field.
The resonance and the update waves, the “reduction of the intensity of the waves” (The Simulated Annealing) and the Far Span k-Distance connections (Far Differentials) described thus far employ a numeric, digitally implemented means of achieving a gradual metamorphosis between the input STM pattern and the memory LTM pattern such that in every intermediate pattern the Topology (Shape) is preserved. That is, manipulation of data patters products a numeric representation of the metamorphasic.
In the digital means of achieving a transformation engine and other strongly cooperative systems, local coupling of degrees of freedom lead to scaling phenomena through a cascade effect which propagates throughout the entire system. The transformation engine approach generates new potentials (known as Hamiltonians) from older potentials by incrementally removing degrees of freedom during each iteration. At the completion of each iteration a Hamiltonian representing unchanged length scale interaction remains.
The present invention also includes a physical transformation engine which includes mixed terms in corresponding Hamiltonian. In this case, it is unnecessary to progressively integrate-out degrees of freedom by the use of Ebbing Resonance and For Spin K-distance connection.
The transformation engine may be described by using the idea of Minimization of an Action Integral and Calculus of Variations in order to describe a potential p (scalar function p(x, y, z, t)) that preserves the geometric properties of itself as it changes from the input STM pattern to the memory LTM pattern. Note that a single potential is insufficient in order to generate realworld metamorphosis between patterns. Also note that a physical transformation engine should work on two-dimensional patterns of Grey Scale patterns.
The minimization of an Action Integral may be used to derive Einstein's Gravity equations and the Quantum Mechanics equations. As further described below, the transformation engine may be described in terms of an Action Integral. Additionally, the Action Integral of the transformation engine may be written in Tensor form and may have applications in String Theory.
Both Far Span k-Distance Connections and the Ebbing Resonance process are redundant in a physical implementation of a transformation engine consistent with the invention. A hardware transformation engine is capable of performance beyond the Turing limit, meaning that a pure software solution will not be able to perform as well as the transformation engine. Instead, a physical hardware based transformation engine demonstrates a behavior that is dictated by a local Hamiltonian. For this reason the Far Span k-distance connections are rendered redundant. Far and near geometric features are encoded by Spatial Frequency functions. As will be shown the principles of Dielectrophoresis and of Electrorotation are an outcome of the Hamiltonian that is minimized by the transformation engine.
An Action Integral is an integral over space and time of an operator over a function. The operator in the integral is a Lagrangian. If the integral is a sum of a potential and the kinetic energy then the Lagrangian is an Hamiltonian.
Considering the classical physics action of a charged particle in an electric field of another charge that is fixed. The Hamiltonian (L) for speed (v) mass (m) and positive (q1) and negative (q2) charges, and one free particle coordinates x(t), y(t), z(t) such that time (t) is given as:
Note that the Hamiltonian of the transformation engine will have a hardware implementation using units that behave like induced electric dipoles in the sense that they depend on the gradient of the local electric field E.
The Hamiltonian is an operator that is defined over the location vector function (x(t),y(t),z(t)). Minimizing the Action Integral results in motion equations of the particle that are not fixed. In the calculus of variations the minimization of an action integral simply means that the variation of the action Hamiltonian is either 0 or vanishes.
The transformation engine provides: 1) intermediate patterns representing a metamorphosis between and affected by both input STM pattern and memory LTM pattern, and 2) intermediate patterns that change between the STM and LTM layers such that their respective geometric characteristics are not lost.
In a two dimensional transformation engine a gradual metamorphosis is generated between the input STM pattern and a memory LTM pattern. If the pattern is encoded by potentials (or some other scalar function) then each pixel in the STM and LTM boards will hold a different potential. For the intermediate potential between the two boards to gradually change it is sufficient to require that the integral of the square of the gradient of the potential to be minimized. In electric field terms, that fits the minimization of the integral on the squared electric field. Multiplying this value by the half of the permitivity factor yields the static electric field energy,
The incomplete Hamiltonian of a transformation engine may be written as:
0 in the expression above means that the variation of the integral vanishes. This Hamiltonian is equivalent to the square distance between units previously described with respect to reaching the ebbing resonance stage and using the for span k-distance connection to amend the Euclidean distance. In a transformation engine that explicitly uses plurality of layers, information is not lost as a pattern changes from one intermediate layer to an adjacent intermediate layer. As previously described, the Far Span k-Distance Connections is responsible for the conservation of geometry.
Writing this term locally means that the difference of a potential p1 and p2 on layer S will not change excessively in view of the counterparts q1, q2 in layer S+1 or S−1. Thus, ((p2−p1)−(q2−q1))2 must be added to the Hamiltonian, wherein the direction p1 to q1 is the direction of the derivative of the potential.
The difference between two adjacent intermediate potential layers is also held constant. Referring to
In order to write the last term such that the potential p may change in any direction, the general form is (∇∇P·∇P)·(∇∇P·∇P).
In second derivative terms this term can be written as,
The second derivative matrix is multiplied by a vector so
Adding the energy Hamiltonian to the Hamiltonian that preserves the geometry in local terms yields:
The optional terms may involve higher derivatives of the potential P. This function is the Hamiltonian of a transformation engine according to the present invention; the plurality of metamorphosis patterns between two boundary conditions which are the input STM pattern and the memory LTM pattern as described.
Note that a second derivative of the potential is multiplied by the first derivative resulting a product in the form of a vector. Such a term is consistent with elongated induced dipoles in a local electric field. This is because electric dipoles respond to or “feel” force when they are positioned in an electric field such that the second derivative does not vanish. Elongated dipoles that are made of conductive material align with the first derivative of the potential and therefore the Hamiltonian that is the result of the previous illustration implicitly dictates the use of elongated dipoles. In order to be totally consistent with the Hamiltonian the dipoles should be ideal needles at sufficiently high resolution and with no or minimal resistance.
Dielectrophoresis itself is insufficient to explore the description of elongated induced dipoles. The net force on an isotropic elongated dipole in an electric field depends on the fields alternating and constant components e.g., V=Vbase+A*cos(ωt) such that ω is the frequency of the alternating component, A is the amplitude in voltage and Vbase is the voltage baseline. The force on a spherical dipole may be expressed as:
FDipole=2πR3εmRe[k(ω)]∇E2
Such that K is the Clausius Mossotti constant. This refers to the relationship:
such that
and σ is the conductivity of the particle
wherein ‘m’ and ‘p’ stand for medium and particle (respectively) and j is the square root of −1. Note that the third power of R makes it difficult to exert strong forces using spherical particles especially when R is very small. Further, the conductivity is extremely important in order to achieve highly responsive particles. A more complex model is of particles on which the electric force depends on the frequency ω. A highly conductive needle shaped particle redirects the electric field because the potential depends on the orientation of the needle. The term 2πR3 can be ignored for needles. Thus, the needle will be parallel to the field. We know from the simple case of constant dipoles that the minimum energy of a dipole D in a field E is u=−D*E where D is the dipole.
The transformation engine Hamiltonian and the square force on a local dipole are linearly dependent
Note that minimizing the geometric part of the transformation engine Hamiltonian means that the integral of the square force on all the dipoles is minimized. This result may not be readily apparent in light of the energy of a spring is
where K is the spring constant.
The variation of the transformation engine Hamiltonian must vanish between the input STM pattern (first boundary condition) and the memory LTM pattern (second boundary condition), so:
If the number of layers in the transformation engine is sufficiently large and the distance between the STM and LTM is also sufficiently large and fields are relatively strong then the constant K can be very large. In practice the transformation engine can work with very high terms of K of about 4096 and if the gradient of p is calculated over far points within the same layer then K can even be 105 or larger. The larger the K constant is, the closer the transformation engine is to a complex springs machine. Note that for practical applications of the transformation engine one potential is insufficient.
The theory about spherical isotropic particles that are placed in an electric field is insufficient when the shape of the particle is not spherical. The Hamiltonian of the transformation engine requires ideal needle shaped particles. The dependence of the net force on a dipole depends of the gradient of the electric field and that feature is applicable also in non-spherical dipoles.
Instead of using a single potential, p, several potentials should be used in order to resolve two dimensional ambiguities. That is, e.g., different alternating voltage frequencies and elongated pieces of material/particles/proteins/neurotransmitters that reside between the LTM and STM pixel boards can serve as vector fields because an elongated particle points to a direction and behaves like a needle that is placed in electric field and turns until it reaches equilibrium in minimum energy. If such “needles” respond to different frequencies and each “needle” is wrapped in an isolating ball, then the needles align like vectors in local fields. Since needle characteristics may be selected so that the needles selectively respond to different frequencies, the system of two pixel boards, STM and LTM and dipoles that respond to different voltage frequencies—“different needles”—simulate a multi potential system.
The Hamiltonian of multiple potentials P(i) may be expressed as:
such that the variation over the n potentials vanishes, as follows:
in a gray scale image. This mathematical formulation is consistent with at least one preferred embodiment of a transformation engine according to the invention.
The mixed term of:
is made of two parts, namely:
The part of the Hamiltonian that forces the pattern to gradually change from the input STM to the memory LTM pattern may be expressed as:
Minimization of this term alone means an electric like field lines between the input STM pattern and the memory LTM pattern:
such that K is some constant.
A gray scale image can be decomposed into high and low frequency Fourier Transforms. A single picture can be translated into multiple pictures using either Fourier Transforms or Wavelet Transforms. In an input STM pattern there are wide vertical stripes and an LTM pattern where the stripes are narrow, the Fourier Transform frequencies that will maximize on the STM pattern will be lower than the Fourier Transform frequencies that will maximize on the input STM pattern. The Hamiltonian that is minimized therefore must include mixed terms if the potentials P(i) need to represent different frequencies Fourier Transforms.
A Hamiltonian that is made of different potentials reduces the degrees of freedom of the transformation between the input STM pattern and the memory LTM pattern.
A potential is a function from Rn to R. The explicit second order (second derivatives) Hamiltonian is:
such that P(i) are the potentials.
Renormalization group methods and introduced to provide a method of treating systems in which multiple degrees of freedom spanning many scales of length are locally coupled. At the end of each step a Hamiltonian results that represent the interactions over length scales not yet treated. In particular, changes in gray-scale levels occur over many scales of length for which the image is organized. Thus, the transformation engine integrates many prior data that comes from lower level neurons such as V1, V2, LGN and integrates all this data into single high-level Neurons.
Coupling, is in terms of the Hamiltonian because the different scales of features in one image may fit different scales of features in another. For example wide lines in the input image may be fit to narrow lines in the memory image. The fitting problem may be even more complex. As shown in
Coupling may involve dipoles that are subject to maximum torque force when exposed to two different frequencies of alternating voltage. Thus the mixed terms in the transformation engine Hamiltonian can be implemented using, for example, Evotec OAI technology. A parceling of two particles that each responds to different frequency provides an exact solution to the coupling problem.
Two boundary conditions of the LTM and STM are well defined. Those are the LTM board and the STM board. The potential LTM=P(X,Y,0) and STM=P(X,Y,1) denote the LTM and STM patterns respectively. The convergence of the mapping between the LTM and the STM such that the topology is preserved depends on the distance between the LTM and the STM. Based on this constraint, the gradient of the potential should be as perpendicular as possible to the LTM and STM boards.
Referring to
The transformation engine Hamiltonian of a single potential is consistent with the laws defined by General Relativity. It is probable that the description of matter in the cosmos as a transformation engine requires more than one potential. However, for the purposes of the present illustration, one potential is used herein and developed into tensor form for simplicity of explanation. The transformation engine can take an interesting General Relativistic form with the following geometric implications. To express the transformation engine in terms of covariance principle articulated by Einstein, it is sufficient to replace the x,y,z parameterization with x,y,x,ct such that c is the speed of light and t is the time, and to replace each derivative with a Covariant or with a Contravariant derivative. Einstein's summation conventions will be used (upper and lower indices are summed up).
Writing L in Einstein's tensor convention provides:
L=(pipi+Kpnpk;npmpk;m)√−g
such that p is the pattern potential that gradually changes between the STM and the LTM patterns.
In the present case, LTM can be a known distribution of the potential field p in the past t=0 and the STM can be a current distribution for t=1 such that a description of the cosmos may be expressed in terms of what happens in-between. In such a case the Hamiltonian L can have terms higher orders for example:
L=(pipi+Kpnpk;npmpk;m+K4/3pnpk;L;npmpk;L;m)√−g
where −g is the minus sign of the determinant of the metric tensor, the square root of −g is the scaling factor of a volume element, where K is a constant and the semi colon; denotes the covariant derivative. The power 4/3 is provided to account for dimensionality. The Hamiltonian can be written in another form
L=(pipi+pnAknpmAkm+pnBkLnpmBkLm)√−g
where A and B are high-order tensors
In terms of differential geometry pmpk;m is an evaluation of how parallel the field pm changes in relation to itself. If this term is 0 then pm is a geodetic field and it represents an inertial particle. If this term is not 0 then it means that a “force” acts on matter. For this reason the transformation engine can be one of the models a unified force theory of the fine fundamental forces.
Using the Christoffel symbols {kn,u}, comma for derivative and gij as the metric tensor, results in the relationship:
L=(P,mP,k gmk+K gLK(p,jgin(P,k,n−{kn,u}P,u)(P,jgin(P,L,n−{Ln,u}P,u).)√−g
the square root of the −g term is always used in General Relativity for the measurement of a 4-volume element on the space-time manifold.
The Cristoffel symbols written explicitly provides:
The variation over the space-time manifold should vanish, that is,
such that Ω is the 4-volume space-time domain.
Transformation engines according to another embodiment of the present invention may also be constructed using FM and microwave frequency electric diploes. As described, the transformation engine uses units that align and generate a plurality of intermediate patterns between the input STM pixels board and the LTM memory board. One method of accomplishing this is by using a spring or a coiled shaped polymer that is coated with metal. The two tips of the spring are ball shaped and the entire spring is embedded in a spherical polymer as appears in
Each unit in
The approximated maximum force that will be felt by the unit depends on 1/LC where that L is the coil's magnetic induction constant in units of Henries and C is the capacitance of the dipole. A formulation of the optimum frequency of an external electric field also depends on other factors such as resistance.
A single unit can contain two different parallel coils and can thus respond to two independent frequencies. This property is useful in order to obtain appropriate coupling. Coupling in the transformation engine sense is broader than in Ising spin models because real induced dipoles can point to any direction and the coupling relates to frequencies and not of quantum spins. The coil or spring shaped dipole or two dipoles in each unit cause the unit to turn, or align, in response to local fields at different frequencies (e.g., 10000 KHz and 57000 KHz) and other whole multiplicity of these two basic frequencies (e.g., n*10000 KHz and m*57000 KHz) wherein m and n are whole numbers greater than or equal to 1.
These units can be placed on many boards, or inner layers, between the input pixels board (containing the STM and LTM patterns) and they can be suspended in liquid between the input and the memory boards.
As shown by
As shown in
Camera 1401 is used to convert an image into electrical signals on pixel board 1402. Edge detectors check groups of adjacent pixels for horizontal, vertical and diagonal lines. Identified edges are contained in pixel board 1403. This may be accomplished by the edge detector which identifies the edge firing at a certain frequency into pixel board 1403. Additional and more complexed detection may also be performed on the pixel boards. For example, a object recognition software may be used to identify various objects such as a human heel, an ear, a nose, or other predefined objects. Detection units which objects within portions of the pixel image may also fire at specific frequencies into corresponding pixels of pixel board 1404. Both pixels boards 1403 and 1404 are connected to transformation engine 1405. Specific pixels in the transformation engine's input pixel board receives pulses both for the existence of an edge and for the existence of identified objects. Electric dipoles 1406 within the transformation engine align such that the energy between the input board 1407 and the memory board 1408 is minimized. Within the transformation engine the suspended units maneuver or are otherwise positioned and/or oriented to accomplish this alignment within the surrounding material, such as the liquid. Individual units may respond to different frequencies and create a mapping between dissimilar objects such as an input of “x's” to a memory of “o's”. The memory board is connected to a regeneration oscillator chip. The chip is able to “learn” (i.e., characterize) the pulses coming from the input board and later to regenerate these pulses. Object recognition unit 1409 analyzes the signals received from the input board. If these signals match the learned and stored signals then the transformation engine fires in order to indicate or signal a match between the image and the stored information.
Note that a solid solution may be less efficient than a liquid or gaseous solution. In the solid machine the spherical units also have a core that is an elongated dipole, possibly spring shaped, that is free to turn in space. The units should have a limited angular freedom of movement. The spheres denote the cells that contain spring shaped elongated dipoles.
as in
In order to allow stationary transformations with this embodiment, each unit in layer S should have the same distance from its counterpart in layer S+3 as from the closest 3 units in layer S+1.
Note that some energy will be required for a dipole to change position because of the elongated nature of the dipole. This elongation along with the nonlinear nature of dipole-dipole interactions allows local competition, a trait important for topology conservation and fundamental to a transformation engine according to a present embodiment of the invention.
Each unit in layer S must have the same distance from four other units, one in layer S+3 and 3 in layer S+1. This condition imposes the following distance d between layers,
The distance between layer S to S+1 is d and therefore the distance from the counterpart dipole in layer S+3 is 3d. The distance to each one of the closest units in layer S+1 is by Pythagoras
From this equation
and the distance between the closest units is therefore R=3d≅0.6124.
An important value is the ratio
that is an approximated ratio between the forces of closest units within layer S and between each unit in S and each one of the 4 closest units, 3 in layer S+1 and one in layer S+3.
Q=7 1/9. This high quotient implies that most of the influence on each dipole comes from interactions between adjacent layers.
Another issue addressed by the transformation engine resolution. In particular, if the transformation is stationary then the layers resolution is only ⅓ in the case where the transformation includes all the layers. This fact implies an emergent property of the triangular structure of the dipolar transformatron; different levels of transformations use different paths. A collision between two paths may be a sign that there is a contradiction in the mapping process and therefore an AC signal, even if the induced dipoles are quick, will lead to a welcome instability.
As mentioned previously, preprocessing may be performed on the values associated with the input pattern and the memory pattern. For example, preprocessing may be performed to replace the ACCGTGGA sequence for DNA comparison with a logical “1”. The complement of the ACCGTGGA sequence is a CCCTGGCACCTA sequence because the ACCGTGGA sequence may bind to the CCCTGGCA sub-string. Preprocessing may be performed on the CCCTGGCACCTA sequence to replace the sequence with a logical “0”. The preprocessed values may then be inserted into the appropriate layers.
Another embodiment of the invention provides a Transformatron model in Riemannian Geometry. In contrast to the prior embodiment implementing a Far Span K−Distance Connections, the following embodiment uses covariant generalization in Riemannian geometry and thus allows complex matching between low dimensional patterns which is more difficult in Euclidean geometry.
As was described above in connection with the first embodiment, a transformation between boundaries, memory layer and input layer (e.g. ellipse drawn on the plane x,y,0 in R3 as p(xellipse,yellipse,0)=1 and 0 is elsewhere in the plane where the ellipse curve is not present and between a circle drawn on the plane x,y,1) can be done such that an intermediate pattern on the intermediate plane x,y,0.5 will emerge (e.g. an ellipse that is an intermediate pattern between the ellipse on the plane x,y,0 and between a circle on the plane x,y,1 in R3). This is done by defining a local cost function such that minimizing that local cost function causes a potential function p to be defined throughout the entire region between the boundaries. This method is used in order to match patterns by a later step that was named, “vibration waves”.
By matching through curved space, more complex matching tasks can be successfully achieved as curved geometry allows parallel lines to exist between the input pattern and the memory pattern even if in flat geometry this task is impossible. For example one can imagine parallel straight lines connecting a circle on x,y,0 plane with a circle drawn on the x,y,1 plane. If instead of circle, an eccentric ellipse is drawn on the x,y,0 plane then one can't match between the ellipse on x,y,0 and a circle on x,y,1 by using only straight parallel lines. This task, however, can be achieved if the space itself is curved as defined in Riemmanian geometry. Further, the minimum energy function has a formalism such that the Euler Lagrange operator of the minimum energy function yields tensor densities. As a result, the matching process is stable in curved spaces. This minimum energy function can further be used as computational basis for the development of new theories in physics.
The modification to the minimum energy function used according to an embodiment of a method of the invention may be described as follows. In Riemannian geometry language the minimum energy to be minimized can be written as
Where: g is the determinant of the metric tensor, upper indices denote contravariant tensor property, lower indices denote covariant tensor properties, Pi denotes a gradient
of a gradually changing function P, That changes between the memory and input layers, (−d<x1<d,−d<x2<d, . . . xj, . . . −d<xn<d), (−d<x1<d,−d<x2<d, . . . xj+δ, . . . ,−d<xn<d) (when n is the dimension of a Riemannian manifold and d>0 is a very large in relation to δ>0) Semi colon denotes a covariant derivative and comma denotes an ordinary derivative,
R denotes the scalar curvature also known as Ricci scalar. K1, K2, K3, K4 denote constants of the model.
The reason to include the curvature R as cost function is in order to penalize curvature which means that matching between patterns will not be at any cost but rather will be also regulated when curvature also costs energy.
According to the earlier described embodiment, the energy function that was minimized was:
and in full tensor language PsPr;sPsPr;s. This minimum energy function is applicable only in flat space and it's Euler Lagrange operator does not yield tensor densities.
A thorough explanation for the theory is as follows. As described above in connection with the previous embodiment, the Far Span k−Distance Connections has both digital and continuous forms. The continuous form of the energy described above may be expressed as
when upper indices are x coordinate indices and not powers and n is the dimension of the compared memory and input patterns and K is constant or changing in a relaxation process. In tensor form the function may be written as (PvPv+KPjPi;jPLPi;L)√{square root over (g)} when PjPi;jPLPi;L was the continuous local form of the Far Span k−Distance Connection. Semi colon denotes the covariant derivative and indices of P denotes the pattern function that is known on the memory layer and on the input layer. Also,
PjPi;jPLPi;L has several problems significantly including a numerical instability of the matching between the memory and input layers.
To address these problems, we can define the Far Span k−Distance Connection as a minimum energy function that depends on the second order derivatives of a pattern P that is defined on two boundaries, with upper indices describing coordinate indices and not powers, boundary (−d<x1<d,−d<x2<d, . . . xj, . . . −d<xn<d) and boundary (−d<x1<d,−d<x2<d, . . . xj+δ, . . . ,−d<xn<d) (when n is the dimension of a Riemannian manifold and d is very large in relation to δ).
This is in contrast to the prior definition of the domain, (−d<x1<d,−d<x2<d, . . . ,−d<xn<d) and the boundaries (−d<x1<d,−d<x2<d, . . . xj, . . . −d<xn<d), (−d<x1<d,−d<x2<d, . . . xj+δ, . . . ,−d<xn<d) Referred to an Euclidean space, e.g. R3.
If one boundary (−d<x1<d,−d<x2<d, . . . xj, . . . −d<xn<d) contains a pattern drawn as p and the other boundary has another definition of p we called (−d<x1<d,−d<x2<d, . . . xj, . . . −d<xn<d), memory layer and input layer. We then tried to connect the pattern p on both layers by the vector field which is generated by the gradient of this field p such that the gradient of p will form a vector field on the domain (−d<x1<d,−d<x2<d, . . . ,−d<xn<d) between the memory layer and the input layer and such that the gradient of p will form curves as parallel and as straight as possible.
Transformations between p that is defined on the input layer and p that is defined on the memory layer need not be linear, e.g. a transformation between two hand written signatures defined on two parallel planes x,y,0 and x,y,1 as p≠0 for ink dot and p=0 for blank paper.
The restriction of matching between such two patterns in Euclidean space doesn't make any sense because in curved space matching can be easier to perform.
The reason is that if parallel
for all i will not always be able to form parallel curves in Euclidean spaces but will be able to form parallel geodesic curves in curved space and thus allow a method by which a wider range transformations will be covered by the disclosed method.
The geometric motivation for extending the method may be explained as follows. A problem is that, given two Gaussian coordinate boundaries, (−d<x1<d,−d<x2<d, . . . xj, . . . −d<xn<d), (−d<x1<d,−d<x2<d, . . . xj+δ, . . . ,−d<xn<d) (when n is the dimension of a Riemannian manifold and d is a very large in relation to δ) and a scalar field P which its P2 values are known on these boundaries, and so is the geometry of the boundaries as geometric objects is also known, and 0≦P2≦1, on each one of the boundaries, define a Lagrangian such that the integral of (PiPi)m s.t.
when the power m≧1 will be globally minimal under the conditions: P2≧ε2 for some minimal value ε2 and |Pi|2≦L2, for some maximal value L2. Here upper indices denote the contravariant property, lower ones the covariant one and xi denotes the coordinates. The problem is also known as optimization of Homotopy. See, e.g., Victor Guillemin, Alan Pollack Differential Topology, Homotopy and Stability, pages 33,34,35, ISBN 0-13-212605-2. Surprisingly, minimizing Lagrangians that involved only first order derivatives of the scalar field P did not accomplish the required global minimum.
An interesting problem is to assign the root of the Ricci scalar (see David Lovelock and Hanno Rund, Tensors, Differential Forms and Variational Principles 261, 3.26, ISBN 0-486-65840-6) to P or in pattern recognition language, to use the Ricci scalar to encode two compared boundaries on which a geometric pattern is known. Intuitively speaking, the problem quite resembles comparing two hand written signatures such that each signature is etched (as the Support P≠0) on a two dimensional plane and two planes with such signatures are parallel at distance δ one from the other, (−d<x1<d,−d<x2<d, . . . xj, . . . −d<xn<d), (−d<x1<d,−d<x2<d, . . . xj+δ, . . . ,−d<xn<d).
A good matching between these two signatures will be achieved by a vector field
that can be interpreted into curves connecting the two planes. If for example one signature is an ellipse and the other one is a circle then checking the intersection of the field Pi with the intermediate plane,
will be also an ellipse with eccentricity half the one of an ellipse which is defined by P≠0 on, (−d<x1<d,−d<x2<d, . . . xj+δ, . . . ,−d<xn<d).
Surprisingly, according to computer simulations, in order to achieve such a field as a result of Euler Lagrange equations and of the calculus of variations, derivatives of the vector field Pi must be used. Using the well known covariant and contravariant indices notation of modern Riemannian Geometry, interesting candidate Lagrangians for exploration are
when semi colon denotes the Covariant derivative and g denotes the determinant of the Metric Tensor. An alternative is
The latter is suitable for yielding a meaningful theory also when Pi is not a gradient of some scalar field p when possibly
As we found out in low dimensions, a stabilizing Lagrangian is the KG operator PiPi and a meaningful theory is to minimize an integral of the form,
(see, U.S. patent application Ser. No. 10/144,754, Apparatus For And Method Of Pattern Recognition And Image Analysis) for some constants K1, K2, K3.
When g denotes the determinant of the metric tensor gij and semi colon denotes the covariant derivative and comma denotes an ordinary derivative as commonly used in differential geometry.
It appears that minimizing the term that involves only the field pi is responsible for minimizing the field intensity between the two boundaries and minimizing terms that involve derivatives of the field, such as Pi;j is responsible that the field p will not lose geometric structure as it changes from the boundary, (−d<x1<d,−d<x2<d, . . . xj, . . . −d<xn<d) to the boundary, (−d<x1<d,−d<x2<d, . . . xj+δ, . . . ,−d<xn<d).
It seems that for fields that are the gradient of some scalar function (conserving fields) minimizing,
is sufficient.
The theory and method that will unfold is of purely geometric meaning and with typical Riemannian formalism. The Lagrangians result as described below.
The Transformatron theory can further be explained as follows. Let us consider a specific Lagrangian of the form
Such that xm is our local coordinates system.
gμv is the metric tensor and Pi is a contravariant vector field.
From now on semi colon ‘;’ will 1 denote covariant derivative and comma ‘,’ will denote ordinary derivative as in the following examples,
when Γkij are the Christoffel symbols
We can now write a simple Lagrangian
The motivation for (2) is explained in further detail below.
Definition 1: The Lagrangian (2) is the Riemannian Far Span k Distance Connection. The term
turns out to be very useful in high dimensional pattern matching tasks in the field of analog computation and visual analysis.
Minimizing this term means that the square norm of the field Pi along it's curves stays as stationary as possible. In directions perpendicular to the field Pi, the field intensity is allowed to change. This is why (2.3) is a preferable Lagrangian for describing a field related to matter. An example of a real problem is two parallel R2 boards in R3 on which an image is encoded using an electric field.
One board encodes a memorized image and the other encodes a new input. The boards are divided into tiny pixels such that in the middle of each pixel there is an electrode. The potential of the memorized image pixels is positive and the potential of the pixels of the input image is negative. Between the two boards needle shaped pieces of conductive material are suspended in liquid. These needle shaped pieces are coated with non-conductive spheres of polyethylene.
Since dipoles respond to the gradient of the square norm of an electric field, the needles will align such that this gradient will be minimized along the direction they point to. L doesn't exactly describe this machine but is basically designed as a result of the same general topological idea.
Please note that (2) uses first order derivatives of Pi, providing Pi=√{square root over (ρ)}Ui for pattern density ρ.
Definition 2: The Lagrangian (2.1) is named Angular or Curl deviation.
Definition 3: The Lagrangain (2.2) is named, Forward Deviation.
Another form is Forward Deviation for Conserving Fields.
The equation
is also referred to as a Transformatron.
The present embodiment can be summarized by the flow diagram of
In any case K2=0 yields a more stable theory.
A difficult question is whether K4=0 yields a flat geometry theory. K2 (or K3) is a result of optimization of
for some m>0, by (2.3) and is determined by knowing K1. g is the determinant of the metric tensor in space-time, R is the Ricci scalar field and Ω is a domain of the form, (−d<x1<d,−d<x2<d, . . . xj, . . . −d<xn<d), (−d<x1<d,−d<x2<d, . . . xj+δ, . . . ,−d<xn<d).
Clearly (2),(2.1),(2.2) are scalar densities (see, David Lovelock and Hanno Rund, Tensors, Differential Forms and Variational Principles page 113, 2.18, and the transformation law in page 114, 2.30, 4.2 The Numerical Relative Tensors, ISBN 0-486-65840-6) Additional constraints may be also required to define a unique solution.
A thorough differential geometry discussion on how to reach (2.1) and (2.2) is included hereinbelow. However, a difficult and important open problem is whether there is an operator of the form,
s.t. P=√{square root over (R)}, the root of Ricci scalar and Ps=(√{square root over (R)}),s, such that the forth order Euler Lagrange operator of L will yield a tensor with a new physical meaning. Until now, attempts did not succeed to prove that there is such a non-trivial tensor in which third and fourth order derivatives of the metric tensor do not vanish. The significance of that open problem is due to section 7 and the introduction because structure conservation cost and curvature may yield a purely metric classical theory. The failure is partly due to lack of clear mathematical theories on Lagrangians with order of derivatives of the metric tensor which is higher than 2. (2.4) is defined as a Transformatron problem if the Ricci scalar P=√{square root over (R)} is known on two three dimensional boundaries, (−d<x1<d,−d<x2<d,−d<x3<d,x4),(−d<x1<d,−d<x2<d,−d<x3<d,x4+δ).
We proceed with the purpose of showing that the Euler Lagrange operator of (2.4) yields tensor densities.
The forward deviation may be addressed as follows. We will calculate Euler Lagrange operators (see, David Lovelock and Hanno Rund, Tensors, Differential Forms and Variational Principles page 323, 5.2, Combined Vector-Metric Field Theory, Page 325, Remark 1, ISBN 0-486-65840-6) in order to prove that they yield tensor density and thus the model is a feasible physical model.
We now calculate,
In which the term
−PiPvZmΓiμm−PμPiZmΓivm (6)
spoils the tensor density character of (5).
Please note the following:
Please also note
Which led to the first term in (5). We continue calculating,
Please note
So the non-tensor components of (5) are the same as in (9) which assures that the following is a tensor density,
In which the terms −PiPvZmΓiμm−PμPiZmΓivm cancel each other which clearly shows (11) is a tensor density.
We continue calculating the other Euler Lagrange terms,
Adding (14) and (13) we have
Clearly (15) is a tensor density because:
−4ΓiμkPiZk=−2(ΓiμvPiZv+ΓiμvPvZi) (16)
By (11) and (15) that we have proved, the Euler Lagrange operators of (2.2) is indeed a tensor density.
Angular/Curl deviation is addressed as follows. We will calculate the Euler Lagrange operators for
Obviously
We continue with
We continue by calculating,
So we can write,
It will be recognized that (20) is a tensor density.
We continue by calculating
We may write
It should be apparent that
Bμv=−Bvμ (23)
which is of crucial importance as will be apparent.
We continue calculating,
By (23) we have
BkvΓkμv=0 (25)
So (24) reduces to,
which is obviously a tensor density.
We continue calculating,
Adding (27) and (26) we have,
(28) is a tensor density. So we have proved that the Euler Lagrange operators of the Angular (or Curl) deviations yield tensor densities as required by physics. We therefore can consider (2.3) as a possible theory.
To address the forward deviation for conserving fields we calculate Euler Lagrange operators (see, Lovelock et al., supra) in order to prove that they yield tensor density and thus the model is a feasible physical model.
We now calculate,
In which the term
−PiPvPmΓiμm−PμPiZmΓivm (32)
spoils the tensor density character of (31).
Please note the following:
Please also note,
Which led to the first term in (31). We continue calculating,
So the non-tensor components of (31) are the same as in (35) which assures that the following is a tensor density,
In which the terms −PiPvPmΓiμm−PμPiPmΓivm cancel each other which clearly shows (36) is a tensor density.
We continue calculating the other Euler Lagrange terms,
Adding (38) and (39) we have
It is apparent that (40) is a tensor density because the non-tensor terms cancel each other.
The motivation for the Lagrangian presented in (2) above may be explained by first providing a definition for the term “Transformatron”. In particular, the geometric meaning of the “Transformatron” may be described as follows. According to one explanation, the Transformatron functions by “penalizing” the Lagarngian for a vector field which is not geodesic, can yield a global minimum of another integral under certain restrictions. This penalty (or cost function) in turn, can take part of the “job” that Ricci tensor does, which means that our vector field will not have to be geodesic but deviation from a geodesic field will have to be “penalized”. We bear in mind that the term “job” refers to a deep Differential Topology issue which is known as Stable Homotopy. See, e.g., Guillemin et al., supra). Moreover, the original Transformatron was an image processing device and was implemented as a complex triangular grid structure and space was not considered as a continuum.
We continue by repeating the following, suppose we have a scalar field Pi who's values are known on the Gaussian coordinates boundaries, (−d<x1<d,−d<x2<d, . . . xj, . . . −d<xn<d), (−d<x1<d,−d<x2<d . . . . xj+δ, . . . ,−d<xn<d).
Now we want to minimize PiPi on one hand and keep the field as geodesic as possible in flat geometry. We assume that the Euler Number (see, John W. Milnor, Topology from the Differentiable Viewpoint, pages 32-41, ISBN 0-691-04833-9) of our vector field Pi is 0 in every point of the vacuo. This is a necessary condition for the following technique to work ! If our field Pi is smooth and doesn't posses catastrophic Euler Numbers, then differential geometry tells us that there exists an arc length curve xi(s) such that
Such a curve is geodesic only if
If our geodesic field Pi fulfills the following, Pi;j=Pj;i or in other words, is a conserving field, then it is sufficient for the norm of the field to be stationary along the curve xi(s) because then
(PiPjgij);m=(Pi;mPi+Pj;mPj)=0Pi;mPi=0 (44)
by which it is obvious that (43) vanishes.
If we want to construct a cost function term that relies solely on the derivatives of the square norm of a vector field then we will have a problem when the field is not conserving.
One elegant way to solve this problem is to look at the following terms,
(2.1) is simply UrUr√{square root over (−g)} and (2.2) is simply VrVr√{square root over (−g)}.
In the past I used to think that
or its identical form,
are preferable tensor densities because (48) has a clear topological meaning and is more stable in pattern recognition engineering applications.
If for example, Pi=√{square root over (ρ)}Ui for some scalar field ρ then 48 is not sensitive to gradiends of ρ which are perpendicular to the field Pi that is if
This is a very useful property because a gradient ρi can represent structural information and forcing it to 0 causes this information to be lost.
The problem is that
which is not useful if Pi is not a conserving field. (2.1) solves that problem.
The problem that was left was to show that the Euler Lagrange operators of (2.1) (2.2) and (2.3) are indeed tensor densities. That is a critical condition for considering such a theory to describe Nature.
We can thus conclude that Stable Homotopy need not require geodesics to exist but does require to minimize deviations from the geodesics properties of the field that describes matter in vacuo.
As described, the Energy function which is also described herinabove as Far Span k−Distance Connection can be extended to Riemannian geometry in the example forms 2.4 and 2.5. Further, the system referred to as the Transformatron can be numerically simulated such that the memory layer, the input layer and the domain in between will all be described as coordinates of Riemannian manifold and thus allow convergence for difficult transformations between the pattern stored in the input layer and the pattern that is stored in the memory layer.
The above referenced conservation constraint may be explained as follows. A natural constraint relates to the conservation of matter and can be
Adding the constraint (A) to (2) we have
(A) along with (B) define a constrained variational equation.
In this case the constant λ can be chosen to be −K3 and yield a theory in which a non-conserving field is not lost or added to vacuo.
The present application includes a significant amount of theory and derivation of equations. This information is included to assist one skilled in the art in understanding the invention in detail. The inclusion of this theory and these derivations are not intended or included to limit the present invention.
It should be noted and understood that all publications, patents and patent applications mentioned in this specification are indicative of the level of skill in the art to which the invention pertains. All publications, patents and patent applications are herein incorporated by reference to the same extent as if each individual publication, patent or patent application was specifically and individually indicated to be incorporated by reference in its entirety.
This application is a continuation-in-part (CIP) of U.S. patent application Ser. No. 10/144,754 filed May 15, 2002 which claims priority to U.S. Provisional Patent Application Ser. No. 60/291,000, filed on May 16, 2001.
Number | Date | Country |
---|---|---|
WO 9704400 | Feb 1997 | WO |
Number | Date | Country | |
---|---|---|---|
20050163384 A1 | Jul 2005 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10144754 | May 2002 | US |
Child | 10810651 | US |