DISCRETE FOURIER TRANSFORM IN A COMPLEX VECTOR SPACE

Information

  • Patent Application
  • 20120109559
  • Publication Number
    20120109559
  • Date Filed
    March 08, 2011
    14 years ago
  • Date Published
    May 03, 2012
    13 years ago
Abstract
An image-based phase retrieval technique has been developed that can be used on board a space based iterative transformation system. Image-based wavefront sensing is computationally demanding due to the floating-point nature of the process. The discrete Fourier transform (DFT) calculation is presented in “diagonal” form. By diagonal we mean that a transformation of basis is introduced by an application of the similarity transform of linear algebra. The current method exploits the diagonal structure of the DFT in a special way, particularly when parts of the calculation do not have to be repeated at each iteration to converge to an acceptable solution in order to focus an image.
Description
BACKGROUND OF THE INVENTION

1. Field of the Invention


This application relates to Discrete Fourier Transforms, and in particular, to Discrete Fourier Transforms in phase retrieval for use in waveform analysis.


2. Background


Phase retrieval is an image-based wavefront sensing method that utilizes point-source images (or other known objects) to recover optical phase information. The most famous application of phase retrieval was the diagnosis of the Hubble Space Telescope mirror edge defect discovered soon after the launch of Hubble and subsequent correction using Corrective Optics Space Telescope Axial Replacement (COSTAR.) It may be ironic that a phase retrieval wavefront-sensing method lies at the very heart of the commissioning process for Hubble's successor. The earliest suggestion of using phase retrieval as a wavefront sensing method for JWST was in 1989, nearly a year before the Hubble launch and deployment. Details documenting the Hubble Space Telescope phase retrieval analysis are discussed in the literature.


A number of image-based phase retrieval techniques have been developed that can be classified into two general categories, the (a) iterative-transform and (b) parametric methods. Modifications to the original iterative-transform approach have been based on the introduction of a defocus diversity function or on the input-output method. Various implementations of the focus-diverse iterative-transform method have appeared which deviate slightly by utilizing a single wavelength or by varying the placement and number of defocused image planes. Modifications to the parametric approach include minimizing alternative merit functions as well as implementing a variety of nonlinear optimization methods such as Levenburg-Marquardt, simplex, and quasi-Newton techniques. The concept behind an optical diversity function is to modulate a point source image in a controlled way; in principle, any known aberration can serve as a diversity function, but defocus is often the simplest to implement and exhibits no angular dependence as a function of the pupil coordinates


The discrete Fourier transform (DFT) calculation is presented in “diagonal” form. By diagonal we mean that a transformation of basis is introduced by an application of the similarity transform of linear algebra.


What is needed is to find a more efficient implementation of the DFT for applications using iterative transform methods, particularly phase retrieval.


BRIEF SUMMARY

The image-based wavefront sensing method of and apparatus for focusing by determining phase difference between received images, removing that phase difference by calculating a DFT for each signal and iteratively multiplying the resultant until a convergence is achieved comprises the steps of accepting point-source stellar objects to recover embedded optical phase information, receiving data from a sensor array wherein the data includes embedded phase information, calculating a similarity transform and pre-transforming the data once at the beginning of the calculation.


The image-based wavefront sensing method of and apparatus for further includes iteratively processing the data using arbitrary frequency spacing in a diagonal fashion until a wavefront convergence is achieved to determine a phase difference by implementing the DFT in a 1 dimensional linear array using just N operations, where 0<N<N×log(N)<∞, wherein N×log(N) is the number of operations of an FFT counterpart, back-transforming the data at the end of the calculation in a single step and focusing the received images based on the determined phase.


The general problem of diagonalizing the DFT matrix (DFT eigen-problem) has been around for years. Apparently some aspects of the problem are related to topics in number theory that were even solved by Gauss. Such considerations have not led to more efficient implementations of the DFT over the FFT implementation, but we note that for certain applications, e.g., phase retrieval, that it is possible to exploit the diagonal structure of the DFT in a special way, particularly when parts of the calculation do not have to be repeated at each iteration to converge to an acceptable solution.





BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS


FIG. 1 is a functional block diagram of a system which contains a discrete Fourier transform processor in a complex vector space, according to an example embodiment;



FIG. 2 is a flowchart representing the methodology of a discrete Fourier transform processor in a complex vector space , according to an example embodiment; and



FIG. 3 is another flowchart representing the methodology of a discrete Fourier transform processor in a complex vector space, according to an example embodiment.





DETAILED DESCRIPTION

Detailed illustrative embodiments are disclosed herein. However, specific structural and functional details disclosed herein are merely representative for purposes of describing example embodiments. Example embodiments may, however, be embodied in many alternate forms and should not be construed as limited to only the embodiments set forth herein.


Accordingly, while example embodiments are capable of various modifications and alternative forms, embodiments thereof are shown by way of example in the drawings and will herein be described in detail. It should be understood, however, that there is no intent to limit example embodiments to the particular forms disclosed, but to the contrary, example embodiments are to cover all modifications, equivalents, and alternatives falling within the scope of example embodiments. Like numbers refer to like elements throughout the description of the figures.


It will be understood that, although the terms first, second, etc. may be used herein to describe various elements, these elements should not be limited by these terms. These terms are only used to distinguish one element from another. For example, a first element could be termed a second element, and, similarly, a second element could be termed a first element, without departing from the scope of example embodiments. As used herein, the term “and/or” includes any and all combinations of one or more of the associated listed items.


The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of example embodiments. As used herein, the singular forms “a”, “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms “comprises”, “comprising,”, “includes” and/or “including”, when used herein, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.


Hereinafter, example embodiments will be described with reference to the attached drawings.


Example embodiments of the present invention include a Discrete Fourier Transform (DFT) in phase retrieval for use in waveform analysis in a space based application.


One such space based application under consideration to use DFT phase retrieval methods includes the James Webb Space Telescope (JWST), one of NASA's great observatories of the Origins program and is scheduled to launch later this decade. The JWST design includes using advanced optical technology and 0-30 grade beryllium mirrors. The primary mirror is a 6.5 meter segmented primary using 18 segments in a hexagonal array. The optical design and manufacturing tolerances are specified for diffraction-limited performance at 2 μm, and the telescope is designed for operation over a temperature range of 30-60 K with an areal density of less than 15 kg/m2. As a result, JWST commissioning and periodic optical maintenance incorporate san image-based wavefront sensing and control (WFSC) system to align the mirror segments, minimize the effects of figure error, and position the secondary mirror of the three-mirror anastigmat design. The wavefront sensing method specified for JWST is image-based in the sense that point-source stellar images are collected and processed to recover optical phase information. The primary camera for this function is the JWST Near Infrared Camera (NIRCam), although it has been shown that the Near Infrared Spectrograph (NIRSpec) is suitable for this function as well.



FIG. 1 shows a block diagram of a satellite or platform in which an embodiment of the instant invention is contained. The sensor array 130 is connected both physically and electronically to the platform housing 120 which is in turn likewise connected the processor 110 which contains the primary embodiments of the instant invention. The platform housing includes all the necessary parts (not shown) of a satellite such as fuel, electrical power, communications etc. Image-based wavefront sensing is computationally demanding due to the floating-point nature of the process. With certain data configurations and large array sizes, and particularly for under-sampled data sets, the calculations can take tens of minutes or even hours to reach full convergence for the optical wavefront. In addition, processors are generally susceptible to opto-mechanical instabilities induced by environmental factors such as jitter and turbulence. These factors not only limit fine-phasing performance but also the stability of the phasing results. In such cases, an obvious mitigation strategy is to increase the computational bandwidth of the WFSC system, i.e., decrease the time required for an entire closed-loop WFSC cycle.


To improve convergence times without sacrificing accuracy and to mitigate these environmental factors, we have developed special-purpose floating-point intensive computing hardware. These processors communicate using a series of multi-processor computing architectures specifically tailored to the JWST fine-phasing methodology. The computing system is based on a cluster of 64 DSPs (digital signal processors) operating in parallel under a shared memory architecture. The system is portable and callable from a laptop computer using a typical Ethernet connection, reaching a sustained floating-point performance of approximately 100 Giga-Flops with 64 DSPs for development, testing and hardware redundancy.


An embodiment of the instant invention is illustrated by some practical aspects of calculating in a diagonal DFT basis. Aside from the application to iterative Fourier methods, calculating in the diagonal DFT basis also has an interesting geometrical interpretation as solving for the Fourier transform pair, (Fn, fn), that are co-linear or “parallel” with one another. Additional discussion on this interpretation is given below.


As an overview we begin by writing down the matrix form of the DFT. Next the hybrid-index notation of Schouten is introduced as an efficient notation for performing the DFT calculations in a complex vector space. The similarity transformation is presented in detail for a low-dimensional DFT example (4×4 array size) and then applied to illustrate how the DFT can be carried out in a diagonal basis using the hybrid-index notation. The main results show that by “pre-transforming” the data once at the beginning of the calculation, and then “back-transforming” the data at the end of the calculation, the DFT can potentially be implemented as a 1-d vector product using just N operations, compared to the N×log(N) operations needed by the FFT counterpart. The “pre-” and “back-” transforms are implemented using the DFT similarity transform and we further note that the calculation can be implemented using arbitrary frequency spacing.


DFT as a Unitary Transform


The DFT calculation can by implemented in matrix form:












F


(

v
m

)




DFT


(

v
m

)



=

Δ





x





n
=
0


N
-
1





f


(

x
n

)




V
mn





,




(
1
)







where Vmn is a non-singular M×N Vandermonde array:






V
mn
=e
−i2πv

m

(nΔx)
=e
−i2π(m·n)/M=(e−i2π/M)m·nMm·n,   (2)


with components given by










V
mn

=


(



1


1


1


L


1




1



ω
M




ω
M
2



L



ω
M

N
-
1






1



ω
M
2




ω
M

2
·
2




L



ω
M

2
·

(

N
-
1

)







M


M


M


O


M




1



ω
M

M
-
1





ω
M


(

M
-
1

)

·
2




L



ω
M


(

M
-
1

)



(

N
-
1

)






)

.





(
3
)







Next we define Wmn as






W
mn
=V
mn
/√{square root over (M)}=(e−i2π/M)m·n/√{square root over (M)},   (4)


and then it is straightforward to show that Wmn is a unitary matrix:






W·W

=I,   (5)


using boldface matrix notation where I is the identity matrix and the “†” symbol denotes the complex-conjugate transpose or “adjoint.” We comment that the adjoint operation will be presented more generally in the next Section. Next letting






f
n
≡Δxf(xn),   (6)


Equation (1) takes the form of a unitary transformation:












F


(

v
m

)




F
m


=




n
=
0


N
-
1





W
mn



f
n




,




(
7
)







with the inverse transformation given by (generally but not always we take M=N):










f
n

=




m
=
0


M
-
1






(

W
mn

)






F
m

.







(
8
)







Given (5) and (7), the DFT can be analyzed using the transformation theory of complex vector spaces. The hybrid-index notation of Schouten supplies a natural framework for this approach and is presented in the next Section.


Hybrid Index Notation

As illustrated above, the DFT is a unitary transformation in a complex vector space. Therefore, the calculations can be performed elegantly and efficiently using the tensor-transformation formalism of Schouten. In particular, Schouten's hybrid-index notation provides a natural framework for the DFT calculations. The goal in using this approach is to give a more general representation of the calculations by identifying components with quantities of a geometric manifold. Often, but not always, such identification can lead to alternative or more efficient calculation strategies as discussed later. Therefore, the goal in the next Section is to sort out the details of the hybrid-index notation for the DFT, and thus identify the DFT with quantities in a geometric manifold. In addition, by adopting a notation that makes the transformation rules of a quantity explicit, the meaning of the calculations can often be clarified and the notation serves as an efficient bookkeeping device for checking consistency throughout the calculations.


DFT in Hybrid-Index Notation

The DFT is defined here as a unitary transform between conjugate Fourier domains. The 1-d DFT transform is then expressed using Schouten's hybrid-index notation as






F
m
=W
m
·n
f
n,   (9)


using the Einstein sum convention defined over repeated indices in an upper/lower combination:












n







W
mn



f
n






W

·
n

m




f
n

.






(
10
)







The lower/upper indices denote the covariant and contravariant transformation rules—the upper indices are not exponents in this notation. Exponents of ( a quantity will be denoted using parentheses, or brackets, respectively, for either a transformation of coordinates, or a “point transformation.” Allowable transformations of coordinates for this application will be discussed later. The distinction between the transformations of coordinates and point transformations are fundamental to Schouten's kernel-index method. To elaborate briefly, coordinate transformations are denoted by a transformation of basis, say, m→m′, where the new basis set is denoted by m′. The components in the new basis are thus






u
m
′=A
m
m
′u
m,   (11)


where AmM′ is the transformation matrix and the um′ are the vector components in the new basis set (u is arbitrary).


A point transformation, on the other hand, is denoted by






U
m
=P
·n
M
u
n,   (12)


and is the form adopted for the DFT in (9). Another important distinction between a transformations of basis and a point transformation is that the indices for the point transformation are ordered and the positions are marked using over- or under-dots as in P·nm (12). The main take-away is that for coordinate transformations, the kernel symbol, u, remains the same as in (11). But for point transformations, the kernel symbol changes (as in (12)) but the indices remain the same. So initially at least in this context, the DFT transformation in (9) is defined as a point transformation, but later we find that when calculating the DFT in a diagonal basis, it is natural to reconsider (9) using a common kernel symbol for the Fourier transform pair, specifically, (Fm, fn)→(fm, fn). We note also that it is possible to mix the two transformations (basis and point) as is required when considering the allowable transformations of basis in digital signal processing. These details will be discussed in more detail later.


Continuing with an explanation of the formalism, the complex conjugate of (9) is given by







F


m

= W

m

n

f


n

f

m,
  (13)


using the over-bar to denote the complex conjugation of components and kernel symbols in accordance with Schouten's hybrid-index notation and noting for consistency that







F
m
=Fmcustom-characterFm=custom-character=Fm.   (14)


The conjugation of indices and kernel objects is fundamental to the study of transformations in a complex vector space, and as illustrated below, is perfectly suited for calculations of the DFT transform.


We choose an initial convention that the data values are labeled by contravariant components, fm, and then by complex conjugation of the kernel symbol and indices in various combinations, several other types of components are represented in Schouten's notation by the “raising” and “lowering” of indices using the “fundamental” or metric tensor, αkl:





fm,fm, fm, fm,   (15)


where the latter two follow by complex conjugation of the first two. Specifically, we mean that






f
m
custom-character
f

m

=a

mm

f
m
; f

m

custom-character

f

m
=a

mm


f


m
,   (16)


As mentioned above, co- and contravariant transformation rules can be switched back and forth using the metric or fundamental tensor, akl, to raise or lower indices as appropriate. The metric tensor also defines how the differential element of distance is calculated in the manifold as well as the inner product between two vectors, say, uk and vl:






custom-character
u|v
custom-character=aklūkvllvtkvk,   (17)


pointing out that in a linear complex vector space, the “bra-ket” notation of Dirac is easily incorporated in this approach as illustrated on the LHS of (17). Illustrating the use of the metric for raising and lowering indices we have:






W
·n
m
=a

ln

W
m l and fnn kfk,   (18)


and then substituting these expressions back into (9) we see that an alternative form of the DFT is obtained:






F
m
=W
·n
m
f
n=(alnWm l)(an kfk)=alnan kWm lfkl· kWm lfkcustom-characterFm=Wm nfn.   (19)


Adjoint in a Geometric Manifold

In general, the adjoint operation applied to the W·nm, denoted by {tilde over (W)}·nm=(W·nm), is defined through a two-step operation using both the metric tensor and complex conjugation:

    • (a) each index is raised or lowered as appropriate using the metric:






a

nk

a
l m
W
·l
k
=W

n

· m,   (20)

    • (b) the result in (a) is then complex conjugated to give





(W·nm)†=custom-character=custom-character=Wn·m, tm (21)


and finally we write






{tilde over (W)}
·n
m=≡(W·nm)= Wn·m,   (22)


For comparison, in matrix boldface notation:





(W)\= WT=a Wa−1=W31 1,   (23)


where “T” defines the usual transpose operation. We note especially that the example in (23) illustrates a degree of non-generality in the way the adjoint is usually calculated in matrix notation. Specifically, defining the adjoint using components in a geometric manifold (using steps (a) and (b) above) can give different results than simply complex conjugating and then performing the transpose operation using the “T” as in (23). In special coordinates, ank≐δnk, these two are identical, but in general, that is not always the case for arbitrary ank.


Since the W·nm are unitary, the inverse,








W

-
1



·
n

m

,




is defined using the adjoint and is expressed (from (22)):












W
~


·
n

m

=



W

-
1



·
n

m

=



(

W

·
n

m

)



=


W
_

n

·
m





,




(
24
)







such that






W
·k
n
text missing or illegible when filed·kl.  (25)


We further comment that Schouten did not initially use an over-bar on the kernel symbol as in we have in (25). Considering the W·nm using two upper indices, Wm n (see (18) and (19)) the corresponding adjoint is similarly calculated:





(Wm n)\=amkalnWk l=custom-character=Wm n=text missing or illegible when filed.  (26)


The inverse DFT transformation to (9) is thus defined:






{tilde over (W)}
·m
n
F
m=text missing or illegible when filedW·kmfkδknfk=fn,   (27)


as expected, or when using the form of (19) the inverse DFT is expressed






{tilde over (W)}
m n
F
m=text missing or illegible when filed(Wm kfk)=δnkfk=fn.   (28)


A few additional comments from this Section: in tensor transformation theory, there is geometric meaning attributed to these differing but complementary transformation rules. Specific transformations are defined by an allowable Group of transformations, Ga, as discussed further in Schouten. The corresponding allowable transformations of basis for the DFT application (digital signal processing) can include fractional and integer shifts of the data origin in addition to scaling and phase transformations. The details for these various transformations in this notation will be discussed in a later set of notes.


Matrix Multiplication Using Index Notation

When calculating with components, index order is important for proper matrix multiplication and thus we adopt the following convention for a rank-2 quantity:










A

·

n


col




m

row


,




(
29
)







and the inverse of A·nm is notated as,








A

-
1



·
n

m

,




such that








A

·
n

m




A

-
1



·
k

n


=

δ

·
k

m





where δ·km is the Kronecker delta. To illustrate, the matrix multiplication of two matrices A and B is notated as:






C
·k
m
=A
·n
m
B
·k
n, (30)


and to be specific we calculate (30) using Mathematica with 2×2 arrays:





Table [Sum [(A[[um,n]]B[[n,lk]]), {n,1,NN}], {um,1,2}, {lk,1, 2}, ]]  (31)


where um denotes “upper-m” and lk denotes “lower-k;” the upper/lower dummy variable, n, is left un-notated in Mathematica with regard to its position. The result is of course (30):











C

·
k

m

=



A

·
n

m



B

·
k

n


=

AB
=

(






a

·
1

1



b

·
1

1


+


a

·
2

1



b

·
1

2








a

·
1

1



b

·
2

1


+


a

·
2

1



b

·
2

2










a

·
1

2



b

·
1

1


+


a

·
2

2



b

·
1

2








a

·
1

2



b

·
2

1


+


a

·
2

2



b

·
2

2






)




,




(
32
)







where AB is the conventional matrix product using matrix boldface notation. For comparison, transposing both the m-n and n-k index positions in (30) reverses the order and gives the BA matrix product:











C

·
k

m

=



A
n

·
m




B
k

·
n



=

BA
=

(






b

·
1

1



a

·
1

1


+


b

·
2

1



a

·
1

2








b

·
1

1



a

·
2

1


+


b

·
2

1



a

·
2

2










b

·
1

2



a

·
1

1


+


b

·
2

2



a

·
1

2








b

·
1

2



a

·
2

1


+


b

·
2

2



a

·
2

2






)




,




(
33
)







and the corresponding Mathematica code for (33) is





Table[Sum[(A[[n,um]]B[[lk,n]]), {n,1,NN}], {um,1,2}, {lk,1,2}]  (34)


Metric Tensor Deriving from Unitary Invariance


In summary of Schouten's notation, various geometric quantities are defined that are distinguished by their transformation rules using index position. A group of allowable transformations, Ga, is introduced and then several invariant quantities may be constructed based on invariance of a given symmetry group. One such quantity is the metric tensor, akl, defining “distance” in the manifold and also defining the inner product between any two vectors. Therefore, one goal in these notes is to solve for the DFT metric tensor, akl, by postulating nothing more than unitary invariance under the action of the point transformation, W·nm, as illustrated below. Specifically, given that the W·nm are defined as a unitary transformation, we introduce a metric tensor defined by invariance under the action of the W·nm transformation such that






W
·l
m
W

k

· n
a

nm

=a

kl
,   (35)


or in matrix form:






W a W
−1
=a.  (36)


Quantities that are invariant under the action of the unitary group are hermitian and thus the akl satisfy






a

kl
=(a kl)\lk,   (37)


The introduction of a hermitian metric akl into the En geometric manifold defines the manifold as a Un which differs from the earlier notation used by Schouten where this manifold was labeled as the {tilde over (R)}n.


General form of Parseval's Theorem


We can show that Parseval's theorem is really implied by unitary invariance, and not necessarily “energy conservation” as it is commonly referred to in the engineering literature. To begin, we consider the magnitude squared of Fm→|F|2, giving






custom-character
F|F
custom-character=|F|2=amnFmFn.  (38)


Substituting the DFT transformation rule from (9) into (38) we have





|F|2=anmW·kmWl· nfkfl,   (39)


but since the akl are invariant under this unitary transformation (see (35)) then the RHS of (39) simplifies and we get





|F|2=aklfkfl.   (40)


Therefore, by comparison with (38), Parseval's theorem is seen to hold for any solution of the aklas implied by unitary invariance (35), and thus in general:






a

mn


F


n

F
n
=a

kl


f


k

f
l.   (41)


The common form of Parseval's theorem is expressed in vector notation as





|F|2=|f|2.  (42)


To give some perspective on these results we look forward a bit and comment that the standard DFT approach adopts a special solution (coordinate system) for the akl given by the Kronecker delta (as shown later):






a

nk
≐δnk  (43)


and so in index notation (42) takes the familiar “Euclidean” form:





δmnFmFnklfkfl,   (44)


or as it is commonly written:






|F
1|2+|F2|2+L+|FN|2=|f1|2+|f2|2+L+|fN|2.  (45)


The point is that for a general akl, Parseval's theorem (41) will not necessarily take the Euclidean form as demonstrated later below (see (67)), although unitary invariance implies that (41) still holds.


A couple of additional comments: as discussed later the δkl is a solution for the akl and corresponds to a positive definite signature solution. The approach taken here is motivated by that fact that in general, the eigen-problem requires the use of a general metric, and by no means does this necessarily correspond to the Kronecker delta in a unitary complex vector space, Un. But even so, we later adopt the Kronecker delta solution to make progress on the main point of calculating the DFT in a diagonal basis, and for easy comparison with previously published results.


Metric Solutions

By postulating unitary invariance of the metric we can solve for the akl using (35) and (37). The W·km are known through identification with (4):











W

·
k

m




1

M




(




-

2π


(
mk
)



/
M


)



=


1

2





(



1


1




1



-
1




)

.






(
46
)







As discussed above, the inverse transformation of (46) is defined through the adjoint operation in (20). But at this point in the discussion we have not yet solved for the metric akl, which is needed to form the adjoint. Therefore, we first solve for the







W

-
1



·
k

n




such that












W

·
n

m




W

-
1



·
k

n


=

δ

·
k

m


,




(
47
)







and the







W

-
1



·
k

n




are given by:











W

-
1



·
k

m

=



W
_

k

·
m


=



1

M




(




2π


(
mk
)


/
M


)


=


1

2





(



1


1




1



-
1




)

.








(
48
)







Continuing to the general text missing or illegible when filed×2 metric, akltext missing or illegible when filed defined as











a


k
_


l


=


(




z


1
_


1





z


1
_


2







z


2
_


1





z


2
_


2





)

=

(





x
11

+








y
11







x
12

+








y
12









x
21

+








y
21







x
22

+








y
22






)



,




(
49
)







where the complex notation is dropped on indices of real components. The condition for unitary invariance (35) then gives the following simultaneous system of 4 equations and 8 unknowns:











1
2



(





z


1
_


1


+

z


1
_


2


+

z


2
_


1


+

z


2
_


2







z


1
_


1


-

z


1
_


2


+

z


2
_


1


-

z


2
_


2









z


1
_


1


+

z


1
_


2


-

z


2
_


1


-

z


2
_


2







z


1
_


1


-

z


1
_


2


-

z


2
_


1


+

z


2
_


2






)


=


(




z


1
_


1





z


1
_


2







z


2
_


1





z


2
_


2





)

.





(
50
)







The system (50) is under-determined and can therefore have many possible solutions but will be further constrained by the hermitian condition (37). Solving (50) eliminates 4 of the unknowns (z11 and z12) in terms of the other 4 unknowns (z21 and z22) showing that akl has the general form:










a


k
_


l


=


(





2


z


2
_


1



+

z


2
_


2






z


2
_


1







z


2
_


1





z


2
_


2





)

.





(
51
)







Next applying the hermitian (symmetry) condition (35) to (51) gives the condition that the 2×2 akl must be real valued:











(





2


y
21


+

y
22





y
21






y
21




y
22




)

=



(



0


0




0


0



)



Im


(

a


k
_


l


)



=
0


,




(
52
)







but as expected there are multiple solutions for the akl in terms of the 2 final parameters x21 and x22. The general solution for the 2×2 case is thus reduced from 8 to 2 unspecified real components and is given by










a


k
_


l


=


(





2


x
21


+

x
22





x
21






x
21




x
22




)

.





(
53
)







As a check, substituting (53) back into (35) verifies that this akl is invariant under the action of the W·km for arbitrary (but real) numerical values of x21 and x22 in (53).


We have shown that an infinite number of solutions exist for the 2×2 metric, akl, as implied by unitary invariance in terms of two real parameters. We note that one of the simplest and most important solutions is a special case of (53) when x21=0 and x22=1 giving the Kronecker delta as mentioned earlier:













a
1



k
_


l








x
21


0



x
22


1






=
*




(



1


0




0


1



)



=
*



δ



k
_


l









,




(
54
)







where the complex conjugation is kept for consistency, and the numeral “1” beneath the kernel symbol in Schouten's notation labels this case as a distinct solution, different from other solutions, say








a
2



k
_


l


.




The labeling of kernel symbols in this way is used extensively in the eigen-problem discussed later.


We comment that since (54) is positive definite, as discussed at greater length below in the Section titled Generalized DFT, there will be no distinction between the other various quantities that may be derived from the akl in the U2, and thus, no numerical distinction exists for components labeled by upper/lower indices, or even the complex conjugation of indices. This is seen in the solution (54) above.


As illustrated above there are an infinite number of solutions for the akl. Several specific solutions are shown in (55) based on various choices for the x21 and x22 components:














a
1



k
_


l








x
21


0



x
22


1






=
*



(



1


0




0


1



)


;




a
2



k
_


l








x
21


1



x
22



-
1







=
*



(



1


1




1



-
1




)














a
3



k
_


l








x
21


1



x
22


0






=
*



(



2


1




1


0



)


;




a
4



k
_


l








x
21



-

1
2





x
22


1






=
*



(



0



-

1
2







-

1
2




1



)



,





(
55
)







Given some specific solutions for the akl in (55), we examine various other quantities that are derived in Schouten's notation. For instance, consider the second solution in (55) that is a multiple of the W·nm:












a


k
_


l





a
2



k
_


l





=
*





(



1


1




1



-
1




)




a
_


k


l
_






=
*




1
2



(



1


1




1



-
1




)




,




(
56
)







and then for generality define a Complex data vector:






f
n=(z1,z2)=(x1+iy1,x2+iy2)  (57)


The DFT transformation (9) is thus given by










F
m

=



W

·
n

m



f
n


=


1

2





(



z
1

+

z
2


,


z
1

-

z
2



)

.







(
58
)







As a consistency check on both the solution for the akl of (56), and the legitimacy of constructing the inverse transformation by use of the adjoint operation in (20), i.e.,














W

-
1



·
n

m

=



(

W

·
n

m

)





?



,






?



indicates text missing or illegible when filed







we can check the inverse transformation calculated by way of the adjoint:




















(

W

·
n

m

)



=


a


n
_


k




a

l


m
_





W

·
l

k








=


?



1

2




?



1
2



?








=


1

2




(



1


1




1



-
1




)









=


W

-
1



·
n

m


,











?



indicates text missing or illegible when filed







(
59
)







in agreement with (48). The inverse text missing or illegible when filed transformation is also text missing or illegible when filed for consistency giving












W
~


·
n

m



F
m


=



1

2




(



1


1




1



-
1




)



(





z
1

+

z
2








z
1

-

z
2





)


=


(


z
1

,

z
2


)

=


f
n

.







(
60
)







Several other quantity are derived from the fn using the akl:






f
n=(z1,z2); fn=ankfk=(z1+z2,z1−z2) custom-character= fk=(z1, z2); fk=ankfn=(z1+ z2, z1z2),  (61)


and comparing the RHS's in (61), as expected we have fn=custom-character, showing consistency using the hybrid-index notation. Correspondingly, various other Fm quantities are given by












F
m

=


1

2




(



z
1

+

z
2


,


z
1

-

z
2



)



;


F

m
_


=



a


m
_


k




F
k


=


2



(


z
1

,

z
2


)














F
_


m
_


=


1

2




(




z
_


1
_


+


z
_


2
_



,



z
_


1
_


-


z
_


2
_




)



;



F
_

m

=



a


n
_


m





F
_


n
_



=


2



(



z
_


1
_


,


z
_


2
_



)









(
62
)







and components for the Wm n and the Wmn are also calculated:












W

m


n
_



=



a

k


n
_





W

·
k

m


=

(




1

2




0




0



1

2





)



;








W


m
_


n


=



a


k
_


n




W

m
_


·

k
_




=


(




1

2




0




0



1

2





)

.







(
63
)







Using the fn from (61) and then comparing with the Fm in (62) we can check the forward DFT for consistency using the Wm n of (63):














W

m


n
_





f

n
_



=


(




1

2




0




0



1

2





)



(





z
1

+

z
2








z
1

-

z
2





)








=


1

2




(



z
1

+

z
2


,


z
1

-

z
2



)








=


F
m

.








(
64
)







The adjoint operation on the Wm n and Wmn of (63) is similarly verified:




















(

W

·
n

m

)



=

?







=


a

m


k
_





a


n
_


l




W

m


n
_










=


(



1


1




1



-
1




)



(




1

2




0




0



1

2





)



1
2



(



1


1




1



-
1




)









=

(




2



0




0



2




)


,











?



indicates text missing or illegible when filed







(
65
)







such that






W
m n
text missing or illegible when filedkn.   (66)


Finally, Parseval's theorem is demonstrated:






a

mn


F


m

F
n
=a

kl


f


k

f
l
=|z
1|2−|z2|s,   (67)


which is not in Euclidean form but is nevertheless invariant.


Comments on a General DFT

Further restrictions may be placed on the solution values for the x21 and x22 in (53), in accordance with the classification of the akl as being positive definite, negative definite, or indefinite, corresponding to the number of values (or signature)>0, <0, or mixed, respectively, along the diagonal of the akl). Note that when akl is definite that there is no distinction between the various quantities that may be derived from akl in the U2 and thus, no numerical distinction exists for components labeled by upper/lower indices, or even the complex conjugation of indices; these notations are merely formal in this instance. We comment further that since the akl are required when solving the eigen-problem in general (more details on this below) there can in principle at least, be different solutions for the eigen-problem corresponding to different solutions for the akl. Note that all earlier work on the DFT eigen-problem uses the akl≐δkl solution without considering the possibility that the akl might be defined in a more general way based on unitary invariance. Thus, different solutions for the akl may lead to possibly different DFT eigen-quantities, but all the while preserving unitary invariance and the hermitian character of the akl. Thus, from the perspective of the transformation theory of complex vector spaces, we can regard some properties of the DFT as deriving from the characteristics of the akl, i.e., that (a) akl is invariant under unitary transformations and (b) is hermitian. Condition (b) really derives from (a) but it is convenient when solving for the akl components (as illustrated above) to regard (b) as an independent condition.


Similarly as in the 2×2 example, we can solve for the akl corresponding to higher dimensional data sets. Of special interest are solutions in which the akl are complex valued since these solutions provide further consistency checks on the proper use of Schouten's' notation. We proceed by deriving the 4×4 W·km and its inverse,








W

-
1



·
k

m

,




from (46):












W

·
k

m

=


1
2



(



1


1


1


1




1



-
i




-
1



i




1



-
1



1



-
1





1


i



-
1




-
i




)



;












W

-
1



·
k

m

=

?







=


W
_

k

·
m









=


1
2



(



1


1


1


1




1


i



-
1




-
i





1



-
1



1



-
1





1



-
i




-
1



i



)



,











?



indicates text missing or illegible when filed






(
68
)







and a general 4×4 metric is defined by










a


k
_


l




(




z


1
_


1





z


1
_


2





z


1
_


3





z


1
_


4







z


2
_


1





z


2
_


2





z


2
_


3





z


2
_


4







z


3
_


1





z


3
_


2





z


3
_


3





z


3
_


4







z


4
_


1





z


4
_


2





z


4
_


3





z


4
_


4





)





(
69
)







The condition for unitary invariance (35) then gives a simultaneous system of 16 equations and 32 unknowns which can be further reduced to 6 free parameters by imposing the hermitian condition (37) on the akl. After considerable algebra the general solution to (35) for the akl, constrained by the hermitian condition (37) is given by










a


k
_


l


=


(





x
33

+

2


(


x
41

+

x
43


)







x
41

-

iy
43





(


x
33

+

x
41

-

x
42

+

x
43

-

x
44

+

2


iy
43



)





x
41

-

iy
43








x
41

+

iy
43





x
44





x
43

+

iy
43





x
42






(


x
33

+

x
41

-

x
42

+

x
43

-

x
44

+

2


iy
43



)





x
43

-

iy
43





x
33





x
43

-

iy
43








x
43

+

iy
43





x
42





x
43

+

iy
43





x
44




)

.





(
70
)







As before, there are an infinite number of both real and complex valued 4×4 solutions for the akl in terms of the 6 free parameters in (70). Several example solutions corresponding to basic numerical values for the x and y components are listed in (71):











a
1


k


l
_












x
33


1

,


x
44


1





x
41


0

,


x
42


0






x
43


0

,


y
43


0








=
*



(



1


0


0


0




0


1


0


0




0


0


1


0




0


0


0


1



)


;







a
2


k


l
_












x
33


1

,


x
44


1





x
41


0

,


x
42


0






x
43


0

,


y
43


1







=
*




(



1



-
i




2

i




-
i





i


1


i


0






-
2


i




-
i



1



-
i





i


0


i


1



)








a
3


k


l
_












x
33


1

,


x
44


1




x

41
,






-
1



x
42



0






x
43


1

,


y
43


0








=
*



(



1



-
1



0



-
1






-
1



1


1


0




0


1


1


1





-
1



0


1


1



)


;







a
4


k


l
_












x
33


1

,


x
44


1





x
41



-
1


,


x
42


0






x
43


0

,


y
43


0







=
*




(




-
1




-
1




-
1




-
1






-
1



1


0


0





-
1



0


1


0





-
1



0


0


1



)

.
















(
71
)







Once again we pick a solution and then illustrate a few of the basic quantities that are derived for the DFT in Schouten's notation. For generality, consider the complex-valued solution of (71) for the akl:












a


k
_


l





a
2



k
_


l





=
*



(



1



-
i




2

i




-
i





i


1


i


0






-
2


i




-
i



1



-
i





i


0


i


1



)


,




(
72
)







and then the inverse is given by











a

-
1




k
_


l


=


a

k


l
_





=
*




1
7




(



1




-
2

-
i





-
2

+

2

i






-
2

-
i







-
2

+

2

i




5



2
+
i




-
2







-
2

-

2

i





2
-
i



1



2
-
i







-
2

+
i




-
2




2
+
i



5



)

.







(
73
)







To illustrate some specific quantities we define a general complex “data” vector:






f
n=(z1,z2,z3,z4),  (74)


and then the DFT transformation (9) is given by










F
m

=



W

·




n

m



f
n


=


1
2




(





z
1

+

z
2

+

z
3

+

z
4








z
1

-

iz
2

-

z
3

+

iz
4








z
1

-

z
2

+

z
3

-

z
4








z
1

+

iz
2

-

z
3

-

iz
4





)

.







(
75
)







Applying the inverse DFT transformation shows that as expected:















W
~


·
m

n



F
m


=




1
4



(



1


1


1


1




1


i



-
1




-
i





1



-
1



1



-
1





1



-
i




-
1



i



)



(





z
1

+

z
2

+

z
3

+

z
4








z
1

-

iz
2

-

z
3

+

iz
4








z
1

-

z
2

+

z
3

-

z
4








z
1

+

iz
2

-

z
3

-

iz
4





)








=



(


z
1

,

z
2

,

z
3

,

z
4


)







=




f
n

.








(
76
)







Other quantities are derived from the fn using the akl of (72), e.g., consider











f

k
_


=



a


k
_


n




f
n


=


1
2



(





z
1

-

iz
2

+

2






iz
3


-

iz
4








iz
1

+

z
2

+

iz
3









-
2







iz
1


-

iz
2

+

z
3

-

iz
4








iz
1

+

iz
3

+

z
4





)




,




(
77
)







and we further check that fn=text missing or illegible when filed, by calculating:














a


k
_


n





(

f
k

)

_


=


a


k
_


n





f
_


k
_









=


f
_

n







=


1
2




(






z
_

1

+

i



z
_

2


-

2





i



z
_

3


+

i



z
_

4










-
i




z
_

1


+


z
_

2

-

i



z
_

3










-
2






i



z
_

1


+

i



z
_

2


+


z
_

3

+

i



z
_

4










-
i




z
_

1


-

i



z
_

3


+


z
_

4





)

.









(
78
)







Comparing fn in (77) to the complex conjugation of fn in (78) we see that as expected:






f

n
=text missing or illegible when filed,   (79)


demonstrating consistency using the hybrid-index notation. Correspondingly, we can lower indices on the Fm to get













F

k
_


=




a


k
_


m




F
m








=




1
2




(





z
1

+


(

1
-

2

i


)



z
2


+


(

1
+

4

i


)



z
3


+


(

1
-

2

i


)



z
4










(

1
+

2

i


)



z
1


-

iz
2

-


(

1
-

2

i


)



iz
3


+

iz
4









(

1
-

4

i


)



z
1


-


(

1
+

2

i


)



z
2


+

z
3

-


(

1
+

2

i


)



z
4










(

1
+

2

i


)



z
1


+

iz
2

+


(


2

i

-
1

)



z
3


-

iz
4





)

.









(
80
)







Parseval's theorem is invariant as expected:













a
2



m
_


n





F
_


m
_




F
n


=




a
2



k
_


l





f
_


k
_




f
l


=





z
1



2

+




z
2



2

+




z
3



2

+




z
4



2




,




(
81
)







but the Euclidean form in (81) deriving from this particular







a
2



m
_


n





is not necessarily expected. As pointed out earlier, Parseval's theorem can take different forms for the different akl solutions. E.g., considering solution 4 of the akl in (71) we have















a
4



m
_


n





F
_


m
_




F
n


=





a
4



k
_


l





f
_


k
_




f
l









=




-




z
1



2


+




z
2



2

+




z
3



2

+




z
4



2

-

2



z
1



(


z
2

+

z
3

+

z
4


)





,







(
82
)







which is invariant but not real valued.


Eigenvalue-Eigenvector Problem

Given the formalism above, we can examine the eigenvalue-eigenvector (eigen-problem) from a more general viewpoint than what is normally discussed. The additional generality stems from the possibility that the metric is not necessarily the Kronecker delta, constrained only by its invariance under unitary transformations and hermitian symmetry. We have shown from a general viewpoint that the akl≐δkl solution is an important “base” solution without additional worries that perhaps something was overlooked. We have also shown that other solutions for the akl exist based simply on unitary invariance, and thus, more general calculations of the DFT and its various quantities are easily extended to arbitrary basis sets. Some advantages in calculation may be possible in these other general basis sets and we will reserve this topic for a future set of notes. For now, we examine a specific basis set that follows by solution to the eigen-problem as discussed below.


Before examining the specifics of the eigen-problem for the DFT, some generalities are discussed. The eigenvalue-eigenvector problem seeks solutions to the following set equations:












T


m
_


n





v
a

n


=


σ
a




v
a


m
_




,




(
83
)







for the vectors







v
a

n




of a general rank-2 quantity, Tmn. The scalar values,







σ
a

,




are the eigenvalues and the







v
a

n




are the eigenvectors. The “a” index below the kernel object, in this case






,

v
a





and






σ
a




does not refer to a component, but rather, the a-th enigenvector,






v
a




corresponding to the a-th eigenvalue,






σ
a




as separate and distinct solutions. We therefore choose indices from the set, {a, b, c, d}, to label the kernel objects corresponding to distinct eigen-solutions, and then reserve the index range, {k, l, m, n}, to label components of a given basis.


Continuing to the DFT application we substitute Wmn of (26) into (83):












W


m
_


n





v
a

n


=


σ
a




v
a


m
_




,




(
84
)







and make special note that we have switched to a common kernel symbol for the Fourier transform pair, (Fm, fn)→(vm, vn), with the substitutions:






v
n
≡f
n and Fm≡vm.   (85)


Looking ahead, we mention now that the solution to (84) results in a transformation of the Wmn into a diagonal basis by use of the similarity transform. In the context of the present application, and noting expression (9) for the Fourier transform, (84) then has an interesting geometrical interpretation as solving for the Fourier transform pair, (vm, vn), that are “parallel” to one another, or equivalently, left invariant under the action of the DFT (aside from the scaling factor σ). To give greater emphasis to this last comment, consider the overall structure of the DFT transformation in a diagonal basis for the 4×4 case (for now using the notation of (9) to emphasize this last point):














W


m
_


n





v
a

n


=



σ
a




v
a


m
_






(




σ
1



0


0


0




0



σ
2



0


0




0


0



σ
3



0




0


0


0



σ
4




)



(




f

1








f

2








f

3








f

4






)









=

(





σ
1



f

1










σ
2



f

2










σ
3



f

2










σ
4



f

2







)








=


σ
a



(




F

1








F

2








F

3








F

4






)



,







(
86
)







where the primes on indices denote the data and its transform in the diagonal basis as discussed further below. Thus as mentioned above, the Fourier transform pair are co-linear when calculated in the diagonal basis:











(




F

1








F

2








F

3








F

4






)

=

(





(


σ
1

/

σ
a


)



f

1










(


σ
2

/

σ
a


)



f

2










(


σ
3

/

σ
a


)



f

3










(


σ
4

/

σ
a


)



f

4







)


,




(
87
)







differing only by a scalar factor. This example illustrates the geometrical significance of the DFT transform in a diagonal basis and motivates using the same kernel symbol for the Fourier transform pair in (85), since the quantities are co-linear. This difference in notation is to be contrasted with the approach taken so far where the kernel symbols have been kept distinct, and the W·nm transformation was regarded as a point transformation.


Continuing with the eigen-problem calculation we use the metric tensor to show that (84) is equivalently expressed in the form:












(


W


m
_


n


-


σ
a



a


m
_


n




)




v
a

n


=
0

,




(
88
)







or also as











(


W

·
n

m

-


σ
a



δ

·
n

m



)




v
a

n


=
0.




(
89
)







And thus the eigen-problem, when cast in the form of (89), does not depend on a specific choice for the metric. Thus we solve the DFT eigen-problem in the form of (89). The eigenvalues for the W·nm are then solutions to










det


(


W

·
n

m

-


σ
a



δ

·
n

m



)


=
0.




(
90
)







Eigen-Problem for the DFT Matrix

The eigenvector-eigenvalue problem for the DFT matrix in the form of (89) has been examined extensively and so we review some of these basic results before examining the diagonal DFT calculation in detail. Since





W4=I,  (91)


the eigenvalues for the DFT matrix for N>3 are non-unique and are multiples of the so-called “roots of unity:”












σ
a

=



π






a
/
2




;

a
=
1


,





,



4
;



σ
a





{


±
1

,

±
i


}

.






(
92
)







Specifically, (92) then becomes











σ
1

=
1

,


σ
2

=

-
1


,


σ
3

=
i

,


σ
4

=

-

i
.







(
93
)







The eigenvalues of the DFT transform are not all distinct for N≧4, so there are many possible sets of eigenvectors satisfying (89). As a consequence of having these non-unique eigenvalues, the corresponding eigenvectors for these degenerate eigenvalues are not necessarily orthogonal. The importance of having an orthogonal set of eigenvectors for the present application is not known, but should this turn out to be an issue, the Gramm-Schmidt procedure can always be applied or an orthogonal set can be obtained. Further consequences and examples of this analysis for higher dimensional DFTs will be discussed in a future set of notes. For now we examine a low-dimensional example to map out some of the basic features.


Eigenvalues

Considering M=N=4, the DFT transform and its inverse is given in (68) which is reproduced below:








W

·
k

m

=


1
2



(



1


1


1


1




1



-
i




-
1



i




1



-
1



1



-
1





1


i



-
1




-
i




)



;








W

-
1



·
k

m

=


W






%

·
k

m


=



W
_

k

·
m


=


1
2




(



1


1


1


1




1


i



-
1




-
i





1



-
1



1



-
1





1



-
i




-
1



i



)

.








Considering solutions of (90) we have








det


(


W

·
n

m

-


σ
a



δ

·
n

m



)


=
0

,




and for the 4×4 example has the solution:













(


σ
a

-
1

)

2



(


σ
a

+
1

)



(


σ
a

+
i

)


=

0


{



σ
1

=
1

,


σ
2

=

-
1


,


σ
3

=
1

,


σ
4

=

-
i



}



,




(
94
)







with one degenerate eigenvalue,







σ
1

=


σ
3

=
1.





Multiplicity of Eigenvalues for the DFT Matrix


As discussed above, the significance of (91) for the DFT eigenvalue problem is that degenerate eigenvalues appear for N≧4. The “multiplicities” of these eigenvalues are defined as the number of times each eigenvalue appears. For the 4×4 example in (94) we therefore have






m
1=2,m2′=1,m3=1,m4=0,   (95)


where the mk ordering corresponds to the ordering of (93). These results can be generalized to higher dimensions by the following Table of multiplicities for each eigenvalue:













TABLE 1





M = N
m1forσ1=1:
m2forσ2=-1:
m3forσ3=i:
m4forσ4=-i:







4m
m + 1
m
m
m − 1


4m + 1
m + 1
m
m
m


4m + 2
m + 1
m + 1
m
m


4m + 3
m + 1
m + 1
m + 1
m









By inspection of Table 1, the 4×4 case corresponds to the m=1 value in the top row giving the multiplicities of eigenvalues listed in (95). Similarly, it can be shown that for the 5×5 case, the eigenvalues are solutions of













(


σ
a

-
1

)

2



(


σ
a

+
1

)



(



σ
2

a

+
1

)


=

0


{



σ
1

=
1

,


σ
2

=

-
1


,


σ
3

=
i

,


σ
4

=

-
i


,


σ
5

=
1


}



,




(
96
)







corresponding to the 2nd row of the Table with m=1. From (96) (or equivalently Table 1) the multiplicities for the 5×5 case are thus






m
1=2,m2=1,m3=1,m4=1.  (97)


To illustrate a higher dimensional example, consider say M=N=131=4.32+3, corresponding to the third row of the Table giving the following multiplicities:






m
1=33,m2=22,m3=22,m4=32.   (98)


Eigenvectors

Given the eigenvalues in (94) the eigenvectors corresponding to each eigenvalue are constructed using (89). It is instructive to consider this process for the eigenvectors corresponding to the degenerate first and third eigenvalues,







σ
1

=


σ
3

=
1.





The first eigenvector is obtained as a solution to the set of linear equations:













(


W

·
n

m

-


σ
a



δ

·
n

m



)




v
a

n






a

1



=


0



(




1
-

σ
1




1


1


1




1




-
i

-

σ
1





-
1



i




1



-
1




1
-

σ
1





-
1





1


i



-
1





-
i

-

σ
1





)



(





v
1

1







v
1

2







v
1

3







v
1

4




)



=

(



0




0




0




0



)



,




(
99
)







which simplifies to










(





-


v
1

1


+


v
1

2

+


v
1

3

+


v
1

4









v
1

1

-


(

2
+
i

)




v
1

2


-



v
1

3


i



v
1

4










v
1

1

-


v
1

2

-


v
1

3

-


v
1

4









v
1

1

+

i



v
1

2


-


v
1

3

-


(

2
+
i

)




v
1

4






)

=


(



0




0




0




0



)

.





(
100
)







The solution to (100) for







v
1

n




is therefore given in terms of two arbitrary components, say








v
1

3






and







v
1

4



:













(



v
1

1

,


v
1

2

,


v
1

3

,


v
1

4


)

=


(




v
1

3

+

2



v
1

4



,


v
1

4

,


v
1

3

,


v
1

4


)

.





(
101
)







We point out that two components are required because of the two-fold degeneracy of the







σ
1

=


σ
3

=
1





eigenvalue. In general, the eigenvectors corresponding to unique eigenvalues are specified in terms of a single component. Continuing we arbitrarily choose








v
1

4

=
1




and









v
1

3

=
0

,




to obtain a realization for the first eigenvector:













(



v
1

1

,


v
1

2

,


v
1

3

,


v
1

4


)

=


(




v
1

3

+

2



v
1

4



,


v
1

4

,


v
1

3

,


v
1

4


)








v
1

4


1




v
1

3


0








v
n

1









=


(

2
,
1
,
0
,
1

)

.








(
102
)







Note also that since,








σ
1

=


σ
3

=
1


,




are degenerate, the solution to (99) is trivially the same as (100). Therefore, a solution for







v
3

n




follows immediately by choosing complementary but arbitrary numerical values for the







v
1

3




and







v
1

4




in (102), e.g.,








v
3

4

=
0




and











v
3

3

=


1
:





(



v
3

1

,


v
3

2

,


v
3

3

,


v
3

4


)


=



(




v
3

3

+

2



v
3

4



,


v
3

4

,


v
3

3

,


v
3

4


)







v
1

3


1


?







v
3

n



=



(

1
,
0
,
1
,
0

)

.





?




indicates text missing or illegible when filed








(
103
)







Similarly, we can solve for the remaining eigenvectors corresponding to their eigenvalues,







σ
2

=

-
1





and







σ
4

=

-

i
.






In summary, a set of engenvectors (but not the only set) for the 4×4 DFT transformation is summarized below:












v
1

n

=

(



2




1




0




1



)


,



v
2

n

=

(




-
1





1




1




1



)


,



v
3

n

=

(



1




0




1




0



)


,



v
4

n

=


(



0





-
1





0




1



)

.






(
104
)







Since two of the eigenvalues are degenerate, (102) and (104) collectively do not form an orthogonal system as can be verified by forming the inner product between any two vectors using the “base” solution,








a
1



k
_


l




=
*



δ


k
_


l






of (71):














v
m



v
n




=




a
1



k
_


l






v
m

_


k
_





v
n

l


=



δ


k
_


l






v
m

_


k
_





v
n

l


=

(



3


0


1


0




0


2


0


0




1


0


1


0




0


0


0


1



)




,




(
105
)







where the off-diagonal entries correspond to the







σ
1

=


σ
3

=
1





value. For reference, we can still construct an orthonormal set from (104):












u
1

n

=


1

6




(



2




1




0




1



)



,



u
2

n

=


-

1
2




(




-
1





1




1




1



)



,



u
3

n

=


1

2


3





(



1





-
1





3





-
1




)



,



u
4

n

=


1

2





(



0





-
1





0




1



)

.







(
106
)







and verify as expected that











δ


k
_


l






u
m

_


k
_





u
n

l


=


δ
mn

.





(
107
)







Similarity Transform


Given the eigenvectors of (104) we form the “similarity” transform using the eigenvectors as columns. We denote this as a transformation of basis and we find by construction that the similarity transform, Smm′, and its inverse,








S

-
1


m

m



,




are given by











S
m

m



=

(



2



-
1



1


0




1


1


0



-
1





0


1


1


0




1


1


0


1



)


;



S

-
1


m

m



=


1
4




(



1


1



-
1



1





-
1



1


1


1




1



-
1



3



-
1





0



-
2



0


2



)

.







(
108
)







When using the kernel-index method for transformations of basis, say xm′=Amm′xm. that the Amm′ quantities do not use dots above or below an index to mark index placement. This differs from the notation for point transformations, which does use the dots e.g., Fm=W·nmfn. The reason for this is to discourage the “raising” or “lowering” of indices on the Amm′ to avoid an inconsistency when using the metric tensor. This inconsistency can occur because the metric akl also transforms under the Amm′, and thus has different components in the new coordinate system. For point transformations however, no coordinate transformation occurs and thus indices can be raised or lowered using a single (unambiguous) metric.


We then verify that applying the similarity transformation to the W·nm gives as expected:











SWS

-
1





S
m

m





W

·
n

m




S

-
1



n


n



=


W

·

n




m



=


(




σ
1



0


0


0




0



σ
2



0


0




0


0



σ
3



0




0


0


0



σ
4




)

=


(



1


0


0


0




0



-
1



0


0




0


0


1


0




0


0


0



-
i




)

.







(
109
)







As a check on index ordering we run the following line of Mathematica code with Si defined as the inverse of the Sin (108):





Table [Sum[S[[m,ump]]W[[m,n]]Si[[lnp,n]], {n,l,M}, {m,l,M}], {ump,l,M}, {lnp,l,M}]  (110)


where ump denotes “upper-m-prime” and lnp is “lower-m-prime.”


It is of further interest to point out that if the similarity transform is constructed using the orthonormal eigenvectors of (106) as columns (and correspondingly it inverse), then the application of the “orthonormal” similarity transform to the W·nm also gives (109). Therefore, the similarity transform that leaves the W·nm in diagonal form is certainly not unique, so many possible choices can be made for the Snm′. So as indicated by calculations thus far, it does not seem to matter whether or not the eigenvectors forming the Snm′ are orthogonal, and this can greatly simplify the initial calculations needed for calculating in the diagonal basis.


DFT Calculation in Diagonal Form

The diagonal form of the DFT in (109) shows that the DFT calculation can be performed both efficiently and elegantly as simply a multiplication of the entries on the main diagonal (the eigenvalues), to the corresponding data components that have been pre-multiplied using the similarity matrix. This means that after pre-multiplying the data, the DFT transform is reduced to a vector product consisting of N operations rather than a matrix multiply with N2 operations (or N log N when using the FFT). In reality though, this reduction in computation cost is only partly true in terms of the performance gains, i.e., the diagonal DFT representation does obviously lead to a speedy calculation for one part of the DFT operation, but in general, this gain is offset by the added computational cost of having to pre-transform the data and then back-transform to bring the Fourier transform into the original basis as illustrated below. But the key is that these latter operations may not need to be performed at every step in an iterative transform application, such as with phase retrieval. Additional details on this calculation approach are outlined below.


To begin, we write down the DFT calculation in the diagonal basis after switching the notation back to distinct kernel symbols for the data and its transform:






f
m
′=W
·n
m
′f
n′,   (111)


where the m′ represents the transformation of basis under the action of the similarity transform as illustrated above in (109). Since the only non-zero values for the W·n′m′ are along the diagonal, we form a vector,







σ

m



,




form the eigenvalues (i.e., the diagonal entries of the W·n′m′) as in (94). Continuing with the calculation of (111, the starting data, fk, are pre-multiplied using the inverse similarity transform,








S

-
1


n

n



,




to give










f

n



=



S

-
1


n

n






f
n

.






(
112
)







The diagonal DFT is then the product of the N components of the






σ

m






and the fn′ defined by (112):











F

m



=


σ

m



·

f

m





,




(
113
)







where no sum is implied over the (m′,m′) pair. To complete the calculation, the “diagonal” DFT components in (113) need to be transformed back to the original DFT basis, Fm. To obtain this result we express the LHS of (111) in terms of components in the original basis:











F

m



=



S

-
1


m

m





F
m



,




(
114
)







and therefore the original Fourier transform components are readily obtained by back-transforming the Fm′ of (114) using the similarity transform:










F
n

=



S

n


n



F

n




=



S

n


n




S

-
1


m

n





F
m


=


δ
m
n




F
m

.








(
115
)







The inverse DFT is also calculated in a diagonal basis since the inverse of a diagonal matrix is found by populating its diagonal entries with the inverse of the individual components. Therefore, given the W·nm′,the inverse is simply (no sum is implied on the RHS over the (m′, m′) pair):












W

-
1










n




m



=


(

1
/

σ

m




)



δ

n



m





,




(
116
)







and then inverse DFT transformation in the diagonal basis gives as expected:












(

1
/

σ

m




)

·

F

m




=



(

1
/

σ

m




)

·

σ

m



·

f

m




=

f

m





,




(
117
)







The overall point is that for applications where it is feasible to perform the DFT in a diagonal basis, the similarity transform and its inverse can be pre-calculated. The multiplication of the similarity transform with the data can also be pre-calculated. Then at an appropriate point, the DFT results can be transformed back to the original basis using the inverse similarity transform, preferably at the end of an iterative loop. The approach is summarized in FIG. 2.


Example: 4×4 Diagonal DFT Calculation


The steps above are illustrated for the 4×4 example. Consider the general complex data vector of (74):






f
n=(z1,z2,z3,z4).  (118)


The inverse similarity transform of (108) times the data vector (118) is given by











f

m



=




S

-
1


m

m





f
m


=


1
4



(





z
1

+

z
2

-

z
3

+

z
4








-

z
1


+

z
2

+

z
3

+

z
4








z
1

-

z
2

+

3


z
3


-

z
4







2


(


-

z
2


+

z
4


)





)




,




(
119
)







and then from (94):










σ

m



=


(

1
,

-
1

,
1
,

-
i


)

.





(
120
)







The DFT in the transformed basis follows by multiplication of the eigenvalues in (120) with the fm′ components in (119) to give the Fourier transform in the diagonal basis:










F

m



=



σ

m



·

f
m


=


1
4




(





z
1

+

z
2

-

z
3

+

z
4








z
1

-

z
2

-

z
3

-

z
4








z
1

-

z
2

+

3


z
3


-

z
4








-
2



i


(


-

z
2


+

z
4


)






)

.







(
121
)







Finally, we back-transform the Fm′ components in (121) by multiplying with the similarity transformation of (108) to give the DFT result in the original non-diagonal basis:













F
m

=




S

m


m



F

m










=




(



2



-
1



1


0




1


1


0



-
1





0


1


1


0




1


1


0


1



)



1
4



(





z
1

+

z
2

-

z
3

+

z
4








z
1

-

z
2

-

z
3

-

z
4








z
1

-

z
2

+

3


z
3


-

z
4








-
2



i


(


-

z
2


+

z
4


)






)









=




1
2



(





z
1

+

z
2

+

z
3

+

z
4








z
1

-

iz

2






-

z
3

+

iz
4








z
1

-

z
2

+

z
3

-

z
4








z
1

+

iz
2

-

z
3

-

iz
4





)



,







(
122
)







in agreement with the original DFT transform, Fm=W·nm fn of (75). Similarly, the inverse transformation can be calculated in the diagonal basis as discussed above using (117).


The diagonal form of the DFT in (113) shows that the calculation can be performed, efficiently and elegantly as simply a vector product. This means that the DFT can be reduced to N multiplications of the data with the eigenvalues. But as emphasized above, this observation is only partly true, i.e., although the performance gains from the diagonal representation are indeed realized, these are offset by the computational cost of setting up the data in a transformed basis as illustrated above. But even with this in mind, certain applications spend most of the time in the forward and inverse transform part of the calculations (see FIG. 2), so the main point is to explore applications where the initial and final data transformations are needed only once, and then subsequent DFT calculations can proceed as simply N multiplications of the entries along the main diagonal. The performance gain for a single DFT calculation can then be approximately log(N) times faster. Assuming for simplicity a power of 2 for the data size (N=2n), we can estimate a performance improvement for large n to be n log(2)˜101, or about an order of magnitude faster than the equivalent FFT calculation, and for arbitrary frequency spacing.



FIG. 3 shows an example embodiment of the present invention and is thus directed to an image-based wavefront sensing method of and means for focusing by determining phase difference between received images, removing that phase difference by calculating a DFT for each signal and iteratively multiplying the resultant until a convergence is achieved by accepting point-source stellar objects to recover embedded optical phase information, receiving data from a sensor array wherein the data includes embedded phase information, calculating a similarity transform and pre-transforming the data once at the beginning of the calculation.



FIG. 3 further shows, in an example embodiment of the present invention, an image-based wavefront sensing method of and means for iteratively processing the data using arbitrary frequency spacing in a diagonal fashion until a wavefront convergence is achieved to determine a phase difference by implementing the DFT in a 1 dimensional linear array using just N operations, where 0<N<N×log(N)<∞, wherein N×log(N) is the number of operations of an FFT counterpart, back-transforming the data at the end of the calculation in a single step and focusing the received images based on the determined phase.


While the invention is described with reference to an exemplary embodiment, it will be understood by those skilled in the art that various changes may be made and equivalence may be substituted for elements thereof without departing from the scope of the invention. In addition, many modifications may be made to the teachings of the invention to adapt to a particular situation without departing from the scope thereof. Therefore, it is intended that the invention not be limited the embodiments disclosed for carrying out this invention, but that the invention includes all embodiments falling with the scope of the appended claims. Moreover, the use of the terms first, second, etc. does not denote any order of importance, but rather the terms first, second, etc. are used to distinguish one element from another.

Claims
  • 1. An image-based wavefront sensing method of focusing by determining phase difference between received images, removing that phase difference by calculating a discrete Fourier transform (DFT) for each signal and iteratively multiplying the resultant until a convergence is achieved comprising the steps of: accepting point-source images to recover embedded phase information:receiving data from an external source wherein the data includes embedded phase information;calculating a similarity transform;iteratively processing the data in a diagonal fashion until a wavefront convergence is achieved to determine a phase difference; andfocusing the received images based on the determined phase.
  • 2. The wavefront sensing method of claim 1 wherein calculating a similarity transform further includes pre-transforming the data once at the beginning of the calculation.
  • 3. The wavefront sensing method of claim 2 wherein iteratively processing further includes a single step of back-transforming the data at the end of the calculation,
  • 4. The wavefront sensing method of claim 3 wherein said point source images include stellar objects.
  • 5. The wavefront sensing method of claim 4 wherein said embedded phase information is of an optical nature.
  • 6. The wavefront sensing method of claim 5 wherein said external source includes a sensor array.
  • 7. The wavefront sensing method of claim 6 wherein the step of iteratively processing further includes the step of determining the phase difference by utilizing a discrete Fourier transform (DFT).
  • 8. The wavefront sensing method of claim 7 including implementing said DFT in a 1 dimensional linear array using just N operations, where 0<N<N×log(N)<∞, wherein N×log(N) is the number of operations of an FFT counterpart and N is an integer.
  • 9. The wavefront sensing method of claim 8 including iteratively processing using arbitrary frequency spacing.
  • 10. An image-based wavefront apparatus for focusing by determining phase difference between received images, removing that phase difference by calculating a discrete Fourier transform (DFT) for each signal and iteratively multiplying the resultant until a convergence is achieved comprising: means for accepting point-source images to recover embedded phase information:means for receiving data from an external source wherein the data includes embedded phase information;means for calculating a similarity transform;means for iteratively processing the data in a diagonal fashion until a wavefront convergence is achieved to determine a phase difference; andmeans for focusing the received images based on the determined phase.
  • 11. The wavefront sensing apparatus of claim 10 wherein said means for calculating a similarity transform further includes means for pre-transforming the data once at the beginning of the calculation.
  • 12. The wavefront sensing apparatus of claim 11 wherein said means for iteratively processing further includes means for of back-transforming the data at the end of the calculation in a single step,
  • 13. The wavefront sensing apparatus of claim 12 wherein said point source images include stellar objects.
  • 14. The wavefront sensing apparatus of claim 13 wherein said embedded phase information is of an optical nature.
  • 15. The wavefront sensing apparatus of claim 14 wherein said external source includes a sensor array.
  • 16. The wavefront sensing apparatus of claim 15 wherein the means for iteratively processing further includes means for determining the phase difference by utilizing a discrete Fourier transform (DFT).
  • 17. The wavefront sensing apparatus of claim 16 further including means for implementing said DFT in a 1 dimensional linear array using just N operations, where 0<N<N×log(N)<∞, wherein N×log(N) is the number of operations of an FFT counterpart.
  • 18. The wavefront sensing apparatus of claim 17 further including means for iteratively processing using arbitrary frequency spacing.
PRIORITY CLAIM

This application claims priority under 35 U.S.C. §119 to U.S. Provisional Application Ser. No. 61/311,909 entitled “DISCRETE FOURIER TRANSFORM IN A COMPLEX VECTOR SPACE” filed on Mar. 9, 2010, the entire contents of which are hereby incorporated by reference.

ORIGIN OF THE INVENTION

The invention described herein was made by an employee of the United States Government, and may be manufactured and used by or for the Government for governmental purposes without the payment of any royalties thereon or therefore.

Provisional Applications (1)
Number Date Country
61311909 Mar 2010 US