Embodiments of this invention relate to the generation of a signal indicative of the output of a discrete time filter to allow for simpler and more realistic simulation of the same.
Circuit designers of multi-Gigabit systems face a number of challenges as advances in technology mandate increased performance in high-speed components. For example, chip-to-chip data rates have traditionally been constrained by the bandwidth of input/output (I/O) circuitry in each component. However, process enhancements (e.g., transistor bandwidth) and innovations in I/O circuitry have forced designers to also consider the effects of the transmission channels between the chips on which data is sent.
At a basic level, data transmission between components within a single semiconductor device or between two devices on a printed circuit board may be represented by the system 10 shown in
However, real transmitters and real transmission channels do not exhibit ideal characteristics, and as mentioned above, the effects of transmission channels are becoming increasingly important in high-speed circuit design. Due to a number of factors, including, for example, the limited conductivity of copper traces, the dielectric medium of the printed circuit board (PCB), and the discontinuities introduced by vias, the initially well-defined digital pulse will tend to spread or disperse as it passes through the channel 16. This is shown in
One known means for neutralizing the deleterious effects of channel-induced ISI comprises the use of a Discrete Time Filter (DTF) 13 on the transmitter 12 side of the system. The DTF 13 essentially pre-processes the data stream 11 of bits prior to the bits being driven onto the channel 16. Ideally, the DTF 13 has a transfer function, 1/H(z), which is the inverse of the transfer function H(z) of the channel 16. If the DTF's transfer function 1/H(z) is truly an exact inverse of the channel's transfer function H(z), then the DTF 13 will cancel the effects of the channel 16, and the data will be received at the receiver 14 without any distortion or ISI.
An exemplary DTF 13 is shown in
While the tap delay typically corresponds to the unit interval of the signal, that is not a requirement. In many cases, the tap delay is set to a fraction of the unit interval. While such “fractionally-spaced” filtering adds complexity to the design, and generally increases the number of taps, it also provides better control of the filtering operation. Other modifications include variable tap delay.
That said, the most common form of DTF is a simple two-tap, unit-interval-spaced filter, wherein the first tap 221 is associated with the pulse peak or as illustrated in waveform 105b of
It is also possible for ISI to occur on the front edge of the pulse, and this can also be canceled by the DTF topology under consideration, a concept best understood by returning to
It should also be noted that there need not be a unity gain tap weight. For example, when it is anticipated that the received pulse will be severely degraded in amplitude due to channel losses, then the tap which corresponds to the main pulse may be given a weight greater than one to boost the pulse height.
While DTFs can be a useful means to precondition data signals to combat channel-induced ISI, a DTF can be difficult to design. That is, it is not always clear the exact number of taps 22 or the corresponding weight values that should be used to compensate for a given channel. Accordingly, before one engages in constructing the DTF 13 at the transmitter 12, it is usually desirable to model and simulate the DTF 13 in light of the expected channel characteristics, with tap number and weight values determined through trial and error.
When designing such a pre-distorting filter for low-speed applications, the task of determining the optimal number of taps and the associated tap weights is simplified. This is because in such cases it is not uncommon for the channel itself to be modeled as a DTF with a finite number of taps. In this situation, designing the corresponding filter, exhibiting the inverse transfer function, is a somewhat trivial matter. Even when the channel model is more complex, as long as timing is less of a concern as it is in low-speed designs, the process of designing the optimal DTF remains relatively simple and is often carried out in mathematical tools like Matlab, independent of any component-level simulation.
High-speed systems are a different matter, in that the full analog, continuous-time nature of the signal, the channel, and the filter are all critical in the derivation of the optimal filter configuration. In addition, verifying the impact of the filter on the link performance requires circuit-level simulation to ascertain whether or not the filter has enabled error free communication, and this of course requires a waveform suitable for simulation in an industry standard simulator.
Unfortunately, modeling and simulation of the DTF is difficult. Even if the DTF is to be merely simulated, it is generally necessary to define the DTF in a layout simulator such as SPICE™. This requires transistors, resistors, and other discrete components to be electronically considered, even if they are not actually yet constructed or laid out. Such component-level consideration takes time and effort, which is particularly undesirable in an application in which one might be frequently changing the number of taps as well as the associated tap weights to try and find the most ideal transfer function 1/H(z) for the DTF to compensate for a given channel.
Furthermore, modeling and simulation may not provide a suitably accurate picture of how the DTF will process signals deviating from the ideal. Realistic data signals will not be ideal, but instead will suffer from various sources of amplitude noise and timing jitter, which noise and jitter may vary randomly between the unit intervals of the data. Regardless of the source or type of noise or jitter, it is difficult to quickly and efficiently simulate the effects of noise or jitter in the context of a DTF circuit. This inability to handle noise and jitter during simulation of the DTF circuit is especially problematic, because DTF circuits are particularly susceptible to noise and jitter, a point which is easy to understand when one considers that noise or jitter is in a sense multiplied by the various taps in the DTF.
The disclosed computer-implementable method allows for the fast creation of a multi-unit-interval vector suitable for simulation. The created vector represents the output of an otherwise ideal Discrete Time Filter (DTF) circuit, and the quick creation of the vector merely requires a designer to input into a computer system the number of taps and their weights without the need of laying out or considering the circuitry of the DTF. Specifically, a matrix is created in the computer system based on a given (preferably though not exclusively randomized) data stream of bits, and the number of taps and weights, which matrix is processed as disclosed herein to create the multi-unit-interval vector. Noise and jitter can be incorporated into the created vector such that it now realistically reflects non-idealities common to actual systems. Once created, the vector can then be simulated using standard computer-based simulation techniques, such as SPICE™. For example, the transmission of the created vector can be simulated down a channel having a particular transfer function, H(z). If the DTF parameters (number of taps and associated weight values) used to create the signal were designed to counter this transfer function (1/H(z)), the simulation can reveal how appropriate the original DTF parameters were. If the effects of the channel were not suitably countered, the number and weights of the taps of the DTF can be adjusted, the matrix re-processed to produce another vector for simulation, and simulation can occur again. This allows the DTF to be quickly modeled and simulated for a particular application without the need of actually laying out the DTF prior to the simulation or otherwise considering the DTF's specific circuit elements. This ultimately hastens the design and improves the accuracy of the DTF circuit to be built.
One implementation of the technique is illustrated starting with
Once the input waveform 100 has been chosen, the designer next inputs the number of taps 22 to be used in the DTF 13, and their weights, W, into the computer system. As illustrated in
From this initial design assumption (number and weights of taps) for the design of the DTF 13, a matrix 110 is populated in the computer system as an intermediary step in the formation of the multi-unit-interval vector to be simulated. The matrix 110 comprises rows and columns, in which the number of columns M equals the number of UIs (bits) in the input waveform 100 (seven in this example), and the number of rows N equals the number of taps assumed for the DTF's design.
To make the illustration simple, it is assumed that the logic state ‘0’ comprises 0 Volts, and that a logic state ‘1’ comprises 1 Volt. This would be the likely scenario in a system 10 which had a power supply voltage (i.e., Vcc) of 1 Volt. This is merely exemplary, and other voltage values could be used for the two logic states and populated into the matrix 110, though a more consistent approach would be to employ the assumption just described and then scale the bit values to the desired or true system voltages just prior to the waveform generation process.
The first row 120a is populated with the voltages of the various bits in the input waveform 100 scaled by the weight W1 of the first DTF tap. In this example W1=1, so the row values equal the original bit values. The second row 120b comprises a UI-shifted version of the voltages in row 120a as further scaled by the weight W2 of the second DTF tap. Thus, it can be seen that 1 Volt in the first column of row 120a has become −0.5 Volts in the second column of row 120b, and so on. The third row 120c comprises a double UI-shifted version of the voltages in row 120a as further scaled by the weight W3 of the third DTF tap. Thus, it can be seen that 1 Volt in the first column of row 120a has become +0.2 Volts in the third column of row 120b, and so on. If there were further taps, still other rows would be added, with their entries scaled by the corresponding tap's weight, and likewise shifted by a number of UIs. To be more explicit, if each Xth tap in the DTF being modeled is delayed by (N−X) unit intervals as previously described, then the Xth row in the matrix 110 comprises the sequential series of voltages (waveform 100) scaled by the Xth tap's weight and shifted by (X−1) columns.
Because each of rows 120b, 120c, and so on, are shifted by an increasing number of UIs and the bit values preceding the example sequence are unknown, the initial columns in each of those rows are populated with zeros 125 as shown.
The next processing step is to use the computer system to sum the elements in each of the columns from matrix 110 to create a vector 160, as shown in
The resulting vector 160 in
With vector 160/waveform 165 derived as just discussed, that vector/waveform can now be simulated to assess the DTF's ability (at least, as initially contemplated, with three taps weighted at W1=+1.0, W2=−0.5, and W3=+0.2) to negate ISI caused by the channel 16. However, prior to the use of vector 160/waveform 165 in a simulation of this sort, it preferable to undertake further processing steps.
For example, in
This technique is also easily modified to allow for the addition of amplitude noise or timing jitter, as shown in
Additionally, periodic jitter (i.e., jitter that varies predictably from cycle to cycle) can also be added to the vector 170/waveform 175 to form the vector 180/waveform 185, as disclosed in U.S. patent application Ser. No. 11/738,193, filed Apr. 20, 2007, which is hereby incorporated by reference in its entirety. To briefly review one embodiment of the technique disclosed in the '193 application, a method implementable in a computer system for generating a multi-cycle signal vector suitable for use as the input to a circuit to be simulated in a simulation program is disclosed. The method first determines in the computer system a time shift value for each of a plurality of cycles of a signal to be simulated, in which the time shift values vary periodically between the plurality of cycles, and wherein the time shift values are further phase shifted by a phase shift in each of the cycles. Next each determined time shift value is applied to create a time shifted vector for each of the plurality of cycles, wherein each time shifted vector comprises a sequence of voltage values each separated by a time step. Finally, the plurality of time shifted vectors are concatenated to create the multi-cycle signal vector.
Regardless of the technique used, a time-step-based vector 180 complete with random noise and jitter is created from otherwise-ideal vector 170/waveform 175. The result is a simulatable vector 180 which is highly realistic, and which truly allows for accurate simulation and modeling of the DTF 13. Note that the techniques disclosed in the '646 and '193 applications are not the only way to add noise or jitter to the vector 170/waveform 175 to form vector 180/waveform 185, and previous or future methods for doing so could also be used.
An alternative embodiment of the disclosed technique is shown in
As before, the matrix 210 is constructed of N rows, where N equals the number of taps assumed for the DTF design. And as before, row 120a is populated with the voltage values for the time-step-based waveform 200 scaled by the weight W1, which, because in this example W1=1, essentially comprises the time-step-based vector for the waveform 200. Subsequent rows (e.g., 120b and 120c) are once again populated with shifted versions of the original voltages as further scaled by the remaining weights of the DTF. However, as applied to matrix 210, each row is still shifted by full unit intervals (UI), with row 120b being shifted by one UI, row 120c shifted by two UIs, etc. Generically, speaking, each Xth row comprises the time-step-based waveform scaled by the Xth tap's weight shifted by a fixed number of time steps times (X−1).
Because there will be a number of time steps in each unit interval, in reality this means that the data for the subsequent rows 120b, 120c, etc. may need to be shifted by many columns. However, as shown in
From this point, matrix 210 is otherwise processed as described previously, with the elements in each column summed to form a vector 215. Because the initial matrix 210 was already time-step based, the time-step conversion step of
Noise and/or jitter can also easily be added to the processing even when an expanded time-step-based matrix 210 is used. Such noise or jitter can be added either before or after processing of the matrix 210.
It should be noted that vectors 215′ (FIG. 8A) and 215″ (
While the methods above all pertain to unit-interval-spaced filtering, they are easily extended to fractions of unit-interval-spaced filtering. This can be accomplished by simply scaling the number of bits and the final time step appropriately in either the unit-interval-based or the time-step-based approaches.
For example, if a half-unit-interval-spaced DTF were desired, the first modification would be to repeat every bit value in the original data stream once (e.g., ‘0101100’ would become ‘00110011110000’), which essentially amounts to a coarse unit-interval-based to time-step-based conversion. Now when the matrix 110 is populated (see
The processes described herein may be further extended to automate the filter design within a computer system. Previously it was mentioned that the designer would likely vary the number and weights of the filter taps manually, and through trial and error converge to the filter configuration that best counters the impact of the transmission channel. If an error metric can be established and measured from within the simulation (e.g., residual ISI, etc.), then it is possible to let the simulator vary the number and weights of the filter taps autonomously, with the only input from the designer being the initial guess. While the process for doing so will not be discussed here, those skilled in the art recognize that the process of in-situ DTF filter adaptation has been well understood for decades. See, e.g., R. W. Lucky et al., “Automatic equalization for digital communication,” in Proc. IEEE, vol. 53, no. 1, pp. 96-97 (January 1965) (incorporated above).
Finally, it should also be noted that while similar filtering of clock signals is not a standard procedure, the methods described above apply not only to random or pseudo-random data signals, but to periodic clock signal modeling as well.
One skilled in the art will realize that the disclosed techniques are usefully implemented as software running on a computer system, and ultimately stored in a computerized-readable media, such as a disk, semiconductor memory, or other media discussed below. Such a computer system can be broadly construed as any machine or system of machines capable or useful in reading and executing instructions in the software program and making the various computations embodiments of the disclosed techniques require. Usually, embodiments of the disclosed techniques would be implemented as programs installable on a circuit designer's workstation or work server. Moreover, embodiments of the disclosed techniques can easily be incorporated into pre-existing circuit simulation software packages, such as those mentioned previously.
The exemplary computer system 300 includes a processor 302 (e.g., a central processing unit (CPU), a graphics processing unit (GPU) or both), a main memory 304 and a static memory 306, which communicate with each other via a bus 308. The computer system 300 may further include a video display unit 310 (e.g., a liquid crystal display (LCD) or a cathode ray tube (CRT)). The computer system 300 also includes an alphanumeric input device 312 (e.g., a keyboard), a user interface (UI) navigation device 314 (e.g., a mouse), a disk drive unit 316, a signal generation device 318 (e.g., a speaker) and a network interface device 320.
The disk drive unit 316 includes a computer-readable medium 322 on which is stored one or more sets of instructions and/or data structures (e.g., software 324) embodying embodiment of the various techniques disclosed herein. The software 324 may also reside, completely or at least partially, within the main memory 304 and/or within the processor 302 during execution thereof by the computer system 300, the main memory 304 and the processor 302 also constituting computer-readable media.
The software 324 and/or its associated data may further be transmitted or received over a network 326 via the network interface device 320 utilizing any one of a number of well-known transfer protocols (e.g., HTTP).
While the computer-readable medium 322 is shown in an exemplary embodiment to be a single medium, the term “computer-readable medium” should be taken to include a single medium or multiple media (e.g., a centralized or distributed database, and/or associated caches and servers) that store the one or more sets of instructions. The term “computer-readable medium” shall also be taken to include any medium that is capable of storing, encoding or carrying a set of instructions for execution by the machine and that cause the machine to perform any one or more of the methodologies of the disclosed techniques, or that is capable of storing, encoding or carrying data structures utilized by or associated with such a set of instructions. The term “computer-readable medium” shall accordingly be taken to include, but not be limited to, solid-state memories, optical and magnetic media such as discs, and carrier wave signals.
Embodiments of the disclosed techniques can also be implemented in digital electronic circuitry, in computer hardware, in firmware, in special purpose logic circuitry such as an FPGA (field programmable gate array) or an ASIC (application-specific integrated circuit), in software, or in combinations of them, which again all comprise examples of “computer-readable media.” When implemented as software, such software can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program can be deployed to be executed on one computer or on multiple computers at one site or distributed across multiple sites and interconnected by a communication network.
Processors 302 suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more processors of any kind of digital computer. Generally, a processor will receive instructions and data from a read-only memory or a random access memory or both.
To provide for interaction with a user, the invention can be implemented on a computer having a video display 310 for displaying information to the user and a keyboard and a pointing device such as a mouse or a trackball by which the user can provide input to the computer. Other kinds of devices can be used to provide for interaction with a user as well. For example, feedback provided to the user can be any form of sensory feedback, such as visual feedback, auditory feedback, or tactile feedback; and input from the user can be received in any form, including acoustic, speech, or tactile input.
Aspects of the disclose techniques can employ any form of communication network. Examples of communication networks 326 include a local area network (“LAN”), a wide area network (“WAN”), and the Internet.
It should be understood that the disclosed techniques can be implemented in many different ways to the same useful ends as described herein. In short, it should be understood that the inventive concepts disclosed herein are capable of many modifications. To the extent such modifications fall within the scope of the appended claims and their equivalents, they are intended to be covered by this patent.
This application is a continuation of U.S. patent application Ser. No. 11/847,543, filed Aug. 30, 2007, to which priority is claimed and which is hereby incorporated by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
Parent | 11847543 | Aug 2007 | US |
Child | 13112036 | US |