Information
-
Patent Application
-
20040044811
-
Publication Number
20040044811
-
Date Filed
August 30, 200222 years ago
-
Date Published
March 04, 200420 years ago
-
CPC
-
US Classifications
-
International Classifications
Abstract
System and method for transferring data to a device using double buffered data transfers. A host computer system couples to a data acquisition device. The device includes a first read buffer and a second read buffer for storing output data received from the host computer. The device reads first data from the computer and stores it in the first read buffer. The first data is transferred out from the first read buffer while the device reads second data from the computer and stores it in the second read buffer. The second data is transferred out from the second read buffer (after the transfer of the first data) while the device reads third data from the host computer and stores the third data in the first read buffer. Thus, the data acquisition device successively reads data into one read buffer concurrently with transferring data out from the other buffer, respectively.
Description
BACKGROUND OF THE INVENTION
[0001] 1. Field of the Invention
[0002] This invention relates to data communications and data delivery over communication media, and, more particularly, to host computer based data acquisition systems.
[0003] 2. Description of the Relevant Art
[0004] IEEE 1394 is an international standard, low-cost digital interface that integrates entertainment, communication, and computing electronics into devices such as multimedia devices. Originated by Apple Computer as a desktop LAN and developed by the IEEE 1394 working group, IEEE 1394 is a hardware and software standard for transporting data at 100, 200, 400, or 800 megabits per second (Mbps). Maximum packet sizes are 512, 1024, 2048, and 4096 bytes depending on the transfer speed. 1394 provides 64-bit addressing—The 16MSb's (most significant bits) are used for determining source/destination bus/node. As used herein, the terms “node” and “device” may be used interchangeably to denote a node on the 1394 bus.
[0005] There can be up to 1023 buses each with up to 63 nodes. The 48 LSb's (least significant bits) are used to access locations within a device's addressing space. 1394 provides for Direct Memory Access (DMA). DMA is the most powerful feature of the bus for the data acquisition purposes since it allows a device to transfer data from/into computer memory without microprocessor intervention, thus, making it very similar to the PCI bus.
[0006] IEEE 1394 also defines a digital interface—there is no need to convert digital data into analog and tolerate a loss of data integrity. 1394 is easy to use in that there is no need for terminators, device IDs, or elaborate setup. Another benefit of 1394 is that it is “hot pluggable”, meaning users can add or remove 1394 devices with the bus active. IEEE 1394 has a scaleable architecture, allowing users to mix 100, 200, 400, and 800 Mbps devices on a bus. IEEE 1394 also provides a flexible topology in that it supports daisy chaining and branching for true peer-to-peer communication between 1394 devices. In addition to asynchronous data transfer, 1394 provides isochronous data transfer, which guarantees delivery of time critical data, reducing costly buffer requirements.
[0007] Serial Bus Management provides overall configuration control of the serial bus in the form of optimizing arbitration timing, guarantee of adequate electrical power for all devices on the bus, assignment of which IEEE 1394 device is the cycle master, assignment of isochronous DMA controller ID, and notification of errors. Bus management is built upon IEEE 1212 standard register architecture. It should be noted that 1394 error notification is limited to general error detection. When an error has occurred, it may not be known when or where the error occurred, and so the delivery status of transmitted data may also be unknown.
[0008] There are two types of IEEE 1394 data transfer: asynchronous and isochronous. Asynchronous transport is the traditional computer memory-mapped, load and store interface. Data requests are sent to a specific address and an acknowledgment is returned. In addition to an architecture that scales with silicon technology, IEEE 1394 features a unique isochronous data DMA controller interface. Isochronous data DMA controllers provide guaranteed data transport at a pre-determined rate. This is especially important for time-critical multimedia data where just-in-time delivery eliminates the need for costly buffering.
[0009] Much like LANs and WANs, IEEE 1394 is defined by the high level application interfaces that use it, not a single physical implementation. Therefore as new silicon technologies allow high higher speeds, longer distances, and alternate media, IEEE 1394 will scale to enable new applications.
[0010] Perhaps most important for use as the digital interface for executer electronics is that IEEE 1394 is a peer-to-peer interface. This allows not only dubbing from one camcorder to another without a computer, but allows multiple computers to share a given camcorder without any special support in the camcorders or computers.
[0011] The IEEE 1394 bus was primarily intended for computer multimedia peripherals such as audio and video devices. One potential application for the IEEE 1394 bus is remote data acquisition and test and measurement. For example, the IEEE 1394 bus could be used to connect a remote data acquisition device or measurement device to a host computer. However, improved methods are desired for transferring data from a host computer system to a device, such as over an IEEE 1394 bus.
SUMMARY OF THE INVENTION
[0012] The present invention comprises various embodiments of a system and method for transferring data over a communications medium using double buffered data transfers. A host computer system may be coupled through a communication medium to a device, such as a data acquisition device or instrument, which may be further coupled to a unit under test (UUT). The device may comprise a first read buffer and a second read buffer for storing output data received from the host computer. The host computer may be operable to provide output data to the device, such as for analog output to the UUT, in a double buffered fashion for improved performance. The device may also use multiple DMA controllers and/or multiple DMA channels and pre-fetch mechanisms for improved performance.
[0013] In one embodiment, the method may comprise the device reading first data from the host computer and storing the first data in the first read buffer. The first data may then be transferred out from the first read buffer, e.g., after the data has been stored in the first read buffer. The device may then read second data from the host computer and store the second data in the second read buffer concurrently with the transfer of the first data out from the first read buffer. The second data may then be transferred out from the second read buffer after completion of the transfer of the first data out from the first read buffer. Further, the device may then read third data from the host computer and store the third data in the first read buffer concurrently with the transfer of the second data out from the second read buffer. The above operations may then continue in a double buffered fashion as set out above, wherein the data acquisition device reads data into one of the first read buffer and the second read buffer concurrently with transferring data out from the other one of the second read buffer and the first read buffer, respectively.
[0014] In one embodiment, the data acquisition device includes a first direct memory access (DMA) channel and a second DMA channel. In this embodiment, the first DMA channel reads data into one of the first read buffer and the second read buffer concurrently with the second DMA channel transferring data out from the other one of the second read buffer and the first read buffer, respectively. Also, the first DMA channel may be operable to read requested data as well as pre-fetch data to provide for a more continuous and uninterrupted flow of data in the system.
[0015] In one embodiment, after the first DMA channel reads data into one of the first read buffer and the second read buffer concurrently with the second DMA channel transferring data out from the other one of the second read buffer and the first read buffer, the method may synchronize the first DMA channel with the second DMA channel. For example, each DMA channel may enter a synchronization point, issue a continue command to the other DMA channel, issue a pause command to itself, then issue another continue command to the other DMA channel. In this manner, both DMA channels may then proceed with the data transfer in a synchronous manner. Other synchronizing approaches using the pause and continue command are also contemplated.
BRIEF DESCRIPTION OF THE DRAWINGS
[0016] Other advantages and details of the invention will become apparent upon reading the following detailed description and upon reference to the accompanying drawings in which:
[0017]
FIG. 1 illustrates a data acquisition system according to one embodiment;
[0018]
FIG. 2A illustrates a 1394/PCI data acquisition system according to one embodiment;
[0019]
FIG. 2B is a block diagram of a 1394/PCI data acquisition system according to one embodiment;
[0020]
FIG. 3 is a block diagram of a 1394/PCI data acquisition system according to one embodiment;
[0021]
FIG. 4 is a block diagram of a software architecture of the system according to one embodiment;
[0022]
FIG. 5 is a block diagram of a double buffered data transfer system according to one embodiment;
[0023]
FIG. 6 is a diagram of a double buffered process, according to one embodiment;
[0024]
FIGS. 7 and 8 are flowcharts of two embodiments of a data transfer process; and
[0025] FIGS. 9A-9E illustrate various embodiments of a method to perform DMA channel synchronization.
[0026] While the invention is susceptible to various modifications and alternative forms, specific embodiments thereof are shown by way of example in the drawings and will herein be described in detail. It should be understood, however, that the drawings and detailed description thereto are not intended to limit the invention to the particular form disclosed, but on the contrary, the intention is to cover all modifications, equivalents and alternatives falling within the spirit and scope of the present invention as defined by the appended claims.
DETAILED DESCRIPTION OF THE EMBODIMENTS
[0027] Incorporation by Reference
[0028] U.S. Pat. No. 5,875,313 titled “PCI Bus to IEEE 1394 Bus Translator Employing Write Pipe-Lining and Sequential Write Combining”, whose inventors are Glen O. Sescila III, Brian K. Odom, and Kevin L. Schultz, and which issued on Feb. 23, 1999, is hereby incorporated by reference in its entirety as though fully and completely set forth herein.
[0029] U.S. patent application Ser. No. 09/659,914 titled “System and Method for Transferring Data Over A Communication Medium Using Double-Buffered Data Transfer Links”, whose inventors are David W. Madden and Aljosa Vrancic, and which was filed on Sep. 11, 2000, is hereby incorporated by reference in its entirety as though fully and completely set forth herein.
[0030]
FIG. 1—A Data Acquisition System
[0031]
FIG. 1 illustrates a system according to one embodiment. It is noted that the present invention may be used in various types of systems wherein a host computer communicates with an external device. Exemplary systems include test and measurement systems, industrial automation systems, process control systems, robotics systems, machine vision and image acquisition systems, and other types of systems. In the embodiment described below, the device is an instrument or data acquisition (DAQ) device, and the system is a computer-based measurement or DAQ system.
[0032] As FIG. 1 shows, a host computer system 108 may be coupled through a communication medium 220 to a data acquisition device or instrument 110, which may be further coupled to a sensor (or actuator) 112. In a preferred embodiment, the communication medium 220 may be a serial bus, such as an IEEE 1394 bus, described in the current or future IEEE 1394 protocol specifications, although in other embodiments the bus may implement other protocols such as Ethernet, USB, or any other communication protocol.
[0033] The sensor 112 may be any type of transducer which is operable to detect environmental conditions and send sensor data to the instrument 110. The sensor 112 may also be operable to receive data from the instrument 110. The instrument 110 may be a data acquisition (DAQ) device, which combined with the sensor 112, may be operable to collect data concerning any of various phenomena, such as pressure, temperature, chemical content, current, resistance, voltage, or any other detectable attribute. The instrument or DAQ device 110 may also include data generation capabilities. The host computer system 108 may be operable to control the instrument 110 by sending requests to read from or write to the instrument's memory registers. The host computer system 108 may be further operable to obtain data from the instrument 110 for storage and analysis on the host computer system 108, either by issuing read requests or by programming the instrument 110 to send data to the memory of the host computer 108. Additionally, the host computer system 108 may be operable to send data, such as waveform data, to the device 110 for various purposes, such as for use in stimulating a unit under test (UUT), either by issuing write requests or by programming the instrument 110 to read data from the memory of the host computer 108. The host computer 108 preferably includes a memory medium which may include a software architecture similar to that shown in FIG. 4.
[0034]
FIG. 2A: A 1394/PCI Data Acquisition System
[0035]
FIG. 2A illustrates one embodiment of a 1394/PCI data acquisition system. As shown in FIG. 2A, host computer system 108 may be coupled to a PCI instrument 110A through serial bus 220, such as an IEEE 1394 bus.
[0036] In one embodiment, as shown in FIG. 2A, the instrument 110A may include a PCI device 208 which may be coupled to a PCI/1394 translator 204 (also referred to as a PCI/1394 Interface) through a PCI bus 210. In one embodiment, the translator 204 may include a National Instruments FirePHLI™, which provides translation between the IEEE 1394 protocol and PCI. The host computer system 108 may be operable to communicate with the PCI device 208 through the 1394 bus 220 via the 1394/PCI translator 204. The 1394/PCI translator 204 may be operable to translate between the 1394 and PCI address spaces, allowing the host computer system 108 to send 1394 requests to and receive 1394 responses from the PCI device 208. The 1394/PCI translator thus allows existing PCI devices to be used in an IEEE 1394 system. For more information on the 1394/PCI translator 204, please see U.S. Pat. No. 5,875,313 titled “PCI Bus to IEEE 1394 Bus Translator Employing Write Pipe-Lining and Sequential Write Combining”, which was incorporated by reference above.
[0037]
FIG. 2B: A 1394/PCI Data Acquisition System
[0038]
FIG. 2B is a block diagram of the data acquisition system of FIG. 2A, according to one embodiment. As FIG. 2B shows, host 108 may be communicatively coupled to PCI instrument 208 through 1394 bus 220 and 1394/PCI translator 204, described above with reference to FIG. 2A. Host 108 may be connected to the 1394 bus 220 via a 1394 interface 230.
[0039]
FIG. 3: A 1394 Data Acquisition System
[0040]
FIG. 3 is a block diagram of a 1394 data acquisition system, according to one embodiment. As shown in FIG. 3, host computer 108 may be communicatively coupled to a 1394-compliant instrument 110A through 1394 bus 220. The host 108 may include a CPU 310, and a memory 312 which may be operable to store programs and data 314. In one embodiment, the instrument 110A may be configured with a PCI instrument card 208 which may be operable to accept and manage sensor data. The instrument 110A may include a Direct Memory Access (DMA) Controller 320 which, in one embodiment, comprises two DMA channels. In another embodiment, the instrument 110A may include two DMA controllers, wherein each DMA controller supports one DMA channel. The instrument 110A may also include a 1394/PCI bridge or translator 204, such as a National Instruments FirePHLI™, which may provide translation between the IEEE 1394 protocol and PCI, as mentioned above. Finally, as can be seen in FIG. 3, the instrument 110A may also include read buffers 322A and 322B which may be coupled to the DMA Controller(s) 320 and the 1394/PCI translator 204, and which may be operable to store data transferred from the host 108, as well as memory 324, which may be coupled to the DMA Controller(s) 320, and which may be operable to store data transferred from the host computer, or data slated for transfer to the host computer, such as data acquired from a sensor. Memory 324 may comprise temporary storage locations Temp A 340 and Temp B 341. The use of temporary storage locations Temp A 340 and Temp B 341 is described below with reference of FIG. 8.
[0041] Thus, although in the embodiments described below the system includes a single DMA controller operating two DMA channels, in other embodiments of the invention, there may be multiple DMA controllers, e.g., one DMA controller per DMA channel. In either approach, the techniques described herein are applicable. In other words, in the approaches described herein, the terms “DMA controller” and “DMA channel” may be used interchangeably.
[0042]
FIG. 4: Software Architecture
[0043]
FIG. 4 is a block diagram of the software architecture of the system, according to one embodiment. As FIG. 4 shows, the top layer of the software architecture is application software 402. The application software 402 may be any software program which is operable to provide an interface for control and/or display of a data acquisition (DAQ) process. In one embodiment, the software application 402 may include a program developed in National Instrument's LabVIEW™ or LabWindows/CVI development environments. A driver program 404 may be below the application software 402. The driver 404 may be a DAQ driver 404, such as National Instrument's NI-DAQ driver program. The next software layer may optionally be a platform abstraction layer (PAL) driver 406, such as National Instrument's NI-PAL driver program. The PAL 406 may operate to abstract the internal communication bus and operating system to a common API. A 1394 platform abstraction layer firewire (PAL-FW) 1394 driver 408, such as National Instrument's NI-PAL F/W driver program may be below the NI-PAL driver 406. This software may manage the data transmission process according to one embodiment of the present invention, described below with reference to FIG. 5. A 1394D host interface 410 is below the NI-PAL F/W driver 408, such as provided by Microsoft Corporation, which abstracts the driver layer. The 1394D host interface 410 provides an interface to 1394 chipset driver software, such as OHCI 1394 driver software, which interfaces with the relevant hardware.
[0044] FIGS. 5: A Double Buffered Data Acquisition System
[0045]
FIG. 5 is a block diagram of a double buffered data acquisition system according to one embodiment. As FIG. 5 shows, a host memory 520 may be coupled through 1394 bus 220 to instrument 110A. Instrument 110A may comprise a PCI/1394 Translator 204, a Memory 324, a first DMA channel or controller 321A, a second DMA channel or controller 321B, and Data Acquisition (DAQ) Hardware 540. PCI/1394 Translator 204 may be coupled to the Memory 324 and the DMA channels 321A and 321B, and may comprise read buffer 1322A and read buffer 2322B. As noted above, memory 324 may comprise temporary storage locations Temp A 340 and Temp B 341. The use of temporary storage locations Temp A 340 and Temp B 341 is described below with reference of FIG. 8. DAQ hardware 540 may be coupled to the DMA channels 321A and 321B, and may comprise a First In-First Out (FIFO) buffer 550.
[0046] In one embodiment, host memory 520 may comprise an ordered series of memory blocks 521-530 (whose number and labels are for illustration purposes only). In one embodiment the host memory 520 may comprise a virtual memory buffer in the form of a linked list of nodes describing successive blocks of contiguous physical memory residing on the host computer. During a data output operation to the device 110, e.g., an “analog out” operation, the Translator 204 may be operable to pre-fetch additional data from the successive blocks of host memory 520 in response to data reads requested by DMA channel 1321A, and to store both the requested data and the pre-fetched data in one of the read buffers 322. In one embodiment, the DMA channels 321A and 321B may be operable to perform tasks in parallel. For example, DMA channel 1321A may request a read from host memory 520, which may trigger a pre-fetch of data from the host computer to read buffer 1322A, while DMA channel 2321B consumes previously pre-fetched data from the Translator's read buffer 2322B. In other words, while DMA channel 2321B is consuming the pre-fetched data from the Translator's read buffer 1322A, the Translator may be pre-fetching a next block of data from the host memory 520 and storing the next block of data into the Translator's read buffer 2322B, i.e., transfers data from the read buffer 2322B out to the FIFO 550. In one embodiment, DMA channel 1321A may be operable to program DMA channel 2321B to consume the pre-fetched data from the Translator's read buffer 322, providing transfer information to DMA channel 2321B indicating memory locations from which data is to be read (consumed). In one embodiment, DMA channel 2321B consuming pre-fetched data from the Translator's read buffer 322 comprises DMA channel 2321B making successive data reads from the Translator's read buffer 322 and storing the data in the DAQ hardware's FIFO 550.
[0047] In one embodiment data transfer instructions may be provided to the device by the host computer system 108 in the form of a linked-list of transfer nodes which may be transferred to a remote heap on the device in a double buffered manner as described in U.S. patent application Ser. No. 09/659,914 titled “System and Method for Transferring Data Over A Communication Medium Using Double-Buffered Data Transfer Links”, which was incorporated by reference above. Further descriptions of this parallel double buffered data transfer are presented as flow charts in FIGS. 7 and 8, described below.
[0048]
FIG. 6: Double Buffering
[0049]
FIG. 6 illustrates the process of double buffering data in a parallel manner. As FIG. 6 A shows, a first process (DMA channel 1321A) may read data 602A from host memory 520 into read buffer 1322A while a second process (DMA channel 2321) consumes data 604A from read buffer 2322B. When all desired data from read buffer 2561 have been consumed, the buffers may be switched, and the first process (DMA channel 1321A) may then read data 602B from host memory 520 into read buffer 2322B while the second process (DMA channel 2321) consumes data 604B from read buffer 1322A. This double buffering data transfer may continue until there are no more data to transfer.
[0050]
FIG. 7: A Double Buffered Data Transfer Process
[0051]
FIG. 7 is a flowchart of a double buffered data transfer process in which a host computer system is coupled through a communication medium to a data acquisition device which includes a first read buffer and a second read buffer. FIG. 7 illustrates a data output operation to the device 110, e.g., an “analog out” operation.
[0052] As FIG. 7 shows, in 702 the data acquisition device may read first data from the host computer and store the first data in the first read buffer. In 704 the first data may be transferred out from the first read buffer, such as to the FIFO 550. In one embodiment the data may be analog waveform data, which is transferred out to a device under test to provide a stimulation signal to the device as part of a test procedure. As indicated in FIG. 7, while the first data are being transferred out from the first read buffer 322A, in 706 the data acquisition device may read second data from the host computer and store the second data in the second read buffer, i.e., the reading of the second data is preferably performed concurrently with the transfer of the first data out from the first read buffer. Performing the reads and writes to and from the two read buffers concurrently may improve the performance of the system substantially.
[0053] Then in 708, the second data may be transferred from the second read buffer concurrently with the data acquisition device reading third data from the host computer and storing the third data in the first read buffer, as indicated in 710. It should be noted that the transfer of the second data out from the second read buffer preferably occurs after completion of the transfer of the first data out from the first read buffer. In other words, the process may only maintain one output stream of data to the FIFO 550, and so data may be read only from one read buffer at a time.
[0054] Thus, as long as there are data to be read from the host computer system, the process may read to and write from the two read buffers in a concurrent manner to effect a double buffered data transfer scheme. Such a scheme may as much as double the performance of the system.
[0055]
FIG. 8: A Double Buffered Data Transfer Process
[0056]
FIG. 8 is a detailed flow chart of the double-buffered data acquisition process performed by the system according to one embodiment. In 802 the host computer 108 may configure the device (instrument) 110 for an I/O operation, such as a read operation wherein data is transferred from host memory 520 to DAQ hardware 540 on the device 110A. In 804 the host 108 may initiate the I/O operation. In an alternate embodiment, the device may initiate the I/O operation. Then in 806 DMA channel 1321A (i.e., the data acquisition device 110A) may request a read from host memory 520. In various embodiments, the read may be for 1, 2, or 4 bytes or more, depending upon the data transfer rates of the transmission protocol. For purposes of illustration, the requested read is for 4 bytes. The read for 4 bytes requested by the DMA channel 1321A may trigger the PCI/1394 Translator 204 to transfer a greater amount of data from the host computer using a pre-fetch method, such as 2K (2048 bytes) of data, to read buffer 1322A, as indicated in 808. In other embodiments, the PCI/1394 Translator 204 may read 1K (bytes) or 512 bytes from the host memory 520, depending upon the packet size of the transmission protocol. In one embodiment, after the 2K of data is transferred to read buffer 1322A, the initial 4 bytes requested by DMA channel 1321A may be transferred from the read buffer 1 and stored into temporary memory location Temp A 340 in order to satisfy the read request of DMA channel 1321A. Thus the 2K of data transferred to the read buffer 1322A may comprise requested data (4 bytes) and pre-fetched data (2K−4 bytes). In one embodiment, after the 4 bytes of data are transferred to Temp A 340, the DMA channel 1321A may program DMA channel 2321B to consume the pre-fetched data in read buffer 1322A, i.e., to transfer the data out from the read buffer 1322A.
[0057] In one embodiment, after the Translator 204 pre-fetches the data, the two DMA channels 321A and 321B may synchronize before proceeding with the data transfer process. This event in the process is referred to as a sync point. In one embodiment, the DMA channel synchronization may operate according to the following rules: DMA channel 1321A may not initiate the next read/pre-fetch into read buffer 1322A (or 2321) until DMA channel 2321B has finished consuming the pre-fetched data from read buffer 1322A (or 2322B); and DMA channel 2321B may not begin consuming the pre-fetched data from read buffer 1322A (or 2322B) until the DMA channel 1321A initiated transfer of data into read buffer 1322A (or 2322B) has been completed. In this way, conflicts between data transfer operations on a particular read buffer may be avoided. The synchronization process is described in more detail below with reference to FIGS. 9A-9E.
[0058] After the Translator 204 pre-fetches the data, DMA channel 2321B may begin consuming the data in read buffer 1322A, as indicated by 811. In the embodiment described above in which the requested read data is stored in the temporary memory location Temp A 340, the DMA channel 2321B may read (consume) the requested read data from Temp A 340 before reading (consuming) the data in read buffer 1322A. Meanwhile, DMA channel 1321A may request another read for 4 (or 2 or 1) bytes of data from the host memory 520, as shown in 810. As described above, the read requested by DMA channel 1321A may trigger the translator to pre-fetch 2K of data from the host memory 520 to read buffer 2322B, as indicated by 812 (and transfer the requested read data to Temp B 322B, in one embodiment). Thus, new data may be pre-fetched into read buffer 2322B while previously fetched data is consumed (read) from read buffer 1322A.
[0059] In one embodiment, after 811 and 812, the two DMA channels 321A and 321B may synchronize again, as described above, and as described in detail below with reference to FIGS. 9A-9E. In 813 a determination may be made whether there are more data to be transferred in the I/O operation. If no more data are to be transferred, the process may end. If there are more data to be transferred, then the read buffers may be switched and DMA channel 2321B may begin consuming the pre-fetched data in read buffer 2322B, as indicated by 815. Again, in one embodiment, the requested read data may be read from Temp B 322B first. Meanwhile, DMA channel 1321A may request another read for 4 (or 2 or 1) bytes of data from the host memory 520, as shown in 814. The read requested by DMA channel 1321A may trigger the translator to pre-fetch 2K of data from the host memory 520 to read buffer 1322A, as indicated by 816. Thus, new data may be pre-fetched into read buffer 1322A while previously fetched data is consumed (read) from read buffer 2322B.
[0060] In one embodiment, after 815 and 816, the two DMA channels 321A and 321B may synchronize again, as described above. Then in 818 a determination may be made whether there are more data to be transferred in the I/O operation. If no more data are to be transferred, the process may end. Otherwise, as FIG. 8 shows in the ‘yes’ branch of decision 818, the process described above may be repeated until the I/O operation is completed.
[0061] FIGS. 9A-9E—DMA Channel Synchronization
[0062] FIGS. 9A-9E illustrate various embodiments of the invention as applied to the synchronization of DMA channels. As noted above, DMA channels involved in the data transfer may control their own and/or each other's execution. As also mentioned above in the description of FIG. 8, in one embodiment, the decision point 813 may also be used as a synchronization point, where the DMA channels may synchronize their operations before proceeding with subsequent tasks.
[0063] For example, in an embodiment in which a guarantee can be made that DMA channel 1320 will always reach the synchronization point before DMA channel 2321, the synchronization of the two DMA channels may be achieved by decomposing the synchronization point 813, as shown in FIG. 9A. As FIG. 9A shows, as soon as DMA channel 1320 enters the synchronization point 813, it may pause itself by issuing a pause command 901. Then, when DMA channel 2321 reaches the same synchronization point it may awaken DMA channel 1320, e.g., by a continue command 902. Both channels may then proceed to the decision point 903. In the described embodiment, the synchronization may be achieved using only pause and continue commands that may be easily implemented in hardware. A similar approach may be used in the embodiment shown in FIG. 9B, where the synchronization point 813 is always reached first by the DMA channel 2321.
[0064] If no guarantees can be made which of the DMA channels will reach synchronization point 813 first, a more complex algorithm may be required, such as that shown in FIG. 9C. When any of the DMA channels enters the synchronization point, it may issue continue command 911 or 912 on the other channel, and then pause itself, e.g., by pause command 913 or 914. Once a DMA channel is awakened, it may re-issue the continue command 915 or 916 to the other DMA channel. For example, in the case that DMA channel 1320 reaches the synchronization point first, it may first issue continue command 911 and then pause itself 913. Since DMA channel 2 is running, the continue command 911 will have no effect. Once DMA channel 2 reaches the synchronization point it may issue continue command 912 and then pause itself 914. The continue command 912 may awaken DMA channel 1, which in turn may execute continue command 915 and proceed to run. The continue command 915 may awaken the DMA channel 1, which may then proceed to run. The final result is that both DMA channels may continue running after they rendezvous at the synchronization point. Again, the entire process has been achieved by only using continue and pause commands.
[0065] In one particular embodiment, the execution may proceed as follows: DMA channel 1 may reach the continue command 911 first. Since DMA channel 2 is running, the command will have no effect. Next, DMA channel 2 may execute the continue command 912. Since DMA channel 1 is running the command again will have no effect. DMA channel 2 may then pause itself by executing 914. Finally, DMA channel 1 may pause itself by executing 913. Since both DMA channels are paused, a deadlock state is reached. To prevent deadlocks, an algorithm such as that shown in FIG. 9D may be used. As FIG. 9D shows, the commands 911 and 913, and 912 and 914 may be combined into single commands 920 and 921 that may be executed atomically, i.e., even though the pause subcommands 913 and 914 are executed, a first DMA channel does not check its state and pause itself if requested until all subcommands in the atomic command are executed. It should be noted that if the second DMA channel issues a continue command on the first DMA channel before the first DMA channel completes its atomic command, then the first DMA channel's pause command will have no effect (and vice versa).
[0066] Another solution may be to combine commands 911 and 913, and 912 and 914 into single atomic execution commands 930 and 931, as shown in FIG. 9E, and to impose the restriction that while any atomic command is being executed by a DMA channel no other atomic commands from other DMA channels may be executed.
[0067] It is noted that the examples presented above can easily be extended to other synchronization points of FIG. 8, or in other embodiments of the methods presented herein.
[0068] While the present invention has been described with reference to particular embodiments, it will be understood that the embodiments are illustrative and that the invention scope is not so limited. Any variations, modifications, additions, and improvements to the embodiments described are possible. These variations, modifications, additions, and improvements may fall within the scope of the inventions as detailed within the following claims.
Claims
- 1. A method for transferring data in a system including a host computer system coupled through a communication medium to a data acquisition device, wherein the data acquisition device comprises a first read buffer and a second read buffer, the method comprising:
the data acquisition device reading first data from the host computer and storing the first data in the first read buffer; transferring the first data out from the first read buffer; the data acquisition device reading second data from the host computer and storing the second data in the second read buffer concurrently with said transferring the first data out from the first read buffer.
- 2. The method of claim 1, further comprising:
determining if there are more data to transfer; if there are no more data to transfer, then terminating said transferring data.
- 3. The method of claim 1, further comprising:
transferring the second data out from the second read buffer after completion of said transferring the first data out from the first read buffer.
- 4. The method of claim 3, further comprising:
the data acquisition device reading third data from the host computer and storing the third data in the first read buffer concurrently with said transferring the second data out from the second read buffer.
- 5. The method of claim 4, further comprising:
after said data acquisition device reading third data from the host computer and storing the third data in the first read buffer concurrently with said transferring the second data out from the second read buffer, determining if there are more data to transfer; if there are no more data to transfer, then terminating said transferring data.
- 6. The method of claim 4, wherein the data acquisition device includes a first direct memory access (DMA) channel and a second DMA channel;
wherein the first DMA channel performs said reading first data from the host computer and storing the first data in the first read buffer, reading second data from the host computer and storing the second data in the second read buffer, and reading third data from the host computer and storing the third data in the first read buffer; and wherein the second DMA channel performs said transferring the first data out from the first read buffer and said transferring the second data out from the second read buffer.
- 7. The method of claim 4, wherein the data acquisition device reads data into one of the first read buffer and the second read buffer concurrently with transferring data out from the other one of the second read buffer and the first read buffer, respectively, in a double buffered fashion.
- 8. The method of claim 7,
wherein the data acquisition device includes a first direct memory access (DMA) channel and a second DMA channel; wherein the first DMA channel reads data into one of the first read buffer and the second read buffer concurrently with the second DMA channel transferring data out from the other one of the second read buffer and the first read buffer, respectively.
- 9. The method of claim 8, the method further comprising:
synchronizing the first DMA channel and the second DMA channel after said first DMA channel reads data into one of the first read buffer and the second read buffer concurrently with the second DMA channel transferring data out from the other one of the second read buffer and the first read buffer.
- 10. The method of claim 9, wherein said synchronizing comprises:
the first DMA channel entering a synchronization point; the first DMA channel issuing a pause command, thereby pausing itself; the second DMA channel entering the synchronization point; and the second DMA channel issuing a continue command, thereby awakening the first DMA channel; wherein, after said second DMA channel issuing the continue command, the first DMA channel and the second DMA channel are operable to proceed with said transferring data in a synchronous manner.
- 11. The method of claim 9, wherein said synchronizing comprises:
the first DMA channel entering a synchronization point; the first DMA channel issuing a continue command, thereby awakening the second DMA channel if the second DMA channel is paused; the first DMA channel issuing a pause command, thereby pausing itself; the second DMA channel entering the synchronization point; and the second DMA channel issuing a continue command, thereby awakening the first DMA channel; the second DMA channel issuing a pause command, thereby pausing itself; the first DMA channel issuing a continue command, thereby awakening the second DMA channel; wherein, after said first DMA channel issuing the continue command, the first DMA channel and the second DMA channel are operable to proceed with said transferring data in a synchronous manner.
- 12. The method of claim 9, wherein said synchronizing comprises:
the first DMA channel entering a synchronization point; the first DMA channel issuing a single atomic command, comprising:
issuing a pause command; and issuing a continue command, thereby awakening the second DMA channel if the second DMA channel is paused; wherein the first DMA channel does not check its state and pause itself if requested until said issued single atomic command is executed; the second DMA channel entering the synchronization point; the second DMA channel issuing a single atomic command, comprising:
issuing a pause command; and issuing a continue command, thereby awakening the first DMA channel if the first DMA channel is paused; wherein the second DMA channel does not check its state and pause itself if requested until said issued single atomic command is executed; the first DMA channel issuing a second continue command, thereby awakening the second DMA channel if the second DMA channel is paused; the second DMA channel issuing a second continue command, thereby awakening the first DMA channel if the first DMA channel is paused; wherein, after said first DMA channel issuing the second continue command, and the second DMA channel issuing a second continue command, the first DMA channel and the second DMA channel are operable to proceed with said transferring data in a synchronous manner.
- 13. The method of claim 9, wherein said synchronizing comprises:
the first DMA channel entering a synchronization point; the first DMA channel issuing single atomic command, comprising:
issuing a continue command, thereby awakening the second DMA channel if the second DMA channel is paused; and issuing a pause command, thereby pausing itself; wherein, during said issuing the single atomic execution command, the second DMA channel is excluded from issuing other atomic commands; the second DMA channel entering the synchronization point; the second DMA channel issuing single atomic command, comprising:
issuing a continue command, thereby awakening the first DMA channel if the first DMA channel is paused; and issuing a pause command, thereby pausing itself; wherein, during said issuing the single atomic execution command, the first DMA channel is excluded from issuing other atomic commands; the first DMA channel issuing a second continue command, thereby awakening the second DMA channel if the second DMA channel is paused; the second DMA channel issuing a second continue command, thereby awakening the first DMA channel if the first DMA channel is paused; wherein, after said first DMA channel issuing the second continue command, and the second DMA channel issuing a second continue command, the first DMA channel and the second DMA channel are operable to proceed with said transferring data in a synchronous manner.
- 14. The method of claim 4,
wherein the data acquisition device reading first data comprises the data acquisition device reading first requested data and first pre-fetch data; wherein the data acquisition device reading second data comprises the data acquisition device reading second requested data and second pre-fetch data; wherein the data acquisition device reading third data comprises the data acquisition device reading third requested data and third pre-fetch data.
- 15. The method of claim 14, further comprising:
transferring the first requested data from the first read buffer to a first temporary memory after reading the first data into the first read buffer, wherein said transferring the first requested data from the first read buffer to the first temporary memory location operates to satisfy a first read request; transferring the second requested data from the second read buffer to a second temporary memory location after reading the second data into the second read buffer, wherein said transferring the second requested data from the second read buffer to the second temporary memory location operates to satisfy a second read request, and transferring the third requested data from the first read buffer to the first temporary memory location after reading the third data into the first read buffer, wherein said transferring the third requested data from the first read buffer to the first temporary memory location operates to satisfy a third read request.
- 16. The method of claim 3,
wherein said transferring the first data out from the first read buffer comprises transferring the first data from the first read buffer to a FIFO memory; and wherein said transferring the second data out from the second read buffer comprises transferring the second data from the second read buffer to a FIFO memory.
- 17. The method of claim 1, wherein said transferring the first data out from the first read buffer is performed after said reading the first data from the host computer and storing the first data in the first read buffer.
- 18. A method for transferring data in a system including a host computer system coupled through a communication medium to a data acquisition device, wherein the data acquisition device comprises a first read buffer and a second read buffer, wherein the first read buffer stores first data, the method comprising:
transferring the first data out from the first read buffer; the data acquisition device reading second data from the host computer and storing the second data in the second read buffer concurrently with said transferring the first data out from the first read buffer; transferring the second data out from the second read buffer after completion of said transferring the first data out from the first read buffer; and the data acquisition device reading third data from the host computer and storing the third data in the first read buffer concurrently with said transferring the second data out from the first second buffer.
- 19. The method of claim 18, further comprising:
after said data acquisition device reading second data from the host computer and storing the second data in the second read buffer concurrently with said transferring the first data out from the first read buffer, determining if there are more data to transfer; and if there are no more data to transfer, then terminating said transferring data.
- 20. The method of claim 19,
wherein the data acquisition device includes a first direct memory access (DMA) channel and a second DMA channel; wherein the first DMA channel performs said reading second data from the host computer and storing the second data in the second read buffer, and reading third data from the host computer and storing the third data in the first read buffer; and wherein the second DMA channel performs said transferring the first data out from the first read buffer and said transferring the second data out from the second read buffer.
- 21. The method of claim 18,
wherein the data acquisition device reads data into one of the first read buffer and the second read buffer concurrently with transferring data out from the other one of the second read buffer and the first read buffer, respectively, in a double buffered fashion.
- 22. The method of claim 21,
wherein the data acquisition device includes a first direct memory access (DMA) channel and a second DMA channel; and wherein the first DMA channel reads data into one of the first read buffer and the second read buffer concurrently with the second DMA channel transferring data out from the other one of the second read buffer and the first read buffer, respectively.
- 23. The method of claim 22, the method further comprising:
synchronizing the first DMA channel and the second DMA channel after said first DMA channel reads data into one of the first read buffer and the second read buffer concurrently with the second DMA channel transferring data out from the other one of the second read buffer and the first read buffer.
- 24. The method of claim 23, wherein said synchronizing comprises:
the first DMA channel entering a synchronization point; the first DMA channel issuing a pause command, thereby pausing itself; the second DMA channel entering the synchronization point; and the second DMA channel issuing a continue command, thereby awakening the first DMA channel; wherein, after said second DMA channel issuing the continue command, the first DMA channel and the second DMA channel are operable to proceed with said transferring data in a synchronous manner.
- 25. The method of claim 23, wherein said synchronizing comprises:
the first DMA channel entering a synchronization point; the first DMA channel issuing a continue command, thereby awakening the second DMA channel if the second DMA channel is paused; the first DMA channel issuing a pause command, thereby pausing itself; the second DMA channel entering the synchronization point; and the second DMA channel issuing a continue command, thereby awakening the first DMA channel; the second DMA channel issuing a pause command, thereby pausing itself; the first DMA channel issuing a continue command, thereby awakening the second DMA channel; wherein, after said first DMA channel issuing the continue command, the first DMA channel and the second DMA channel are operable to proceed with said transferring data in a synchronous manner.
- 26. The method of claim 23, wherein said synchronizing comprises:
the first DMA channel entering a synchronization point; the first DMA channel issuing single atomic command, comprising:
issuing a pause command; and issuing a continue command, thereby awakening the second DMA channel if the second DMA channel is paused; wherein, the first DMA channel does not check its state and pause itself if requested until said issued the single atomic command is executed; the second DMA channel entering the synchronization point; the second DMA channel issuing single atomic command, comprising:
issuing a pause command; and issuing a continue command, thereby awakening the first DMA channel if the first DMA channel is paused; wherein, the second DMA channel does not check its state and pause itself if requested until said issued the single atomic command is executed; the first DMA channel issuing a second continue command, thereby awakening the second DMA channel if the second DMA channel is paused; the second DMA channel issuing a second continue command, thereby awakening the first DMA channel if the first DMA channel is paused; wherein, after said first DMA channel issuing the second continue command, and the second DMA channel issuing a second continue command, the first DMA channel and the second DMA channel are operable to proceed with said transferring data in a synchronous manner.
- 27. The method of claim 23, wherein said synchronizing comprises:
the first DMA channel entering a synchronization point; the first DMA channel issuing single atomic command, comprising:
issuing a continue command, thereby awakening the second DMA channel if the second DMA channel is paused; and issuing a pause command, thereby pausing itself; wherein, during said issuing the single atomic execution command, the second DMA channel is excluded from issuing other atomic commands; the second DMA channel entering the synchronization point; the second DMA channel issuing single atomic command, comprising:
issuing a continue command, thereby awakening the first DMA channel if the first DMA channel is paused; and issuing a pause command, thereby pausing itself; wherein, during said issuing the single atomic execution command, the first DMA channel is excluded from issuing other atomic commands; the first DMA channel issuing a second continue command, thereby awakening the second DMA channel if the second DMA channel is paused; the second DMA channel issuing a second continue command, thereby awakening the first DMA channel if the first DMA channel is paused; wherein, after said first DMA channel issuing the second continue command, and the second DMA channel issuing a second continue command, the first DMA channel and the second DMA channel are operable to proceed with said transferring data in a synchronous manner.
- 28. A system for transferring data over a communication medium, the system comprising:
a communication medium; a data acquisition device coupled to a first end of the communication medium, wherein the data acquisition device comprises a first read buffer and a second read buffer; and a host computer system coupled to a second end of the communication medium, wherein the host computer system is operable to communicate through the communication medium to the data acquisition device; wherein the data acquisition device is operable to:
read first data from the host computer and store the first data in the first read buffer; transfer the first data out from the first read buffer; and read second data from the host computer and store the second data in the second read buffer concurrently with the transfer of the first data out from the first read buffer.
- 29. The system of claim 28, wherein the data acquisition device is further operable to:
determine if there are more data to transfer; and if there are no more data to transfer, terminate said transferring data.
- 30. The system of claim 28, wherein the data acquisition device is further operable to transfer the second data out from the second read buffer after completion of said transfer of the first data out from the first read buffer.
- 31. The system of claim 30, wherein the data acquisition device is further operable to read third data from the host computer and store the third data in the first read buffer concurrently with said transfer of the second data out from the second read buffer.
- 32. The system of claim 31, wherein the data acquisition device is further operable to:
after said data acquisition device reading third data from the host computer and storing the third data in the first read buffer concurrently with said transferring the second data out from the second read buffer, determine if there are more data to transfer; and if there are no more data to transfer, terminate said transferring data.
- 33. The system of claim 31,
wherein the first data comprise first requested data and first pre-fetch data; wherein the second data comprise second requested data and second pre-fetch data; and wherein the third data comprise third requested data and third pre-fetch data.
- 34. The system of claim 33,
wherein the data acquisition device being operable to transfer the first data out from the first read buffer comprises the data acquisition device being operable to transfer the first requested data from the first read buffer to a first temporary memory, wherein said transferring the first requested data from the first read buffer to the first temporary memory location operates to satisfy a first read request; and wherein the data acquisition device being operable to transfer the second data out from the second read buffer comprises the data acquisition device being operable to transfer the second data from the second read buffer to a second temporary memory location, wherein said transferring the second requested data from the second read buffer to the second temporary memory location operates to satisfy a second read request.
- 35. The system of claim 28,
wherein the data acquisition device comprises a first direct memory access (DMA) channel and a second DMA channel; and wherein the first DMA channel reads data into one of the first read buffer and the second read buffer concurrently with the second DMA channel transferring data out from the other one of the second read buffer and the first read buffer, respectively.
- 36. The system of claim 35, wherein, after said first DMA channel is operable to read data into one of the first read buffer and the second read buffer concurrently with the second DMA channel transferring data out from the other one of the second read buffer and the first read buffer, respectively, the data acquisition device is further operable to:
determine if there are more data to transfer; and if there are no more data to transfer, terminate said transferring data.
- 37. The system of claim 35, wherein, after said first DMA channel reads data into one of the first read buffer and the second read buffer concurrently with the second DMA channel transferring data out from the other one of the second read buffer and the first read buffer, the data acquisition device is further operable to:
synchronize the first DMA channel and the second DMA channel.
- 38. A method for transferring data in a system including a host computer system coupled through a communication medium to a data acquisition device, the method comprising:
configuring a data acquisition device for a data input/output (I/O) operation, wherein the data acquisition device comprises a first read buffer, a second read buffer, a FIFO buffer, a first direct memory access (DMA) channel and a second DMA channel; the data acquisition device issuing a first read request for first data from the host computer; reading second data from the host computer into the first read buffer in response to the data acquisition device issuing the first read request for the first data, wherein the second data includes the first data; the first DMA channel programming the second DMA channel to read data from the first read buffer into the FIFO buffer; the second DMA channel reading data from the first read buffer into the FIFO buffer after said programming; while the second DMA channel is reading data from the first read buffer into the FIFO buffer,
the data acquisition device issuing a second read request for third data from the host computer; reading fourth data from the host computer into the second read buffer in response to the data acquisition device issuing the second read request for the third data, wherein the fourth data includes the third data; the first DMA channel programming the second DMA channel to read data from the second read buffer into the FIFO buffer; when the second DMA channel has completed reading the data from the first read buffer into the FIFO buffer,
the second DMA channel reading data from the second read buffer into the FIFO buffer; the data acquisition device issuing a third read request for fifth data from the host computer; the first DMA channel reading sixth data from the host computer into the first read buffer in response to the data acquisition device issuing the third read request for the fifth data, wherein the sixth data includes the fifth data.
- 39. The method of claim 38,
wherein the data acquisition device reading first data comprises the data acquisition device reading first requested data and first pre-fetch data; wherein the data acquisition device reading second data comprises the data acquisition device reading second requested data and second pre-fetch data; wherein the data acquisition device reading third data comprises the data acquisition device reading third requested data and third pre-fetch data.
- 40. The method of claim 38,
transferring the first requested data from the first read buffer to a first temporary memory after reading the first data into the first read buffer, wherein said transferring the first requested data from the first read buffer to the first temporary memory location operates to satisfy the first read request; transferring the second data from the second read buffer to a second temporary memory location after reading the second data into the second read buffer, wherein said transferring the second requested data from the second read buffer to the second temporary memory location operates to satisfy the second read request, and transferring the third data from the first read buffer to the first temporary memory location after reading the third data into the first read buffer, wherein said transferring the third requested data from the first read buffer to the first temporary memory location operates to satisfy the third read request.