This disclosure relates generally to data handling systems, and more specifically, to storage systems, methods, and apparatuses for fast input/output (TO) on PCIE devices.
Data handling systems, such as computer systems, generally include a host system with several core components, including a central processing unit (CPU), a memory device, IO devices, and a device interconnect or root complex for transferring information between components of the host system. One interconnect protocol is PCIe, which defines modes of data transfer between host system components.
Direct memory access (DMA), direct cache access (DCA), and data direct I/O (DDIO) are existing techniques to transfer data between host systems (memory or caches) and IO devices. DMA transfers data between IO devices and host system memory. DDIO transfers data between IO devices and the last level cache (LLC) of a host CPU. DCA transfers data between IO devices and host system memory similar to DMA, but after the data transfer is complete, DCA prefetches the data to the CPU caches from memory.
Host systems often include multiple data handling ports, each of which may be configured to transfer data by one of DMA, DCA or DDIO protocols. Such ports are referred to herein as “multi-ports.” A common system may have some multi-ports configured for DMA, some multi-ports configured for DDIO, and possibly some multi-ports configured for DCA. Existing systems send an IO request to devices using one of these techniques (DMA, DCA, or DDIO). However, there are currently no methods to combine functional aspects of DMA, DCA, § or DDIO protocols, or to dynamically switch between protocols for reduced latency or improved throughput, depending on the applications IO request.
Embodiments of storage systems, methods, and apparatuses for fast IO on PCIE devices are described. In an embodiment, a method may include receiving an IO request from a user or application, the IO request comprising instructions for communicating data with a host system, the host system comprising a processing device and a memory device. The method may also include analyzing information from the IO request in an IO block analyzer to select one of a plurality of communication paths for communicating the data with the host system. Additionally, the method may include defining a routing instruction in a transfer routing information transmitter in response to the selected communication path. Also, the method may include communicating the routing instruction in a Transaction Layer Packet (TLP) to an integrated IO (ITO) module of the host system. The method may additionally include routing the data from the peripheral device to either the processing device or the memory device according to the routing instruction with a data transfer router.
An embodiment of a system includes a peripheral device comprising an IO manager configured to: receive an IO request, the IO request comprising instructions for communicating data with a host system, the host system comprising a processing device and a memory device, analyze information from the IO request to select one of a plurality of communication paths for communicating the data with the host system, define a routing instruction in response to the selected communication path. The system may also include a data transfer device coupled to the peripheral device, and configured to communicate the routing instruction in a TLP to an IIO module of the host system.
An apparatus may include a data transfer device coupled to a peripheral device, and configured to communicate a routing instruction in a TLP to an IIO module of a host system. In such an embodiment, the routing instruction may include selection of one of a plurality of communication paths for communicating the data with the host system, the selection being determined in response to instructions included in an IO request.
The present invention(s) is/are illustrated by way of example and is/are not limited by the accompanying figures, in which like references indicate similar elements. Elements in the figures are illustrated for simplicity and clarity, and have not necessarily been drawn to scale.
Embodiments of systems and methods are disclosed for fast IO on PCIe devices. In an embodiment, a PCIe device may include multiple consistent IO ports for redundancy, performance, and QoS. Additionally, the PCIe device may include a set of routing scratch pads and a packet builder. The set of routing scratch pads may be used to store routing information for each port. The packet builder may dynamically build PCIe TLPs that include routing information set in the routing scratch pad.
The present embodiments also provide an IO manager, which may be defined by system software or firmware, and that may include a transfer routing decision maker comprising an IO block analyzer and a routing information sender. The IO block analyzer may generate routing information based on data characteristics. The routing information sender may send the information to the routing scratch pad in the PCIe device via memory mapped IO (MMIO). Additionally, the PCIe device may inform the CPU to set the routing control registers using routing information dynamically.
Further disclosed herein is an integrated input/output (IIO) module that may include a routing demultiplexer to change the data paths dynamically between memory and cache, based on the routing information, and methods that control the IO flow among device, memory, and cache.
In an embodiment, the host system 102 may include a root complex 104 which forms a data communication fabric for communicating data between the CPU 106, the memory 108 and one or more internal endpoints 112 or external endpoints 114 coupled to the root complex 104 via a physical interface 110. The root complex 104 may operate according to a data communication protocol, such as PCIe. In an embodiment, the root complex 104 and/or the physical interface 110 may include a plurality of data communication ports as illustrated in
One of ordinary skill will also recognize that the functional blocks of the method 300 illustrated in
As used herein, the term “data characteristic” may include an explicit tag, indicator, setting, field value, user input, or the like, which indicates a level of urgency of the IO request. Alternatively, a data characteristic may be an inherent property of the IO request, such as identification and/or classification of the software application 404 that sent the IO request, the status of the LLC 414, port availability, or the like.
In an embodiment, the routing information is transferred before transfer of the DMA request from the DMA request generator 512 to the Rx/Tx DMA register 518 in the FastIO PCIe device 514. In one embodiment, a routing scratch pad 516 in the transfer routing information transmitter 522 may store the transfer routing information received from the routing information sender. When the FastIO PCIe device 514 builds a Transaction Layer Packet (TLP) 520, the transfer routing information may be embedded in the TLP and communicated to the IIO module 410.
In an embodiment, the IIO module 410 initializes a routing control register 528 in the data transfer router 526 using the routing information embedded in the TLP. The purpose of a TLP is to transfer DMA information such as memory address, size, and so on to the host system 102 for each DMA request. So, the DMA engine 524 sends data to IIO module 526 following TLP transfer. The data is transferred to the destination which is stored in the routing control register by the routing demultiplexer (demux) 530, which is controlled by the routing information from the TLP stored in the routing control register 528. The data may be communicated to either the LLC 414 or to the memory 108 via the memory controller 412.
The following discussion is intended to provide a brief, general description of a suitable machine or machines in which certain aspects of the inventive concept can be implemented. Typically, the machine or machines include a system bus to which is attached processors, memory, e.g., random access memory (RAM), read-only memory (ROM), or other state preserving medium, storage devices, a video interface, and input/output interface ports. The machine or machines can be controlled, at least in part, by input from conventional input devices, such as keyboards, mice, etc., as well as by directives received from another machine, interaction with a virtual reality (VR) environment, biometric feedback, or other input signal. As used herein, the term “machine” is intended to broadly encompass a single machine, a virtual machine, or a system of communicatively coupled machines, virtual machines, or devices operating together. Exemplary machines include computing devices such as personal computers, workstations, servers, portable computers, handheld devices, telephones, tablets, etc., as well as transportation devices, such as private or public transportation, e.g., automobiles, trains, cabs, etc.
The machine or machines can include embedded controllers, such as programmable or non-programmable logic devices or arrays, Application Specific Integrated Circuits (ASICs), embedded computers, smart cards, and the like. The machine or machines can utilize one or more connections to one or more remote machines, such as through a network interface, modem, or other communicative coupling. Machines can be interconnected by way of a physical and/or logical network, such as an intranet, the Internet, local area networks, wide area networks, etc. One skilled in the art will appreciate that network communication can utilize various wired and/or wireless short range or long range carriers and protocols, including radio frequency (RF), satellite, microwave, Institute of Electrical and Electronics Engineers (IEEE) 545.11, Bluetooth®, optical, infrared, cable, laser, etc.
Embodiments of the present inventive concept can be described by reference to or in conjunction with associated data including functions, procedures, data structures, application programs, etc. which when accessed by a machine results in the machine performing tasks or defining abstract data types or low-level hardware contexts. Associated data can be stored in, for example, the volatile and/or non-volatile memory, e.g., RAM, ROM, etc., or in other storage devices and their associated storage media, including hard-drives, floppy-disks, optical storage, tapes, flash memory, memory sticks, digital video disks, biological storage, etc. Associated data can be delivered over transmission environments, including the physical and/or logical network, in the form of packets, serial data, parallel data, propagated signals, etc., and can be used in a compressed or encrypted format. Associated data can be used in a distributed environment, and stored locally and/or remotely for machine access.
Having described and illustrated the principles of the inventive concept with reference to illustrated embodiments, it will be recognized that the illustrated embodiments can be modified in arrangement and detail without departing from such principles, and can be combined in any desired manner. And although the foregoing discussion has focused on particular embodiments, other configurations are contemplated. In particular, even though expressions such as “according to an embodiment of the inventive concept” or the like are used herein, these phrases are meant to generally reference embodiment possibilities, and are not intended to limit the inventive concept to particular embodiment configurations. As used herein, these terms can reference the same or different embodiments that are combinable into other embodiments.
Embodiments of the inventive concept may include a non-transitory machine-readable medium comprising instructions executable by one or more processors, the instructions comprising instructions to perform the elements of the inventive concepts as described herein.
The foregoing illustrative embodiments are not to be construed as limiting the inventive concept thereof. Although a few embodiments have been described, those skilled in the art will readily appreciate that many modifications are possible to those embodiments without materially departing from the novel teachings and advantages of the present disclosure. Accordingly, all such modifications are intended to be included within the scope of this inventive concept as defined in the claims.
This application is a continuation of U.S. patent application Ser. No. 15/227,961 filed Aug. 4, 2016 which is incorporated by reference and which claims the priority benefit under 35 U.S.C. § 119(e) of U.S. Patent Application Ser. No. 62/341,604, filed May 25, 2016, which is hereby incorporated by reference.
Number | Name | Date | Kind |
---|---|---|---|
6424621 | Ramaswamy et al. | Jul 2002 | B1 |
6510164 | Ramaswamy et al. | Jan 2003 | B1 |
6651131 | Chong et al. | Nov 2003 | B1 |
8745292 | Lais et al. | Jun 2014 | B2 |
8788873 | Galles et al. | Jul 2014 | B2 |
8819304 | Shapiro et al. | Aug 2014 | B2 |
8874833 | Gole | Oct 2014 | B1 |
9003077 | Ye et al. | Apr 2015 | B2 |
9098423 | Cho | Aug 2015 | B2 |
9164679 | Smith | Oct 2015 | B2 |
9432298 | Smith | Aug 2016 | B1 |
9977754 | Kim | May 2018 | B2 |
10198185 | Park | Feb 2019 | B2 |
10248328 | Lee | Apr 2019 | B2 |
10387343 | Ching | Aug 2019 | B2 |
10642764 | Gerhart | May 2020 | B1 |
11029749 | Yang | Jun 2021 | B2 |
11126575 | Aslanidis | Sep 2021 | B1 |
11137486 | Ralston | Oct 2021 | B2 |
11184267 | Iorga | Nov 2021 | B2 |
11256644 | Mekad | Feb 2022 | B2 |
20010049744 | Hussey et al. | Dec 2001 | A1 |
20020188875 | Hwang et al. | Dec 2002 | A1 |
20030097481 | Richter | May 2003 | A1 |
20070064725 | Minami et al. | Mar 2007 | A1 |
20110022818 | Kegel et al. | Jan 2011 | A1 |
20110246686 | Cavanagh, Jr. et al. | Oct 2011 | A1 |
20130311817 | Kim | Nov 2013 | A1 |
20140189427 | Jayaprakash Bharadwaj | Jul 2014 | A1 |
20140198790 | Christenson | Jul 2014 | A1 |
20140359044 | Davis | Dec 2014 | A1 |
20150180782 | Rimmer | Jun 2015 | A1 |
20160095110 | Li | Mar 2016 | A1 |
20160154756 | Dodson | Jun 2016 | A1 |
20160378545 | Ho | Dec 2016 | A1 |
20170091108 | Arellano | Mar 2017 | A1 |
20170206166 | Kumar | Jul 2017 | A1 |
20180063555 | Raduchel et al. | Mar 2018 | A1 |
Entry |
---|
Final Office Action for U.S. Appl. No. 15/227,961 dated May 16, 2019. |
Liao, Guangdeng et al., “A New Server I/O Architecture for High Speed Networks,” High Performance Computer Architecture (HPCA), 2011 IEEE 17th International Symposium on High Performance Computer Architecture, 2011, 11 pages. |
Notice of Allowance for U.S. Appl. No. 15/227,961 dated Apr. 22, 2020. |
Office Action for U.S. Appl. No. 15/227,961 dated Oct. 31, 2018. |
Office Action for U.S. Appl. No. 15/227,961 dated Oct. 4, 2019. |
Number | Date | Country | |
---|---|---|---|
20200364176 A1 | Nov 2020 | US |
Number | Date | Country | |
---|---|---|---|
62341604 | May 2016 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 15227961 | Aug 2016 | US |
Child | 16986240 | US |