The described technology relates to a processing-in-memory (PIM). and more specifically, to a memory controller inside a PIM unit. In particular, the described technology relates to a memory management method for minimizing a transaction time of a memory inside a PIM and a hardware structure of a PIM for the memory control method
A processing-in-memory (PIM) is a memory integrated with a processing unit as one chip and improves system performance and power efficiency by reducing the frequency with which a processor accesses the memory to perform operations Unlike the traditional memory approach, the PIM approach uses instruction signals for operation in addition to using the address and data signals. The processing tout of the PIM processes tut instruction received from a processor and thus has complicated hardware. Accordingly, the memory performance of the PIM may be degraded. Therefore, since the memory performance of the PIM should not be significantly degraded, the processing unit requires as low hardware complexity as possible and thus is designed to support only relatively simple operations, for PIM, a variety of memories (e.g, dynamic random access memory (DRAM)) may be used as a memory element.
One aspect is a memory management method for maximizing processing-in-memory (PIM) performance and is directed to improving PIM performance by reducing PIM's unnecessary time to access DRAM.
In some embodiments, by changing the processing order of memory addresses explicitly or implicitly included in PIM instructions (or PIM operations), it is possible to improve the open page hit rate of the internal memory (e.g.. DRAM). To this end, when processing a PIM instruction packet, an instruction processing unit secondarily processes a request for access to a destination address at which read and write actions of an internal memory are likely to be sequentially performed. By secondarily requesting the destination address, a row address of an open page of the internal memory may match a row address to which a PIM instruction packet processing result is written back.
Also, according to the described technology, by the instruction processing unit pre-determining memory addresses explicitly or implicitly included in the PIM instruction and informing a memory controller of the memory addresses, it is possible to minimize performance loss that has occurred in an open-page miss situation of the internal memory. To this end, the instruction processing unit inside the PIM maintains memory write and read addresses that base been previously requested. The instruction processing unit compares the address of a packet to be processed to the maintained previous memory address and informs a memory controller about the comparison result through a page closing signal. The memory controller determines whether to maintain or close the page of the internal memory through this page closing signal.
The detailed configuration and interaction of the described technology will become more apparent through specific embodiments described below with reference to the drawings.
According to the described technology, by changing the processing order of memory addresses explicitly and implicitly included in PIM instructions (or PIM operations), it is possible to improve the open page hit rate of the DRAM.
According to the described technology, it is possible to reduce performance loss due to page miss having occurred when the management of the DRAM inside the PIM is operated in open-page mode.
In the case of DRAM, due to a unique action scheme different from those of other conventional memories, a required memory transaction time vanes greatly depending on the management method. In traditional computer systems, system performance has been improved through various memory management methods using open-page mode and close-page mode. Recently, computer systems of various structures have been developed due to social demands, and each computer system employs various memory scheduling methods that meet requirements. However, PIM is a computer structure that is newly studied to maximize the performance of a computer system, but a technique proposed for memory management is insufficient.
Advantages and features of the described technology and methods of achieving the same will become apparent with reference to some embodiments described in detail together with the accompanying drawings. However, the described technology is not limited to the following embodiments and may be implemented in various forms. The embodiments are merely provided to completely disclose the described technology and to fully inform those skilled in the art about the scope of the described technology, and the described technology is defined by the appended claims.
Also, terms used herein are only for describing the embodiments while not limiting the described technology. Herein, the singular forms “a,” “an,” and “one” include the plural unless the context clearly indicates otherwise. Herein, the terms “comprises” and/or “comprising” are used to specify the presence of stated elements, steps, actions, and/or components, but do not preclude the presence or addition of one or more other elements, steps, actions, and/or components.
Hereinafter, embodiments of the described technology will be described in detail with reference to the accompanying drawings. At this time, detailed descriptions of well-known elements or functions will be omitted if the descriptions may obscure the gist of the described technology.
<Basic Descripuon>
Processing-In-Memory (PIM)
A processing-in-memory (PIM) is an intelligent semiconductor that is packaged by integrating a memory and a processing unit into a single chip. Using a PIM can maximize the performance of a computer system by eliminating the memory bottleneck of traditional computers. Various memories may be used as the memory for PIM, and the described technology relates to a PIM that uses dynamic random access memory (DRAM) as an internal memory.
Page Policy for DRAM
In order to write data in the DRAM or to read data from the DRAM, a requested address is divided into a row address and a column address to be sequentially processed. A precharge action is performed before processing the row address. Since the row-address processing and the precharge action take a long time, i.e.. take up most of the response time of the DRAM, a conventional computer system mainly uses an open-page policy in which unnecessary row-address processing raid precharge action are omitted. As shown in
PIM Instruction Packet
A PIM performs an operation function other than a general read function and write function of a memory. Accordingly, a PIM and a computer system transmits and receives, for example, an instruction packet 110 that is configurable as shown in
A process of processing the instruction packet in the PIM has an atomic operation in which a memory read and write function and an operation function are sequentially executed as shown m
The memory management method of the PIM according to the described technology includes the following two types of PIM instruction packet processing methods in order to improve the response speed and throughput of the PIM. Through this processing scheme, it is possible to improve a page hit rate when the DRAM inside the PIM uses the open-page policy.
(1) Secondary Request for Destination Address
Types of PIM instruction packets include an instruction packet in which tin operation to be performed should use, as an operand, data of a destination address as well as a source address. The PIM memory management method according to the described technology includes a scheme of accessing the source address first and then accessing the destination address when the instruction processing unit 230 (see
Referring to
This method proposed by the described technology matches a row address of an open page of the internal DRAM 250 (see
(2) Page Closing Signal
The PIM structure diagram of
The other actions of
In this way, unlike the conventional DRAM controllers using open-page policy, the DRAM controller 240 determines whether to maintain the page according to the page closing signal 234 of the instruction processing unit 230 after the column-address processing 280. According to this method, it is possible to reduce performance loss by performing, in advance, a precharge action performed by conventional DRAM controller upon page mismatching.
An instruction analyzer 340 analyzes a PIM instruction packet to extract an instruction 341 and a destination address 342 and extract an operand address 343 or direct data 344 depending on the type of instruction.
When the instruction 341 to be processed is a memory write instruction, the finite state machine 350 performs a memory write action using the direct data 344 as write data. When the instruction 341 to be performed is a memory read, the finite state machine 350 requests data of the destination address from the memory controller 360 find transmits the requested data as read response data 361.
In the case of an operation instruction (I-type) using direct data the finite state machine 350 requests an operand from the destination address 342, delivers the operand to the processing unit 370 along with die direct data 344, and writes the operation result 371 back to the destination address 342. On the other hand, in the case of an operation instruction (R-type) using an operand address, the finite state machine 350 first requests an operand address 343 from die memory controller 360 and then requests data of the destination address 342. Also, the finite state machine 350 delivers collected operands to the processing unit 370 and writes the operation result back to the destination address 342.
While analyzing die PIM instruction packet, the instruction analyzer 340 compares the destination address of the previous instruction to the operand address of the current instruction to inspect whether the row addresses match (see
The described technology has been described in detail with reference to some embodiments, but those skilled in he art can understand that the described technology may be carried out in specific forms different from those described herein without changing the technical spirit or essential features of the described technology. Therefore, the above-described embodiments are to be regarded as illustrative rather than restrictive. Also, the scope of the described technology is defined not by the detailed description but by the appended claims, and all changes or modifications derived from the claims and their equivalents will be construed as being included in the technical scope of the described technology.
Number | Date | Country | Kind |
---|---|---|---|
10-2020-0085573 | Jul 2020 | KR | national |
This is a continuation application of International Patent Application No. PCT/KR2020/009163, filed on Jul. 13, 2020, which claims priority to Korean Patent Application No. 10-2020-0085573 filed on Jul. 10, 2020, contents of both of which are incorporated herein by reference in their entireties
Number | Date | Country | |
---|---|---|---|
Parent | PCT/KR2020/009163 | Jul 2020 | US |
Child | 17137003 | US |