This application claims the benefit of China application Serial No. CN202010990240.6, filed on Sep. 18, 2020, the subject matter of which is incorporated herein by reference.
The disclosure relates to a memory access technique, and more particularly to a memory access method and an intelligent processing apparatus.
Current electronic devices such as smartphones and personal computers all include memories, for example, random access memories (RAM), for storing data. Data stored in a memory is usually access by an electronic device in a sequential manner during access, that is, consecutive addresses in a memory are sequentially accessed in a single access mode, in a way that the transmission efficiency and flexibility of the memory are quite limited
In view of the issues of the prior art, it is an object of the disclosure to provide a memory access method and an intelligent processing apparatus so as to improve the prior art.
The disclosure provides a memory access method applied to an intelligent processing apparatus. The memory access method includes: determining a memory access mode; obtaining a data bit length parameter according to the memory access mode, the data bit length parameter including a data read/write bit length and a data transmission bit length; obtaining a memory access parameter of a memory space, the memory access parameter including a data transmission length and an address hop length; and accessing the memory space according to the data read/write bit length, the data transmission bit length, the data transmission length and the address hop length.
The disclosure further provides an intelligent processing apparatus. The intelligent processing apparatus includes: a processor, generating a read command to read data from a memory; and a direct memory access (DMA) controller, reading data from the memory according to the read command. The DMA controller includes: a configuration circuit, providing a first configuration parameter according to a first operation mode; a signal port, receiving the read command; an address generation circuit, generating a plurality of first address signals to the memory according to the first configuration parameter and the read command; a data processing circuit, receiving data that the memory outputs in response to the first address signals, and performs a data interception process on the data received from the memory according to the read command and the first configuration parameter; and a data port, outputting the data processed by the data processing circuit to the processor. The first address signals include a plurality of non-consecutive address signals such that the data read from the memory and processed by the data processing circuit corresponds to n-dimensional data blocks, where n is a positive integer greater than 1.
The disclosure further provides a memory access method applied to an intelligent processing apparatus. The intelligent processing apparatus includes a processor. The method access method includes: receiving a read command from the processor to read data from the memory; providing a first configuration parameter according to a first operation mode; generating a plurality of first address signals according to the first configuration parameter and the read command; and receiving data that the memory outputs in response to the first address signals, and performing a data interception process on the data received from the memory according to the read command and the first configuration parameter to output to the processor. The first address signals include a plurality of non-consecutive address signals such that the data read from the memory and processed by the data processing circuit corresponds to n-dimensional data blocks, where n is a positive integer greater than 1.
Features, implementations and effects of the disclosure are described in detail in preferred embodiments with the accompanying drawings below.
The technical solutions of the embodiments of the disclosure are clearly and comprehensively described in combination with the accompanying drawings of the embodiments of the disclosure. It is obvious that the described embodiments are merely some embodiments of the disclosure but not all possible embodiments. On the basis of the embodiments of the disclosure, all other embodiments arrived by a person skilled in the art without involving inventive skills are to be encompassed within the scope of protection of the disclosure.
Refer to
In one embodiment, the IPU 11 is integrated with a direct memory access (DMA) controller, and the IPU 11 accesses the memory 13 through the DMA controller. The DMA controller can control access or transfer of data stored in the memory 13, for example, control transfer of the data from one storage space to another storage space, or control transfer of the data from the memory 13 to another memory (such as an internal memory).
The memory 13 may include multiple types of memories, for example, random access memory (RAM), static random access memory (SRAM), and dynamic random access memory (DRAM).
Refer to
Refer to
In step 210, an access mode of the memory is determined. The DMA controller 14 is configured therein with multiple memory access modes in advance, that is, including multiple operation modes, each memory access mode corresponds to one set of configuration parameters, and the DMA controller 14 operates according to the configuration parameters. For example, a non-byte access mode and a byte access mode may be configured in advance. A corresponding mode message may be provided for each memory access mode, and the mode message may be recorded in a register, and includes, for example, an access mode command code byte_mode. For example, when the memory access mode is a non-byte access mode, byte_mode may be configured as 0; when the memory access mode is a byte access mode, byte_mode may be configured as 1.
In one embodiment, the DMA controller 14 may be further provided therein with a mode message of a memory access pattern, for example, an access pattern command code extract mode. Extract_mode=0 represents a sequential address mode, and extract_mode=1 represents a hop address access mode. The DMA controller 14 may first obtain the mode message of the memory access pattern at the beginning of execution of memory access. When the obtained mode message of the memory access pattern is extract_mode=0, the memory 13 is accessed according to a common sequential address access pattern; in this mode, an address generation circuit 142 correspondingly generates a plurality of consecutive addresses so as to access the memory 13. When the obtained mode message of the memory access pattern is extract_mode=1, the memory 13 is accessed according to a hop address access pattern; in this mode, the address generation circuit 142 correspondingly generates a plurality of address signals, which include a plurality of non-consecutive addresses. In this embodiment, the mode message of the memory access mode and the mode message of the memory access pattern determine the operation mode of the DMA controller 14, and determine the corresponding configuration parameters, and the DMA controller 14 then operates according to the configuration parameters. In practice, the DMA controller 14 includes therein a configuration circuit 141. The configuration circuit 141 provides the corresponding configuration parameters according to the current operation mode, that is, the configuration circuit 141 may provide the corresponding configuration parameters according to the mode messages of the memory access mode and the memory access pattern.
In one embodiment, the configuration parameters may include parameters such as a data bit length parameter, a data transmission length and an address hop length. The data bit length parameter includes information such as a data read/write bit length and a data transmission bit length. The data read/write bit length represents a bit length for reading a data line, and the data transmission bit length represents a bit length of data transmission during access of the memory 13.
For example, for the non-byte access mode, the data read/write bit length and the data transmission bit length are both configured as m bytes; for the byte access mode, the data read/write bit length is configured as m bytes, and the data transmission bit length is configured as one byte, where m is an integer greater than 1.
In practice, the data transmission length may include one parameter, or may include one set of parameters. Similarly, the address hop length may include one parameter, or may include one set of parameters. In one embodiment, the data transmission length includes l0, which represents the data length of one transmission; the address hop length includes s0, which represents the address length that needs to be hopped between one access and a next access of the memory space, where l0 and s0 are both positive integers.
In step 220, multiple first address signals are generated according to the configuration parameters and the read command. When the IPU core 110 is to read the data in the memory 13, the IPU core 110 generates a read command to a signal port 144 of the DMA controller 14, and the signal port 144 translates and transmits the received read command to the address generation circuit 142. The address generation circuit 142 generates multiple address signals according to the configuration parameters provided by the configuration circuit 141, and outputs the address signals to the memory 13, so as to read the corresponding data in the memory 13.
When the mode message of the memory access pattern is extract_mode=1 and the operation is in the non-byte access mode, the multiple address signals generated by the address generation circuit 142 include multiple non-consecutive address signals so as to read data corresponding to n-dimensional data blocks from the memory 13, where n is a positive integer greater than 1. Associated details are given below.
In practice, the address generation circuit 142 generates the corresponding multiple address signals according to the parameters including the data read/write bit length, the data transmission bit length, the data transmission length and the address hop length, and then according to the read command.
Refer to
In another embodiment, refer to
In some embodiments, refer to
When the DMA controller 14 performs a write operation on the memory space, data of one one-dimensional line l0 is transmitted each time and one one-dimensional line l0 is written, and writing of the next one-dimensional line is performed in a way of hopping s0 addresses each time after writing one one-dimensional line. Thus, l0×m bytes are transmitted and written to l0×m byte addresses of the memory space each time, and the next write is performed in a way of hopping s0×m byte addresses each time after writing l0×m bytes.
In step 230, data interception or complement is performed on the data received from the memory according to the configuration parameters and the read command. The data processing circuit 143 receives the data that the memory 13 outputs in response to the multiple first address signals generated by the address generation circuit 142, and performs a data interception or complement process on the data received from the memory according to the configuration parameters and the read command. The data processed by the data processing circuit 143 may be transmitted to the IPU core 110 through the data port 145.
In one embodiment, refer to both
Refer to
It is understandable that because l0 is less than m, the m byte addresses need to be complemented each time after writing l0 bytes into the m byte addresses of the memory space. Thus, after transmitting and writing l0 bytes to the m byte addresses of the memory space each time, a corresponding identifier code needs to be provided while writing each set of l0 bytes, and the byte addresses not written with any bytes in each set of m byte addresses of the memory space are complemented.
The DMA 14 may provide a corresponding identifier code each time when writing l0 bytes to the m byte addresses of the memory space. It should be noted that, to configure the identifier code, the bytes may be first written and then the identifier code is provided, the identifier code may be first provided and then the bytes are written, or the identifier code may be provided while writing the bytes. The identifier code indicates which byte addresses of the m byte addresses are written with valid data, and which byte addresses are not yet written with valid data. The identifier code may be an m-bit binary character, a binary character of “1” represents that the located byte address is written with valid data, and a binary character of “0” represents that the locate byte address is not written with valid data. For example, as shown in
Moreover, the DMA controller 14 may complement the byte addresses not written with bytes in each set of the m byte addresses of the memory space, that is, complementing the byte addresses having the identifier code as the binary character 0, such as complementing to 0, or complementing to other value as desired. It is understandable that, since the binary characters of the complemented byte addresses are 0, it also means that the data at the complemented byte addresses is invalid data, and any desired value used for the complement does not affect the data written to the m byte addresses.
In some embodiments, for the byte access mode, for example, when byte_mode=1, the data read/write bit length is m bytes, and the data transmission bit length is one byte. When the data in the memory space corresponds to a three-dimensional space, the three-dimensional space includes l2 two-dimensional spaces, with each two-dimensional space including l1 one-dimensional lines, and the length of each one-dimensional line being l0. The address hop length include s0 and s1, s0 and s1 are a set of parameters, each one-dimensional line and a next one-dimensional line are spaced by s0 byte addresses, and each two-dimensional space and a next two-dimensional space are space by s1 byte addresses.
At this point, when l0 is less than m, the sum of l0 and s0 is equal to m and s0 is less than s1, the operation performed by the DMA controller 14 according to the configuration parameters of the operation mode includes: reading l1×m bytes in the memory space each time, performing the next read in a way of hopping l1×m (s1−s0) byte addresses each time after reading l1×m bytes, and transmitting l1×m bytes each time after reading l1×m bytes.
It is understandable that, in an actual application, to perform a read operation or a write operation on a double data rate (DDR) memory, because the bus structure is an advanced extensible interface (AXI), such bus structure is capable of reading/writing multiple m bytes, and multiple read/write requests may exist simultaneously on the bus although a result of a first read/write request sent out is not yet received. Due to a delay in a read/write operation of such DDR memory, the utilization efficiency is significantly reduced if one set of m bytes is read/written each time.
Further, it is understandable that, both l0 and s0 are small when l0 is less than m, the sum of l0 and s0 is equal to m and s0 is less than s1, and so if the memory space is accessed according to the byte access mode, a read operation or a write operation needs to be performed multiple times on the same m byte addresses, further leading to lowered access efficiency. When expanded to a multi-dimensional memory space, in a situation where one set of m bytes includes valid bytes at k0, k1, k2, . . . and kx sections, the address hop lengths are p0, p1, p2, . . . and px−1, respectively, and k0+p0+k1+p1+k2+p2+px−1+kx+s0 is m, the m bytes need to be accessed for x times, and hence it is evident that the access efficiency is extremely low.
Therefore, the access mode is modified at this point to the non-byte access mode, and the length of the one-dimensional lines is modified to Ii, the number of one-dimensional lines is modified to l2, the number of two-dimensional spaces is modified to 1, the hop address between each one-dimensional line to a next one-dimensional line is modified to s1−s0, the hop address between each two-dimensional space and a next two-dimensional space is modified to 0, and access is performed according to the non-byte access mode, so as to enhance the access efficiency.
That is, l1×m bytes in the memory space are read each time, the next read is performed in a way of hopping (s1−s0)×m byte addresses each time after reading l1×m bytes, and l1×m bytes are transmitted each time after reading l1×m bytes.
Similarly, when l0 is less than m, the sum of l0 and s0 is equal to m and s0 is less than s1, data is transmitted in a unit of the data transmission bit length according to the data transmission length, and a write operation is performed on the memory space in a unit of the data read/write bit length according to the data transmission length and the address hop length, the operation including: transmitting and writing l1×m bytes to l1×m byte addresses of the memory space each time, and performing the next read in a way of hopping (s1−s0)×m byte addresses each time after reading l1×m bytes.
Accordingly, access of a three-dimensional space can be hopped to access of two dimensional spaces, and access can be performed according to the non-byte mode, thereby enhancing the access efficiency of the memory space.
In one embodiment, after reading l1×m bytes of the memory space each time, the operation further includes: determining a start byte and an end byte of each set of m bytes, wherein a byte length between the start byte and the end byte of each set of m bytes is l0; the transmitting and writing l1×m bytes to l1×m byte addresses of the memory space each time includes: determining a start byte and an end byte of each set of m bytes of the memory space; transmitting l0×m bytes each time; sequentially writing each set of l0 bytes to each set of m byte addresses, and providing a corresponding identifier code; and complementing the byte addresses not written with bytes in each set of m byte addresses.
It is understandable that, since the access of the three-dimensional space is hopped to the access of the two-dimensional spaces, to perform a read operation, a start byte (start_byte) and an end byte (end_byte) of each set of m bytes need to be determined each time after reading l1×m bytes of the memory space, so as to ensure the data read by the read operation, data transmitted, data written by the write operation are accurately aligned, further ensuring validity of data.
Correspondingly, to perform a write operation, the start bye and the end byte of each set of m bytes need to be first determined, l0×m bytes are then transmitted each time, and each set of l0×m bytes are sequentially written to one set of m byte addresses.
Moreover, a corresponding identifier code needs to be provided while writing each set of l0 bytes, and the byte addresses not written with any bytes in each set of m byte addresses of the memory space are complemented. It should be noted that, to configure the identifier code, the bytes may be first written and then the identifier code is provided, the identifier code may be first provided and then the bytes are written, or the identifier code may be provided while writing the bytes.
Details of providing the identifier code and complementing the byte addresses may be referred from the description of the embodiments above, and are omitted herein for brevity.
In an actual implementation, the disclosure is not limited by the execution sequences of the various steps described, and without producing any conflict, some of the steps may also be performed in other sequences or performed simultaneously.
It is known from the description above, in the memory access method provided by the embodiments of the disclosure, multiple operation modes can be configured for a DMA controller, wherein the different operation modes correspond to different configuration parameters. Further, in conjunction with specific read commands, the operation mode may be set according to different memory access requirements, and a memory space may be accessed by adopting different parameters, thereby providing the memory space with more flexible and diversified accessed modes. In addition, the object of corresponding to data of n-dimensional data blocks can be achieved by involving only one read command, hence saving command resources during access of the memory space.
The memory access method and the intelligent processing apparatus provided according to the embodiments of the disclosure are as described in detail above. The principle and implementations of the disclosure are described by way of applying specific embodiments in the description, and the description of the embodiments are for better understanding of the method and core concepts of the disclosure. Further, modifications may be made to the specific implementations and application ranges by a person skilled in the art according to the concepts of the disclosure. In conclusion, the contents of the description should not be construed as limitations to the disclosure.
Number | Date | Country | Kind |
---|---|---|---|
202010990240.6 | Sep 2020 | CN | national |