Technical Field
The present disclosure relates to controlling access to a memory device.
Description of the Prior Art
It is known to provide a data processing apparatus having a memory device which has more than one storage unit in which a data item (having an associated memory address) can be stored. For example, where the memory device is a set-associative cache, a given data item can be stored in any of the multiple ways of the set-associative cache.
In this situation, it is further known for the purposes of reducing the overall access time for the memory device, to initiate a read procedure in each of the multiple storage units of the memory device before it is definitively known in which of those storage units the requested data item is stored. For example, in the context of a set-associative cache for example, it is known to perform a read procedure from all was of the cache and to rely on a tag hit (a match between the memory address of the requested data item and a stored tag portion of the memory address) to select the correct data item from each of the data items read out from the multiple ways. This technique however has the disadvantage that it comes at the price of the additional energy expenditure in performing the read out procedure from the “other” storage units (in which the requested data item is not in fact) in addition to the read out procedure performed for the storage unit in which the data item is in fact stored.
In order to seek to reduce the power consumption of a memory device having multiple storage units, it is further known to seek to predict the storage unit in which a requested data item is stored, so that only that storage unit is then accessed and additional power associated with accessing other storage units in which the data item is not stored is not expended. For example, in the context of the above mentioned set-associative cache, various “way prediction” techniques are known which determine a way in which the requested data item is stored and then only initiate the access with respect to that way. However, these techniques have the disadvantage that the way prediction then adds to the overall time taken for the retrieval of data from the memory, since additional time for the way prediction to complete can delay the start of the memory access.
Viewed from a first aspect a memory device is provided comprising: a plurality of storage units; and access control circuitry configured to receive an access request and in response to the access request to initiate an access procedure in each of the plurality of storage units, wherein the access control circuitry is configured to receive an access kill signal after the access procedure has been initiated, and the access control circuitry is configured, in response to the access kill signal, to initiate an access suppression to suppress the access procedure in at least one of the plurality of storage units.
The present techniques provide a memory device which facilitates fast access to the memory device by configuring the access control circuitry to initiate an access procedure in each (all) of the multiple storage units in response to an access request that is received (e.g. from a processor seeking access to a data item stored in the memory device). The access request may be a read access request or may be a write access request. However, the dynamic power consumed by the memory device can also be reduced because the access control circuitry can receive an access kill signal after the access procedure has already been initiated and in response thereto suppress the access procedure in at least one of the multiple storage units. Thus, although some dynamic power will already be consumed by the memory device by virtue of the fact that the access procedure has already started, by suppression of the access procedure in at least one of the multiple storage units, for example in all but one of the storage units in which the requested data item is subsequently recognised not to be stored when the storage unit in which the data item is stored has been identified, dynamic power can be saved. Note that the access control circuitry may form a separate component of the memory device from the plurality of storage units providing access control to all storage units or may be distributed across the plurality of storage units providing specific access control to each storage unit.
The suppression of the access procedure could be implemented in a variety of ways, but in some embodiments each of the plurality of storage units comprises wordline circuitry, the wordline circuitry configured to activate a selected wordline in response to the access request as part of the access procedure, and the memory device further comprises wordline suppression circuitry configured to suppress the selected wordline in response to the access kill signal. Hence, where the access procedure is dependent on a selected wordline being activated (e.g. ramped up to a predetermined voltage) further progress of the access procedure can be suppressed by suppressing that selected wordline (e.g. by bringing the selected wordline back to an inactive state (e.g. to a logical zero value)). Suppression of the selected wordline could be implemented in a number of ways, but in some embodiments the wordline suppression circuitry is configured to connect the selected wordline to a fixed voltage in response to the access kill signal. For example, the wordline suppression circuitry can comprise a selectable connection to a voltage rail in the memory device, e.g. the ground (VSS) rail.
Alternatively, or in addition, the suppression of the access procedure can be carried out with respect to sense amplifier circuitry in the memory device and in some embodiments each of the plurality of storage units comprises sense amplifier circuitry, and the memory device is configured to activate the sense amplifier circuitry in response to the access request to perform the access procedure, and the memory device further comprises sense amplifier suppression circuitry configured to suppress the sense amplifier circuitry in response to the access kill signal. The data read out from each storage unit in response to a read access request is typically generated by sense amplifier circuitry which is usually activated once the bit lines in the storage unit have discharged to the point at which it is possible for the sense amplifier circuitry to determine a stored data value from a voltage difference between those bitlines, and use of the sense amplifier circuitry to generate the output data value generally represents a significant proportion of the dynamic power consumed by the storage unit in carrying out the access procedure (in part as well because the changing data value also causes output drivers to toggle). Suppression of the sense amplifier circuitry thus can avoid the expenditure of this dynamic power, in particular if an access kill signal can be acted upon before the sense amplifier circuitry is even activated.
The sense amplifier suppression circuitry could suppress the sense amplifier circuitry in a number of ways, but in some embodiments the sense amplifier circuitry is configured to be activated when a sense amplifier enable signal is asserted, and the sense amplifier suppression circuitry is configured to connect the sense amplifier enable signal to a fixed voltage in response to the access kill signal. Where the sense amplifier circuitry requires the enable signal to be asserted in order to be active, coupling the enable signal to a fixed voltage (e.g. to ground when the sense amplifier enable signal is active high or to a logical high value when the sense amplifier enable signal is active low) can effectively disable the sense amplifier circuitry and prevent any significant further dynamic power being expended by the operation of the sense amplifier circuitry.
The memory device could take a variety of forms, but in some embodiments the memory device is a first level cache providing in association with the data processor. It is typically required for a response from a first level cache to be received very quickly and it is beneficial therefore for this responsiveness not to be delayed by uncertainty about the precise storage location (e.g. in which way of a set associative cache) of the accessed data item, as is supported by the present techniques.
The plurality of storage units could take a number of forms, but in some embodiments the memory device is a multi-way set-associative cache and the plurality of storage units are a plurality of ways of the multi-way set-associative cache. Storage of a data item in a memory device which is multi-way set-associative in this manner could take place in any one of the multiple ways, but which way a particular data item is stored in is generally not known when the access request is first received by the memory device, unless provision of the access request has been delayed whilst some variety of way prediction is carried out. The present techniques enable an undelayed access request to be provided to the memory device, but nevertheless to benefit from a late kill signal received later by the memory device (such as a predicted way signal), and therefore nevertheless to reduce the dynamic power consumption of the memory device.
In a multi-way set-associative cache the access suppression could be implemented in a number ways, but in some embodiments each way of the plurality of ways comprises a tag storage unit and a data storage unit and the access control circuitry is configured to initiate the access suppression in at least one tag storage unit of the plurality of storage units. Accordingly, where tag retrieval and comparison (against at least a portion of the memory address of the access request to be carried out) is performed, a dynamic power saving can be achieved by suppressing the part of the access procedure carried out in at least one tag storage unit.
Alternatively, or in addition, in some embodiments each way of the plurality of ways comprises a tag storage unit and a data storage unit and the access control circuitry is configured to initiate the access suppression in at least one data storage unit of the plurality of storage units. Where data items from a number of was are initially retrieved as part of the access procedure, dynamic power can be saved by suppressing the retrieval of a data item from at least one data storage unit.
In some embodiments the access control circuitry is configured to generate an internal clock signal in each of the plurality of storage units to control timing of the access procedure, and the access control circuitry is configured to initiate the access suppression after a first edge of the internal clock signal has been generated. Where an internal clock signal is generated for each storage unit to control the timing of the access procedure, and for example a first edge (e.g. rising edge) of the internal clock signal typically signals the start of the access procedure in the storage unit, it is beneficial if the access suppression can begin even after this point (i.e. that the access procedure initiated after the first edge of the internal clock signal is not atomic and can in fact be interrupted) since this provides the opportunity for dynamic power saving even at this relatively late stage.
In some embodiments each of the plurality of storage units comprises bitline precharge circuitry, wherein the bit line precharge circuitry is configured to precharge bitlines in that storage unit before the access procedure is initiated, and the access procedure comprises discharging the bitlines, and the access control circuitry is configured to initiate the access suppression after discharging the bitlines has started. It is also beneficial for the access suppression to be able to be initiated even after the discharging of the bit lines has started, in that although a small amount of dynamic power will be consumed by this partial bit line discharge, further dynamic power can be saved by being able to intervene to initiate the access suppression after this has already started and thus prevent further discharge of the bit lines and the dynamic power associated therewith.
In some embodiments the access control circuitry is configured to receive a chip enable signal for the memory device, the access request comprises assertion of the chip enable signal and the access control circuitry is configured to initiate the access procedure in response to assertion of the chip enable signal. The use of a chip enable signal, in particular asserted by a processor as part of the access request, may represent a critical path within a CPU which comprises the processor and the memory device (e.g. a level 1 cache) and initiating the access procedure (directly) in response to the assertion of the chip enable signal prevents this CPU critical path timing from being affected.
The access kill signal could take a number of forms, but in some embodiments, where the memory device is a multi-way set-associative cache, the access control circuitry is configured to receive, as the access kill signal, a way prediction signal indicating a subset of the ways, and the access control circuitry is configured to initiate the access suppression to suppress the access procedure in the subset of was indicated by the way prediction signal. The way prediction signal may be a positive identification of a way in which the requested data item is expected to be stored, in which case the access procedure is then suppressed in all other ways, or alternatively the way prediction signal could comprise an indication of one or more of the ways in which it is known that the requested data item is not stored and in this case the access procedure is suspended in those identified ways.
The access kill signal may directly indicate one or more of the storage units in which the access suppression should take place (either by the storage unit deselection signal directly indicating the storage units in which the access suppression should take place or by indicating the storage units in which the access procedure should continue (and by inference indicating that the access should be suppressed in the remainder)). Accordingly, in some embodiments, the access control circuitry is configured to receive, as the access kill signal, a storage unit deselection signal indicating a subset of the plurality of storage units, and the access control circuitry is configured to initiate the access suppression to suppress the access procedure in the subset of the plurality of storage units indicated by the storage unit deselection signal.
In some embodiments the access request is a read access request and the access procedure is a read access procedure.
In some embodiments the access request is a write access request and the access procedure is a write access procedure.
Viewed from a second aspect a method of operating a memory device is provided comprising:
storing data in a plurality of storage units;
receiving an access request; in response to the access request, initiating an access procedure in each of the plurality of storage units; receiving an access kill signal after the access procedure has been initiated; and in response to the access kill signal, suppressing the access procedure in at least one of the plurality of storage units.
Viewed from a third aspect a memory device is provided comprising a plurality of means for storing data; means for receiving an access request; means for, in response to the access request, initiating an access procedure in each of the plurality of means for storing data; means for receiving an access kill signal after the access procedure has been initiated; and means for, in response to the access kill signal, suppressing the access procedure in at least one of the plurality of means for storing data.
The present invention will be described further, by way of example only, with reference to embodiments thereof as illustrated in the accompanying drawings, in which:
When the processor core 14 requires access to a data item, whether that access is a read access or a write access, it issues an access request (a read access request or a write access request) which is first received by the L1 cache 16. If the requested data item is not present in the L1 cache 16 the access request is forwarded to the L2 cache 18 and if it is not stored there it is forwarded to the external memory 20. Again, one of ordinary skill in the art will be familiar with this use of a memory hierarchy. When the data item is stored in the L1 cache 16 the data item is accessed under control of the access control circuitry 22. In this embodiment the L1 cache 16 is a set-associative cache comprising four ways 24, 26, 28, 30. In order to keep the overall access time of the L1 cache 16 low, the access control circuitry 22 is configured to initiate the access (whether a read access or a write access) to the ways 24, 26, 28, 30 directly in response to the reception of the access request. However, the L1 cache 16 is also configured to receive an addition signal from the core 14, after the access request has already been sent, namely the “late kill signal”, in response to which the access control circuitry 22 is configured to suppress (i.e. stop) the access procedure which has already been initiated in one of the ways 24, 26, 28, 30. This is described in more detail with reference to the following figures.
The processor core 14 is configured to generate the late kill signal on the basis of a way prediction generated by way prediction 32. It should be noted that in this embodiment the way prediction 32 represents a function carried out by the core 14 rather than a specific physical sub-component within the core 14, although such a configuration is also contemplated. The way prediction generated by way prediction 32 and determining the late kill signal may be an explicit indication of the way in which the requested data item is expected to be stored, in which case the access control circuitry 22 is configured to cause the access procedure already initiated in all the other ways to be suppressed, or the way prediction may be an indication of at least one way in which it is predicted that the requested data item is not stored, in which case the access control circuitry 22 is configured to suppress the access procedure already initiated in those ways. It should also be noted that although the way prediction 32 may in principle be capable, with sufficient time available, to accurately predict which way of the L1 cache 16 of the requested data item is stored in, in order to maintain the overall access time of the L1 cache 16 at the above-mentioned low level, the processor core 15 can be configured to generate the late kill signal before the way prediction 32 has completed it prediction process, so that the access suppression within the L1 cache 16 can be triggered earlier enough to gain the benefit of a dynamic power reduction, even though this may then involve the access suppression being carried out in only a limited number of the ways (i.e. in fewer than N−1 ways where N is the total number of ways in the L1 cache 16). This may for example be appropriate in the context of a complex instruction being executed by the processor core 14 which requires a memory access to be carried out, but where the complexity of the instruction (for example requiring multiple derivative items of information) is such that the way prediction will take commensurately longer to be carried out.
The access control circuitry 42 is configured to receive the access request, which in this illustrated example is a read access request, where the access request comprises both the address of the requested data item, and a chip enable signal CE. The access control circuitry 42 is configured to initiate the access procedure in the storage unit 44 (and in the other storage units provided in the memory device 40) directly upon receipt of the asserted CE signal. Accordingly, the usual timing of this CE signal can be maintained and as such (generation and assertion of this CE signal being a critical path for the CPU) the timing of the issuance of the read access request is not affected by the present techniques. The access control circuit 42 is configured to cause the bit line precharge circuitry 56 to pre-charge the bit lines BL 50 and BLB 52 and indeed this may take place before the access request is received such that the time associated with precharging these bit lines is not incurred as part of the access latency. When the access procedure is initiated by the access control circuitry 42, the access control circuitry 42 causes the wordline driver circuitry 54 to assert the wordline 48 and the voltage of the wordline 48 thus begins rising. The access control circuitry 42 also generates an internal clock signal which initiates the access procedure in a storage unit 44. This is dependent on an external clock signal CLK for the memory device which is received by both the access control circuitry 42 and the storage unit 44 to control and coordinate the overall timing of these components.
Once the wordline driver circuitry 44 has brought the wordline 44 up to a sufficient voltage, the coupling that this asserted wordline 48 causes between the bit cell 46 and the bit lines BL 50 and BLB 52 causes the bit lines to begin discharging. Once this discharge has had time to progress significantly then the access control circuitry 42 is configured to assert a sense amplifier enable signal SAE (which in this illustrated embodiment is active high) in order to cause the sense amplifier 58 to determine the value stored in the bit cell 46 by comparison of the relative voltages on the two bit lines. However, the access control circuitry 44 is also configured to receive a late kill signal from the processor which issues the access request and in response to this to generate a the KILL signal which is passed to the storage unit 44. This kill signal KILL provides the switching signal for both the gate 68 coupled to the wordline 48 and to the gate 70 coupled to the SAE path. When the kill signal is asserted then both the wordline 48 and the SAE path are rapidly pulled down to VSS (ground) such that both are forced into an inactive state. This suppresses the access procedure in the storage unit 44. The late kill signal may be for example be a way prediction signal, when the storage unit 44 is a way of a set associative cache, as in the example illustrated in
However, when the late kill signal is received by the access control circuitry 22 after the access procedure has already been initiated in the tag RAMs and data RAMs, the access control circuitry 22 is configured to assert a kill signal in the deselected way or ways, this being provided by at least one of the respective kill signals kill0, kill1, kill2, kill3. In this manner the access procedure in the respective tag RAM and data RAM of a deselected way is suppressed. This can be implemented as shown in
In brief overall summary therefore the present techniques provide a memory device and a method of operating the memory device. The memory device comprises a plurality of storage units and access control circuitry. The access control is configured to receive an access request and in response to the access request to initiate an access procedure in each of the plurality of storage units. The access control circuitry is configured to receive an access kill signal after the access procedure has been initiated and, in response to the access kill signal, to initiate an access suppression to suppress the access procedure in at least one of the plurality of storage units. Hence, by initiating the access procedures in all storage units in response to the access request the access latency, e.g. without waiting for a further indication of a specific storage unit in which to carry out the access procedure, the overall access time for the memory device kept low, but by enabling at least one of the access procedures later to be suppressed in response to the access kill signal dynamic power consumption of the memory device can be reduced.
Although illustrative embodiments of the invention have been described in detail herein with reference to the accompanying drawings, it is to be understood that the invention is not limited to those precise embodiments, and that various changes, additions and modifications can be effected therein by one skilled in the art without departing from the scope and spirit of the invention as defined by the appended claims. For example, various combinations of the features of the dependent claims could be made with the features of the independent claims without departing from the scope of the present invention.
Number | Name | Date | Kind |
---|---|---|---|
5367654 | Furukawa | Nov 1994 | A |
5392443 | Sakakibara | Feb 1995 | A |
5701434 | Nakagawa | Dec 1997 | A |
6021461 | Dhong et al. | Feb 2000 | A |
7251710 | Isaac | Jul 2007 | B1 |
7443759 | Rowlands | Oct 2008 | B1 |
7647452 | Moll | Jan 2010 | B1 |
7873788 | Moll | Jan 2011 | B1 |
7934054 | Moll | Apr 2011 | B1 |
8848412 | Yeung | Sep 2014 | B1 |
20010026465 | Choi et al. | Oct 2001 | A1 |
20030061446 | Lee | Mar 2003 | A1 |
20030131196 | Park | Jul 2003 | A1 |
20040024792 | Chauvel | Feb 2004 | A1 |
20050240731 | Steely | Oct 2005 | A1 |
20060087882 | Kinsley | Apr 2006 | A1 |
20060226869 | Chong | Oct 2006 | A1 |
20060291314 | Miyamoto | Dec 2006 | A1 |
20060294291 | Kinsley | Dec 2006 | A1 |
20070025172 | Miyamoto | Feb 2007 | A1 |
20070288776 | Dement et al. | Dec 2007 | A1 |
20070297259 | Miyamoto | Dec 2007 | A1 |
20090019540 | Itsik | Jan 2009 | A1 |
20090144834 | Mochizuki | Jun 2009 | A1 |
20090216979 | Balasubramanian | Aug 2009 | A1 |
20110072323 | Chong | Mar 2011 | A1 |
20110149674 | Yeung | Jun 2011 | A1 |
20110158021 | Wiatrowski | Jun 2011 | A1 |
20110199844 | Miki et al. | Aug 2011 | A1 |
20120002499 | Kinkade | Jan 2012 | A1 |
20120124318 | Bivens | May 2012 | A1 |
20120155194 | Chong | Jun 2012 | A1 |
20120159076 | Tanpure et al. | Jun 2012 | A1 |
20120170390 | Idgunji | Jul 2012 | A1 |
20120215979 | Hunt | Aug 2012 | A1 |
20120290780 | Kinter | Nov 2012 | A1 |
20130166860 | Owaki | Jun 2013 | A1 |
20130182484 | Chong | Jul 2013 | A1 |
20130191583 | Fujimoto | Jul 2013 | A1 |
20130318530 | Hooker | Nov 2013 | A1 |
20140063926 | Lee et al. | Mar 2014 | A1 |
20140181407 | Crum et al. | Jun 2014 | A1 |
20140241036 | Toda | Aug 2014 | A1 |
20150149742 | Richter | May 2015 | A1 |
20150155044 | Yin | Jun 2015 | A1 |
Entry |
---|
UKIPO Search Report; GB 1511055.4; Nov. 25, 2015. |
Number | Date | Country | |
---|---|---|---|
20160034403 A1 | Feb 2016 | US |