The present invention relates to the field of processing devices, and, more particularly, to processing devices with secure external memory and related methods.
A typical wireless communications device includes a memory, a processor cooperating with the memory, and a wireless transceiver cooperating with the processor for transmitting and receiving transmissions. The memory may store data to be processed or program code for execution by the processor. As computational demands on the typical wireless communications device have increased, the speed of the processor may be incremented to increase performance. Another approach to increasing wireless communications device performance is to reduce the time taken by the processor to access the memory, i.e. reducing memory access time.
An approach to reducing memory access time is to provide several types of memory, each with a different memory access time, for storing data. For example, the memory types may include long-term memory and short-term memory, for example, a cache. More specifically, the cache, which has a relatively quick access time, may be used to store data that is frequently accessed. Once the data is stored in the cache, future use can be made by accessing the cached copy rather than re-fetching or re-computing the original data, so that the average access time is shorter. On the other hand, the long-term memory is typically substantially larger than the cache but also includes a substantially greater memory access time.
Physically, within the typical wireless communications device, the processor and memory are typically separated, i.e. off-chip. In other words, the processor and memory are coupled together via a communication line, typically a data communications bus. In certain applications, this communications line between the processor and the memory presents a potential security risk to the computer system. For example, an unauthorized user may eavesdrop on the communications line in an attempt to perceive transmitted data from the memory, or the unauthorized user may compromise the memory and data stored therein.
An approach to this potential security risk is to encrypt ail data transmitted on this communications line between the memory and the processor. For example, as disclosed in U.S. Pat. No. 6,523,118 to Buer, a computing system includes a processor, a memory subsystem storing encrypted data, and a secure cache controller coupled between the memory and the processor. When the processor needs data stored in the memory subsystem, the processor communicates with the secure cache controller, which requests the encrypted data from the memory subsystem and subsequently decrypts the data for the processor. A potential drawback to this design is the decrease in device performance since the processor no longer directly accesses the memory subsystem.
In view of the foregoing background, it is therefore an object of the present invention to provide a secure processing device that accesses external memory efficiently.
This and other objects, features, and advantages in accordance with the present invention are provided by a secure processing device comprising an external memory storing encrypted data, and at least one processor cooperating with the external memory. The at least one processor may be configured to generate a plurality of address requests for the encrypted data in the external memory, cache a plurality of keystreams based upon an encryption key, and generate decrypted plaintext based upon the cached plurality of keystreams and the encrypted data requested from the external memory. Advantageously, this secure processing device efficiently accesses encrypted external memory using a cache of keystreams.
More specifically, the at least one processor may be further configured to predict a plurality of future address requests, and the plurality of future address requests may be respectively associated with the cached plurality of keystreams. The at least one processor may also predict the plurality of future address requests based upon at least one of a current address request and a past request address.
In some embodiments, the at least one processor may comprise a plurality thereof operating in parallel. Further, in these embodiments, the secure processing device may further comprise at least one data cross-checker cooperating with the plurality of processors operating in parallel.
Additionally, the at least one processor may comprise an address bus, a keystream cache coupled to the address bus, and a keystream generator upstream from the keystream cache. Also, the keystream generator may comprise an expander coupled to the address bus, and an encryption engine coupled to the expander and having a key input thereto to generate the plurality of keystreams.
For example, the encryption engine may comprise an advanced encryption standard (AES) engine. The at least one processor may also be configured to generate each address request comprising at least one of a key stream index value, an index value, a tag value, and a memory page value. The at least one processor may be further configured to operate based upon a direct mapped cache protocol.
Another aspect is directed to a method of operating a secure processing device including an external memory storing encrypted data, and at least one processor cooperating with the external memory. The method may comprise using the at least one processor to generate a plurality of address requests for the encrypted data in the external memory, using the at least one processor to cache a plurality of keystreams based upon an encryption key, and using the at least one processor to generate decrypted plaintext based upon the cached plurality of keystreams and the encrypted data requested from the external memory.
The present invention will now be described more fully hereinafter with reference to the accompanying drawings, in which preferred embodiments of the invention are shown. This invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art. Like numbers refer to like elements throughout, and prime/multiple prime notations are used to indicate similar elements in alternative embodiments.
Referring initially to
The processor 11 illustratively includes a central processing unit (CPU) 13, and a keystream cache 14 cooperating with the CPU and the external memory 12. As will be appreciated by those skilled in the art, the CPU 13 and the keystream cache 14 are on-chip, i.e. on the same integrated circuit, whereas the external memory 12 is illustratively off-chip.
The external memory 12 may comprise, for example, random access memory. The CPU 13 may comprise, for example, a bus master processor, or the illustrated processing device with local cache 30, i.e. level-1/2 cache. As will be appreciated by those skilled in the art, the secure wireless communications device 10 may be implemented in Type-1 National Security Agency (NSA), North Atlantic Treaty Organization (NATO), Federal Information Processing Standard (FIPS) Publication 140-2, and Common Criteria for Information Technology Security Evaluation applications.
The processor 11 is configured to generate a plurality of address requests for the encrypted data in the external memory 12. As will be appreciated by those skilled in the art, the address requests are associated with corresponding memory addresses in the external memory 12, i.e. the processor 11 is requesting the data stored in the memory addresses.
The processor 11 is configured to cache a plurality of keystreams based upon an encryption key. For example, the processor 11 illustratively stores n+m keystreams (
The processor 11 is illustratively configured to generate each address request comprising at least one of a key stream index value, an index value, a tag value, and a memory page value. The processor 11 is illustratively configured to operate based upon a direct mapped cache protocol. In other embodiments, the processor 11 may be configured to operate based upon other caching schemes, for example, two-way associative and four-way associative.
More specifically, the processor 11 is further configured to predict a plurality of future address requests. The future address requests may be respectively associated with the cached plurality of keystreams. The processor 11 also predicts the future address requests based upon at least one of a current address request and a past request address.
Another aspect is directed to a method of operating a secure processing device 10 including an external memory 12 storing encrypted data, and at least one processor 11 cooperating with the external memory. The method may comprise using the at least one processor 11 to generate a plurality of address requests for the encrypted data in the external memory 12, using the at least one processor to cache a plurality of keystreams based upon an encryption key, and using the at least one processor to generate decrypted plaintext based upon the cached plurality of keystreams and the encrypted data requested from the external memory.
As will be appreciated by those skilled in the art, the secure wireless communications device 10 provides a significant performance benefit over the typical secure memory approach. More specifically, with the adaptive caching of the keystreams, the CPU 13 can achieve greater speed and experience bursts when a greater number of address request keystreams are in the keystream cache 14 rather than accessing the external memory 12. Moreover, unlike typical secure memory devices that decrypt the entire memory at boot up, this secure wireless communications device 10 does not have long boot-up times. Moreover, the secure wireless communications device 10 keeps decrypted portions of memory to a minimum, thereby enhancing security.
Referring now to
Also, the keystream generator 18′ illustratively includes an expander 16′ coupled to the address bus 15′ and for expanding the 32-bit address request to 128-bits, and an encryption engine 17′ coupled to the expander and having a key input thereto to generate the plurality of keystreams (illustratively 128-bits wide). For example, the encryption engine 17′ illustratively includes an advanced encryption standard (AES) engine. In other embodiments, the encryption engine 17′ may use other encryptions regimes, for example, Data Encryption Standard (DES), RSA, and MEDLEY encryption standard.
Moreover, the processor 11′ also illustratively includes a cipher text data bus 20′ (illustratively 16 bits wide) communicating between the external memory (not shown) and the keystream cache 14′. The processor 11′ also illustratively includes a plaintext data bus 21′ (illustratively 16 bits wide) for transmitting plain text from the keystream cache 14′ to the CPU 13′. The processor 11′ also illustratively includes a keystream cache controller 31′ cooperating with the expander 16′, the encryption engine 17′, and the keystream cache 14′ to provide post-fetching/pre-fetching of keystreams and other management of the keystream cache system.
As discussed above in regards to the embodiment illustrated in
Referring now to
Many modifications and other embodiments of the invention will come to the mind of one skilled in the art having the benefit of the teachings presented in the foregoing descriptions and the associated drawings. Therefore, it is understood that the invention is not to be limited to the specific embodiments disclosed, and that modifications and embodiments are intended to be included within the scope of the appended claims.
Number | Name | Date | Kind |
---|---|---|---|
5444781 | Lynn et al. | Aug 1995 | A |
6345359 | Bianco | Feb 2002 | B1 |
6523118 | Buer | Feb 2003 | B1 |
7095850 | McGrew | Aug 2006 | B1 |
7469338 | Buer | Dec 2008 | B2 |
7505588 | Mironov et al. | Mar 2009 | B2 |
7653196 | Koshy et al. | Jan 2010 | B2 |
7773754 | Buer et al. | Aug 2010 | B2 |
20030091185 | Swindlehurst et al. | May 2003 | A1 |
20030149869 | Gleichauf | Aug 2003 | A1 |
20050021986 | Graunke et al. | Jan 2005 | A1 |
20050220302 | Mironov et al. | Oct 2005 | A1 |
20050223175 | Hepner et al. | Oct 2005 | A1 |
20050240764 | Koshy et al. | Oct 2005 | A1 |
20060179239 | Fluhr et al. | Aug 2006 | A1 |
20070192632 | Botzum et al. | Aug 2007 | A1 |
20070204108 | Griswell et al. | Aug 2007 | A1 |
20070260838 | Schwemmlein | Nov 2007 | A1 |
20080095370 | Rose et al. | Apr 2008 | A1 |
20080279371 | Lee et al. | Nov 2008 | A1 |
20090183161 | Kolinummi et al. | Jul 2009 | A1 |
Entry |
---|
Yang et al., “Improving memory encryption performance in secure processors”, IEEE Transactions on Computers, vol. 54, No. 5, May 2005, pp. 630-640. |
Platte, Jorg et al., “A Cache Design for a Security Architecture for Microprocessors (SAM),” Robotics Research Institute: Section Information Technology, University of Dortmund, 2006, pp. 1-15. |
Duca, Nathaniel et al., “Stream Caching: Optimizing Data Flow within Commodity Visualization Clusters,” 2002 pp. 1-4. |
Number | Date | Country | |
---|---|---|---|
20100299537 A1 | Nov 2010 | US |