Resistive random-access memory (RRAM) is a type of non-volatile (NV) random-access (RAM) computer memory that operates by changing the resistance across a dielectric solid-state material. An example of a RRAM is a memristor. RRAM may be used to store data and to perform various operations related to the stored data.
Features of the present disclosure are illustrated by way of example and not limited in the following figure(s), in which like numerals indicate like elements, in which:
For simplicity and illustrative purposes, the present disclosure is described by referring mainly to examples. In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present disclosure. It will be readily apparent however, that the present disclosure may be practiced without limitation to these specific details. In other instances, some methods and structures have not been described in detail so as not to unnecessarily obscure the present disclosure.
Throughout the present disclosure, the terms “a” and “an” are intended to denote at least one of a particular element. As used herein, the term “includes” means includes but not limited to, the term “including” means including but not limited to. The term “based on” means based at least in part on.
When an analog value (e.g., a scalar) is quantized to a digital representation, the error between the original analog value and the digital representation may be denoted as quantization error. Collectively, as when a discrete-time analog signal (e.g., a vector) is converted to a digital sequence, the discrete-time analog signal may be denoted as quantization noise. According to an example, a total quantization noise energy may be proportional to the least significant bit (LSB) size of the digital representation (e.g., a smaller LSB step is proportional to the lower total quantization noise energy). The quantized signal may be expressed as the sum of the original (noise-less) analog signal plus the quantization noise signal.
Oversampling discrete-time signals may increase the bandwidth of the associated discrete-time signal. In this respect, frequencies between the Nyquist limit of the original sampled signal, and the Nyquist limit of the new oversampled signal, may become viable.
Since quantization noise energy is a function of the digital LSB (i.e., quantization noise energy is fixed, and independent of bandwidth), oversampling a discrete-time (digitized) sequence may result in the quantization noise energy to uniformly spread across a wider bandwidth, reducing the amount of in-band quantization noise energy. This aspect of quantization noise energy provides for a 3 dB improvement in signal-to-noise ratio (SNR) for each doubling of the sampling rate (i.e., the noise spectral density decreases by a factor of two for each doubling of the bandwidth).
A mechanism for displacing quantization noise energy from the baseband may include noise shaping. In this regard, oversampling may be used to provide a disposal area for the noise energy, but additional manipulations may be imposed on the noise signal such that the noise signal is attenuated in-band at the expense of amplifying the noise signal in the higher (disposal) frequencies made available by oversampling. In order to implement noise shaping, as an analog sample is quantized, the quantization error at the point of quantization may be measured, and the quantization error value may be taken into consideration during subsequent samples' quantization steps. In this regard, noise shaping processes may differentiate the noise by averaging the noise out at low frequencies (a low frequency signal has nearly the same value from point to point, thus differentiating may leave the relatively small point-to-point change), while accentuating the noise at high frequencies (high frequencies may differ from point to point, thus differentiating may produce relatively large point-to-point changes).
With respect to Multiply-Accumulate (MAC) operations, since convolution in the time-domain equates to multiplication in the frequency domain (and vice-versa), convolution of a noise shaped signal with either another noise shaped signal or a noise-free signal produces a normal (i.e., noise-less, or at least low-noise, depending on the efficacy of the noise shaping) result in-band. Further, with respect to convolution of a noise shaped signal with either another noise shaped signal or a noise-free signal, what occurs in the disposal band (noise operating on noise) may not be of interest.
With respect to discrete-time filtering, one of the two convolution input signals may include a finite impulse response (i.e., a non-infinite sequence of values that is relatively shorter than the other input signal) that is repeatedly used in the convolution determination as the finite impulse response is swept along the entirety of the other input signal. This shorter sequence may be denoted as the convolution kernel. The convolution kernel may represent a matrix of weights in a convolution.
A discrete-time convolution may be performed as a sequence of MAC operations, with each value of the kernel multiplied by a value in the input signal, and the products being summed to produce one sample in an output sequence. For a next sample in the output sequence, the entire kernel may be shifted along the input sequence by one value, and the MAC may be repeated. This process may be repeated until the entire output sequence has been determined.
Resistive random-access memory (RRAM) cells are capable of storing analog values. However, under certain conditions, control over the written value may be limited. The limitations on control over the written value may interfere with the use of memristors in analog computations. For example, when an analog signal is sampled and the samples are inaccurately represented (e.g., in certain cases, storing an analog signal on an array of memristors), the corrupted signal may be analyzed as the sum of the original (noise-less) analog signal, plus the error signal. Under certain conditions, this error signal may be noise shaped out of the baseband, and into a disposal band generated by oversampling.
In order to address the aforementioned aspects related to RRAM cells, a discrete-time analog filtering apparatus and a method for discrete-time analog filtering are disclosed herein. For the apparatus and method disclosed herein, any arbitrary level of accuracy in a computation (e.g., an analog computation) may be obtained based on oversampling and noise shaping. Thus, the accuracy may be increased based on the use of further RRAM cells. For the apparatus and method disclosed herein, errors may be noise shaped out of the baseband, where the errors may remain for the duration of a computation.
The apparatus and method disclosed herein may further include a circuit to provide for efficient and accurate determination of a MAC (i.e., a dot product) using memristor cells and, and noise shaping of the values related to the determination of the MAC. The apparatus and method disclosed herein may thus provide for efficient and accurate determination of a MAC (e.g., an analog MAC) by using memristor cells whose stored values are oversampled and noise shaped in order to retain accuracy in the MAC computation.
The operation control module 108 may include an oversampling module 114 to perform the operation 112 on the sampled input signal values 106 by oversampling the input signal 104 and weight values 116 stored in the RRAM cells 110. The weight values 116 may represent coefficients for a convolution related to the MAC operation. In this regard, the weight values 116 may correspond to kernel values for the MAC operation. Further, the operation control module 108 may include a noise shaping module 118 to perform the operation 112 on the weight values 116 by noise shaping the weight values 116 stored in the RRAM cells 110. In this regard, the noise shaping module 118 may also noise shape the input signal values 106.
Referring to
As described herein, the coefficients of a convolution kernel may be multiplied with the input signal 104 represented by the voltage values Vin[0], Vin[1], etc. For each of the RRAM cells R0,0-R0,3, a current, for example, for Vin[0] and R0,0, may be determined as Vin[0]*G0,0, where G0,0 may represent the conductance for R0,0, and the current is proportional to the kernel value for R0,0 (other currents at R0,1-R0,3 may be similarly determined). The currents related to the input signal including Vin[0]-Vin[3], and the RRAM cells R0,0-R0,3 may be respectively summed at the negative input node of the operational amplifier 202, amplified, and output as Vout[0].
With respect to the example of
Referring to
For example,
Referring to
For example,
After oversampling, for example, by adding intermediate samples to the sequence, and low pass filtering to determine the values of the new samples, the noise shaping module 118 may perform the noise shaping, for example, for the oversampled kernel values stored in the RRAM cells 302.
With respect to noise shaping,
Referring to
Referring to
Referring to
[R0,0+R0,1+R0,2+R0,3+R0,4+R0,5+R0,6+R0,7]+[e0,0+(e0,1-e0,0)+(e0,2-e0,1)+(e0,3-e0,2)+(e0,4-e0,3)+(e0,5-e0,4)+(e0,6-e0,5)+(e0,7-e0,6)] Equation (1)
The resulting sequence of Equation (1) may include the form of the original (noise-free) signal and the noise shaped error signal, where the noise shaping module 118 may apply first order differentiation. The noise shaping module 118 may apply a variety of other noise shaping techniques.
Referring to
Compared to the one-dimensional example of
Referring to
The apparatus 100 may be used, for example, with respect to MAC operations related to signal processing areas such as discrete-time Fourier transform, discrete-time wavelet transform, a finite impulse response (FIR) or infinite impulse response (IIR) digital filter, cross-correlation, etc. The apparatus 100 may also be used, for example, with respect to other types of MAC operations, where dot-product operations are used. For example, dot-products may be used in Neural Networks (NN), where the k paints in a layer's computation may be specified as follows:
yk[n]=f(Σwi·xi[n]+θk) Equation (2)
For Equation (2), θ may represent an offset (constant), and f(x) may represent a non-linear operation (e.g., an operation used to avoid clipping y or other behaviors when the sum becomes very large or very small, instead implementing a gradual, smooth approach to the limits of its range).
The modules and other elements of the apparatus 100 may be machine readable instructions stored on a non-transitory computer readable medium. In this regard, the apparatus 100 may include or be a non-transitory computer readable medium. In addition, or alternatively, the modules and other elements of the apparatus 100 may be hardware or a combination of machine readable instructions and hardware.
Referring to
At block 1104, the method may include sampling the input signal 104 to determine sampled input signal values 106 related to the input signal 104. For example, referring to
At block 1106, the method may include using a plurality of RRAM cells 110 on which are stored kernel values to perform a MAC operation on the sampled input signal values 106 by oversampling the input signal 104 and kernel values stored in the RRAM cells 110, and noise shaping the kernel values stored in the RRAM cells 110. For example, referring to
According to an example, for the method 1100, using the plurality of RRAM cells 110 on which are stored kernel values to perform the MAC operation on the sampled input signal values 106 may further include determining, for each of the sampled input signal values 106 and corresponding kernel values stored in the RRAM cells 110, a product of a sample input signal value of the sampled input signal values 106 and a corresponding kernel value of the corresponding kernel values, and determining a sum of the products of each of the sampled input signal values 106 and the corresponding kernel values.
According to an example, for the method 1100, using the plurality of RRAM cells 110 on which are stored kernel values to perform the MAC operation on the sampled input signal values 106 by oversampling the input signal 104 and the kernel values stored in the RRAM cells 110 may further include oversampling the input signal 104 and the kernel values stored in the RRAM cells 110 by a predetermined oversampling value to increase an available bandwidth corresponding to the oversampling value.
According to an example, for the method 1100, as discussed herein with reference to
According to an example, for the method 1100, as discussed herein with reference to
According to an example, for the method 1100, as discussed herein with reference to
According to an example, as discussed herein with reference to
According to an example, for the method 1100, as discussed herein with reference to
Referring to
At block 1204, the method may include sampling the input signal 104 to determine sampled input signal values 106 related to the input signal 104. For example, referring to
At block 1206, the method may include using a plurality of memristors on which are stored weight values to perform an operation on the sampled input signal values 106 by oversampling the input signal 104 and weight values 116 stored in the memristors, and noise shaping the weight values 116 stored in the memristors. For example, referring to
According to an example, for the method 1200, the operation may include a MAC operation, and the weight values 116 may represent coefficients for a convolution related to the MAC operation.
Referring to
At block 1304, the method may include sampling the input signal 104 to determine sampled input signal values 106 related to the input signal 104. For example, referring to
At block 1306, the method may include using a plurality of RRAM cells 110 on which are stored weight values 116 to perform an operation on the sampled input signal values 106 by noise shaping weight values 116 stored in the RRAM cells 110 by writing an intended weight value of the weight values 116 into a RRAM cell of the RRAM cells 110, and determining an error between the intended weight value and a written weight value from the RRAM cell. For example, referring to
The computer system 1400 may include a processor 1402 that may implement or execute machine readable instructions performing some or all of the methods, functions and other processes described herein. Commands and data from the processor 1402 may be communicated over a communication bus 1404. The computer system may also include a main memory 1406, such as a random access memory (RAM), where the machine readable instructions and data for the processor 1402 may reside during runtime, and a secondary data storage 1408, which may be non-volatile and stores machine readable instructions and data. The memory and data storage are examples of computer readable mediums. The memory 1406 may include a discrete-time analog filtering module 1420 including machine readable instructions residing in the memory 1406 during runtime and executed by the processor 1402. The discrete-time analog filtering module 1420 may include the modules of the apparatus 100 shown in
The computer system 1400 may include an I/O device 1410, such as a keyboard, a mouse, a display, etc. The computer system may include a network interface 1412 for connecting to a network. Other known electronic components may be added or substituted in the computer system.
What has been described and illustrated herein is an example along with some of its variations. The terms, descriptions and figures used herein are set forth by way of illustration only and are not meant as limitations. Many variations are possible within the spirit and scope of the subject matter, which is intended to be defined by the following claims—and their equivalents—in which all terms are meant in their broadest reasonable sense unless otherwise indicated.
This invention was made with Government support. The Government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2015/028208 | 4/29/2015 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2016/175781 | 11/3/2016 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
4939516 | Early | Jul 1990 | A |
561705 | Adrian et al. | Apr 1997 | A |
7295140 | Chuang | Nov 2007 | B2 |
8064248 | Lung | Nov 2011 | B2 |
8225180 | Jiang et al. | Jul 2012 | B2 |
8284605 | Tanaka et al. | Oct 2012 | B2 |
8570200 | Ashburn, Jr. et al. | Oct 2013 | B2 |
8629793 | Tsia | Jan 2014 | B2 |
8917198 | Pagnanelli | Dec 2014 | B2 |
9147463 | Sarraju | Sep 2015 | B1 |
9743024 | Tyrrell | Aug 2017 | B2 |
9772935 | Kan | Sep 2017 | B2 |
9875209 | Mishra | Jan 2018 | B2 |
20050216545 | Aldrich et al. | Sep 2005 | A1 |
20080013784 | Takeshima et al. | Jan 2008 | A1 |
20090154222 | Chien et al. | Jun 2009 | A1 |
20090310406 | Sarin | Dec 2009 | A1 |
20110182104 | Kim et al. | Jul 2011 | A1 |
20110235409 | Kang et al. | Sep 2011 | A1 |
20120011092 | Tang et al. | Jan 2012 | A1 |
20120069630 | Xi et al. | Mar 2012 | A1 |
20120087175 | Zhu et al. | Apr 2012 | A1 |
20120105143 | Strachan et al. | May 2012 | A1 |
20130051123 | Lee et al. | Feb 2013 | A1 |
20130106462 | Yang et al. | May 2013 | A1 |
20130166972 | Seabury et al. | Jun 2013 | A1 |
20130215669 | Haukness | Aug 2013 | A1 |
20130293161 | Al Dibs et al. | Nov 2013 | A1 |
20140003139 | Pickett et al. | Jan 2014 | A1 |
20140025613 | Ponulak | Jan 2014 | A1 |
20140027702 | Lu et al. | Jan 2014 | A1 |
20140149773 | Huang et al. | May 2014 | A1 |
20140215121 | Ordentlich et al. | Jul 2014 | A1 |
20150347896 | Roy | Dec 2015 | A1 |
20180069536 | Roy | Mar 2018 | A1 |
20180113649 | Shafiee Ardestani | Apr 2018 | A1 |
Number | Date | Country |
---|---|---|
WO-2011088526 | Jul 2011 | WO |
WO-2012106080 | Aug 2012 | WO |
Entry |
---|
Gao, G., et al., Analog-Input Analog-Weight Dot-Product Operation with Ag/a-Si/Pt Memristive Devices [online], Jul. 17, 2012, University of California Santa Barbara, Retrieved from the Internet: <https://www.ece.ucsb.edu/˜strukov/papers/2012/VLSISOCdp2012.pdf>, [retrieved on Mar. 12, 2015], 8 pages. |
International Searching Authority, The International Search Report and the Written Opinion, PCT/US2015/028208, dated Feb. 23, 2016, 12 Pages. |
Jung Jg. et al., Spread Programming Using Orthogonal Code for Alleviating Bit Errors of NAND Flash Memory, IEEE, Jan. 9-13, 2010, P1-9, pp. 1-2. |
Kim, S., et al.; “Flexible Memristive Memory Array on Plastic Substrates”; Oct. 25, 2011, American Chemical Society, Nano Letters 2011, No. 11, pp. 5438-5442. |
PCT; “Notification of Transmittal of the International Search Report and the Written Opinion of the international Searching Authority, or the Declaration”; cited in PCT/US2014/048435 dated Feb. 24, 2015; 12 pages. |
PCT; “Notification of Transmittal of the International Search Report and the Written Opinion of the International Searching Authority, or the Declaration”; cited in PCT/US2014/062948 dated Jun. 30, 2015; 11 pages. |
Zangeneh, M., et al.; “Design and Optimization of Nonvolatile Multibit 1t1r Resistive RAM”; Aug. 2014; IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 22, No. 8, pp. 1815-1828. |
Number | Date | Country | |
---|---|---|---|
20170221579 A1 | Aug 2017 | US |