The present invention relates to software and execution environments loaded in a digital apparatus and, in particular, a method of dynamic protection of data, in particular in view of attacks due to a fault, during the execution of an application compiled in intermediate language on a digital apparatus equipped with a virtual execution machine.
The invention applies to any type of digital apparatus, portable or not, such as a computer but also any apparatus equipped with a microcontroller comprising a processor and storage devices, such as a smart card.
The desire to create inter-operable applications has resulted in the development of intermediate programming languages. The main aim of said languages is thus to make their software independent of the hardware on which they must be executed. The software is therefore planned for being executed in the form of an intermediate code independent of the underlying architecture.
The programmers are thus globally freed of the constraints linked to specific hardware. The intermediate languages such as the Java bytecode, obtained following compilation of the Java source language, have thus undergone considerable development. The MSIL (acronym for “Microsoft intermediate language”) intermediate language can also be cited: it is deployed within the framework of the Net or DotNet (registered trademark) environment, obtained following compilation of various possible source languages such as C++ or C#.
The intermediate code therefore conventionally corresponds to a compiled form of the software. Said software, compiled in Java, or in other intermediate languages, such as Net, cannot be executed as such by the processor of the apparatus on which one wishes to execute a program compiled in the form of intermediate code. It is necessary to introduce a software layer having for main function of interpreting the intermediate code in instructions that can be executed by the processor of the host apparatus. Said software layer is known as a “virtual machine”. For example, the JAVA virtual machine allows Java software to be executed on a given platform on which it is implemented.
Java software is conventionally distributed in the form of a set of modules consisting of class files, corresponding to a compiled form of the software. Each compiled file corresponds to a class type data structure and comprises in as much the information for said class: i.e., the description of the elements of the class (its constants, its fields, its methods), the description of the elements used by the class and defined in other classes (fields and methods), the code of the methods of the class in the form of instructions that can be interpreted (bytecode) by the interpreter of the Java virtual machine.
In a simplified manner, a class file has for example the following structure:
Thus, by way of example, within the framework of a Java electronic purse application, a class known as “Purse” may be defined, with its “balance” field and its “decrementBalance( )” method. The class file may have the following structure:
Thus, according to said example, the execution of the method of the “Purse” class consists in removing the value 1 at the current instance of the balance field.
The execution of the corresponding Java software is carried out by the Java virtual machine installed on the digital apparatus.
Said virtual machine transforms the information from the ClassFile into work storage data structures which are specific to the virtual machine and which allow said virtual machine to interpret and execute the software. Said data structures are conventionally organised in the form of stacks, wherein the data are stored one on top of the other in the order wherein they arrive.
The virtual machine has a set of specific instructions, each instruction being coded on one or more octets. The set of instructions of the virtual machine comprises for example a certain number of conventional instructions such as the arithmetic, logic operations and jumps.
Currently, the execution of an application by the virtual machine is not entirely secured, in particular in relation to attacks in general and, in particular, attacks due to a fault. Thus, the sensitive value in the abovementioned example, shown in the “balance” field, may be modified following the injection of a fault during the manipulation of said value for the interpretation of the method for updating said field by the virtual machine, resulting in setting a final value in the execution stack for the balance field different to the one normally expected. For example, the balance field may be forced to its maximum value.
A known manner of combating said type of attacks due to a fault consists of adding a protection device at the level of the application, in the form of a redundant control code inserted within same of the code of the application having to be executed by the virtual machine.
For example, in order to protect the preceding code, it is possible to use in Java, as a checksum, the complemented value of the balance field, noted ˜balance, which sends the complement to 1 of the binary balance value, and to update said checksum value at the same time with the update of the balance value during the execution of the method by the virtual machine. The comparison of the result of the two update calculations carried out at the same time, on one hand for the balance field and, on the other hand, the complemented ˜balance field, therefore allows the integrity of the data to be verified which were used to establish the calculations. The java code modified accordingly therefore has the following structure:
The disadvantage of such a method for securing the execution of the Java code is that it is highly resource and calculation time consuming, which is unfavourable in the constraint environments such as the smart cards.
Furthermore, said solution implies that the code developers are familiarised and are conscious, at the time of the development, of the manner of protecting the sensitive parts of the code. Therefore, in reality it is restrictive for the developers to add code in order to protect their application.
The invention aims to resolve one or more of said disadvantages.
The object of the invention is thus a method for protecting the data of an application compiled in intermediate code planned for being executed on a digital apparatus equipped with a virtual machine managing the execution of the code via an execution stack defining at least one stack frame corresponding to a method called during the execution of the code.
According to the invention, the method includes the application of a secured execution mode of the code, comprising:
According to one particular embodiment, the method comprises the division of each stack frame into a plurality of areas and the determination of a plurality of global checksums (gck0, gck1, gck2, gck3) respectively associated with each area of the stack frame.
According to said particular embodiment, the local checksum associated with the manipulated datum comprises the determination beforehand of the global checksum associated with the area of the stack frame wherein the manipulated datum is stored.
Advantageously, the calculation of the local checksum associated with the manipulated datum is performed on one hand, based on the global checksum associated with the area of the stack frame wherein the manipulated datum is stored and, on the other hand, on other data of said area of the stack frame.
Preferably, each stack frame is broken down into two areas, respectively corresponding to a local variable area of the method called and an operand stack area of said method.
According to one embodiment, the method is implemented in a software manner.
According to another embodiment, the method is implemented in a hardware manner.
The invention also concerns a virtual machine for executing an application compiled in intermediate code on a digital apparatus, characterised in that it is likely to be stored in a non-volatile storage device of the apparatus and likely to implement the method according to the invention.
The invention again concerns a digital apparatus comprising a non-volatile storage device storing in memory the virtual machine according to the invention.
Other special features and advantages of the invention will become apparent from reading the description made by way of non-limitative example and in relation to the following figures wherein:
a and 2b illustrate a first example of embodiment of the invention;
a to 3d illustrate a second example of embodiment of the invention;
The invention therefore aims to protect the execution of an application compiled in intermediate code on a digital apparatus equipped with a virtual execution machine for the interpretation of the code of the application.
The protection mechanism according the present invention acts, as will be seen later in greater detail, at the level of the stack frame, showing a reserved storage block, in the execution stack of the virtual machine, during the execution of a method of the code.
The general principle of the invention therefore consists of adding a safety mechanism at the level of each stack frame of the virtual machine, with the objective of being able to guarantee at any time the integrity of the data maintained in the associated stack frame during the execution of a method.
For this, with each stack frame of the virtual machine such as illustrated in
Then, each time that a datum of a stack frame, i.e. an element of the operand stack 13 or local variables 11, is unstacked for any operation, an associated local checksum may therefore be calculated dynamically, using on one hand, the global checksum associated with the stack frame concerned, calculated beforehand and maintained in the stack frame and, on the other hand, the other data of the stack frame concerned.
It will be advisable, each time that a datum of a stack frame is manipulated by the virtual execution machine, to update accordingly the global checksum associated with the stack frame.
The general principles explained above on which the invention is based will now be described in greater detail. For this, are first provided hereafter definitions useful for the description of the calculation functions of the global checksums, associated with each stack frame, and local checksums, likely to being associated with each datum of a stack frame based on the associated global checksum.
Indeed, for the purpose of the invention, it is necessary to define two types of function, one of which is planned for acting globally on a stack frame, whereas the other allows the calculations executed by the virtual machine on one or more data to be monitored locally, in particular based on the global checksum associated with the stack frame corresponding to the data concerned. By using such a combination of functions, it will be seen that it is therefore possible to guarantee the integrity of the data manipulated by the virtual machine at any time of the execution of the code.
Either V the space of values on the s long registers. For example, for the 32-bit long registers, the following is obtained:
V=[−2147483648, 2147483647]=[−231, 231−1]
The GCK global checksum calculation function associated with a stack frame is defined in the following manner:
GCK: V×V× . . . ×V--------->V
S0, S1, . . . , Sn−1, being all of the data constituting a given stack frame.
The LCK local checksum calculation function associated with the S0 datum is defined in the following manner:
LCK: V---------->V
The GCK function must have the following attribute:
Either the GCK and LCK function such as above defined, we will consider any F function such as, for any data set (S0, . . . , Sn−1):
GCK(S0, S1, . . . , Sn−1)=F(LCK(S0), . . . , LCK(Sn−1))
By way of example, by using the ˜ complement operator applied to the sum of all of the elements of a given stack frame, the following is defined:
GCK: V×V× . . . ×V---------->V
LCK: V------------>V
Or the F function defined by:
F: V×V× . . . ×V----->V
Therefore:
GCK(S0, S1, . . . , Sn−1)=˜(S0+ . . . +Sn−1)=˜S0+ . . . +˜Sn−1+n−1=F (˜S0, . . . , ˜Sn−1)=F(LCK(S0), . . . , LCK(Sn−1)).
From there, if the gck value of the global checksum associated with a stack frame (gck=GCK(S0, S1, . . . , Sn−1)) is known, it is therefore easy to calculate the LCK(Si) local checksum associated with an Si datum of the stack frame accessed by the virtual machine, based on the gck value associated with the stack frame concerned and the remaining Sj data of the stack frame (j≠i[[i]]), by using the following operation:
LCK(Si)=˜Si=GCK(S0, S1, . . . , Sn−1)−(˜S0+ . . . +˜Si−1+˜Si+1+ . . . +˜Sn−1+n−1)
According to the example, in order to determine the local checksum associated with an Si datum of the stack frame, the complemented sum of the other data constituting the stack frame is therefore removed from the gck global checksum value.
This shows that the LCK(Si) local checksum associated with the Si datum, may be calculated without explicitly using the value of the Si datum, but only bringing into play the gck global checksum associated with the stack frame corresponding to the Si datum and the other data of the stack frame.
Said attribute is particularly advantageous in view of reinforcing the safety of the execution of applications within the context of the invention. Indeed, even if an attacker is capable of modifying the value of a datum of a stack frame at the time wherein said latter is read by the virtual machine, the associated local checksum not being calculated by using the value of the datum in question, it will therefore be possible to detect that the datum in question has been modified by comparing it with the value originating from the checksum.
a and 2b allow said matter to be illustrated more accurately. Let us presume that the content of a stack frame 10 is described by stacking cells from
The gck global checksum associated with said stack frame may be calculated as being the sum complemented with the elements of the stack frame, by using the example of the GCK function provided above based on the ˜complement operator applied to the sum of all of the elements of the stack frame:
gck=˜(a+b+c+d)
Let us now presume that the a datum is unstacked from the top of the stack frame 10 as illustrated in
By applying the attributes linked to the ˜ operator, the following has been obtained:
gck=˜a+˜b+˜c+˜d+3
As already explained, the LCK(a) local checksum associated with the current manipulated a datum may therefore be calculated based on the gck global checksum and the data remaining in the stack frame, such that the following is obtained:
LCK(a)=gck−˜(b+c+d)−1=˜a
The LCK(a) local checksum may for example be calculated if the system requests a verification of the manipulated a datum. In this case, the value of the manipulated datum will be compared with the value of the datum derived from the ˜a associated local checksum. As the a datum, in the example, has been modified into a′ following an external attack, the comparison with the value obtained of ˜a therefore indicates to the system that an error has occurred, therefore allowing it to react.
Said verification step of the integrity may be implemented optionally. Thus, it may be envisaged to defer it, to perform it at predetermined or even random times.
The virtual machine is also capable of guaranteeing the integrity of the data itself during a calculation. Said mechanism is more specifically illustrated by the example of
Let us presume a stack frame 10, constituted by stacking the b, a, c and d data, as illustrated in
gck=˜(a+b+c+d)
Let us presume that the execution of the method involves the d+c calculation. The first argument of the addition constituted by the d datum is therefore initially unstacked and, by using the abovementioned principles, its associated LCK(d) local checksum may be calculated based on the gck global checksum associated with the stack frame corresponding to the data remaining in the stack frame:
LCK(d)=gck−˜(b+a+c)−1=˜d
Then, the gck global checksum associated with the stack frame is updated before continuing the calculation operation, in order to reflect the new state of the stack frame, the d datum having been unstacked. For this, the value of the local checksum associated with the d datum is removed from the current value of the global checksum:
gck=gck−(˜d)−1=˜(a+b+c)
The calculation operation is therefore continued and, as illustrated in
LCK(c)=gck−˜(b+a)−1=˜c
In the same way as before, the global checksum associated with the stack frame must be updated in order to reflect the new state of the stack frame, wherein the c datum has been unstacked. Therefore the value of the local checksum associated with the c datum is removed from the current value of the global checksum:
gck=gck−(˜c)−1=˜(a+b)
At said stage, the following four data are therefore manipulated by the virtual machine: c, ˜c, d and ˜d, which therefore allows it to be checked that the data brought into play in the calculation have not been modified at the time wherein they are read, in the same manner as explained in reference with the example of
The (c+d) addition operation is therefore performed. The integrity of said operation may be checked by using the LCK(c) and LCK(d) local checksums, respectively associated with the c and d data. Indeed, the corresponding ˜(c+d) checking operation is written: ˜c+˜d+1, according to the arithmetic attributes of the ˜ complement operator. Thus, advantageously the LCK(c+d) local checksum associated with the result of the (c+d) operation may be calculated, without having to use the values of the c and d data directly, but only by using the LCK(c) and, LCK(d) local checksums previously calculated and associated with the data intervening in the operation. Furthermore, as already seen, the LCK(c) and LCK(d) local checksums have themselves been calculated based on the gck global checksum, without having to use the values of the c and d data.
Finally, as illustrated in
gck=gck+(˜(c+d)+1)=˜(a+b)+˜(c+d)+1=˜(a+b+c+d)
In the abovementioned examples, the ˜ complement operator was used for calculating the local checksums. The invention may however be generalised by using the following generalisation of the ˜ operator:
Either O the V set, with the following conventional arithmetic and logic operations (+, −, *, /, &, |).
Or P the V set with all of the operations symbolised in the following manner (⊙, ♦, □, ◯).
A Chk checksum operator is defined in the following manner:
Chk: O-------------→P, such that for any op operation in the set (+, −, *, /, &, |), there is a cop corresponding operation in the (⊙, ♦, □, ◯) set such that:
Chk(a op b)=Chk(a) cop Chk(b)
The ˜ operator previously described of the complement at 1 is therefore a specific case of the Chk function:
˜: (V, (+, −,*, /, &, |))----------→(V, (⊙♦□◯))
with
˜(a+b)=˜a˜b=˜a+˜b+1
Mathematically, the Chk operator may be defined as a group morphism of the O space towards the P space, respecting all of the operations in P.
The principle of the invention therefore consists of associating with each stack frame of the virtual execution machine a global checksum and, each time that a datum is accessed, calculating a local checksum associated with said datum by using the global checksum associated with the stack frame concerned.
In the examples of embodiments provided above, the calculation of the global checksum associated with a stack frame consists of calculating the complemented sum of all of the elements of the stack frame. In addition, in secured mode, each time that a datum of the stack frame is accessed, its associated checksum is calculated, by using the global checksum to which is subtracted the complemented sum of the other elements of the stack frame. Also, if the length of the stack frame is substantial, said calculations may prove to be unfavourable in terms of performances (slowing down of the execution of the code in secured mode) in environments restricted in calculation resources.
Also, in the aim of reducing the number of operations required for calculating a local checksum based on a global checksum or for calculating the update of the global checksum, an improvement of the invention consists of using a plurality of global checksums associated with the same stack frame.
According to a first variant illustrated in
According to another variant presented in
According to said embodiment, each time that a datum of the stack frame is manipulated (read or added), it is first necessary to determine which area of the stack frame is concerned, in order to use the global checksum associated with said area in order to deduce the local checksum associated with the manipulated datum.
Each global checksum associated with an area of the stack frame is in fact identified by an index (0 to 3 according to the example in
index=(jSP−base)/n
Wherein jSP represents the pointer on the current element of the stack frame, the base represents the pointer on the base of the stack frame and n represents the number of elements of an area of fixed length of the stack frame.
In reference to the example of
The calculation of the local checksum associated with the a datum is therefore carried out as previously explained, by using the gck1 global checksum previously determined and by advantageously taking into account only the data of the area concerned, in comparison with all of the data of the stack frame.
The calculation of a local checksum meets the following formula:
the Si symbolising the elements of the stack frame.
The calculations of the local and, global checksums such as they have been described may be carried out in a software manner. However, it has been seen that when the stack frames corresponding to the methods called during the execution of the code are of a considerable length, said calculations may prove to be excessively long. The abovementioned variant, consisting of dividing the stack frame into a plurality of areas with each one of which is associated a global checksum, therefore advantageously allows the calculation time to be reduced.
Another solution is based on a hardware implementation of the calculation of checksums according to the invention. One example of hardware implementation is described in
The logic circuitry 30 is for example wired in order to execute the sum of all of the registers selected. The automatic determination of a checksum via the wired logic thus allows the rapidity of the calculations to be significantly increased in relation to a purely software implementation when the n number is high.
The set of r1 to rn registers is therefore used as a storage device allowing the corresponding data of a stack frame which is mapped on the storage area 20 to be stacked and unstacked. An F set of F1 to Fn indicators is further planned, wherein each F1 to Fn indicator is respectively associated with an r1 to rn register, allowing an update of the stack frame to be indicated. For example, when a datum is placed on the stack, the bit of the corresponding indicator is placed at 1 and when a datum is unstacked, the bit of the corresponding indicator is placed at 0. Thus, on each change detected in the set of F indicators, the checksum is recalculated via the logic circuitry 30 based on the content of all of the ri to rn registers of which the bit of the corresponding indicator is placed at 1.
The hardware set 20, 30, 40, 50 cooperates with an interface device 60 connected to a processing unit of the component on which the code is executed. The operations of adding and removing data on a stack frame may therefore be performed, checked via the hardware configuration described.
Thus, the data set of a stack frame being mapped in the storage area 20, when a datum must be removed, the current checksum, known as CHK1, associated with the stack frame is first read from the register 40. Then, a ‘read’ signal indicating to the interface device 60 that a datum is removed, said latter commands the updating of registers 20 and indicators 50 on the database of ADR address and DAT data signals received by the processing unit and an updated checksum value, known as CHK2, is automatically calculated and placed in the check register 40. Thus, based on the CHK1 and CHK2 checksums automatically calculated, the system may determine the local checksum associated with the datum removed in order to verify its integrity.
In the same way, when a datum must be added to the stack, the current checksum, known as CHK1, associated with the stack frame is first read from the register 40. A ‘write’ signal therefore indicates to the interface device 60 that a datum is added, thus commanding the update of the registers 20 and indicators 50 on the database of ADR address and DAT data signals received by the processing unit. Once the a datum has been loaded in the corresponding register of the storage area 20, the checksum is automatically recalculated and the updated CHK2 value is placed in the check register 40. Thus, based on the CHK1 and CHK2 checksums automatically calculated, the system may determine the local checksum associated with the datum added in order to verify its integrity.
Number | Date | Country | Kind |
---|---|---|---|
06291183 | Jul 2006 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP2007/057417 | 7/18/2007 | WO | 00 | 1/16/2009 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2008/009697 | 1/24/2008 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
4524416 | Stanley et al. | Jun 1985 | A |
4697233 | Scheuneman et al. | Sep 1987 | A |
4866599 | Morganti et al. | Sep 1989 | A |
5471487 | Andrighetti | Nov 1995 | A |
5644709 | Austin | Jul 1997 | A |
5805920 | Sprenkle et al. | Sep 1998 | A |
6477702 | Yellin et al. | Nov 2002 | B1 |
6742081 | Talagala et al. | May 2004 | B2 |
6785886 | Lim et al. | Aug 2004 | B1 |
7117485 | Wilkinson et al. | Oct 2006 | B2 |
7272748 | Conover et al. | Sep 2007 | B1 |
7302550 | Merck | Nov 2007 | B1 |
7392404 | Montgomery et al. | Jun 2008 | B2 |
7426721 | Saulpaugh et al. | Sep 2008 | B1 |
7430670 | Horning et al. | Sep 2008 | B1 |
7444644 | Slaughter et al. | Oct 2008 | B1 |
7581089 | White | Aug 2009 | B1 |
7716555 | Andou | May 2010 | B2 |
7945953 | Salinas et al. | May 2011 | B1 |
20010044912 | Francis et al. | Nov 2001 | A1 |
20020112227 | Kramskoy et al. | Aug 2002 | A1 |
20020144115 | Lemay et al. | Oct 2002 | A1 |
20020161972 | Talagala et al. | Oct 2002 | A1 |
20030028742 | Hameau et al. | Feb 2003 | A1 |
20030177435 | Budd et al. | Sep 2003 | A1 |
20030217277 | Narayanan | Nov 2003 | A1 |
20030226022 | Schmidt et al. | Dec 2003 | A1 |
20040107200 | Sjogren et al. | Jun 2004 | A1 |
20040255146 | Asher et al. | Dec 2004 | A1 |
20050076226 | Boivie et al. | Apr 2005 | A1 |
20050246511 | Willman et al. | Nov 2005 | A1 |
20050283770 | Karp et al. | Dec 2005 | A1 |
20060136720 | Armstrong et al. | Jun 2006 | A1 |
20060179483 | Rozas | Aug 2006 | A1 |
20060195824 | Iwamoto | Aug 2006 | A1 |
20070005992 | Schluessler et al. | Jan 2007 | A1 |
20070101236 | Bauerle et al. | May 2007 | A1 |
20070165937 | Markov et al. | Jul 2007 | A1 |
20070285271 | Erlingsson et al. | Dec 2007 | A1 |
20070294496 | Goss et al. | Dec 2007 | A1 |
20080146178 | Lu et al. | Jun 2008 | A1 |
20080232582 | Chevallier-Mames et al. | Sep 2008 | A1 |
20090144561 | Davidson et al. | Jun 2009 | A1 |
20120310892 | Dam et al. | Dec 2012 | A1 |
Number | Date | Country |
---|---|---|
WO 0188705 | Nov 2001 | WO |
WO 2005101725 | Oct 2005 | WO |
WO2007085785 | Aug 2007 | WO |
Entry |
---|
Gelernter, D., “Truth, Beauty, and the Virtual Machine”, Discovery Magazine, Sep. 1, 1997, all pages, http://discovermagazine.com/1997/sep/truthbeautyandth1217/article—print. |
Shavit et. al., “Definition: Virtual Machine (VM)”, Jul. 2007, all pages, http://searchservervirtualization.techtarget.com/definition/virtual-machine. |
“Virtual Memory Tutorial”, Jul. 1, 2006, all pages, http://www.cs.gmu.edu/cne/modules/vm/green/defn.html. |
Regev, H., “Definition: Virtual Memory”, May 1998, www.searchstoragetechtarget.com, all pages, http://searchstorage.techtarget.com/definition/virtual-memory. |
Josef Neliβen, “Buffer Overflows for Dummies”, May 1, 2002, all pages. |
Form PCT/ISA/210 (International Search Report) dated Sep. 6, 2007. |
Form PCT/ISA/237 (Written Opinion of the International Searching Authority) dated Sep. 6, 2007. |
Gilles Grimaud et al., “FACADE: A Typed Intermediate Language Dedicated to Smart Cards”, Software Engineering Notes, Sep. 6, 1999, vol. 24, No. 6, pp. 476-493, New York, New York, U.S., XP-002161892. |
Inria, “Verificard at INRIA”, INRIA Online, Sep. 19, 2003, 4 pages, XP-002418434. Retrieved from the Internet: URL:http://www-sop.inria.fr/lemme/verificard>. |
Simon Moore et al., “Balanced Self-Checking Asynchronous Logic for Smart Card Applications”, Microprocessors and Microsystems, 2003, vol. 27, pp. 421-430, XP-002418435. |
Number | Date | Country | |
---|---|---|---|
20090328231 A1 | Dec 2009 | US |