EP1576443A2

EP1576443A2 - Method of securing computer systems comprising a code interpretation module

Info

Publication number: EP1576443A2
Application number: EP03799637A
Authority: EP
Inventors: Patrice Hameau; Daniel Le Metayer
Original assignee: Trusted Logic SAS
Current assignee: Trusted Logic SAS
Priority date: 2002-12-24
Filing date: 2003-12-18
Publication date: 2005-09-21
Also published as: WO2004061622A3; FR2849232A1; FR2849232B1; US20060048230A1; WO2004061622A2; AU2003299355A1

Abstract

The invention relates to a method of securing computer systems comprising at least one code interpretation module and memory capacity for storing the code to be interpreted. For said purpose, the invention consists in making more difficult attacks involving physical measures and/or requiring a synchronisation with the interpreted code, by introducing variants into the interpreted code runtimes and the measurable physical prints.

Description

METHOD FOR SECURING COMPUTER SYSTEMS INCORPORATING A CODE INTERPRETATION MODULE.

The present invention relates to securing computer systems comprising at least one code interpretation module and memory storage capacities for the code to be interpreted.

Its purpose is more particularly to solve the problems of securing computer systems comprising at least one code interpretation module (a code being defined as a structured set of instructions) which will be called simply "interpreter" thereafter ( hardware interpreter: microcontroller, microprocessor or software: virtual machine) and memory storage capacities of the code to be interpreted (or "interpreted code").

Said code can be written directly by a programmer, be obtained automatically (what will be called "code generation") from a "source code" in a language which is generally of a higher level or even result a combination of automatic production and manual intervention.

In general, we know that most of the attacks recorded against such computer systems are based on physical measures (electromagnetic emission, etc.) during execution and require synchronization with the interpreted code. In other words, it is necessary for the intruder to determine when the interpreter is executing certain features of the code. Among these best known techniques, we can cite those developed to find a key in cryptographic algorithms by passive spying on the physical emission of a circuit: attacks of SPA type ("Simple Power Analysis") and DPA ("Differential Power Analysis ") in particular have been successfully used to discover DES (" Data Encryption Standard ") keys. For example, on an embedded Java platform ("Java Card", "JEFF", "J2ME", ...), these attacks can be used in an attempt to obtain information on secrets manipulated by the Java virtual machine. These secrets can concern confidential data as well as the Java code itself.

The invention therefore more particularly aims to eliminate these drawbacks.

It proposes, to this end, to make more difficult attacks based on physical measures and / or requiring synchronization with the interpreted code by introducing variants in the execution times of the interpreted codes and physical fingerprints (for example, and non-exclusive, electromagnetic emission, etc.) measurable.

According to the invention, this method essentially involves two types of variants in the execution times of the interpreted codes, in the following manner:

- by causing in certain places of an interpreted code derivations towards new portions of code (which do not belong to the original code) intended to complicate the synchronization and the physical imprint of the execution, or - by proposing a plurality of implementations of certain instructions, each requiring a different execution time and / or presenting a different physical fingerprint and providing an identical result, ensuring that two executions of this instruction within the same code can be carried out by two different implementations.

Thus, by introducing distortions in execution times and by modifying the physical effect of execution, the two types of variants above will make any attempt to correlate between the observed physical manifestations of an interpreted code and more difficult. its functionality.

Advantageously, this method will make the apparently executed code different on each execution, and will therefore make it more difficult to discover the actual code of the application.

This process may involve: • for the first variant:

- two methods of introducing "derivation codes",

- four embodiments of "derivation codes", • for the second variant:

- two methods of introducing "multiple implementations" of certain instructions,

- three embodiments of "alternative codes" with physical footprint and variable duration.

Concerning the first variant, the first mode of introducing "derivation codes" consists of introducing a specific instruction (s), called "derivation" (s) in certain particular places of the code. This introduction can be done either manually or automatically during code generation. In the latter case, the code generator can be guided, to produce these instructions, by annotations inserted by the programmer in the source code and making it possible to designate sensitive portions of code (for example, and in a nonlimiting manner, procedures of encryption or verification of access rights). The execution of a derivation instruction by the interpreter causes a connection to an associated derivation code. This first method can also be improved by attaching different levels of security to the bypass instructions and by associating them with bypass codes that are all the more complex (or defensive against security attacks described above) as their level of security is high.

Concerning the first variant, the second mode of introducing "derivation codes" consists in introducing the derivation code into the implementation of the interpreter itself: between the execution of two consecutive instructions of the code, the interpreter executes the derivation code, either systematically, selectively or randomly. It can for example execute this code only when calling certain sensitive methods (typically from libraries, called API "Application Program Interface").

The advantage of the first mode is that it makes it possible to selectively introduce derivative code executions, which leads to less penalty in terms of execution time if the number of such derivations is small. It also allows the implementation of so-called "discretionary" security policies, that is to say at the discretion of the applications.

On the other hand, the second mode will be more advantageous if the number of derivations desired is significant since the implementation of the method in the interpreter itself can then be optimized. Furthermore, it allows the implementation of security policies called "proxies" where controls are imposed uniformly on all applications.

The two preceding methods of introduction require the introduction of a derivation code. The invention provides four modes for achieving these bypass codes so that they introduce variations in execution times and measurable physical footprints.

Concerning the first variant, the first embodiment of "derivation codes" with physical footprint and variable duration consists in performing a so-called "superfluous" calculation depending on data known at runtime (which can therefore differ on each runtime). This superfluous calculation must have no effect on the final result of the execution of the interpreter. A simple example of such a calculation is a parity test of dynamic data (known at runtime) which can lead either to an empty action, or to the addition of an element from a stack followed by its immediate removal . It should be noted that the number of possible actions is not necessarily limited to two. A large number of possible actions will lead to significant variability in the execution time and the physical footprint of the derivation code.

The second embodiment of "derivation codes" improves the first mode by providing it with a random drawing of additional data during the execution of the superfluous calculation, said additional data being used in the calculation carried out by the code of derivation (for example in a test of said code). This random draw adds a new variable element and makes the execution time and the physical footprint of the derivation code even less predictable.

The third embodiment of "derivation codes" improves the efficiency of the two previous ones by replacing the test allowing the decision of the next action by a connection in a so-called indirection table, that is to say containing the addresses of possible actions, to an index calculated from variable elements (dynamic data and / or result of a random draw).

The fourth embodiment of "derivation codes" improves the first mode (and therefore the other three) by considering a superfluous calculation which, while remaining without effect on the final result, presents the external characteristics (physical imprint) of a particular sensitive calculation (for example encryption or decryption) unrelated to the actual code of the application. Such a superfluous calculation makes it possible to deceive an attacker who would try to deduce secrets by measuring the physical effect of the execution of the application. Such a process can be qualified as "software decoy" since its purpose is to mislead attackers by making them believe in the presence of said sensitive calculation in the effective code of the application. This mode can be carried out simply by implementing the sensitive calculation in question without retaining its result.

With regard to the second variant, the first mode of introducing "multiple implementations" of certain instructions consists in enriching all of the instructions recognized by the interpreter with a plurality of implementations for a given instruction. These implementations will be carried out so as to have different physical fingerprints and execution times while producing an identical result. Any of these implementations can be used interchangeably in the code. This use can be done either manually, by programming, or automatically during code generation. In the latter case, the code generator can be guided, to produce these instructions, by annotations inserted by the programmer in the source code and making it possible to designate sensitive portions of code (for example, and without limitation, procedures encryption or verification of access rights). This first mode can also be improved by attaching different levels of security to the implementations of instructions and by associating them with implementations which are all the more complex (or defensive with respect to security attacks) as their level of security. is high.

Concerning the second variant, the second method of introducing "multiple implementations" of certain instructions consists in including in the implementation the instruction itself implements a connection to a portion of alternative code which will dynamically determine the implementation to be executed.

The advantage of the first mode is to minimize the additional cost in terms of execution time since the choice of the implementation of the instruction to be applied is determined before execution. It also allows the implementation of so-called "discretionary" security policies, that is to say at the discretion of the applications.

The advantage of the second mode is to further complicate attacks requiring synchronization with the code since two consecutive executions of the same instruction (at the same location in the code) will be likely to take different execution times and offer fingerprints different physical. Furthermore, this second mode allows the implementation of security policies called "proxies" where controls are imposed uniformly on all applications.

The two modes are not mutually exclusive: an implementation can include a multiplicity of implementations for a given instruction, some of them (or all) being implemented by connecting to a portion of alternative code dynamically determining the implementation to execute.

The above second mode of the second variant requires the introduction of an alternative code associated with an instruction. The invention proposes three modes for producing this alternative code so that it introduces different implementations into the execution times and the physical footprint measured.

Concerning the second variant, the first embodiment of "alternative codes" with physical footprint and variable duration consists in proposing a plurality of different implementations of the instruction and in conditioning the choice of the version executed in a dynamic test, that is to say dependent on data known at runtime. A simple example of such a calculation is a parity test of dynamic data (known at runtime). A large number of implementations will lead to significant variability in the execution time and in the physical footprint of the alternative code.

The second embodiment of "alternative codes" improves the first mode by providing it with a random drawing of a data item which is then used for carrying out the test leading to the dynamic choice of the version executed. This random draw adds a new variable element and makes the execution time and the physical footprint of the alternative code even less predictable.

The third embodiment of "alternative codes" improves the efficiency of the two preceding ones by replacing the test making it possible to decide on the version chosen by a connection in an indirection table (containing the addresses of the versions available) to an index calculated at from variable elements (dynamic data and / or result of a random draw).

Thus, the introduction of variants in the execution times of the interpreted codes and therefore the physical fingerprints makes it more difficult to attack based on said physical fingerprints, by ensuring that a coded action in the implementation of the the application can have different electronic signatures and occur at variable execution times.

The implementation of the above interpreted codes will be carried out on software code interpretation modules such as virtual machines of the JAVA family, and on physical code interpretation modules of the microcontroller or microprocessor type.

Claims

1. Method for securing computer systems comprising at least one code interpretation module and memory capacities for storing the interpreted code having measurable physical fingerprints, characterized in that in order to make attacks based on measurements more difficult physical or requiring synchronization with the above interpreted code, it consists in introducing variants of execution of the interpreted code, said variants having an effect on the execution times of the interpreted code or its measurable physical fingerprints.

2. Method according to claim 1, characterized in that it comprises branches to new portions of code, called "branch codes", which do not belong to the original code.

3. Method according to claim 1, characterized in that it comprises a plurality of implementations of certain instructions, each claiming a different execution time or having a different physical footprint while providing an identical result.

4. Method according to claim 2, characterized in that it comprises a first mode of introduction of "derivation codes" consisting in introducing one (or more) instruction (s) specific (s) at certain particular places of the code, either manually or automatically when generating the above code.

5. Method according to claim 4, characterized in that the derivation instructions are associated with security levels which correspond to degrees of complexity of their derivation code, the most complex being considered as the most defensive with regard to security attacks requiring synchronization with the code or a measurement of its physical footprint.

6. Method according to claim 2, characterized in that it comprises a second mode of introduction of "derivation codes" consisting in introducing the derivation code in the implementation of the interpreter itself.

7. Method according to claim 6, characterized in that the derivation code introduced in the implementation of the interpreter is executed either systematically by the interpreter, either selectively or randomly.

8. Method according to claim 2, characterized in that it comprises a first embodiment of "derivation codes" consisting of performing a so-called "superfluous" calculation depending on data known at runtime.

9. Method according to claim 2, characterized in that it comprises a second embodiment of "derivation codes" consisting in providing the aforesaid first mode with a random drawing of additional data during the execution of the calculation superfluous, said additional data being used in the calculation carried out by the derivation code.

10. Method according to claim 8, characterized in that the aforesaid first embodiment of "derivation codes" is improved by attaching different levels of security to the implementations of instructions and by associating them with implementations of as much more complex.

11. Method according to claim 2, characterized in that it comprises a third embodiment of "derivation codes" consisting in replacing in the above first and second mode the test making it possible to decide the next action by a connection in an indirection table containing the addresses of the possible actions to an index calculated from the variable elements (dynamic data and / or result of a random draw).

12. Method according to claim 2, characterized in that it comprises a fourth embodiment of "derivation codes" consisting in performing a superfluous calculation having the external characteristics of a particular sensitive calculation.

13. Method according to claim 3, characterized in that it comprises a first mode of introduction of a plurality of implementations of certain instructions consisting in enriching all of the instructions recognized by the interpreter with a plurality of bets implemented for a given instruction; the above instructions are carried out either manually, by programming, or automatically when generating the code.

14. Method according to claim 3, characterized in that it comprises a second mode of introduction of the aforesaid plurality of implementations of certain instructions consisting in including in the implementation itself of the instruction a connection to a portion of at least one alternative code with physical footprint or variable duration which dynamically determines the implementation to be executed.

15. Method according to claim 14, characterized in that it comprises a first embodiment of the aforementioned alternative code consisting in proposing a plurality of implementations different from the instruction and by conditioning the choice of the version executed on a dynamic test, that is to say dependent on data known at execution.

16. Method according to claim 14, characterized in that it comprises a second embodiment of the aforementioned alternative code consisting in improving the aforesaid first embodiment of "alternative codes" by providing it with a random draw for the realization of the test leading to the dynamic choice of the version executed.

17. Method according to claim 14, characterized in that it comprises a third embodiment of the aforementioned alternative code consisting in improving the aforesaid first and second embodiments of "alternative codes" consisting in replacing the test making it possible to decide on the version chosen by connection in an indirection table containing the addresses of the versions available to an index calculated from the variable elements.

18. The method of claim 1, characterized in that it is implemented on a software code interpretation module said virtual machine.

19. The method of claim 18, characterized in that said virtual machine is a Java platform.

20. Method according to claim 1, characterized in that it is implemented on a physical code interpretation module.

21. Method according to claim 1, characterized in that it is implemented on an on-board system and on an interpretation module of the micro-controller or micro-processor type.