The present invention relates to a watermark insertion apparatus that inserts a watermark in a program in order to prevent and suppress illegal use and distribution of the program, and a watermark extraction apparatus.
With the advance of computer networks, it has become common for computer programs to be distributed via networks. As a computer program can easily be duplicated, there is a possibility of illegal secondary distribution of program duplicates, and theft of or tampering with algorithms in programs. There is thus a need to protect programs from such illegal use.
One example of a conventional program protection technology is a method whereby an electronic watermark is inserted in a program. With this method, a program is distributed with different watermark embedded for each distribution destination. Then, in the event of illegal use, watermark is extracted from the illegal user's program, and that watermark is analyzed. By this means, the source of circulation can easily be detected.
An actual watermark insertion method is disclosed, for example, in Unexamined Japanese Patent Publication No.2000-76064 (pages 3-4,
With this method, code with no dependency relationship to the order of execution is first detected. Next, a dummy variable operation is inserted in the detected part. Then the order of execution of the detected part containing the dummy variable operation is switched around randomly.
By performing such processing, a mechanism is implemented that changes this order of execution as electronic watermark for each distribution destination.
However, a problem with the conventional method of inserting an electronic watermark in a program is that it is easy to alter or delete a water mark based on collusion attack. “Collusion attack” is an attack method whereby watermark data insertion locations are identified by finding differences in a plurality of programs in which watermarks have been inserted.
When different watermark is inserted in a program for each distribution destination, if differences between programs distributed to each distribution destination are found, only the locations at which watermarks have been inserted will surface as differences. There is thus a problem in that watermark insertion locations can easily be identified and watermark can easily be deleted or altered.
It is an object of the present invention to prevent easy generation of a program that does not have a watermark and operates normally, by inserting a watermark in such a way that the watermark insertion location cannot be identified.
The present invention generates watermark from ID information that uniquely identifies a program distribution destination, inserts the generated watermark in a program, and prevents the program from operating correctly if the watermark is tampered with, and also inserts the same watermark verification code to examine whether the watermark is tampered with in a program regardless of the distribution destination.
By this means, it is possible to prevent detection of watermark verification code constituting a watermark by means of collusion attack. As a result, a distribution destination cannot generate a program that does not have a watermark and operates normally, and thus is not able to circulate a program illegally.
(Embodiment 1)
An illegal program distribution prevention system comprising a watermark insertion apparatus and watermark extraction apparatus according to Embodiment 1 of the present invention will now be explained with reference to the accompanying drawings.
First, at the time of program distribution, the distribution source 10 performs distribution with a different watermark inserted by means of a watermark insertion apparatus 20 for each of distribution destinations 40a and 40b (it is assumed that secondary distribution by distribution destinations is not authorized).
By performing distribution with watermarks embedded in this way, in the event of program circulation via illegal secondary distribution, for example, distribution source 10 can confirm the distribution destination by extracting, by means of a watermark extraction apparatus 30, the watermark from the program circulated to the circulation destination 50, and identify circulation source (distribution destination) 40a or 40b.
Furthermore, distribution destinations 40a and 40b will fear identification as the circulation source, and will refrain from illegal secondary distribution.
In this way, the illegal distribution prevention system suppresses illegal distribution by means of watermarks.
Next, watermark insertion apparatus 20 according to Embodiment 1 will be described using
Watermark insertion apparatus 20 is provided with a program input section 201. Program input section 201 is a means of inputting program code that inputs a watermark. Program input section 201 outputs program code to a watermark insertion section 202.
Watermark insertion section 202 is a means of generating a watermark to be actually embedded in a program from ID information generated by an ID information generation section 205, and inputting the watermark to program code output from program input section 201. If the program code output by program input section 201 is source code, watermark insertion section 202 compiles the source code and passes the watermark input location to a watermark information storage section 206 as an assembler code line number.
A program output section 203 is a means whereby watermark insertion section 202 outputs input program code.
A watermark data input section 204 inputs watermark data. The input watermark data is information that uniquely specifies a distribution destination, comprising the distribution destination address, telephone number, company, e-mail address, and so on. Distribution source information may also be input in the watermark data.
ID information generation section 205 generates ID information that can be uniquely determined from watermark data input by watermark data input section 204. ID information may be the input data itself, or may be data obtained by encryption of the input data. Also, ID information may be an ID for uniquely specifying watermark data in a database holding watermark data.
In embodiments of the present invention, a mode is used whereby watermark is generated based on ID information, but it is not absolutely necessary for watermark to be generated based on ID information, and it is sufficient to be able to specify a distribution destination uniquely from watermark. For example, it is also acceptable to enable a distribution destination to be uniquely specified by inserting a sequence number 1 to N in software as watermark, distributing sequence number i software to distribution destination A, distributing sequence number j software to distribution destination B, and so forth.
Watermark information storage section 206 is a means of storing an insertion location of a watermark inserted by watermark insertion section 202. Specifically, the assembly code line number of the code in which a watermark is inserted is stored.
Next, watermark extraction apparatus 30 according to Embodiment 1 will be described using
Program input section 301 is a means of inputting a program in which a watermark is input.
A watermark detection section 302 disassembles a program output from program input section 301, and extracts an input watermark from the watermark insertion location (assembler code line number) obtained from a watermark information storage section 305. Watermark detection section 302 then generates ID information from the extracted watermark, and passes this ID information to an ID information storage section 304.
ID information storage section 304 is a means of generating distribution destination information from ID information obtained from watermark detection section 302. When ID information is a database data ID, ID information storage section 304 obtains distribution destination information by extracting data from the ID. When ID information is distribution destination information encryption data, ID information storage section 304 obtains distribution destination information by performing decryption.
Watermark information storage section 305 is a means of storing a watermark insertion location of a distributed program. Watermark insertion location information is obtained from watermark information storage section 206 of watermark insertion apparatus 20.
Output section 303 is a means of outputting obtained distribution destination information.
Next, watermark insertion section 202 according to Embodiment 1 will be described using
First, watermark insertion section 202 generates, by means of generation function F(1) , watermark X1 and X2 to be actually inserted into the program from ID information I generated from distribution destination 40 information (step 401).
Next, watermark insertion section 202 configures function F21 that outputs constant C1 and function F22 that outputs constant C2 when watermark X1 and X2 is used as input (step 402).
Watermark insertion section 202 then embeds in the program code an expression that assigns watermark X1 and X2 to variables val1 and val2 (step 403).
Watermark insertion section 202 then embeds in the program code an expression that assigns F21(val1, val2) to variable val3, and F22(val1, val2) to variable val4 (step 404).
Next, watermark insertion section 202 embeds in the program code as watermark verification code a conditional branch that determines whether variable val3 and constant C1 are equal and halts the program if they are not equal, and a conditional branch that determines whether variable val4 and constant C2 are equal and halts the program if they are not equal (step 405).
Watermark insertion section 202 then stores the locations at which watermark and watermark verification code were inserted in step 403 through step 405 in watermark information storage section 206 (step 406).
In this way, watermark insertion section 202 inserts in the program watermark and watermark verification code.
Watermark insertion section 202 inputs the expressions and conditional branches (watermark verification code) inserted in step 403 through step 405 in the order of execution of the program. A condition for F1 is that there should be an inverse function of F1 that generates I uniquely from X1 and X2, and a condition for F21 and F22 is that F21 (X1, X2)==C1 and F22(X1, X2) ==C2 should not hold other than for X1 and X2 (“==” indicates that the values are equal).
For example, a case will be considered in which ID information=12345678, F1 is a function that divides an 8-digit value into two values from the 4th digit, F21 (x, y) and F22(x, y) are 2-variable linear functions ax+by, C1=2345, and C2=5678.
In this case, watermark X1=1234 and X2 =5678 is first generated from F1. F21 and F22 are configured by finding a1, a2, b1, and b2 that satisfy the conditions a1×1234+b1×5678 =2345 and a2×1234+b2×5678 =5678. For example, values of a1=1, a2 =0.195667, a2=3.700972, and b2=0.195667 satisfy the conditions.
An example of program code generated when Embodiment 1 is applied is shown in
In
First, in step 403, watermark insertion section 202 inputs watermark X1a (1234) , X1b (5678) and X2a (1111), X2b (2222) generated from different ID information Ia (12345678) and Ib (11112222) into programs 500b and 500c (part indicated by reference numeral 501 in the drawing)
Next, in step 404, watermark insertion section 202 inserts mutually differing F21 and F22 respectively into watermark insertion programs 500b and 500c (part indicated by reference numeral 502 in the drawing).
Then, in step 405, watermark insertion section 202 embeds in the program code, as watermark verification code, a conditional branch that determines whether variable val3 and constant C1 (2345) are equal and halts the program if they are not equal (assert(0)), and a conditional branch that determines whether variable val4 and constant C2 (5678) are equal and halts the program if they are not equal (assert(0)) (part indicated by reference numeral 503 in the drawing).
The point to be noted here is that, when the differences between the two programs 500b and 500c are identified, parts 501 and 502 constituting watermark are detected, but conditional branches 503 constituting watermark verification code are not detected. Consequently, even if a watermark input location is detected by means of collusion attack on programs 500b and 500c, and alteration or deletion of the detected part is carried out, alteration or deletion cannot be performed on the conditional branches 503 constituting watermark verification code. Therefore, the watermark verification code part 503 does not meet the conditions, and the program no longer operates.
Thus, in the case of a simple method of altering or deleting only a location detected by means of collusion attack, it is possible to prevent acquisition of a program that operates normally when all watermarks are deleted.
For the sake of clarity, source code is used in
Also, in Embodiment 1, two items of watermark are generated from ID information, but it is also possible to generate three or more items of watermark.
Next, watermark detection section 302 according to Embodiment 1 will be described using
First, watermark detection section 302 disassembles program execution code (step 1001).
Next, watermark detection section 302 refers to watermark information storage section 305, obtains stored information in which the watermark insertion location in the program is stored (that is, a line number indicating the insertion location), and based on this, specifies the input location of watermark X1 and X2. Watermark detection section 302 then extracts watermark X1 and X2 from the program (step 1002).
Next, watermark detection section 302 generates ID information using the inverse function of function Fl used when generating watermark X1 and X2 (step 1003).
In this way, watermark detection section 302 obtains ID information and performs specification of distribution destination 40.
With the above method, if the code execution order is switched around by means of optimization or “obfuscating” (that makes reading more difficult) by an execution code distribution destination or circulation destination, it is possible that the assembler line number of a watermark input location will be changed, preventing acquisition of the watermark. In consideration of such a possibility, the processing in step 1002 may be changed to processing whereby an assignment instruction is sought in lines around the assembler line number indicating the insertion location, and the operand part of the assignment instruction is extracted.
As described above, according to Embodiment 1, watermark verification code (part 503 in
A mode is also possible in which the processing performed by watermark insertion apparatus 20 and watermark extraction apparatus 30 is in the form of a program and is executed by a general-purpose computer.
(Embodiment 2)
Embodiment 2 provides for a case where a person intending to distribute a program illegally attempts to alter or delete watermark verification code of Embodiment 1 by detecting watermark by means of collusion attack, detecting a location at which a variable generated by a function used in the detected watermark is used (part indicated by reference numeral 503 in
Specifically, watermark is used, and watermark verification code necessary to operate a program normally is inserted in the program.
By this means it is possible to prevent a program from being operated normally when watermark verification code using watermark is detected and altered or deleted by means of the above-described procedure.
Embodiment 2 is described in detail below. The difference between the watermark insertion apparatus in Embodiment 2 and watermark insertion apparatus 20 in Embodiment 1 lies in the operation of watermark insertion section 202.
Next, the operation of the watermark insertion section of Embodiment 2 will be described using
The operations in step 601 and step 602 are the same as the operations in step 401 and step 402 described in Embodiment 1, and therefore descriptions thereof are omitted here.
Next, the watermark insertion section generates function F3 that generates C3 so that C1+C2 +C3 =0 from watermark X1 and X2 (step 603).
The watermark insertion section then embeds in the program code an expression that assigns watermark X1 and X2 to variables val1 and val2 (step 604).
The watermark insertion section then embeds in the program code an expression that assigns F21 (val1, val2) to variable val3, and F22(val1, val2) to variable val4 (step 605).
Next, the watermark insertion section embeds in the program code as watermark verification code a conditional branch that determines whether variable val3 and constant C1 are equal and halts the program if they are not equal, and a conditional branch that determines whether variable val4 and constant C2 are equal and halts the program if they are not equal (step 606).
The watermark insertion section then embeds an expression that assigns F3(val1, val2) to variable val5 (step 607).
Then the watermark insertion section inserts in the program, as watermark verification code, code that adds val3+val4+val5 to a decision statement that determines original code 0 (step 608).
Watermark insertion section 202 then stores the locations at which watermark and watermark verification code were inserted in step 604 through step 608 in watermark information storage section 206 (step 609).
In this way, watermark insertion section 202 inserts a watermark in the program.
The points to be noted here are that variables val3, val4, and val5 detected by collusion attack are included in val3+val4+val5 inserted in step 608, and that val3+val4+val5 is inserted in the 0 part of the decision statement related to program operation. As a result, if an illegal user attempts to detect variables (val3, val4, val5) by means of collusion attack, and alter or delete a location using variables generated by a function using the detected variables, a decision statement related to program operation will also be altered or deleted. Thus, the program will not operate normally, and cannot be used illegally.
Next, program code generated by a watermark insertion section according to Embodiment 2 will be described using
In
In program 800b, watermark is inserted in the part indicated by reference numeral 701 in step 604, and calculation expressions (code) for watermark verification are inserted in the part indicated by reference numeral 702.
Then, in program 800b, the processing result of step 608 is inserted in the part indicated by reference numeral 703. Also, in program 800b, watermark verification code is inserted in the part indicated by reference numeral 704 in step 606.
The result of generating program 800b in this way is that, if a person attempting illegal use detects watermark verification code 703 from program 800b by means of collusion attack and alters or deletes the watermark verification code, since watermark verification code 703 is code related to the specifications (related to program input/output in the original code) , the program will not operate normally if this code is deleted.
In order to change only the watermark verification code 703 decision statement within the watermark, it is necessary to understand the program specifications and know that watermark verification code 703 is specification related code. It takes time to understand the structure of a program, and watermark deletion cannot be performed by means of mechanical processing.
The condition C1+C2+F3=0 need not apply. In this case, C1+C2+F3 can be inserted in a decision statement that uses the value obtained from C1+C2+F3. For example, if C1+C2+F3=1, 1 of a decision statement determining 1 is switched with C1+C2+F3.
As described above, according to Embodiment 2, if a location (part 703 shown in
(Embodiment 3)
Embodiment 3 alters code around a location at which watermark and watermark verification code are input, or all code, by performing processing such as “obfuscating.” Consequently, code other than a watermark is detected by collusion attack, thus enabling watermark alteration or deletion based on collusion attack to be prevented with certainty.
Embodiment 3 is described in detail below. The difference between the watermark insertion apparatus in Embodiment 3 and watermark insertion apparatus 20 in Embodiment 1 lies in the operation of watermark insertion section 202.
Next, the operation of watermark insertion section 202 of Embodiment 3 will be described using
First, watermark insertion section 202 assigns an initial value of 1 to variable i (step 800). Then the watermark insertion section divides ID information into n items of information, and generates watermark X(1), X(2) . . . (X)n (step 801).
Next, watermark insertion section 202 detects a loop section (while, for statements) in the program source code (step 802), and inserts watermark X(i) within the loop (step 803).
Watermark insertion section 202 then “obfuscates” the insertion location loop section by applying the method described in “Method for Scrambling Programs Containing Loops” (Monden et al., Technical Report of IEICE D-I, Vol. J80-D-I, No.7, pp.644-652, July 1997) (step 804). At this time, there are a number of variations in the program obfuscating method, and the variation is selected at random (or so as not to duplicate obfuscating executed on a program distributed in the past).
Then, watermark insertion section 202 determines whether variable i is less than or equal to the number of items of watermark n (step 805), and if variable i is less than or equal to n, increments variable i (step 806) and proceeds to the processing in step 802. If, on the other hand, variable i is determined not to be less than or equal to n in step 805-that is, if all watermark has been input-watermark insertion section 202 next compiles the source code, stores the assembler code line numbers at which watermark was input, outputs the program, and terminates processing (step 807).
Next, program code generated by watermark insertion section 202 according to Embodiment 3 will be described using
In programs 900b and 900c, implementation differs according to obfuscating, but the specifications (relationship to program input/output) are not changed. When the differences between programs 900b and 900c are identified, since program code has also been modified at locations other than the watermark location, non-watermark parts 902a and 902b are also detected as differences.
Therefore, in order to alter or delete the watermarks of programs 900b and 900c, it is necessary to analyze the programs and find out which parts are watermarks unrelated to the program specifications. Since determining whether a part is unrelated to the program specifications requires an understanding of the program specifications, it is difficult to mechanically delete a watermark embedded using this method.
As described above, according to Embodiment 3, the watermark insertion section also operates as an alteration means that performs obfuscating processing so that program specifications are not affected in parts other than a location at which a program watermark is inserted, so that non-watermark code in parts related to program specifications is detected by collusion attack. It is thus difficult to identify a watermark insertion location based on collusion attack. As a result, it is possible to prevent watermark alteration or deletion with certainty, and to prevent illegal program circulation. (Embodiment 4)
In Embodiment 4, a watermark insertion apparatus is provided at a distribution destination, and a watermark is given to a distributed program at the distribution destination.
The configuration of an illegal distribution prevention system according to Embodiment 4 is described below using
In this system, distribution source 1100 first distributes to distribution destinations 1110 and 1120 respectively ID information 1101 and ID information 1102 that uniquely determine distribution destinations 1110 and 1120 respectively.
In response to this, distribution destinations 1110 and 1120 store ID information 1101 and 1102 in watermark insertion apparatuses 20a and 20b.
Next, distribution source 1100 distributes a program 1103 to distribution destinations 1110 and 1120.
In response to this, distribution destinations 1110 and 1120 generate programs 1111 and 1121 in which watermarks are inserted in distributed program 1103 using watermark insertion apparatuses 20a and 20b.
Watermark insertion apparatuses 20a and 20b may be watermark insertion apparatuses according to any one of Embodiment 1 through Embodiment 3.
Thereafter, insertion apparatuses 20a and 20b transmit storage information 1104 and 1105 to distribution source 1100, and distribution source 1100 holds storage information 1104 and 1105.
If distribution destination 1110 performs illegal secondary distribution to circulation destination 1130, distribution source 1100 obtains the circulated program 1112, and inputs it together with storage information 1104 and 1105 to watermark extraction apparatus 30. Distribution source 1100 then acquires ID information 1107 specifying distribution destination 1110 or 1120 by means of watermark extraction apparatus 30. Distribution source 1100 next compares ID information 1101 and 1102 distributed to distribution destinations 1110 and 1120 with acquired ID information 1107, and identifies distribution destination 1110 or 1120 that illegally circulated the program.
As described above, according to Embodiment 4, it is possible to easily distribute a program to an unspecified number of distribution destinations, and insert watermarks at distribution destinations. This kind of mode is effective when applied to a system in which it is desirable to simply distribute programs only, such as program distribution using digital broadcasting, multicasting or broadcasting via an IP network, and so forth.
(Embodiment 5)
Embodiment 5 alters a program by adding dummy code that does not affect program specifications at a location at which a watermark insertion method or other method is implemented. As a result, code other than a watermark is detected at different locations when collusion attack is executed, thus enabling watermark alteration or deletion based on collusion attack to be prevented with certainty.
Next, a watermark insertion apparatus 1200 according to Embodiment 5 will be described using
The operation of program input section 201 of watermark insertion apparatus 1200 according to Embodiment 5 is identical to that of program input section 201 of watermark insertion apparatus 20 in other embodiments.
Watermark insertion apparatus 1200 is provided with a dummy method input section 1203 that inputs a redundant dummy method that does not affect execution of a program output by program input section 201. Dummy method input section 1203 outputs an input dummy method to a dummy method insertion section 1201.
Dummy method insertion section 1201 is a means of adding a dummy method input by dummy method input section 1203 as an area for embedding a watermark. Dummy method insertion section 1201 outputs a program to which a dummy method has been added to a dummy code insertion section 1202.
Dummy code insertion section 1202 is an alteration means of performing alteration without changing program specifications by inserting a dummy code pair not necessary for program execution results at locations at which all program methods (all methods including the dummy method) are implemented without affecting program execution. An example of dummy code that could be inserted is the PUSH/POP pair.
Watermark insertion section 202, program output section 203, watermark data input section 204, and ID information generation section 205 are means identical, respectively, to watermark insertion section 202, program output section 203, watermark data input section 204, and ID information generation section 205 of watermark insertion apparatus 20 in other embodiments.
Watermark information storage section 1204 stores information on the correspondence between characters, numeric values, and symbols used in watermarks and bit strings, and information on the correspondence between bit strings and instruction codes, for watermarks inserted by watermark insertion section 202. Watermark information storage section 1204 also holds a method name and line number as identification information for a dummy method used for watermark insertion. Moreover, when encrypted data is used as watermark data, watermark information storage section 1204 also stores key information for decryption of the data.
In this way, a watermark insertion location can easily be identified using identification information, and watermark can easily be detected.
Next, a watermark extraction apparatus 30 according to Embodiment 5 will be described. The difference between watermark extraction apparatus 30 according to Embodiment 5 and watermark extraction apparatus 30 in other embodiments lies in the operation of watermark information storage section 305.
Watermark detection section 302 acquires identification information of a method used for watermark insertion obtained from watermark information storage section 305 in a program output from program input section 301, and checks the method indicated by the identification information.
Next, watermark detection section 302 extracts watermark inserted in the program by performing conversion from instruction code to bit string, and from bit string to character, numeric value, or symbol, using the correspondence between characters, numeric values, and symbols used in watermarks and bit strings, and the correspondence between bit strings and instruction codes, obtained from the same watermark information storage section 305.
Watermark detection section 302 generates ID information from an extracted watermark, and outputs it to ID information storage section 304.
Watermark information storage section 305 is a means of holding identification information of a method in which a watermark is inserted. Watermark information storage section 305 also stores the correspondence between characters numeric values, and symbols used in a watermark of a distributed program and bit strings, and the correspondence between bit strings and instruction codes. Moreover, when inserted watermark is encrypted, watermark information storage section 305 also holds the key for decryption of the data. Watermark information storage section 305 obtains the correspondence between characters, numeric values, and symbols and bit strings, the correspondence between bit strings and instruction codes, identification information for a method in which a watermark is inserted, and a key for decryption of encrypted data, from watermark information storage section 1204.
Next, the operation of dummy code insertion section 1202 and watermark insertion section 202 of Embodiment 5 will be described using
First, dummy code insertion section 1202 assigns an initial value of 1 to variable i (step 1300). Then watermark insertion section 202 generates watermark S from ID information, using the correspondence between characters, numeric values, and symbols and bit strings (step 1301).
Dummy code insertion section 1202 then detects a method section (location at which a method is implemented) in the program (step 1302) and determines whether variable i is less than or equal to the total number of methods in the program (step 1303), and if variable i is less than or equal to the total number of methods, inserts essentially unnecessary dummy code that does not affect the program specifications (step 1304).
There are a number of variations of the dummy code inserted at this time, and the variation is selected at random, or so as not to duplicate dummy code inserted in a program distributed in the past. That is to say, dummy code is inserted in such a way that the dummy code will be extracted by collusion attack.
Next, watermark insertion section 202 determines whether the detected method section is a dummy method (step 1305), and if it is a dummy method, inserts watermark S by applying the method described in “A Watermarking Method for Computer Program” (Monden et al., 1998 Symposium on Cryptography and Information Security, SCIS ′98-9.2.A, January 1998) (step 1306).
At this time, watermark insertion section 202 also retains dummy method identification information (step 1307).
Watermark insertion section 202 then increments variable i (step 1308) and proceeds to the processing in step 1302.
If, on the other hand, variable i is determined not to be less than or equal to the total number of methods in step 1303—that is, if dummy code has been inserted in all methods, and watermark has been inserted in a dummy method thereamong—watermark insertion section 202 outputs a program in which watermark has been embedded (step 1309).
With a program generated by watermark insertion section 202 according to Embodiment 5, implementation differs according to dummy code insertion, but the specifications (relationship to program input/output) are not changed. Also, since different dummy codes are inserted by the respective programs, when differences between programs are identified in order to identify a watermark insertion method, methods other than a method in which a watermark is inserted are also detected as differences.
Therefore, in order to alter or delete a program watermark, it is necessary to analyze the program and find out which method is a dummy method for watermark insertion unrelated to the program specifications. Since determining whether a part is unrelated to the program specifications requires an understanding of the program specifications, it is difficult to mechanically delete a watermark embedded using this method.
An example of program code generated when Embodiment 5 is applied is shown in
The program indicated by reference numeral 1600a in
Program 1600c and program 1600d are programs in which different watermarks and dummy code are inserted. In programs 1600a through 1600d, method A2 denotes a dummy method, and the numeral before each instruction mnemonic indicates the line number.
First, in step 1301, watermark insertion section 202 generates watermark S1 (100111 001101 101000 000000 000001) and S2 (100111 001101 101000 000000 000010) with 6 bits per character from different ID information I1 ((C) 01) and I2 ((C) 02) ,respectively, for use by watermark insertion programs 1600c and 1600d.
Next, in step 1302, dummy code insertion section 1202 detects a method section in watermark insertion program 1600b, and instep 1304 inserts mutually differing dummy code in A1, which is not a dummy method (part indicated by reference numeral 1601 in
Furthermore, when the method is dummy method A2, in step 1306 watermark insertion section 202 embeds as watermark in watermark insertion program 1600b only the number of bits allocated to the instructions subject to embedding from watermark information S1 and S2.
In this example, iconst_0 in method A2 of program 1600b is an instruction subject to embedding and 2-bit information is allocated thereto, and embedding is performed by extracting 2 bits from S1 and S2 (part indicated by reference numeral 1602 in
At this time, watermark insertion section 202 performs extraction from the low-order bits of each character, and when extraction is completed for one entire character, performs extraction from the low-order bits of the next character.
Dummy code insertion section 1202 also performs the same kind of dummy code insertion for method A2 as for method Al (part indicated by reference numeral 1603 in
If the distribution destinations of programs 1600c and 1600d are in collusion, and find differences between the programs in order to identify the watermark information insertion location, the parts indicated by reference numerals 1601 and 1603, which are not watermark, will also be detected together with watermark 1602, making it difficult for a watermark insertion location to be identified based on collusion attack.
It is thus possible to prevent mechanical alteration or deletion of watermark, and to prevent illegal program circulation.
Next, the operation of watermark detection section 302 according to Embodiment 5 will be described using
First, watermark detection section 302 acquires dummy method identification information from watermark information storage section 305 (step 1500).
Then watermark detection section 302 detects a dummy method section in which a dummy method is implemented and a method section in the program using the acquired identification information (step 1501), and extracts watermark S from the dummy method section using the correspondence between bit strings and instruction codes stored in watermark information storage section 305 (step 1502).
Watermark detection section 302 generates ID information uniquely identifying the program distribution destination from information stored in ID information storage section 304 and extracted watermark S (step 1503), outputs the ID information (step 1504), and terminates processing.
Thus, watermark detection section 302 can easily detect a dummy method section and method section by using identification information, and can identify the program distribution destination by extracting watermark S from the dummy method section. As a result, illegal program circulation can be prevented.
As described above, according to Embodiment 5, it is possible to insert in a program not only watermark but also dummy code, comprising an execution code pair, that does not affect the specifications. As a result, non-watermark code is detected in different places when collusion attack is performed, making it possible to prevent with certainty watermark alteration or deletion based on collusion attack.
(Embodiment 6)
Embodiment 6 alters a program by switching around the order of parts other than a watermark insertion location, or the code of the entire program. As a result, non-watermark code is detected in different places when collusion attack is performed, making it possible to prevent with certainty watermark alteration or deletion based on collusion attack.
Embodiment 6 is described in detail below. The difference between the watermark insertion apparatus in Embodiment 6 and watermark insertion apparatus 20 in Embodiment 1 lies in the operation of watermark insertion section 202.
Next, the operation of watermark insertion section 202 of Embodiment 6 will be described using
First, watermark insertion section 202 assigns an initial value of 1 to variable i (step 1601). Then watermark insertion section 202 generates, from ID information, watermark S (different for each distribution destination) for embedding in the code (program) (step 1602).
Next, watermark insertion section 202 extracts code parts, within the entire program, that will not affect the specifications-that is, that will allow the specifications to be maintained-even if their order is switched around (step 1603). A code part here means a part of a program composed of a plurality of codes.
Watermark insertion section 202 then determines whether variable i is less than or equal to the number (N) of code parts that allow the specifications to be maintained even if their order is switched around (step 1604), and if variable i is less than or equal to N, switches around the order of the code contained in that code part (step 1605) , increments i (step 1606) , and proceeds to step 1604.
If variable i is not less than or equal to N, watermark insertion section 202 inserts watermark S in the code (step 1607), compiles the source code, stores the assembler code line number at which watermark S was input, outputs the program, and terminates processing (step 1608).
In this way, watermark insertion section 202 converts parts other than a location at which watermark is input, while maintaining the specifications, by switching around the order of code parts that allow the specifications to be maintained even if their order is switched around.
Next, program code generated by watermark insertion section 202 according to Embodiment 6 will be described using
Programs 1700b and 1700c contain code parts 1701b and 1701c, and 1703b and 1703c. Code parts 1701b and 1701c, and 1703b and 1703c, are not code parts for insertion of a watermark contained in original program 1700a, and in these code parts 1701b and 1701c, and 1703b and 1703c, the code sequence of code parts 1701a and 1701b that allow the specifications to be maintained even if their code is switched around is changed.
Thus, program 1700b and program 1700c have been converted to different instruction sequences with respect to program 1700a, but the overall specifications have not changed. That is to say, in program 1700b and program 1700c, program 1700a has been converted while maintaining its specifications. When the differences between programs 1700b and 1700c are identified, since program code has also been modified at locations other than the watermark location, non-watermark code parts 1701b, 1701c, 1703b, and 1703c are also detected as differences.
Therefore, in order to alter or delete the watermarks of programs 1700b and 1700c, it is necessary to analyze the programs and find out which parts are watermarks that do not affect the program specifications. Since determining whether a part does not affect the program specifications requires an understanding of the program specifications, it is difficult to mechanically delete a watermark embedded using this method.
As described above, according to Embodiment 6, watermark insertion section 202 operates as a conversion means that detects, from among program parts other than locations at which a program watermark is inserted, program parts that allow the specifications to be maintained even if the instruction sequence is switched around, and performs sequence conversion of program parts whose instruction sequence can be switched around without affecting the program specifications—that is to say, while maintaining the specifications. Consequently, program parts that do not affect program specifications, comprising non-watermark code, are detected by collusion attack. As a result, it is possible to prevent watermark alteration or deletion with certainty, and to prevent illegal program circulation.
Also, according to Embodiment 6, with regard to sequence conversion of program parts for which switching around of the instruction sequence presents no problem, permutations of instruction statements within a program part are found, and conversion is performed in accordance with a permutation selected so as to be different for each distribution destination. As a result, the instruction sequences of program parts for which switching around of the instruction sequence presents no problem are different for each distribution destination. It is thus difficult to identify a program part for which switching around of the instruction sequence presents no problem, and it is possible to prevent watermark alteration or deletion with certainty.
As a method other than that of conversion in accordance with instruction sequence permutations, the order of program parts for which switching around of the instruction sequence presents no problem may be made different for each distribution destination by being converted randomly.
It is also possible to hold historical information on sequence conversion of code contained in code parts for which sequence switching presents no problem, and to use this historical information to perform conversion of code parts for which sequence switching presents no problem so as to be different for each distribution destination.
By this means, sequence conversion of code contained in code parts for which sequence switching presents no problem can be made different for each distribution destination reliably and easily.
This application is based on Japanese Patent Application No.2002-311815 filed on Oct. 25, 2002, Japanese Patent Application No.2003-133566 filed on May 12, 2003, and Japanese Patent Application No.2003-324805 filed on Sep. 17, 2003, entire contents of which are expressly incorporated by reference herein.
As described above, according to the present invention it is possible to insert a watermark so that it is difficult to identify the watermark insertion location, and therefore the present invention is applicable over a wide range including the circulation of computer programs using a network.
Number | Date | Country | Kind |
---|---|---|---|
2002-311815 | Oct 2002 | JP | national |
2003-133566 | May 2003 | JP | national |
2003-324805 | Sep 2003 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP03/13405 | 10/21/2003 | WO | 1/21/2005 |