WO2016107082A1

WO2016107082A1 - Processor instruction check method during program execution

Info

Publication number: WO2016107082A1
Application number: PCT/CN2015/081034
Authority: WO
Inventors: 邹鹏; 吕雅帅; 吴玲达; 赵忠文; 王宇; 杨超; 郭静
Original assignee: 中国人民解放军装备学院
Priority date: 2014-12-30
Filing date: 2015-06-09
Publication date: 2016-07-07
Also published as: CN104504329A

Abstract

A processor instruction check method during program execution, mainly comprising the following contents: before program execution, calculating an address of each correct instruction in all the code segments of a program, then converting the address of the correct instruction into address check data, and finally, in the program execution process, when a processor calculates a branch target address of a branch transfer instruction and does not submit a result, checking the calculated branch target address. The method can be used for stopping a malicious code constructed by using an abnormal instruction sequence of a program from being executed on a processor.

Description

Program instruction verification method for program runtime

Technical field

The present invention relates to the field of computer security and can be used to prevent malicious code constructed using a program abnormal sequence of instructions from being executed on a processor.

Background technique

The use of software vulnerabilities for code injection attacks is the most popular type of malicious code attack for a long time. However, in recent years, computer systems have introduced a combination of software and hardware storage protection mechanism.

This protection mechanism makes code injection attacks very difficult.

The mechanism stipulates that the writing and execution of a memory page of the operating system are mutually exclusive, that is, the memory page storing the executable code cannot be written, and the content that can be written into the memory page cannot be executed. This means that even if malicious code is embedded in a memory page, the malicious code will not work because the memory page cannot be executed.

However, the code reuse attack technology that has emerged in recent years has become a new means of exploiting software vulnerabilities for malicious code attacks. Code reuse attacks do not require code injection, but reorganize the existing code of the program (including the code in the system runtime) through a series of branch transfer instructions (including jump instructions, function call instructions, function return instructions, etc.) Malicious code sequences are attacked. Because the code reuse attack does not write code data to the memory page, but uses the code already in the program, so

The mechanism is completely ineffective for code reuse attacks.

Code reuse attacks pose a much greater threat to CISC instruction set processors (such as the currently widely used x86 family of processors) than RISC instruction set processors. This is because in the RISC processor, all instruction lengths are consistent, and the starting address of each instruction is an integral multiple of the instruction length. The RISC processor does not accept instructions starting at an integer multiple of the non-instruction length when executing the instruction. However, the instruction length of the CISC processor is not fixed. Taking the x86 processor as an example, the instruction length can be from 1 byte to 16 bytes. This means that any address in the code segment may be the starting address of a new instruction. Therefore, constructing malicious code on a CISC processor is easier than on a RISC processor. Currently published research shows that more than 90% of the malicious code used by code reuse attacks for x86 processors consists of a program's abnormal sequence of instructions. Thus, if the execution of these abnormal instruction sequences can be prevented, the risk of the computer system being attacked by the code reuse will be greatly reduced.

Currently, computer systems that use the CISC instruction set (such as computers using x86 series processors) do not have any protection mechanisms to prevent the execution of such abnormal instruction sequences. Although some academic researches have proposed methods to prevent the execution of abnormal x86 instructions, they are all implemented by software. These methods require that the application must run in a software virtual machine environment, resulting in greatly reduced application performance. The invention proposes a protection mechanism combining software and hardware to prevent the processor from executing an abnormal instruction sequence. The mechanism does not depend on the virtual machine environment, the application does not need to be modified, and the program running performance loss is very small. The method of the present invention can be applied to x86 and other CISC processor families. So far, a similar approach to implementing methods to prevent the execution of abnormal instructions has not been reported.

Summary of the invention

It is an object of the present invention to reduce the risk of computer systems being coded for reuse and other malicious code attacks using abnormal instructions.

The invention is a program runtime processor instruction verification method, characterized in that the verification method steps are as follows:

(1) Before the program runs, calculate the address of each correct instruction in all code segments of the program by:

For each code segment, the code segment information is used to obtain the address of the first instruction at the beginning of the code segment, and then the code segment is disassembled from the first instruction, and all the code segments are obtained through disassembly. The length of the instruction, after which the address of each instruction is the address of the previous instruction plus the previous instruction Length of the order;

(2) Convert the address of the correct instruction into address verification data by:

Combine consecutive code segments on the address into one code segment for processing. Let the initial load address of the merged code segment be a ₀ and the end address be a ₁ . For each address in the range from a ₀ to a ₁ , The address is the starting address of a correct instruction, which is represented by a bit "1". Otherwise, it is represented by a "0", or a "0" indicates the starting address of a correct instruction, one "1""No, during the running of the program, the address check data is placed in the memory for query when the processor verifies the instruction. The start load address, the code segment size and the address check data of all the merged code segments. The address is recorded in a table for query use when the program is running;

(3) During the running of the program, when the processor calculates the branch target address of a branch transfer instruction and fails to submit the result, the calculated branch target address is verified, and the branch target address to be verified is set to a _t The verification method is:

1) by a branch target address of a _t the parity check cache, the branch target buffer stored branch target address of the branch instruction just executed successfully is, when the branch address to be verified branch target buffer with a _t If one of the addresses is the same, the verification is successful, and the processor continues to execute normally. Otherwise, 2) is required.

2) First find the code segment corresponding to the address a _t , that is, to know the address a _t is in the address range of the code segment, when the corresponding code segment is not found, the address a _t is an illegal address, and the processing should be triggered at this time. abnormality, calibration process terminates; when you find the code section address corresponding to a _T, assuming a start address of the first instruction of the code segment is a _0, then the address _T a ₀ offset with respect to a search The corresponding address check data is obtained by setting the address check data of the code segment to be stored at the address a _c and fetching the address.

One byte at which the operation

Denotes the largest integer not exceeding n, the second byte _{_{(a t -a 0)% 8}} + 1 bit is the parity of the data address a _t, where% indicates modulo operation; when the check data indicates that address a _t For the start address of a correct instruction, the verification is successful, the processor continues to execute normally, and the branch target check cache is updated with the address a _t , otherwise the verification fails, causing a processor exception.

This method can completely block the execution of the application code and the abnormal instructions in the system library code. After applying this method, the risk of the code reusing attack of the CISC processor can be greatly reduced.

detailed description

In the following, a computer system using an x86 series processor and an operating system of Linux is taken as an example to describe a specific embodiment of the present invention. The memory addresses mentioned in the following description all refer to virtual addresses.

1. After the operating system loads the application or the dynamic shared library, calculate the address of each correct instruction in each code segment:

The ELF format is an executable file format adopted by the current Linux operating system. When an executable program or a shared library of the ELF format is loaded into the memory, the operating system can know the starting address of the first instruction of each code segment in the memory. . According to the encoding rules of the x86 instruction set, the binary data of the entire code segment can be disassembled into a legal x86 instruction sequence in turn from the first instruction, and the length of each instruction according to the first instruction address and disassembly. , you can calculate the address of each instruction in memory in turn.

2. Convert the correct instruction address calculated in step 1 into address verification data:

Combine multiple consecutive code segments on the address into one code segment for processing. It is assumed that the initial load address of a code segment after the merge is A, and the end address is B. For each address in the range of A to B, if The address is the starting address of a correct instruction, which is represented by a bit "1", otherwise it is represented by a "0". Because the address check data is read-only, the data is placed in the code segment space of the program, and the processor accesses the cache access by means of an instruction cache or a separate instruction. In addition, the start load address of all merged code segments, the size of the code segment, and the address of the address check data are recorded into a table, which is recorded as a code segment check query table, as part of the operating system process data structure. Program operation Line-time queries are used.

3. During the running of the program, when the processor calculates the branch target address of a branch transfer instruction but has not submitted the result, the calculated branch target address is verified:

When the pipeline commit unit is to submit the result of a non-direct branch instruction, the branch target address of the branch instruction is verified. If the branch target address is valid, the processor continues to execute normally, otherwise a processor exception is raised. The verification process is as follows:

(1) First, the branch target address is verified by a branch target check buffer, and the branch target check cache stores the branch target address of the branch instruction that has been successfully executed recently. In hardware implementation, the branch target check cache is set near the submit unit of the processor, and the access speed is faster. When the branch target address to be verified hits the cache, it indicates that the branch target is legal, and the processor can continue to execute normally. If there is no hit, then step (2) is required;

(2) The current mainstream processor usually has at least one branch predictor, because the branch predictor is usually located near the fetch unit of the processor, so the delay of accessing the branch predictor is usually greater than the branch target near the direct access commit unit. Cache is checked, but the branch target check cache can set fewer check items to save hardware overhead. The BTB (branch target cache) component exists in the branch predictor. When the target address of the branch to be verified does not hit the branch target check buffer near the commit unit, the BTB in the branch predictor can be queried. The branch target is legal, and the processor can continue to execute normally, otherwise step (3) is required;

(3) Check the query table cache by the code segment to find the code segment where the branch target address is located. The code segment check query table cache is a partial cache image of the code segment check query table stored in the memory in the processor, and stores the recently accessed code segment information table entry. If the branch target address to be verified finds the corresponding code segment information in the code segment check query table cache, step (4) is performed, otherwise an interrupt is generated, and the operating system checks the query in the complete code segment during the interrupt process. Find in the table to be verified The code segment corresponding to the branch target address, if the search fails, indicates that the branch target address is illegal, causing a processor exception. If the corresponding code segment is found, the code segment information is used to update the code segment check query table cache, and the interrupt is returned. , proceed to step (4);

(4) Assuming that the branch target address to be verified is a _t , the first instruction address of the code segment queried in step (3) is a ₀ , and the address check data of the code segment is stored at the memory address a _c , in the memory address

The (a _t -a ₀ )%8+1 bits of the byte at which it is located are the parity data of the address a _t . In order to properly access the address, the address will need to be

The ITLB (Instruction Translation Lookaside Buffer) is converted into a physical address, and then the verification data at the address is accessed through the level 1 instruction cache and the secondary cache as if the instruction data in the memory is accessed. In order to improve the verification performance, a separate check cache can be set instead of the first level instruction cache. After obtaining calibration data, if the check data is "1" in the address a _t the start address of a proper instruction, the processor may continue to perform normally, with the address and a _t updated branch target cache check, or validation fails , causing a processor exception.

The above is only a specific embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily think of changes or substitutions within the technical scope of the present invention. All should be covered by the scope of the present invention. Therefore, the scope of protection of the present invention should be determined by the scope of the claims.

Claims

A program runtime processor instruction verification method, characterized in that the verification method steps are as follows:

(1) Before the program runs, calculate the address of each correct instruction in all code segments of the program by:

For each code segment, the code segment information is used to obtain the address of the first instruction at the beginning of the code segment, and then the code segment is disassembled from the first instruction, and all the code segments are obtained through disassembly. The length of the instruction, after which the address of each instruction is the address of the previous instruction plus the length of the previous instruction;

(2) Convert the address of the correct instruction into address verification data by:

Combine consecutive code segments on the address into one code segment for processing. Let the initial load address of the merged code segment be a 0 and the end address be a 1 . For each address in the range from a 0 to a 1 , The address is the starting address of a correct instruction, which is represented by a bit "1". Otherwise, it is represented by a "0", or a "0" indicates the starting address of a correct instruction, one "1""No, during the running of the program, the address check data is placed in the memory for query when the processor verifies the instruction. The start load address, the code segment size and the address check data of all the merged code segments. The address is recorded in a table for query use when the program is running;

(3) During the running of the program, when the processor calculates the branch target address of a branch transfer instruction and fails to submit the result, the calculated branch target address is verified, and the branch target address to be verified is set to a t The verification method is:

1) by a branch target address of a t the parity check cache, the branch target buffer stored branch target address of the branch instruction just executed successfully is, when the branch address to be verified branch target buffer with a t If one of the addresses is the same, the verification is successful, and the processor continues to execute normally. Otherwise, 2) is required.

2) First find the code segment corresponding to the address a t , that is, to know the address a t is in the address range of the code segment, when the corresponding code segment is not found, the address a t is an illegal address, and the processing should be triggered at this time. abnormality, calibration process terminates; when you find the code section address corresponding to a T, assuming a start address of the first instruction of the code segment is a 0, then the address T a 0 offset with respect to a search The corresponding address check data is obtained by setting the address check data of the code segment to be stored at the address a c and fetching the address.
One byte at which the operation
Denotes the largest integer not exceeding n, the second byte (a t -a 0)% 8 + 1 bit parity data is the address of a t, wherein% denotes a modulo operation; when the check data address indicating a t For the start address of a correct instruction, the verification is successful, the processor continues to execute normally, and the branch target check cache is updated with the address a t , otherwise the verification fails, causing a processor exception.