This application is claims priority to Chinese Application No. 202110716015.8 filed on Jun. 25, 2021, which is incorporated herein by reference in its entirety.
The present disclosure relates to a field of computer technology, and in particular to a field of Internet of Vehicles and a field of information security technology.
With the comprehensive promotion of “Internet+”, an application of information technology in the country's social and economic construction has become more and more extensive. Correspondingly, a new network security threat is more prominent, and a traditional network security defense system based on “protection” will face great challenges. Facing the new network security threat, the network security defense system will pay more attention to capabilities of monitoring and response of a network security in the future.
The present disclosure provides a method of processing security information, a device, and a storage medium.
According to an aspect of the present disclosure, a method of processing security information is provided, including standardizing a security alarm information for a target device to obtain standardization data; determining a similarity between the standardization data and attack data in an attack behavior knowledge base; and updating a security information of the target device according to the similarity.
Another aspect of the present disclosure provides an electronic device, including: at least one processor; and a memory communicatively connected to the at least one processor, wherein the memory stores instructions executable by the at least one processor, and the instructions, when executed by the at least one processor, cause the at least one processor to execute the method shown in embodiments of the present disclosure.
According to another aspect of the embodiments of the present disclosure, a non-transitory computer-readable storage medium having computer instructions stored thereon is provided, wherein the computer instructions are configured to cause a computer to execute the method shown in the embodiments of the present disclosure.
It should be understood that content described in this section is not intended to identify key or important features in the embodiments of the present disclosure, nor is it intended to limit the scope of the present disclosure. Other features of the present disclosure will be easily understood through the following description.
The accompanying drawings are configured to understand the present disclosure better and do not constitute a limitation to the present disclosure, in which:
Exemplary embodiments of the present disclosure will be described below with reference to the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding and should be considered as merely exemplary. Therefore, those of ordinary skilled in the art should realize that various changes and modifications may be made to the embodiments described herein without departing from the scope and spirit of the present disclosure. Likewise, for clarity and conciseness, descriptions of well-known functions and structures are omitted in the following description.
As shown in
According to an embodiment of the present disclosure, when the terminal devices 101, 102, and 103 are under a network attack, the terminal devices 101, 102, and 103 generate corresponding security alarm information according to an attack behavior of the network attack. A security data probe may be deployed in the terminal devices 101, 102 and 103 for collecting the security alarm information of the terminal devices 101, 102 and 103, and reporting the security alarm information to the threat analysis platform 105 through the network 104.
The terminal devices 101, 102, and 103 may be various electronic devices that support a network communication, including but not limited to an intelligent vehicle-mounted system, a vehicle-mounted sensor, an intelligent transportation terminal device, etc.
The threat analysis platform 105 may be configured to process e.g. analyze data such as the received security alarm information, and feedback a processing result (e.g. an analysis result generated according to the security alarm information, etc.) to the terminal device.
The threat analysis platform 105 may be deployed in a server or a server cluster consisting of a plurality of servers. The server may be a cloud server, further known as a cloud computing server or a cloud host, which is a host product in a cloud computing service system, so as to solve existing defects of difficult management and weak business expansion in a traditional physical host and a virtual private server (VPS). The server may further be a server of a distributed system, or a server combined with a blockchain.
According to the embodiments of the present disclosure, the attack behavior knowledge base 106 may be established from an information of an attack behavior of an attacker observed and collected in a real world, and may be configured to reflect an attack life cycle of the attacker and techniques and means the attacker used. The attack behavior knowledge base 106 may be based on an adversarial tactics, techniques, and common knowledge (ATT&CK) model. The ATT&CK model has a low level of abstraction, may effectively help a user associate a tactical strategy and may more clearly reflect what life cycle a current attack is in.
It should be noted that, the method of processing security information provided by the embodiments of the present disclosure may be executed by the threat analysis platform 105. Correspondingly, the apparatus of processing security information provided by the embodiments of the present disclosure may be disposed in the threat analysis platform 105. The method of processing security information provided by the embodiments of the present disclosure may further be executed by a server or a server cluster different from the threat analysis platform 105 and capable of communicating with the terminal devices 101, 102, 103 and/or the threat analysis platform 105. Correspondingly, the apparatus of processing security information provided by the embodiments of the present disclosure may further be disposed in the server or the server cluster different from the threat analysis platform 105 and capable of communicating with the terminal devices 101, 102, 103 and/or the threat analysis platform 105.
It should be understood that the numbers of the terminal device, the network, the threat analysis platform and the attack behavior knowledge base in
As shown in
In operation S210, a security alarm information for a target device is standardized to obtain standardization data.
Then, in operation S220, a similarity between the standardization data and attack data in the attack behavior knowledge base is determined.
In operation S230, a security information of the target device is updated according to the similarity.
According to the embodiments of the present disclosure, the security data probe may be deployed in the terminal device in advance. Therefore, a security alarm information of the corresponding terminal device may be collected by the security data probe deployed in each terminal device, and then uploaded to the threat analysis platform. According to other embodiments of the present disclosure, the threat analysis platform may also access a third-party threat analysis platform to obtain a security alarm information from the third-party threat analysis platform.
According to the embodiments of the present disclosure, the security alarm information may be parsed to obtain at least one target field. Then the at least one target field is converted into a field in the standardization data according to a preset format. The preset format may include, for example, a Structured Threat Information eXpression (STIX). The STIX is a language configured to express relevance and coverage of an event, which may be used to express a structural network threat information.
According to the embodiments of the present disclosure, the attack data is data configured to describe the network attack, and may include, for example, a procedures field, a techniques field, and a tactics field, wherein the tactics field may be configured to describe why the attacker attacks, reflecting an attack intent of the attacker. A corresponding attack phase may be determined by the tactics field. For example, the tactics field may include initial access, execution, persistence, privilege escalation, defense evasion, credential access, discovery, lateral movement, collection, exfiltration, command and control, etc. Exemplarily, in this embodiment, each tactic may correspond to one or more techniques, and the techniques field may be configured to describe what the attacker has done to complete the tactic. In this embodiment, each technique may correspond to one or more procedures, and the procedures field may be configured to describe a procedure of the attack.
According to the embodiments of the present disclosure, the security information of the target device may reflect a security status of the target device, and updating the security information of the target device may help the user or an analysis system to extract and understand the attack intent of the attacker.
According to the embodiment of the present disclosure, the security information may include, for example, an attack chain. The attack chain may reflect an attack phase experienced by the device when being attacked. A form of the attack chain may take into account both machine readability and human readability. Based on this, when updating the security information, a tactics identifier corresponding to the tactics field in the target attack data may be added to the attack chain to update the attack chain.
For example, the standardization data contains a unique identifier of the device, and a current attack chain status of the device may be found through the unique identifier of the device. If the device has not been attacked before, the attack chain is empty. In this case, a current tactics identifier may be added as a data item to the attack chain. If the device has been attacked before, a current attack chain is not empty. In this case, the tactics identifier may be added to a tail of the attack chain, and the tactics identifier added during a previous attack may be set to point to the currently added tactics identifier in order to indicate a direction of the attack.
The method of processing security information according to the embodiments of the present disclosure may improve an efficiency of extracting effective threat behavior information from massive security alarm information, thereby improving an efficiency of discovering and responding to a security threat, so as to detect a behavior of the network security threat in time and response to the threat quickly.
The operation of determining the similarity between the standardization data and the attack data in the attack behavior knowledge base will be further described below combined with
As shown in
In operation S321, at least one first keyword in the standardization data is determined.
According to the embodiments of the present disclosure, the standardization data includes a field configured to describe the attack behavior, and a word segmentation processing may be performed on the field to obtain the at least one first keyword.
Then, in operation S322, for each attack data in the attack behavior knowledge base, at least one second keyword in a procedures field of the attack data is determined.
According to the embodiments of the present disclosure, word segmentation may be performed on the procedures field in the attack data to obtain the at least one second keyword.
In operation S323, the similarity between the standardization data and the attack data is determined according to the at least one first keyword and the at least one second keyword.
According to the embodiments of the present disclosure, the similarity between the standardization data and the attack data may be determined according to a similarity between the at least one first keyword and the at least one second keyword.
Exemplarily, in this embodiment, the at least one first keyword may be combined with the at least one second keyword to obtain a keyword set. Then a word frequency of each keyword of the keyword set in the standardization data is determined to obtain a first word frequency feature vector. A word frequency of each keyword of the keyword set in the attack data is determined to obtain a second word frequency feature vector. A cosine similarity between the first word frequency feature vector and the second word frequency feature vector is calculated as the similarity between the standardization data and the attack data.
For example, a field sl configured to describe the attack behavior in the standardization data is “DDOS
(This attack is a DDOS attack against a network server)”. A procedures field s2 in the attack data is “
DDOS ,
(This attack is configured to attack DDOS, a target is a network server)”.
Word segmentation is performed on the above-described s1 and s2, respectively, and the following word vectors S1 and S2 are obtained.
S1: [DDOS
]
S2: [DDOS
,]
Then, all the words in the S1 and the S2 are counted. All the words appeared in the S1 and the S2 are deduplicated and merged, to obtain a keyword set as follows:
[DDOS,]
For each keyword in the above keyword set, a frequency of occurrence of each keyword in the S1 and the S2 is determined respectively, and following word frequency feature vectors A and B are obtained. Each element in the word frequency feature vector represents a frequency of occurrence of a corresponding keyword in the word vector.
A:[1210110110]
B:[1211011011]
Next, a cosine similarity between A and B may be calculated according to the following formula.
where similarity is the cosine similarity between A and B, θ is an angle between the A and the B, Ai is an ith element in A, Bi is an ith element in B, and n is a total number of elements in A (or B). In this embodiment, a range of the cosine similarity is in [−1, 1]. The cosine similarity between two vectors being closer to 1 indicates a greater similarity between the two vectors is.
According to the embodiments of the present disclosure, after determining the similarity, a target attack data with the greatest similarity to the standardization data may be determined in the attack behavior knowledge base. Then, the security information of the target device is updated according to the target attack data in response to the similarity of the target attack data being greater than a similarity threshold. The similarity threshold may be set as desired in practice, and a specific value of the similarity threshold is not specifically limited in the present disclosure. Exemplarily, in this embodiment, the similarity threshold may be 0.5.
According to the embodiments of the present disclosure, by determining the similarity between the standardization data and the attack data in the attack behavior knowledge base, an attack data corresponding to the standardization data may be obtained, that is, information such as an attack method of the attacker, the attack intent of the attacker and the like may be obtained. Thus a security status of the device may be reflected more clearly and accurately.
The method of processing security information shown above will be further described below with reference to
As shown in
After obtaining the security alarm information 41, the threat analysis platform may perform data standardization on the security alarm information 41 according to a STIX format to obtain a standardization data 42. It may be understood that, if the obtained security alarm information 41 is already in the STIX format, data standardization is not required.
A target field used to describe the attack behavior in the standardization data 42 is then matched with a procedures field of a plurality of attack data 43 in an ATT&CK knowledge base. A similarity between the target field and the procedures field in each attack data is calculated to obtain a similarity between the target field and each procedures field. For a procedures field with a highest similarity among all the procedures fields, it is determined whether the similarity is greater than the similarity threshold. If the similarity is greater than the similarity threshold, a techniques field to which the procedures field belongs is determined, and then a tactics field to which the techniques field belongs is determined, thereby obtaining a target attack data 44.
After the target attack data 44 is obtained, a tactics identifier 45 of the target attack data may be obtained, an attack chain 46 corresponding to the device may be obtained, and then the tactics identifier 45 may be added to an original attack chain 46 of the device. Exemplarily, in this embodiment, the original attack chain 46 of the device is not empty. Accordingly, the tactics identifier 45 may be added to a tail of the attack chain 46, and the tactics identifier added during the previous attack may be set to point to the currently added tactics identifier to indicate the direction of the attack.
As shown in
The standardization module 510 may be configured to standardize the security alarm information for the target device to obtain the standardization data;
The similarity determination module 520 may be configured to determine the similarity between the standardization data and the attack data in the attack behavior knowledge base.
The update module 530 may be configured to update the security information of the target device according to the similarity.
According to the embodiments of the present disclosure, the standardization module may include a parsing sub-module and a conversion sub-module. The parsing sub-module may be configured to parse the security alarm information to determine the at least one target field. The conversion sub-module may be configured to convert the at least one target field into the field in the standardization data according to the preset format.
According to the embodiments of the present disclosure, the preset format may include the STIX.
According to the embodiments of the present disclosure, the similarity determination module may include a first determination sub-module, a second determination sub-module, and a third determination sub-module. The first determination sub-module may be configured to determine the at least one first keyword in the standardization data. The second determination sub-module may be configured to determine the at least one second keyword in the attack data for each attack data in the attack behavior knowledge base. The third determination sub-module may be configured to determine the similarity between the standardization data and the attack data according to the at least one first keyword and the at least one second keyword.
According to the embodiments of the present disclosure, the third determination sub-module may include a combination unit, a first determination unit, a second determination unit and a calculation unit. The combination unit may be configured to combine the at least one first keyword with the at least one second keyword to obtain the keyword set. The first determination unit may be configured to determine the word frequency of each keyword of the keyword set in the standardization data to obtain the first word frequency feature vector. The second determination unit may be configured to determine the word frequency of each keyword of the keyword set in the attack data to obtain the second word frequency feature vector. The calculation unit may be configured to calculate the cosine similarity between the first word frequency feature vector and the second word frequency feature vector as the similarity between the standardization data and the attack data.
According to the embodiments of the present disclosure, the update module may include a fourth determination sub-module and an update sub-module. The fourth determination sub-module may be configured to determine, in the attack behavior knowledge base, the target attack data with the greatest similarity to the standardization data. The update sub-module may be configured to update the security information of the target device according to the target attack data in response to the similarity of the target attack data being greater than the similarity threshold.
According to the embodiments of the present disclosure, the attack data may include the tactics field, and the security information includes the attack chain.
According to the embodiments of the present disclosure, the update sub-module includes an addition unit, which may be configured to add the tactics identifier corresponding to the tactics field in the target attack data to the attack chain.
It should be noted that, collecting, storing, using, processing, transmitting, providing, and disclosing etc. of the personal information of the user involved in the present disclosure all comply with the relevant laws and regulations, are protected by essential security measures, and do not violate the public order and morals. According to the present disclosure, personal information of the user is acquired or collected after such acquirement or collection is authorized or permitted by the user.
According to the embodiments of the present disclosure, the present disclosure further provides an electronic device, a readable storage medium and a computer program product.
As shown in
Various components in the device 600, including an input unit 606 such as a keyboard, a mouse, etc., an output unit 607 such as various types of displays, speakers, etc., a storage unit 608 such as a magnetic disk, an optical disk, etc., and a communication unit 609 such as a network card, a modem, a wireless communication transceiver, etc., are connected to the I/O interface 605. The communication unit 609 allows the device 600 to exchange information/data with other devices through a computer network such as the Internet and/or various telecommunication networks.
The computing unit 601 may be various general-purpose and/or special-purpose processing components with processing and computing capabilities. Some examples of the computing unit 601 include but are not limited to a central processing unit (CPU), a graphics processing unit (GPU), various dedicated artificial intelligence (AI) computing chips, various computing units running machine learning model algorithms, a digital signal processor (DSP), and any appropriate processor, controller, microcontroller, and so on. The computing unit 601 may perform the various methods and processes described above, such as the method of processing security information. For example, in some embodiments, the method of processing security information may be implemented as a computer software program that is tangibly contained on a machine-readable medium, such as the storage unit 608. In some embodiments, part or all of a computer program may be loaded and/or installed on the electronic device 600 via the ROM 602 and/or the communication unit 609. When the computer program is loaded into the RAM 603 and executed by the computing unit 601, one or more steps of the method of processing security information described above may be performed. Alternatively, in other embodiments, the computing unit 601 may be configured to perform the method of processing security information in any other appropriate way (for example, by means of firmware).
Various embodiments of the systems and technologies described herein may be implemented in a digital electronic circuit system, an integrated circuit system, a field programmable gate array (FPGA), an application specific integrated circuit (ASIC), an application specific standard product (ASSP), a system on chip (SOC), a complex programmable logic device (CPLD), a computer hardware, firmware, software, and/or combinations thereof. These various embodiments may be implemented by one or more computer programs executable and/or interpretable on a programmable system including at least one programmable processor. The programmable processor may be a special-purpose or general-purpose programmable processor, which may receive data and instructions from the storage system, the at least one input device and the at least one output device, and may transmit the data and instructions to the storage system, the at least one input device, and the at least one output device.
Program codes for implementing the method of the present disclosure may be written in any combination of one or more programming languages. These program codes may be provided to a processor or a controller of a general-purpose computer, a special-purpose computer, or other programmable data processing devices, so that when the program codes are executed by the processor or the controller, the functions/operations specified in the flowchart and/or block diagram may be implemented. The program codes may be executed completely on the machine, partly on the machine, partly on the machine and partly on the remote machine as an independent software package, or completely on the remote machine or the server.
In the context of the present disclosure, the machine readable medium may be a tangible medium that may contain or store programs for use by or in combination with an instruction execution system, device or apparatus. The machine readable medium may be a machine-readable signal medium or a machine-readable storage medium. The machine readable medium may include, but not be limited to, electronic, magnetic, optical, electromagnetic, infrared or semiconductor systems, devices or apparatuses, or any suitable combination of the above. More specific examples of the machine readable storage medium may include electrical connections based on one or more wires, portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM or flash memory), optical fiber, convenient compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
In order to provide interaction with users, the systems and techniques described here may be implemented on a computer including a display device (for example, a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to the user), and a keyboard and a pointing device (for example, a mouse or a trackball) through which the user may provide the input to the computer. Other types of devices may also be used to provide interaction with users. For example, a feedback provided to the user may be any form of sensory feedback (for example, visual feedback, auditory feedback, or tactile feedback), and the input from the user may be received in any form (including acoustic input, voice input or tactile input).
The systems and technologies described herein may be implemented in a computing system including back-end components (for example, a data server), or a computing system including middleware components (for example, an application server), or a computing system including front-end components (for example, a user computer having a graphical user interface or web browser through which the user may interact with the implementation of the system and technology described herein), or a computing system including any combination of such back-end components, middleware components or front-end components. The components of the system may be connected to each other by digital data communication (for example, a communication network) in any form or through any medium. Examples of the communication network include a local area network (LAN), a wide area network (WAN), and Internet.
The computer system may include a client and a server. The client and the server are generally far away from each other and usually interact through a communication network. The relationship between the client and the server is generated through computer programs running on the corresponding computers and having a client-server relationship with each other.
It should be understood that steps of the processes illustrated above may be reordered, added or deleted in various manners. For example, the steps described in the present disclosure may be performed in parallel, sequentially, or in a different order, as long as a desired result of the technical solution of the present disclosure may be achieved. This is not limited in the present disclosure.
The above-mentioned specific embodiments do not constitute a limitation on the scope of protection of the present disclosure. Those skilled in the art should understand that various modifications, combinations, sub-combinations and substitutions may be made according to design requirements and other factors. Any modifications, equivalent replacements and improvements made within the spirit and principles of the present disclosure shall be contained in the scope of protection of the present disclosure.
| Number | Date | Country | Kind |
|---|---|---|---|
| 202110716015.8 | Jun 2021 | CN | national |