Big knowledge-based semantic correlation for detecting slow and low-level advanced persistent threats

Lajevardi, Amir Mohammadzade; Amini, Morteza

doi:10.1186/s40537-021-00532-9

Research
Open access
Published: 27 November 2021

Big knowledge-based semantic correlation for detecting slow and low-level advanced persistent threats

Journal of Big Data volume 8, Article number: 148 (2021) Cite this article

2567 Accesses
3 Citations
2 Altmetric
Metrics details

Abstract

Targeted cyber attacks, which today are known as Advanced Persistent Threats (APTs), use low and slow patterns to bypass intrusion detection and alert correlation systems. Since most of the attack detection approaches use a short time-window, the slow APTs abuse this weakness to escape from the detection systems. In these situations, the intruders increase the time of attacks and move as slowly as possible by some tricks such as using sleeper and wake up functions and make detection difficult for such detection systems. In addition, low APTs use trusted subjects or agents to conceal any footprint and abnormalities in the victim system by some tricks such as code injection and stealing digital certificates. In this paper, a new solution is proposed for detecting both low and slow APTs. The proposed approach uses low-level interception, knowledge-based system, system ontology, and semantic correlation to detect low-level attacks. Since using semantic-based correlation is not applicable for detecting slow attacks due to its significant processing overhead, we propose a scalable knowledge-based system that uses three different concepts and approaches to reduce the time complexity including (1) flexible sliding window called Vermiform window to analyze and correlate system events instead of using fixed-size time-window, (2) effective inference using a scalable inference engine called SANSA, and (3) data reduction by ontology-based data abstraction. We can detect the slow APTs whose attack duration is about several months. Evaluation of the proposed approach on a dataset containing many APT scenarios shows 84.21% of sensitivity and 82.16% of specificity.

Introduction

Since the term “Advanced Persistent Threat” was coined by United States Air Force (USAF) [1,2,3] in 2006, various general definitions [3,4,5,6,7,8,9,10,11,12,13,14] are proposed to describe the advanced persistent threat (APT) attacks which are often far from real APT scenarios such as Stuxnet, Flame, Project Sauron, Shamoon, and WannaCry [15]. According to our survey on the behavior and anatomy of nearly 70 real APTs, which are reported by Kaspersky Targeted Cyberattacks Logbook [15], these attacks uses low and slow patterns that make them difficult to be detected. Low APTs use trusted agents by some tricks such as code injection to do any malicious activities through a trusted process, and conceal any footprint and abnormalities. To detect such malicious activities it is necessary to use fine-grained event interception in the endpoint systems. Since fine-grained interception leads to huge numbers of events, detection approaches use short time-window to correlate the events. Slow APTs use some tricks to increase the time of attacks and escape from short time-window of intrusion detection and alert correlation systems. Hence, security information and event management approaches which collect and correlate the logs and alerts generated by different tools (e.g., antivirus, firewall, UTM, network intrusion detection system, network device, and operating system) are vulnerable against the APT attacks for the following reasons:

Due to the processing limitation, the generated and correlated logs are mostly coarse-grained. The coarse-grained logs lead to data loss and lack of precise correlation between the low-level operating system events with the network events and rebuilding the attack vectors.
Due to the processing limitation, the available solutions use short-time windows for correlating the alerts, and hence they are vulnerable to detect slow attacks and long-term attack vectors.

As a result, the purpose of this paper is to solve these problems in practice by proposing a big Knowledge-based semantic correlation engine for detecting slow and low-level APTs, which are the most sophisticated APTs. To this aim, we enhance our previous solution proposed in [16] to detect slow APTs, other than low-level and hybrid APTs. Therefore, the contributions of our proposed approach in this paper are as follows:

Since, detecting low-level APT attacks needs processing large event logs, and detecting slow APTs makes the processing problem much harder, one of our contributions is to propose an approach to detect both low-level and slow APT attacks.
Using a long sliding window for detecting the slow APTs. We propose a Vermiform sliding window to analyze and correlate system events instead of using a fixed-size time-window.
Using Scalable Semantic Analytics Stack (SANSA) [17] as a big inference engine based on Spark for scalable semantic correlation.
Although SANSA is a good inference engine for processing huge number of events, its processing power is limited. We use event abstraction concept to reduce the number of events, to speed up the inference time, and to detect the very slow APTs (whose attack duration is several years instead of several months). By abstracting the old events, we consider them as a history in the detection process instead of being disposed by the movement of the timing window.

The rest of the paper is organized as follows. “Preliminaries” section describes the necessary preliminaries which are used in the paper. The characteristics of APTs, related works, and the formal definition of the problem are described in “Background and problem statement” section. The proposed approach is discussed in “Proposed approach” section. Evaluation and result analysis are reported in “Evaluation” section. Finally, the paper is concluded in “Conclusion” section.

Preliminaries

In this section, we define the basic terms and concepts that are used in other sections of this paper. The most basic terms of this section are retrieved from our previous work [16], which proposes an approach to detect hybrid and low-level APTs. The summary of the defined symbols in this paper is presented in “List of the symbols used in the paper” section.

Since the proposed approach in this paper employs description logic [18], and Ontology web language- description logic (OWL-DL), we have defined the syntax and semantics of part of description logic for the readers who are not familiar with these concepts in “Syntax and semantics of description logic” section.

Definition 1

An event occurs when a subject acts on an object in a specific time or period [16].

More formally, event $e_i \in Event^{\mathcal {I}}$ is defined by a quadruple as follows:

$$\begin{aligned} \forall e_i \in Event^{\mathcal {I}}, e_i=\langle s_i,o_i,a_i, t_i \rangle , \nonumber \\&s_i\in Subject^{\mathcal {I}}, \nonumber \\&o_i \in Object^{\mathcal {I}}, \nonumber \\&a_i \in Action^{\mathcal {I}}, \nonumber \\&t_i \in {\mathbb {N}}, \end{aligned}$$

(1)

where $s_i$ is a subject such as a user or a process, $o_i$ is an object such as a socket or file, $a_i$ is an action such as reading (R) or writing (W), and $t_i$ is the timestamp of the event occurrence.

Since in this paper event is considered as a concept in system ontology and the languages provided for ontology specification allow only using unary and binary predicates, we should specify the properties of an event $e_i$ with four binary relations as follows [16].

$$\begin{aligned} e_i=\langle s_i,o_i,a_i, t_i\rangle \Longleftrightarrow \langle e_i,s_i\rangle ,\langle e_i,o_i\rangle ,\langle e_i,a_i\rangle , \langle e_i,t_i\rangle . \end{aligned}$$

In the rest of the paper, for the sake of simplicity, we define the event and its properties as a quadruple.

According to the ontology specified in the section “Semantic correlation”, concept Subject includes four subject types Thread, Process, User, and Host as follows:

$$\begin{aligned} Subject=Host \sqcup User \sqcup Process\sqcup Thread. \end{aligned}$$

(2)

Also, function $time: Event^{\mathcal {I}} \longrightarrow {\mathbb {N}}$ specifies the timestamp of an event, and is defined as follows:

$$\begin{aligned} \forall \ e_i \in Event^{\mathcal {I}}, \ e_i=\langle s_i, o_i, a_i, t_i \rangle \rightarrow time(e_i)=t_i. \end{aligned}$$

(3)

Similarly functions subject, object, and action specify the subject, the object, and the action of an event respectively as follows:

$$\begin{aligned}&\forall \ e_i \in Event^{\mathcal {I}}, \ e_i=\langle s_i, o_i, a_i, t_i \rangle \nonumber \\&\quad \rightarrow subject(e_i)=s_i, object(e_i)=o_i, action(e_i)=a_i. \end{aligned}$$

(4)

Definition 2

Frame $f:Event \times {\mathbb {N}} \rightarrow {\mathbb {N}}$ specifies the number of events in a specific event set which have a specific timestamp.

In other words, $f(E_i, t_i)=n_i$ means, the number of events in $E_i$ where $e_i \in E_i \wedge time(e_i)=t_i$ is equal to $n_i$.

For example, if $E_i=\{\langle s_1, o_1, a_1,t_1\rangle , \langle s_2, o_2, a_2,t_2 \rangle ,$ $\langle s_3, o_3, a_3,t_1 \rangle \}$ then $f (E_i,t_1)=2$ and $f(E_i,t_2)=1$.

Definition 3

Two events $e_i$ and $e_j$ are related to each other and denoted by $e_i \overset{}{\sim } e_j$ if there are specific relations between their properties and $t_i \le t_j$ [16]. This relation can be modeled by a directed acyclic graph, which is shown in Fig. 1.

To detect malicious activities, it is necessary to define the system security policy. The security policy is defined as follows.

Definition 4

(Security policy [16]) Security policy (SP) is defined as $SP \subseteq Subject^{{\mathcal {I}}} \times Object^{{\mathcal {I}}} \times Action^{{\mathcal {I}}}$, which determines the set of all unauthorized events in the system. Any policy rule $p_i=\langle s_i,o_i,a_i \rangle$ in SP shows that subject $s_i$ is not authorized to do action $a_i$ on object $o_i$ at any time.

Definition 5

(Explicit violation [16]) The occurrence of an event set ES ($ES \subseteq Event^{\mathcal {I}}$) in a system causes the explicit violation of security with regard to security policy SP, if and only if,

$$\begin{aligned}&\exists e_i, p_j, e_i \in ES \wedge p_j \in SP \wedge subject(e_i)=subject(p_j) \wedge \nonumber \\&\quad object(e_i)=object(p_j) \wedge action(e_i)=action(p_j). \end{aligned}$$

(5)

Definition 6

(Implicit violation [16]) The occurrence of an event set ES ($ES \subseteq Event^{\mathcal {I}}$) in a system causes the implicit violation of security with regard to security policy SP, if and only if,

$$\begin{aligned} \exists e_i, p_j, e_i \in I(ES) \wedge p_j \in SP \wedge subject(e_i)=subject(p_j) \nonumber \\ ~~~~ \wedge object(e_i)=object(p_j) \wedge action(e_i) =action(p_j) \wedge \nonumber \\ \not \exists e_k, e_k \in ES \wedge subject(e_k)=subject(e_i) \wedge \nonumber \\ ~~~~ object(e_k)=object(e_i) \wedge action(e_k)=action(e_i), \end{aligned}$$

(6)

where function $I: {\mathcal {P}}(Event^{\mathcal {I}}) \longrightarrow {\mathcal {P}}(Event^{\mathcal {I}})$ specifies the set of all events that are occurred implicitly following the execution of another event set. As an example, for two untrusted subjects $s_1$, $s_2$, if $ES:=\{\langle s_1, o_1, r, t_1\rangle , \langle s_1, o_2, W, t_2 \rangle , \langle s_2,o_2, r, t_3 \rangle \}$ and $SP:=\{\langle s_2, o_1, r \rangle \}$ then the violated policy is $I (ES):= \{\langle s_2,o_1, r,t_3 \rangle \}$ . The schematic description of this example is shown in Figure 2. As shown in this figure, the occurrence of a sequence of events (ES) cause to object $o_1$ be read by subject $s_2$ indirectly. In other words, event set ES cause event $e_k=\langle s_2, o_1, r, t_3 \rangle$ occur implicitly. The process of indirect access detection is discussed in the proposed approach in “Expanding: Knowledge-based Inference” section.

Definition 7

(Attack vector) An attack vector $\nu _i$ is a set of events ($\nu _i \subseteq Event^{\mathcal {I}}$) that has the following three characteristics:

Malicious: An attack vector $\nu _i$ is malicious if it violates the security policies implicitly or explicitly.
Minimal: An attack vector $\nu _i$ is minimal if the exclusion of any event $e_i$ from $\nu _i$ reduces the maliciousness of $\nu _i$ [16]. Suppose that function $\zeta :Event^{\mathcal {I}} \rightarrow {\mathbb {N}}$ shows the value of maliciousness of an event set, then $E_i \subseteq Event^{\mathcal {I}}$ is more malicious than $E_j \subseteq Event^{\mathcal {I}}$, if and only if, $\zeta (E_i)>\zeta (E_j)$ [16]. In other words the minimality means:
$$\begin{aligned} \forall e_i \in \nu _i, \zeta (\nu _i - \{e_i\}) < \zeta (\nu _i). \end{aligned}$$
(7)
Connected: An attack vector $\nu _i$ is connected, if the relations between all the events of $\nu _i$ construct a connected directed acyclic graph. In other words:
$$\begin{aligned}&(\nu _i,\sim ) \ is\ Partial\ Order \wedge \not \exists \nu _1, \nu _2, \nu _1 \subseteq \nu _i \wedge \nonumber \\&\quad \nu _2 \subseteq \nu _i \wedge \nu _i=[\nu _1 \cup \nu _2]~ \wedge ~[\nu _1 \cap \nu _2] =\varnothing ~ \wedge \nonumber \\&\quad \not \exists e_1, e_2, e_1 \in \nu _1 \wedge e_2 \in \nu _2 \wedge (e_1 \sim e_2 \vee e_2 \sim e_1). \end{aligned}$$
(8)

Set $\nu$ is defined as the set of all attack vectors.

Background and problem statement

As discussed in the introduction, there are various definitions to describe the APTs. In this section, the characteristics of the APTs and the problem to be solved in this paper are defined.

APT characteristics

According to our survey on the behavior and anatomy of nearly 70 real APTs, which are reported by Kaspersky Targeted Cyberattacks Logbook [15], the APTs can be defined by the following characteristics:

Special-purpose: Since the intruders have sensitive information about the victim’s infrastructure, the behaviors of APT attacks are somewhat intelligent. This characteristic means that an APT that is malicious in one infrastructure might be completely benign in another. For example, Stuxnet [19] is an instance of a special-purpose APT, which is malicious after satisfying certain conditions in the victim’s infrastructure (e.g., detecting special patterns in centrifuge falls of victim’s industrial infrastructure), but it is nearly benign in the system of a normal user.

Slow: Since existing security mechanisms use short time-windows (about a few minutes), some APTs (e.g., ProjectSauron APT [20]) abuse this weakness to bypass the detection methods. In this case, the intruders take advantage of some tricks such as using wake-up and sleep functions to distribute their attack vectors in several time-windows (about several months). Note that, in real conditions, the attack duration cannot last very long (e.g., several years); because the software migration in the victim’s infrastructure can cause the attack to fail.

Low-level: In low-level APTs, the explicit violation of security policy is not probable and the attacker usually violates the security policy implicitly by some methods, including:

Using trusted events and agents to perform malicious activities: this method takes the advantages of some techniques such as malicious code injection into trusted applications (e.g., Gauss APT [15]), or using stolen digital certificates (e.g., Stuxnet APT [15]), or using genuine recognized removable media to bypass the data loss prevention (DLP) system (e.g., Project Sauron APT [20]), and human errors to infiltrate the victim’s system.
Performing the malicious actions gradually: some APTs (e.g., Carbanak APT [15]), especially the malware that use data exfiltration, steal the sensitive data gradually to hide from intrusion and anomaly detection systems. For example, to exfiltrate 1 GB of data from the victim’s system, the malware breaks the data into several tiny parts (e.g., less than 1 MB) and exfiltrates them slowly in several days.

Multi-step: In multi-step APTs (e.g., Flame APT [15]), the attack vector is divided into several steps, and activation of each step depends on the success of the previous steps. In these cases, the main challenge is detecting the relations of the steps and constructing the primary attack vector.

Distribution: In such threats, the intruders distribute the malware attack vectors in several sub-vectors and sub-vectors are executed by different subjects (e.g., different processes, and in some cases by different hosts). In such cases, communications between malware subjects are established through inter-process communication (IPC). Also, these malwares try to obfuscate the dependencies between the sub-vectors using fake and unrelated events within actual events. The main challenge is to identify the actual events semantically, remove the fake events, and summarize the behavior.

Hybrid: Since most intrusion detection and alert correlation systems do not correlate operating system events with network events, the intruders use a combination of both event types to bypass the detection mechanisms. For example, some APTs (e.g., Stuxnet, Hacking Team RCS, and ProjectSauron APTs [15]), for lateral movement in air-gapped networks, use removable media to spread the malwares from the Internet to local networks.

It is important to note that most APTs have only some of the six mentioned characteristics (especially low and slow features) and a few sophisticated APTs (e.g., Project Sauron APT [20]) have all of the six characteristics.

Related works

In recent years, several methods have been proposed to detect APT attacks (e.g., see [12,13,14, 21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38]) that suffer from lack of detecting low APTs, lack of evaluation against the well-known APTs such as Stuxnet and Flame, lack of detecting the hybrid APTs, and mostly lack of using dependable theoretical foundations. Nevertheless, recently a few good works have been done as follows.

Brogi et al. [39] proposed an APT detector called TerminAPTor, which tracks the information flows between the operating system processes. This approach intercepts events of a network system for two months and collects 3.2 billion events and 7.4 attacks per day. The main drawbacks of this approach are the lack of evaluation by some well-known APTs and the existence of high false positive alerts.

Ghafir et al. [40] proposed a machine learning-based system called MLAPT, which can detect APTs in real-time in three phases. In the first phase, the system analyzes the network traffic and generates some alerts based on some malicious patterns. In the next phase, the generated alerts of the first phase are correlated, and in the third phase, a machine-learning-based prediction is used for APT prediction. Again this approach has not been evaluated against the well-known APTs and it cannot detect the hybrid APTs.

In [16], a general approach is proposed to detect multi-step, hybrid, and low-level APTs. This approach is based on a knowledge-based system, i.e., the ontology of the operating system and network entities, low-level interception, and inference over the security policies and event relationships. The correlation between the operating system and network events in this approach is done based on the semantic relations of the entities, which are defined in the system ontology. In this approach, malicious behaviors and implicit violation of security policies are detected by deduction based on the existing knowledge of the occurred events and various relations between the entities of the machines and network. The main drawback of this approach is its weakness in detecting slow APTs. Since this approach is based on event correlation (instead of alert correlation) and uses ontology and inference engine, suffers from high processing overhead. However, we believe this approach is the best available solution for detecting low-level and hybrid APTs.

Mohamed et al. [38] proposed an approach based on adversarial tactics techniques and a common knowledge matrix for detecting advanced persistent attacks. This approach focuses on detecting APT attacks in their first steps of malicious activities and they managed to reduce the detection time of the attack from several months to several minutes.

Problem statement

As discussed in the introduction, attack vectors of APTs have several characteristics such as special purpose, low-level, hybrid, multi-step, slow, or distributed. Since all these characteristics are not necessarily held in one APT, we just focus on the two most important characteristics, which make detection more difficult; low-level and slow APTs. Therefore, the problem is finding an approach $\varphi : {\mathcal {P}}(Event^{\mathcal {I}}) \rightarrow {\mathcal {P}}(\nu )$ to detect attack vectors such as $\nu _i$ from a set of intercepted events such as $ES \subseteq E$, in which the following conditions are held:

1
$\nu _i$ is low-level: Since APTs try to hide their malicious activities, explicit violation of security policy is not common in such attacks. For this reason, APTs violate the security policy implicitly and at least one violation of security policy occurs by attack vector $\nu _i$. In other words [16]:
$$\begin{aligned}&\exists e_i, p_j, ~e_i \in I(\nu _i) \wedge p_j \in SP \wedge object(e_i)=object(p_j) \nonumber \\&\quad \wedge subject(e_i)=subject(p_j) \wedge action(e_i)=action(p_j) \nonumber \\&\quad \wedge \not \exists e_k, e_k \in \nu _i \wedge subject(e_k)=subject(e_i) \wedge \nonumber \\&\quad object(e_k)=object(e_i) \wedge action(e_k)=action(e_i)). \end{aligned}$$
(9)
2
$\nu _i$ is slow: As mentioned in “Introduction” section, APT attacks infiltrate/exfiltrate data to/from the target system slowly in order to hide their malicious behavior. Our research shows that each APT attack lasts from several days [10, 41] to several months [4, 10]. However, the attack duration cannot last very long (e.g., several years); because the software migration or upgrade in the victim’s infrastructure can cause the attack to fail.
3
Complexity of $\varphi$ should be acceptable: Since, detecting low-level APT attacks require processing large event logs and detecting slow APTs makes the processing problem much harder, proposing an approach to detect both low-level and slow APT attacks in an acceptable time is another main challenge.

Proposed approach

As we mentioned in previous sections, since the most sophisticated APT attacks are low-level and slow, and these two characteristics make detection difficult for intrusion detection and alert correlation systems, the purpose of this paper is to detect this type of APT attack. In our approach, we enhance our previous solution proposed in [16] to detect slow APT attacks other than the low-level ones. Our approach takes the advantages of event correlation (instead of alert correlation) and using the ontology of operating system and network entities, which are specified in this section (“Semantic correlation” section). Since the number of events and event relations significantly increases during the time, using semantic correlation leads to massive processing overhead for detecting slow attacks. The other purpose of this paper is to solve this problem.

The architecture of the proposed approach and the process of detecting malicious attack vectors like $\nu _i$ are shown in Figs. 3 and 4, respectively. In our approach, on the client side, the operating system and network events are deeply intercepted, normalized, and sent to the server side. Afterward, in step 3, on the server side, we can detect the low-level attacks by using ABox, TBox, RBox, and an inference engine. Since the number of intercepted events is very big in slow APTs, we cannot use Protégé-OWL [42] as used in [16] for processing and inference. To overcome this problem, we use a scalable inference engine called SANSA [17], which can analyze a big size of ABox and TBox using Spark. Although SANSA is a good inference engine for processing big-size events, its processing power is limited. Therefore, in step 3, we use Event Abstraction concept to reduce the number of events, speed up the inference time, and detect very slow APTs (whose attack duration is more than one year). In the Event Abstraction process, the old events are considered as an abstracted history instead of being completely disposed of. Finally, in step 4, we can detect the violation of security policy based on the inferred data in the previous steps and the high-level user-defined security policies.

The components and concepts that are used in the proposed architecture are explained in the rest of this section. Since our approach uses semantic correlation to detect low-level APTs, at first we explain the semantic correlation concepts and its limitation.

Semantic correlation

The main concepts of semantic correlation for detecting APT attacks which are retrieved from [16] are as follows:

1
Knowledge Base or KB: In semantic correlation for detecting APT attacks, we employ a knowledge base consisting of the following three boxes:
- TBox: This box defines the system ontology and the relations between the system entities. For example, the ontology of Windows operating system is shown in Fig. 5. As shown in this figure, class Object consists of three subclasses KernelObj, UserObj, and GDIObj. For another example, subject Thread is a part of subject Process.
- ABox: This box consists of four sub-boxes as follows:
  
  Instances or Individual Storage: All instances of the system ontology are stored in Individual Storage. For example, all intercepted events, subject instances, and object instances are stored in this box. In other words, Process $p_i$ and Object $o_j$ are samples of instances, which are stored in Individual Storage.
  
  Memory/Manipulation Storage (MStore): This sub-box uses two functions: Memory or me and Manipulation or ma. Function $me:Subject^{\mathcal {I}} \longrightarrow {\mathcal {P}}(Object^{\mathcal {I}})$ determines the objects that are read explicitly or implicitly by a specific subject. Function $ma:Subject^{\mathcal {I}} \longrightarrow {\mathcal {P}}(Object^{\mathcal {I}})$ determines the objects that are written explicitly or implicitly or deleted by a specific subject. For example $o_i \in me(s_i)$ means subject $s_i$ has read object $o_i$, or $o_j \in ma(s_i)$ means object $o_j$ has been written by subject $s_i$. These two functions are defined for detecting the violation of confidentiality and integrity, respectively.
  
  Security Policy (PStore): The security policy, which is defined in "Preliminaries", is stored in PStore. It is necessary to note that by using the ontology, we can define high-level and more abstract security policies and then infer the low-level security policies. The general format of security policy is shown in Algorithm 1.
  Algorithm 1: General format of high-level security policy [16]
  
  For example, in Fig. 2, the main security policy is $o_1 \notin me(s_2)$, which means subject $s_2$ should not read object $o_1$. For another example, consider Fig. 6, if no data would be extracted from local network to the Internet, then the security policy can be defined as $LNO(o_i) \wedge PNS(s_j) \longrightarrow o_i \notin me(s_j)$, where:
  
  $LNO \sqsubseteq Object$ and $PNS \sqsubseteq Subject$
  
  In this scenario, data can be exfiltrated using different approaches (e.g., through network buffer or USB drive or CD-ROM). Since we use system ontology, it is not necessary to define several security policies, because all data transmission devices (e.g., network buffer or USB drive or CD-ROM) are a type of PNO objects. For more details about security policy, readers are referred to [16].
  
  Event Relations: Two events can be related to each other by the relations between their subjects, objects, or actions. For example, relation $e_i \overset{}{\sim } e_j$, which is defined for two events $e_i$ and $e_j$, is stored in this sub-box. This sub-box contains the events that are related to each other based on some relation rules. The relation rules are described in the rest of this section in RBox subsection.
- RBox: This box consists of two sub-boxes as follows:
  
  Relation Rules: As mentioned before, two events can be related to each other based on their subjects, objects, or actions. All types of relations rules are described in Table 1. For example, as shown in this table, relation $\ e_i \overset{wr}{\sim } e_j$ means $a_i=W$ or Write and $a_j=R$ or Read.
  
  According to this table, we can define approximately 500 relation rules for event correlation (precisely $(3+1) \times 4 \times (6\times 6)$ rules which some are meaningless). For example, relation rule $e_i \overset{tewrip}{\sim } e_j$ is equal to three event relations $e_i \overset{te}{\sim } e_j$, $e_i \overset{wr}{\sim } e_j$, and $e_i \overset{ip}{\sim } e_j$.
  
  Indirect Access Rules: Some event relations result in indirect change to the value of me and ma of subjects (e.g., as shown in Fig. 2). The related rules, which are used to detect indirect changes to the value of me and ma, are defined as Indirect Access Rules. These rules, which can be used for detecting low-level APTs, are defined in “Expanding: knowledge‑based inference” section.
2
Inference Engine: Inference engine is a component of semantic correlation that uses the information and rules in the knowledge base to infer the event relations and calculates the me and ma for each subject. The low performance of inference engines is a considerable limitation in this approach. In the approach proposed in [16], Protégé-OWL [42] is used as an inference engine to perform reasoning based on Description Logic. The processing power of Protégé-OWL is limited to a knowledge base with several million frames. Hence Protégé-OWL is not a proper inference engine to detect slow APTs.
3
Policy Checker: In the final step, according to the me and ma functions and the security policy, which is stored in PStore and defined based on the system ontology, the system detects the violations of the security policy. More details of Policy Checker is explained in “Step 3: big event set processing” section.

Table 1 All types of event relations [16]

Big knowledge-based semantic correlation for detecting slow and low-level advanced persistent threats

Abstract

Introduction

Preliminaries

Definition 1

Definition 2

Definition 3

Definition 4

Definition 5

Definition 6

Definition 7

Background and problem statement

APT characteristics

Related works

Problem statement

Proposed approach

Semantic correlation

Step 1: event interception

Step 2: event normalization

Step 3: big event set processing

Step 4: policy checking

Big event knowledge-based processing

Vermiform window

Expanding: knowledge-based inference

Shrinking: event abstraction

Other restrictions

Evaluation

Dataset

Experimental results

Discussion

Conclusion

Abbreviations

Availability of data and materials

References

Acknowledgements

Authors’ information

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Appendix

Appendix

List of the symbols used in the paper

Syntax and semantics of description logic

Rights and permissions

About this article

Cite this article

Share this article

Keywords