CN111738335A

CN111738335A - A neural network-based anomaly detection method for time series data

Info

Publication number: CN111738335A
Application number: CN202010577504.5A
Authority: CN
Inventors: 周春姐; 李阿丽; 张振兴
Original assignee: Ludong University
Current assignee: Ludong University
Priority date: 2020-06-23
Filing date: 2020-06-23
Publication date: 2020-10-02

Abstract

The invention discloses a time series data abnormity detection method based on a neural network, which comprises the following steps: collecting data, and performing an anomaly detection process based on the time series data of the neural network and performance verification of an anomaly detection method. The invention has the advantages that: (1) the anomaly detector does not make any assumption on the potential mechanism of the anomaly type and the anomaly pattern, but can obtain related concepts from the training data set through learning; (2) the anomaly detector does not need to select a threshold value, so that the tedious work of setting the threshold value is avoided, and good anomaly detection performance can be obtained; (3) with the accumulation of the anomaly detection experience, the anomaly detector continuously carries out dynamic improvement to learn new anomalies, so that the knowledge accumulation of the anomaly detection by the anomaly detector is enhanced; (4) the anomaly detection method provided by the invention can be widely applied to the fields of human body disease monitoring, traffic accident discovery, equipment fault diagnosis, network intrusion detection and the like in the industries of health care, intelligent transportation, large-scale production systems and the like.

Description

A neural network-based anomaly detection method for time series data

技术领域technical field

本发明涉及一种时间序列异常检测方法，具体涉及一种基于神经网络的时间序列数据异常检测方法，属于人工智能和大数据应用技术领域。The invention relates to a time series abnormality detection method, in particular to a time series data abnormality detection method based on a neural network, and belongs to the technical field of artificial intelligence and big data application.

背景技术Background technique

异常检测是广泛存在于很多领域的热门话题。例如健康医疗中的人体疾病监测，智能交通中的交通事故发现，大型生产系统中的设备故障诊断、网络入侵检测等众多领域，可见，异常检测非常重要。理想情况下的异常检测方法应该适用于各类不同场景，并且可以轻松操作。Anomaly detection is a hot topic that exists widely in many fields. For example, human disease monitoring in health care, traffic accident detection in intelligent transportation, equipment fault diagnosis in large-scale production systems, network intrusion detection and many other fields, it can be seen that anomaly detection is very important. Ideally, anomaly detection methods should be applicable to a variety of different scenarios and can be easily manipulated.

但是，现有的异常检测方法无法满足需求。文献（Chandola V, Banerjee A, andKumar V. Anomaly detection: A survey. ACM Computing Survey, 41(3): 1-58,2009）对异常检测方法进行了深入总结，发现不同的异常检测方法对异常模式都做出了各种强假设（Gandhimathi L, Murugaboopathi G. A novel hybrid intrusion detectionusing flow-based anomaly detection and cross-layer features in wirelesssensor network. Automatic Control and Computer Sciences, 54(1):62-69, 2020），如基于分布式的方法，但是在假设条件不成立的情况下，异常检测方法可能无法取得令人满意的结果（Feng F, Liu X, and Yong B. Anomaly detection in ad-hoc networksbased on deep learning model: A plug and play device. Ad Hoc Networks, 84:82-89, 2019）。另一方面，异常检测方法并不总是很容易操作。2015年，雅虎发布了他们的时间序列异常检测系统EGADS（Laptev N, Amizadeh S, and Flint I. Generic andscalable framework for automated time-series anomaly detection. ACM SIGKDDInternational Conference on Knowledge Discovery and Data Mining, 1939-1947,2015），在系统内，实施并集成了一组方法以生成异常检测结果。但是如此复杂的系统要求工程师不仅要了解组件，还要理解方法集，以便能够为每个组件调整参数。此外，业界使用的方法很少考虑异常模式的演变（Carreno A, Inza I, and Lozano J. Analyzing rareevent, anomaly, novelty and outlier detection terms under the supervisedclassification framework. Artificial Intelligence Review，53(5): 3575- 3594,2020），这导致静态异常检测参数在动态场景下的性能较差。However, existing anomaly detection methods cannot meet the demand. The literature (Chandola V, Banerjee A, and Kumar V. Anomaly detection: A survey. ACM Computing Survey, 41(3): 1-58, 2009) gave an in-depth summary of anomaly detection methods and found that different anomaly detection methods have Various strong assumptions have been made (Gandhimathi L, Murugaboopathi G. A novel hybrid intrusion detection using flow-based anomaly detection and cross-layer features in wireless sensor network. Automatic Control and Computer Sciences, 54(1):62-69, 2020 ), such as distributed-based methods, but anomaly detection methods may not achieve satisfactory results when assumptions do not hold (Feng F, Liu X, and Yong B. Anomaly detection in ad-hoc networks based on deep learning model: A plug and play device. Ad Hoc Networks, 84:82-89, 2019). On the other hand, anomaly detection methods are not always easy to operate. In 2015, Yahoo released their time series anomaly detection system EGADS (Laptev N, Amizadeh S, and Flint I. Generic andscalable framework for automated time-series anomaly detection. ACM SIGKDDInternational Conference on Knowledge Discovery and Data Mining, 1939-1947, 2015), within the system, a set of methods are implemented and integrated to generate anomaly detection results. But such complex systems require engineers to understand not only the components, but also the methodology set to be able to tune parameters for each component. Furthermore, methods used in the industry rarely consider the evolution of anomalous patterns (Carreno A, Inza I, and Lozano J. Analyzing rareevent, anomaly, novelty and outlier detection terms under the supervisedclassification framework. Artificial Intelligence Review, 53(5): 3575- 3594, 2020), which leads to poor performance of static anomaly detection parameters in dynamic scenarios.

综上，现有方法存在以下不足：To sum up, the existing methods have the following shortcomings:

（i）现有异常检测方法对异常类型和异常模式都做出了各种强假设，当假设条件不满足时，无法得到满意的结果；(i) Existing anomaly detection methods make various strong assumptions on both the type of anomaly and the anomaly pattern, and when the assumptions are not satisfied, satisfactory results cannot be obtained;

（ii）现有的异常检测方法构建复杂，需要频繁调整参数，可操作性不强；(ii) The existing anomaly detection methods are complicated to construct, require frequent parameter adjustment, and are not very maneuverable;

（iii）现有方法很少考虑异常模式的演变，在动态场景下的性能较差。(iii) Existing methods rarely consider the evolution of anomalous patterns and perform poorly in dynamic scenarios.

基于此，本发明提出了一种基于神经网络的异常检测方法，不对异常模式的潜在机制做出任何假设；避免阈值设置的繁琐工作，以获得良好的异常检测性能；随着异常检测经验的增长而不断学习改进，从而对时间序列数据做出异常检测。Based on this, the present invention proposes a neural network-based anomaly detection method, which does not make any assumptions about the underlying mechanism of anomaly patterns; avoids the tedious work of threshold setting to obtain good anomaly detection performance; with the growth of anomaly detection experience And continuous learning and improvement, so as to make anomaly detection on time series data.

发明内容SUMMARY OF THE INVENTION

为了消除突发事件或紧急状况等引起的异常数据，本发明提出了一种基于神经网络的时间序列数据异常检测方法，通过使用强化学习框架不断进行培训。异常检测方法的过程描述如图2所示，该过程包括三个主要组件：异常检测器，累积的异常值和经验值，它们都是自学习的。In order to eliminate abnormal data caused by emergencies or emergencies, the present invention proposes a time series data abnormality detection method based on neural network, which is continuously trained by using a reinforcement learning framework. The process description of the anomaly detection method is shown in Fig. 2. The process consists of three main components: an anomaly detector, accumulated outliers, and empirical values, which are all self-learning.

为了实现上述目标，本发明采用如下的技术方案：In order to achieve above-mentioned goal, the present invention adopts following technical scheme:

一种基于神经网络的时间序列数据异常检测方法，其特征在于，包括以下步骤：A method for detecting abnormality in time series data based on neural network, characterized in that it comprises the following steps:

步骤1：收集数据Step 1: Collect data

收集具有一定持续时间的某类时间序列数据，和各种可能引起数据突变的事件，然后通过移动终端或其他设备将收集的数据上传到云端，此部分为我们的现有技术；Collect certain types of time series data with a certain duration and various events that may cause data mutation, and then upload the collected data to the cloud through mobile terminals or other devices. This part is our existing technology;

步骤2：基于神经网络的时间序列数据异常检测过程Step 2: Anomaly detection process of time series data based on neural network

本发明提出了一种时间序列数据异常检测方法，异常检测器由递归神经网络驱动，并采用强化学习方法来实现自学习过程，例如通过采用X学习方法来训练用于估计X(z,a)的循环神经网络，通过从经验值中自我学习从而不断改进异常检测器，获得良好的异常检测性能。所提出的异常检测方法具有以下特征：不对异常类型和模式做任何假设，不需要选择阈值，随着异常检测经验的增长而不断学习改进，从而可以广泛应用于众多领域；The present invention proposes a time series data anomaly detection method. The anomaly detector is driven by a recurrent neural network and adopts a reinforcement learning method to realize the self-learning process. For example, the X learning method is used for training to estimate X(z, a) The recurrent neural network can continuously improve the anomaly detector by self-learning from the experience value, and obtain good anomaly detection performance. The proposed anomaly detection method has the following characteristics: it does not make any assumptions about the type and pattern of anomalies, does not need to select a threshold, and continuously learns and improves with the growth of anomaly detection experience, so it can be widely used in many fields;

步骤3：异常检测方法的性能验证Step 3: Performance Verification of Anomaly Detection Method

我们对多种类型时间序列数据集进行了异常检测（如慢性心力衰竭患者的体征数据、公交网络的服务数据、生产系统中的设备能源数据等），以消除由突发事件或紧急状况引起的异常数据，首先通过训练某持续时间段的数据记录集，得到异常检测经验，然后在实时数据集中验证了本发明中异常检测方法的性能，该方法能够识别目标时间序列的均值，点异常和异常模式的偏移，在测试数据集中能够获得高质量的结果，其准确度约为100％。We performed anomaly detection on multiple types of time series datasets (such as physical signs data of chronic heart failure patients, service data of public transport networks, equipment energy data in production systems, etc.) For abnormal data, firstly, the abnormality detection experience is obtained by training the data record set of a certain duration, and then the performance of the abnormality detection method in the present invention is verified in the real-time data set. The method can identify the mean value, point abnormality and abnormality of the target time series Shifting of the pattern, high-quality results can be obtained in the test dataset with an accuracy of about 100%.

本发明的有益之处在于：The benefits of the present invention are:

（1）异常检测器不对异常类型、异常模式的潜在机制做任何假设，但可以通过学习从训练数据集中获得相关概念；(1) The anomaly detector does not make any assumptions about the underlying mechanism of anomaly types and anomaly patterns, but can obtain relevant concepts from the training dataset through learning;

（2）异常检测器不需要选择阈值，避免了阈值设置的繁琐工作，从而可以获得良好的异常检测性能；(2) The anomaly detector does not need to select a threshold, which avoids the tedious work of threshold setting, so that good anomaly detection performance can be obtained;

（3）随着异常检测经验的积累，异常检测器不断进行动态改进，学习新的异常，从而增强其对异常检测的知识积累；(3) With the accumulation of anomaly detection experience, the anomaly detector continuously improves dynamically and learns new anomalies, thereby enhancing its knowledge accumulation for anomaly detection;

（4）本发明所提的异常检测方法可以广泛应用于健康医疗、智能交通和大型生产系统等行业中的人体疾病监测、交通事故发现、设备故障诊断、网络入侵检测等众多领域。(4) The anomaly detection method proposed in the present invention can be widely used in many fields such as human disease monitoring, traffic accident detection, equipment fault diagnosis, network intrusion detection, etc. in industries such as health care, intelligent transportation, and large-scale production systems.

附图说明Description of drawings

图1是我们的真实科研项目HeartCarer截图；Figure 1 is a screenshot of our real scientific research project HeartCarer;

图2是基于神经网络的时间序列数据异常检测过程；Fig. 2 is the abnormal detection process of time series data based on neural network;

图3是异常检测方法的性能验证。Figure 3 is the performance verification of the anomaly detection method.

具体实施方式Detailed ways

以下结合附图和具体实施例对本发明作具体的介绍。The present invention will be described in detail below with reference to the accompanying drawings and specific embodiments.

以慢性心力衰竭患者体征监测数据为例。Take the monitoring data of signs of chronic heart failure patients as an example.

一、收集数据1. Data collection

通过可穿戴技术收集慢性心力衰竭患者的各种生理体征数据（例如：心率、血压、血糖等）和各类可能引起体征数据突变的事件（例如：情绪变化，饮食情况，精神压力，过度体力消耗，环境因素等），然后通过移动终端或电话线将收集的数据上传到云端。此部分为我们的现有技术。Collect various physiological signs data (such as heart rate, blood pressure, blood sugar, etc.) of patients with chronic heart failure through wearable technology and various events that may cause sudden changes in sign data (such as emotional changes, diet, mental stress, excessive physical exertion) , environmental factors, etc.), and then upload the collected data to the cloud through a mobile terminal or phone line. This part is our prior art.

在本具体实施例中，数据来自于我们的一个真实科研项目——HeartCarer，如图1所示。这是一个面向家庭的远程监控系统，基于云平台，专门用于监控慢性心力衰竭患者并及时进行干预。该远程监控系统通过可穿戴技术监测慢性心力衰竭患者的各种生理体征数据（具体有：心率、血压、血糖等）和各类可能引起体征数据突变的事件（例如：情绪变化，饮食情况，精神压力，过度体力消耗，环境因素等），并通过移动终端或电话线将监测到的数据上传到云端。In this specific embodiment, the data comes from one of our real scientific research projects, HeartCarer, as shown in Figure 1. This is a home-oriented remote monitoring system, based on a cloud platform, dedicated to monitoring chronic heart failure patients and intervening in a timely manner. The remote monitoring system monitors various physiological signs data (specifically: heart rate, blood pressure, blood sugar, etc.) of patients with chronic heart failure through wearable technology and various events that may cause sudden changes in sign data (for example: mood changes, diet, mental stress, excessive physical exertion, environmental factors, etc.), and upload the monitored data to the cloud through a mobile terminal or phone line.

该远程监控系统已经应用于中国6家医疗机构的2607名慢性心力衰竭患者的临床观察研究中。这些慢性心力衰竭患者分别在2015年至2019年期间接受了护理，其中多数超过60岁（63.8±12岁），并且大部分是男性（占70％），这些人的各类信息数据量超过100GB。The remote monitoring system has been applied in a clinical observational study of 2,607 chronic heart failure patients in 6 medical institutions in China. These chronic heart failure patients received care from 2015 to 2019, most of them were over 60 years old (63.8 ± 12 years old), and most of them were male (70%), and the amount of various information data of these people exceeded 100GB .

我们使用OrientDB Cluster来存储大规模矩阵图，使用HBase作为顶点属性，使用Hadoop MR进行数据分析和计算。该集群包括8台运行CentOS 7.4操作系统的服务器，配备12核（24线程）Intel Xeon CPU，运行频率为2.80 GHz，内存为64 GB。We use OrientDB Cluster to store large-scale matrix graphs, HBase for vertex attributes, and Hadoop MR for data analysis and computation. The cluster consists of 8 servers running the CentOS 7.4 operating system with 12-core (24-thread) Intel Xeon CPUs running at 2.80 GHz and 64 GB of memory.

二、基于神经网络的时间序列数据异常检测过程2. Anomaly detection process of time series data based on neural network

本发明提出了一种时间序列数据异常检测方法，异常检测器由递归神经网络驱动，并采用强化学习方法来实现自学习过程。具体的异常检测过程如图2所示，该过程包括三个主要组件：异常检测器，累积的异常值和经验值，其关系描述如下：The invention proposes a time series data abnormality detection method. The abnormality detector is driven by a recurrent neural network, and adopts a reinforcement learning method to realize the self-learning process. The specific anomaly detection process is shown in Figure 2. The process includes three main components: anomaly detector, accumulated outliers and empirical values, and the relationship is described as follows:

经验值Y是一组元组，每个元组表示为

，其中

分别表示在给定时间点有事件a和没有事件a的相应数据记录，r是由事件a引起的瞬时异常。这些事件是由异常检测方法中的异常检测器发现的，因此经验值记录了异常检测器的所有行为。The empirical value Y is a set of tuples, each represented as

,in

represent the corresponding data records with and without event a at a given point in time, respectively, and r is the instantaneous anomaly caused by event a. These events are discovered by the anomaly detector in the anomaly detection method, so the empirical value records all the behavior of the anomaly detector.

我们用条件概率分布π:= p(A|Z)来表示异常检测器，其中A和Z分别是项目中的事件集合和实际数据记录。通常A={0,1}，其中1表示当前的数据记录存在异常，0表示没有异常。公式π(z, a) = p(A=a|Z=z)表示某特定数据记录z存在事件异常的概率。We denote the anomaly detector by the conditional probability distribution π:= p(A|Z), where A and Z are the set of events and actual data records in the project, respectively. Usually A={0,1}, where 1 indicates that the current data record has an exception, and 0 indicates that there is no exception. The formula π(z, a) = p(A=a|Z=z) represents the probability that a particular data record z has an event anomaly.

本发明使用异常检测能力来测量异常检测器的性能

，其定义为：The present invention uses anomaly detection capabilities to measure the performance of anomaly detectors

, which is defined as:

,

其中

是具有异常检测器π的实际数据记录z的概率，而X(z,a)表示在事件a作用下从数据记录z开始的累积异常。也就是说，性能是在使用异常检测器π情况下的平均累积异常。in

is the probability of the actual data record z with an anomaly detector π, and X(z, a) represents the cumulative anomaly starting from the data record z under the action of event a. That is, the performance is the average cumulative anomaly using the anomaly detector π.

如果检测器满足下列条件If the detector meets the following conditions

，

,

本质上该检测器就是最优异常检测器，它可以最大化性能。同时，对于所有

，

大致相同。如果

成立，则π(z,a)=1。也就是说，最优异常检测器π^*完全由累积异常函数X(z,a)确定。Essentially this detector is an optimal anomaly detector that maximizes performance. At the same time, for all

,

roughly the same. if

is established, then π(z, a)=1. That is, the optimal anomaly detector π ^* is completely determined by the cumulative anomaly function X(z,a).

根据上述过程描述，经验值可用于更好地估计X(z,a)，可以通过从经验值中自我学习从而不断改进异常检测器，并且可以采用X学习方法来训练用于估计X(z,a)的循环神经网络，获得良好的异常检测性能。总之，所提出的异常检测方法具有以下特征：（1）不对异常类型和模式做任何假设；（2）不需要选择阈值；（3）随着异常检测经验的增长而不断学习改进，从而可以广泛应用于众多领域。According to the above process description, empirical values can be used to better estimate X(z, a), anomaly detectors can be continuously improved by self-learning from empirical values, and X learning methods can be used to train for estimating X(z, a) a) of the recurrent neural network to obtain good anomaly detection performance. In summary, the proposed anomaly detection method has the following characteristics: (1) does not make any assumptions about anomaly types and patterns; (2) does not need to select a threshold; (3) continuously learns and improves with the growth of anomaly detection experience, so that it can be widely used Applied to many fields.

三、异常检测方法的性能验证3. Performance Verification of Anomaly Detection Methods

如步骤1所述，用于训练的数据集是HeartCarer基准数据集，其中包括2607名慢性心力衰竭患者的各种生理体征数据，各类信息数据量超过100GB。As mentioned in step 1, the dataset used for training is the HeartCarer benchmark dataset, which includes various physiological sign data of 2607 chronic heart failure patients, and the amount of various information data exceeds 100GB.

使用滑动窗口方法将每个时间序列转换为一组多维数据实例。X学习中的事件为

，其中0表示没有异常，1表示有异常。为了增强模型训练的过程，使用了二叉树策略，即通过对先前的数据记录z执行不同的操作0和1，将生成的两个数据记录

和

都添加到用于训练的经验集中。也就是说，在训练的过程中将两条记录

和

添加到经验集中。通过在数据集Z中执行不同的操作，我们可以获得奖励r₀和r₁。Transform each time series into a set of multidimensional data instances using a sliding window approach. The events in X learning are

, where 0 means no exception and 1 means there is an exception. To enhance the process of model training, a binary tree strategy is used, that is, by performing

different operations

0 and 1 on the previous data record z, the two data records generated are

and

are added to the experience set used for training. That is to say, during the training process, the two records are

and

Added to experience set. By performing different operations in dataset Z, we can obtain rewards r ₀ and r ₁ .

如图3所示，我们对慢性心力衰竭患者的各类体征数据集进行异常检测，以消除由情绪变化或环境因素等引起的异常数据。首先通过训练前三年的体征数据记录集，得到异常检测经验，然后在实时体征数据集中验证了本发明中异常检测方法的性能，图3（a）和3（b）分别显示了在心率指标和血压血糖指标的异常检测性能。As shown in Figure 3, we perform anomaly detection on various sign datasets of chronic heart failure patients to eliminate anomalous data caused by emotional changes or environmental factors, etc. First, the abnormality detection experience is obtained by training the first three-year physical sign data record set, and then the performance of the abnormality detection method in the present invention is verified in the real-time physical sign data set. and abnormal detection performance of blood pressure and blood glucose indicators.

从图3中我们可以看到：From Figure 3 we can see:

（1）灰线表示原始体征数据记录，黑线表示异常检测中的突发事件；(1) The gray line represents the original sign data record, and the black line represents the emergencies in anomaly detection;

（2）心率指标与慢性心力衰竭有直接关联，其测试时间间隔设为120分钟，血压血糖等指标有间接关联，其间隔设为240分钟；(2) Heart rate indicators are directly related to chronic heart failure, and the test time interval is set to 120 minutes, and indicators such as blood pressure and blood sugar are indirectly related, and the interval is set to 240 minutes;

（3）本发明中的异常检测方法能够识别目标时间序列的均值，点异常和异常模式的偏移；(3) The anomaly detection method in the present invention can identify the mean value of the target time series, the point anomaly and the offset of the anomaly pattern;

（4）该异常检测方法在测试数据集中能够获得高质量的结果，其准确度约为100％。(4) This anomaly detection method is able to obtain high-quality results in the test dataset with an accuracy of about 100%.

需要说明的是，上述实施例不以任何形式限制本发明，凡采用等同替换或等效变换的方式所获得的技术方案，均落在本发明的保护范围内。It should be noted that the above embodiments do not limit the present invention in any form, and all technical solutions obtained by means of equivalent replacement or equivalent transformation fall within the protection scope of the present invention.

Claims

1. a method for detecting abnormality in time series data based on neural network, is characterized in that, comprises the following steps:

Step 1: Collect data

Collect certain types of time series data with a certain duration and various events that may cause data mutation, and then upload the collected data to the cloud through mobile terminals or other devices. This part is our existing technology;

Step 2: Anomaly detection process of time series data based on neural network

The present invention proposes a time series data anomaly detection method based on neural network, the anomaly detector is driven by a recurrent neural network, and adopts a reinforcement learning method to realize the self-learning process, for example, by using the X learning method to train to estimate X ( z, a) of the recurrent neural network, the abnormality detector does not make any assumptions about abnormality types and abnormality patterns, and continuously improves the abnormality detector by self-learning from experience values. The abnormality detection method proposed in the present invention can be widely used in health care , human disease monitoring, traffic accident detection, equipment fault diagnosis, network intrusion detection and many other fields in industries such as intelligent transportation and large-scale production systems;

Step 3: Performance Verification of Anomaly Detection Method

We performed anomaly detection on multiple types of time series datasets (such as physical signs data of chronic heart failure patients, service data of public transport networks, equipment energy data in production systems, etc.) For abnormal data, firstly, the abnormality detection experience is obtained by training the data record set of a certain duration, and then the performance of the abnormality detection method in the present invention is verified in the real-time data set. The method can identify the mean value, point abnormality and abnormality of the target time series Shifting of the pattern, high-quality results can be obtained in the test dataset with an accuracy of about 100%.

2 . The method for detecting anomalies in time series data based on neural network according to claim 1 , wherein, in step 2, the anomaly detector does not make any assumptions about anomaly types and anomaly patterns. 3 .

3. The method for detecting anomaly in time series data based on neural network according to claim 1, characterized in that, in step 3, the anomaly detector does not need to select a threshold value and does not need to adjust parameters.