[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN111738335A - A neural network-based anomaly detection method for time series data - Google Patents

A neural network-based anomaly detection method for time series data Download PDF

Info

Publication number
CN111738335A
CN111738335A CN202010577504.5A CN202010577504A CN111738335A CN 111738335 A CN111738335 A CN 111738335A CN 202010577504 A CN202010577504 A CN 202010577504A CN 111738335 A CN111738335 A CN 111738335A
Authority
CN
China
Prior art keywords
anomaly
time series
data
neural network
anomaly detection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010577504.5A
Other languages
Chinese (zh)
Inventor
周春姐
李阿丽
张振兴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ludong University
Original Assignee
Ludong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ludong University filed Critical Ludong University
Priority to CN202010577504.5A priority Critical patent/CN111738335A/en
Publication of CN111738335A publication Critical patent/CN111738335A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/243Classification techniques relating to the number of classes
    • G06F18/2433Single-class perspective, e.g. one-against-all classification; Novelty detection; Outlier detection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computational Linguistics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Measuring And Recording Apparatus For Diagnosis (AREA)

Abstract

The invention discloses a time series data abnormity detection method based on a neural network, which comprises the following steps: collecting data, and performing an anomaly detection process based on the time series data of the neural network and performance verification of an anomaly detection method. The invention has the advantages that: (1) the anomaly detector does not make any assumption on the potential mechanism of the anomaly type and the anomaly pattern, but can obtain related concepts from the training data set through learning; (2) the anomaly detector does not need to select a threshold value, so that the tedious work of setting the threshold value is avoided, and good anomaly detection performance can be obtained; (3) with the accumulation of the anomaly detection experience, the anomaly detector continuously carries out dynamic improvement to learn new anomalies, so that the knowledge accumulation of the anomaly detection by the anomaly detector is enhanced; (4) the anomaly detection method provided by the invention can be widely applied to the fields of human body disease monitoring, traffic accident discovery, equipment fault diagnosis, network intrusion detection and the like in the industries of health care, intelligent transportation, large-scale production systems and the like.

Description

一种基于神经网络的时间序列数据异常检测方法A neural network-based anomaly detection method for time series data

技术领域technical field

本发明涉及一种时间序列异常检测方法,具体涉及一种基于神经网络的时间序列数据异常检测方法,属于人工智能和大数据应用技术领域。The invention relates to a time series abnormality detection method, in particular to a time series data abnormality detection method based on a neural network, and belongs to the technical field of artificial intelligence and big data application.

背景技术Background technique

异常检测是广泛存在于很多领域的热门话题。例如健康医疗中的人体疾病监测,智能交通中的交通事故发现,大型生产系统中的设备故障诊断、网络入侵检测等众多领域,可见,异常检测非常重要。理想情况下的异常检测方法应该适用于各类不同场景,并且可以轻松操作。Anomaly detection is a hot topic that exists widely in many fields. For example, human disease monitoring in health care, traffic accident detection in intelligent transportation, equipment fault diagnosis in large-scale production systems, network intrusion detection and many other fields, it can be seen that anomaly detection is very important. Ideally, anomaly detection methods should be applicable to a variety of different scenarios and can be easily manipulated.

但是,现有的异常检测方法无法满足需求。文献(Chandola V, Banerjee A, andKumar V. Anomaly detection: A survey. ACM Computing Survey, 41(3): 1-58,2009)对异常检测方法进行了深入总结,发现不同的异常检测方法对异常模式都做出了各种强假设(Gandhimathi L, Murugaboopathi G. A novel hybrid intrusion detectionusing flow-based anomaly detection and cross-layer features in wirelesssensor network. Automatic Control and Computer Sciences, 54(1):62-69, 2020),如基于分布式的方法,但是在假设条件不成立的情况下,异常检测方法可能无法取得令人满意的结果(Feng F, Liu X, and Yong B. Anomaly detection in ad-hoc networksbased on deep learning model: A plug and play device. Ad Hoc Networks, 84:82-89, 2019)。另一方面,异常检测方法并不总是很容易操作。2015年,雅虎发布了他们的时间序列异常检测系统EGADS(Laptev N, Amizadeh S, and Flint I. Generic andscalable framework for automated time-series anomaly detection. ACM SIGKDDInternational Conference on Knowledge Discovery and Data Mining, 1939-1947,2015),在系统内,实施并集成了一组方法以生成异常检测结果。但是如此复杂的系统要求工程师不仅要了解组件,还要理解方法集,以便能够为每个组件调整参数。此外,业界使用的方法很少考虑异常模式的演变(Carreno A, Inza I, and Lozano J. Analyzing rareevent, anomaly, novelty and outlier detection terms under the supervisedclassification framework. Artificial Intelligence Review,53(5): 3575- 3594,2020),这导致静态异常检测参数在动态场景下的性能较差。However, existing anomaly detection methods cannot meet the demand. The literature (Chandola V, Banerjee A, and Kumar V. Anomaly detection: A survey. ACM Computing Survey, 41(3): 1-58, 2009) gave an in-depth summary of anomaly detection methods and found that different anomaly detection methods have Various strong assumptions have been made (Gandhimathi L, Murugaboopathi G. A novel hybrid intrusion detection using flow-based anomaly detection and cross-layer features in wireless sensor network. Automatic Control and Computer Sciences, 54(1):62-69, 2020 ), such as distributed-based methods, but anomaly detection methods may not achieve satisfactory results when assumptions do not hold (Feng F, Liu X, and Yong B. Anomaly detection in ad-hoc networks based on deep learning model: A plug and play device. Ad Hoc Networks, 84:82-89, 2019). On the other hand, anomaly detection methods are not always easy to operate. In 2015, Yahoo released their time series anomaly detection system EGADS (Laptev N, Amizadeh S, and Flint I. Generic andscalable framework for automated time-series anomaly detection. ACM SIGKDDInternational Conference on Knowledge Discovery and Data Mining, 1939-1947, 2015), within the system, a set of methods are implemented and integrated to generate anomaly detection results. But such complex systems require engineers to understand not only the components, but also the methodology set to be able to tune parameters for each component. Furthermore, methods used in the industry rarely consider the evolution of anomalous patterns (Carreno A, Inza I, and Lozano J. Analyzing rareevent, anomaly, novelty and outlier detection terms under the supervisedclassification framework. Artificial Intelligence Review, 53(5): 3575- 3594, 2020), which leads to poor performance of static anomaly detection parameters in dynamic scenarios.

综上,现有方法存在以下不足:To sum up, the existing methods have the following shortcomings:

(i)现有异常检测方法对异常类型和异常模式都做出了各种强假设,当假设条件不满足时,无法得到满意的结果;(i) Existing anomaly detection methods make various strong assumptions on both the type of anomaly and the anomaly pattern, and when the assumptions are not satisfied, satisfactory results cannot be obtained;

(ii)现有的异常检测方法构建复杂,需要频繁调整参数,可操作性不强;(ii) The existing anomaly detection methods are complicated to construct, require frequent parameter adjustment, and are not very maneuverable;

(iii)现有方法很少考虑异常模式的演变,在动态场景下的性能较差。(iii) Existing methods rarely consider the evolution of anomalous patterns and perform poorly in dynamic scenarios.

基于此,本发明提出了一种基于神经网络的异常检测方法,不对异常模式的潜在机制做出任何假设;避免阈值设置的繁琐工作,以获得良好的异常检测性能;随着异常检测经验的增长而不断学习改进,从而对时间序列数据做出异常检测。Based on this, the present invention proposes a neural network-based anomaly detection method, which does not make any assumptions about the underlying mechanism of anomaly patterns; avoids the tedious work of threshold setting to obtain good anomaly detection performance; with the growth of anomaly detection experience And continuous learning and improvement, so as to make anomaly detection on time series data.

发明内容SUMMARY OF THE INVENTION

为了消除突发事件或紧急状况等引起的异常数据,本发明提出了一种基于神经网络的时间序列数据异常检测方法,通过使用强化学习框架不断进行培训。异常检测方法的过程描述如图2所示,该过程包括三个主要组件:异常检测器,累积的异常值和经验值,它们都是自学习的。In order to eliminate abnormal data caused by emergencies or emergencies, the present invention proposes a time series data abnormality detection method based on neural network, which is continuously trained by using a reinforcement learning framework. The process description of the anomaly detection method is shown in Fig. 2. The process consists of three main components: an anomaly detector, accumulated outliers, and empirical values, which are all self-learning.

为了实现上述目标,本发明采用如下的技术方案:In order to achieve above-mentioned goal, the present invention adopts following technical scheme:

一种基于神经网络的时间序列数据异常检测方法,其特征在于,包括以下步骤:A method for detecting abnormality in time series data based on neural network, characterized in that it comprises the following steps:

步骤1:收集数据Step 1: Collect data

收集具有一定持续时间的某类时间序列数据,和各种可能引起数据突变的事件,然后通过移动终端或其他设备将收集的数据上传到云端,此部分为我们的现有技术;Collect certain types of time series data with a certain duration and various events that may cause data mutation, and then upload the collected data to the cloud through mobile terminals or other devices. This part is our existing technology;

步骤2:基于神经网络的时间序列数据异常检测过程Step 2: Anomaly detection process of time series data based on neural network

本发明提出了一种时间序列数据异常检测方法,异常检测器由递归神经网络驱动,并采用强化学习方法来实现自学习过程,例如通过采用X学习方法来训练用于估计X(z,a)的循环神经网络,通过从经验值中自我学习从而不断改进异常检测器,获得良好的异常检测性能。所提出的异常检测方法具有以下特征:不对异常类型和模式做任何假设,不需要选择阈值,随着异常检测经验的增长而不断学习改进,从而可以广泛应用于众多领域;The present invention proposes a time series data anomaly detection method. The anomaly detector is driven by a recurrent neural network and adopts a reinforcement learning method to realize the self-learning process. For example, the X learning method is used for training to estimate X(z, a) The recurrent neural network can continuously improve the anomaly detector by self-learning from the experience value, and obtain good anomaly detection performance. The proposed anomaly detection method has the following characteristics: it does not make any assumptions about the type and pattern of anomalies, does not need to select a threshold, and continuously learns and improves with the growth of anomaly detection experience, so it can be widely used in many fields;

步骤3:异常检测方法的性能验证Step 3: Performance Verification of Anomaly Detection Method

我们对多种类型时间序列数据集进行了异常检测(如慢性心力衰竭患者的体征数据、公交网络的服务数据、生产系统中的设备能源数据等),以消除由突发事件或紧急状况引起的异常数据,首先通过训练某持续时间段的数据记录集,得到异常检测经验,然后在实时数据集中验证了本发明中异常检测方法的性能,该方法能够识别目标时间序列的均值,点异常和异常模式的偏移,在测试数据集中能够获得高质量的结果,其准确度约为100%。We performed anomaly detection on multiple types of time series datasets (such as physical signs data of chronic heart failure patients, service data of public transport networks, equipment energy data in production systems, etc.) For abnormal data, firstly, the abnormality detection experience is obtained by training the data record set of a certain duration, and then the performance of the abnormality detection method in the present invention is verified in the real-time data set. The method can identify the mean value, point abnormality and abnormality of the target time series Shifting of the pattern, high-quality results can be obtained in the test dataset with an accuracy of about 100%.

本发明的有益之处在于:The benefits of the present invention are:

(1)异常检测器不对异常类型、异常模式的潜在机制做任何假设,但可以通过学习从训练数据集中获得相关概念;(1) The anomaly detector does not make any assumptions about the underlying mechanism of anomaly types and anomaly patterns, but can obtain relevant concepts from the training dataset through learning;

(2)异常检测器不需要选择阈值,避免了阈值设置的繁琐工作,从而可以获得良好的异常检测性能;(2) The anomaly detector does not need to select a threshold, which avoids the tedious work of threshold setting, so that good anomaly detection performance can be obtained;

(3)随着异常检测经验的积累,异常检测器不断进行动态改进,学习新的异常,从而增强其对异常检测的知识积累;(3) With the accumulation of anomaly detection experience, the anomaly detector continuously improves dynamically and learns new anomalies, thereby enhancing its knowledge accumulation for anomaly detection;

(4)本发明所提的异常检测方法可以广泛应用于健康医疗、智能交通和大型生产系统等行业中的人体疾病监测、交通事故发现、设备故障诊断、网络入侵检测等众多领域。(4) The anomaly detection method proposed in the present invention can be widely used in many fields such as human disease monitoring, traffic accident detection, equipment fault diagnosis, network intrusion detection, etc. in industries such as health care, intelligent transportation, and large-scale production systems.

附图说明Description of drawings

图1是我们的真实科研项目HeartCarer截图;Figure 1 is a screenshot of our real scientific research project HeartCarer;

图2是基于神经网络的时间序列数据异常检测过程;Fig. 2 is the abnormal detection process of time series data based on neural network;

图3是异常检测方法的性能验证。Figure 3 is the performance verification of the anomaly detection method.

具体实施方式Detailed ways

以下结合附图和具体实施例对本发明作具体的介绍。The present invention will be described in detail below with reference to the accompanying drawings and specific embodiments.

以慢性心力衰竭患者体征监测数据为例。Take the monitoring data of signs of chronic heart failure patients as an example.

一、收集数据1. Data collection

通过可穿戴技术收集慢性心力衰竭患者的各种生理体征数据(例如:心率、血压、血糖等)和各类可能引起体征数据突变的事件(例如:情绪变化,饮食情况,精神压力,过度体力消耗,环境因素等),然后通过移动终端或电话线将收集的数据上传到云端。此部分为我们的现有技术。Collect various physiological signs data (such as heart rate, blood pressure, blood sugar, etc.) of patients with chronic heart failure through wearable technology and various events that may cause sudden changes in sign data (such as emotional changes, diet, mental stress, excessive physical exertion) , environmental factors, etc.), and then upload the collected data to the cloud through a mobile terminal or phone line. This part is our prior art.

在本具体实施例中,数据来自于我们的一个真实科研项目——HeartCarer,如图1所示。这是一个面向家庭的远程监控系统,基于云平台,专门用于监控慢性心力衰竭患者并及时进行干预。该远程监控系统通过可穿戴技术监测慢性心力衰竭患者的各种生理体征数据(具体有:心率、血压、血糖等)和各类可能引起体征数据突变的事件(例如:情绪变化,饮食情况,精神压力,过度体力消耗,环境因素等),并通过移动终端或电话线将监测到的数据上传到云端。In this specific embodiment, the data comes from one of our real scientific research projects, HeartCarer, as shown in Figure 1. This is a home-oriented remote monitoring system, based on a cloud platform, dedicated to monitoring chronic heart failure patients and intervening in a timely manner. The remote monitoring system monitors various physiological signs data (specifically: heart rate, blood pressure, blood sugar, etc.) of patients with chronic heart failure through wearable technology and various events that may cause sudden changes in sign data (for example: mood changes, diet, mental stress, excessive physical exertion, environmental factors, etc.), and upload the monitored data to the cloud through a mobile terminal or phone line.

该远程监控系统已经应用于中国6家医疗机构的2607名慢性心力衰竭患者的临床观察研究中。这些慢性心力衰竭患者分别在2015年至2019年期间接受了护理,其中多数超过60岁(63.8±12岁),并且大部分是男性(占70%),这些人的各类信息数据量超过100GB。The remote monitoring system has been applied in a clinical observational study of 2,607 chronic heart failure patients in 6 medical institutions in China. These chronic heart failure patients received care from 2015 to 2019, most of them were over 60 years old (63.8 ± 12 years old), and most of them were male (70%), and the amount of various information data of these people exceeded 100GB .

我们使用OrientDB Cluster来存储大规模矩阵图,使用HBase作为顶点属性,使用Hadoop MR进行数据分析和计算。该集群包括8台运行CentOS 7.4操作系统的服务器,配备12核(24线程)Intel Xeon CPU,运行频率为2.80 GHz,内存为64 GB。We use OrientDB Cluster to store large-scale matrix graphs, HBase for vertex attributes, and Hadoop MR for data analysis and computation. The cluster consists of 8 servers running the CentOS 7.4 operating system with 12-core (24-thread) Intel Xeon CPUs running at 2.80 GHz and 64 GB of memory.

二、基于神经网络的时间序列数据异常检测过程2. Anomaly detection process of time series data based on neural network

本发明提出了一种时间序列数据异常检测方法,异常检测器由递归神经网络驱动,并采用强化学习方法来实现自学习过程。具体的异常检测过程如图2所示,该过程包括三个主要组件:异常检测器,累积的异常值和经验值,其关系描述如下:The invention proposes a time series data abnormality detection method. The abnormality detector is driven by a recurrent neural network, and adopts a reinforcement learning method to realize the self-learning process. The specific anomaly detection process is shown in Figure 2. The process includes three main components: anomaly detector, accumulated outliers and empirical values, and the relationship is described as follows:

经验值Y是一组元组,每个元组表示为

Figure 575741DEST_PATH_IMAGE001
,其中
Figure 820778DEST_PATH_IMAGE002
分别表示在给定时间点有事件a和没有事件a的相应数据记录,r是由事件a引起的瞬时异常。这些事件是由异常检测方法中的异常检测器发现的,因此经验值记录了异常检测器的所有行为。The empirical value Y is a set of tuples, each represented as
Figure 575741DEST_PATH_IMAGE001
,in
Figure 820778DEST_PATH_IMAGE002
represent the corresponding data records with and without event a at a given point in time, respectively, and r is the instantaneous anomaly caused by event a. These events are discovered by the anomaly detector in the anomaly detection method, so the empirical value records all the behavior of the anomaly detector.

我们用条件概率分布π:= p(A|Z)来表示异常检测器,其中A和Z分别是项目中的事件集合和实际数据记录。通常A={0,1},其中1表示当前的数据记录存在异常,0表示没有异常。公式π(z, a) = p(A=a|Z=z)表示某特定数据记录z存在事件异常的概率。We denote the anomaly detector by the conditional probability distribution π:= p(A|Z), where A and Z are the set of events and actual data records in the project, respectively. Usually A={0,1}, where 1 indicates that the current data record has an exception, and 0 indicates that there is no exception. The formula π(z, a) = p(A=a|Z=z) represents the probability that a particular data record z has an event anomaly.

本发明使用异常检测能力来测量异常检测器的性能

Figure 958104DEST_PATH_IMAGE003
,其定义为:The present invention uses anomaly detection capabilities to measure the performance of anomaly detectors
Figure 958104DEST_PATH_IMAGE003
, which is defined as:

Figure 477947DEST_PATH_IMAGE004
,
Figure 477947DEST_PATH_IMAGE004
,

其中

Figure DEST_PATH_IMAGE005
是具有异常检测器π的实际数据记录z的概率,而X(z,a)表示在事件a作用下从数据记录z开始的累积异常。也就是说,性能是在使用异常检测器π情况下的平均累积异常。in
Figure DEST_PATH_IMAGE005
is the probability of the actual data record z with an anomaly detector π, and X(z, a) represents the cumulative anomaly starting from the data record z under the action of event a. That is, the performance is the average cumulative anomaly using the anomaly detector π.

如果检测器满足下列条件If the detector meets the following conditions

Figure 121681DEST_PATH_IMAGE006
Figure 121681DEST_PATH_IMAGE006
,

本质上该检测器就是最优异常检测器,它可以最大化性能。同时,对于所有

Figure 803198DEST_PATH_IMAGE007
Figure 398127DEST_PATH_IMAGE005
大致相同。如果
Figure 954617DEST_PATH_IMAGE008
成立,则π(z,a)=1。也就是说,最优异常检测器π*完全由累积异常函数X(z,a)确定。Essentially this detector is an optimal anomaly detector that maximizes performance. At the same time, for all
Figure 803198DEST_PATH_IMAGE007
,
Figure 398127DEST_PATH_IMAGE005
roughly the same. if
Figure 954617DEST_PATH_IMAGE008
is established, then π(z, a)=1. That is, the optimal anomaly detector π * is completely determined by the cumulative anomaly function X(z,a).

根据上述过程描述,经验值可用于更好地估计X(z,a),可以通过从经验值中自我学习从而不断改进异常检测器,并且可以采用X学习方法来训练用于估计X(z,a)的循环神经网络,获得良好的异常检测性能。总之,所提出的异常检测方法具有以下特征:(1)不对异常类型和模式做任何假设;(2)不需要选择阈值;(3)随着异常检测经验的增长而不断学习改进,从而可以广泛应用于众多领域。According to the above process description, empirical values can be used to better estimate X(z, a), anomaly detectors can be continuously improved by self-learning from empirical values, and X learning methods can be used to train for estimating X(z, a) a) of the recurrent neural network to obtain good anomaly detection performance. In summary, the proposed anomaly detection method has the following characteristics: (1) does not make any assumptions about anomaly types and patterns; (2) does not need to select a threshold; (3) continuously learns and improves with the growth of anomaly detection experience, so that it can be widely used Applied to many fields.

三、异常检测方法的性能验证3. Performance Verification of Anomaly Detection Methods

如步骤1所述,用于训练的数据集是HeartCarer基准数据集,其中包括2607名慢性心力衰竭患者的各种生理体征数据,各类信息数据量超过100GB。As mentioned in step 1, the dataset used for training is the HeartCarer benchmark dataset, which includes various physiological sign data of 2607 chronic heart failure patients, and the amount of various information data exceeds 100GB.

使用滑动窗口方法将每个时间序列转换为一组多维数据实例。X学习中的事件为

Figure 889075DEST_PATH_IMAGE009
,其中0表示没有异常,1表示有异常。为了增强模型训练的过程,使用了二叉树策略,即通过对先前的数据记录z执行不同的操作0和1,将生成的两个数据记录
Figure 741493DEST_PATH_IMAGE010
Figure DEST_PATH_IMAGE011
都添加到用于训练的经验集中。也就是说,在训练的过程中将两条记录
Figure 325184DEST_PATH_IMAGE012
Figure 921250DEST_PATH_IMAGE013
添加到经验集中。通过在数据集Z中执行不同的操作,我们可以获得奖励r0和r1。Transform each time series into a set of multidimensional data instances using a sliding window approach. The events in X learning are
Figure 889075DEST_PATH_IMAGE009
, where 0 means no exception and 1 means there is an exception. To enhance the process of model training, a binary tree strategy is used, that is, by performing different operations 0 and 1 on the previous data record z, the two data records generated are
Figure 741493DEST_PATH_IMAGE010
and
Figure DEST_PATH_IMAGE011
are added to the experience set used for training. That is to say, during the training process, the two records are
Figure 325184DEST_PATH_IMAGE012
and
Figure 921250DEST_PATH_IMAGE013
Added to experience set. By performing different operations in dataset Z, we can obtain rewards r 0 and r 1 .

如图3所示,我们对慢性心力衰竭患者的各类体征数据集进行异常检测,以消除由情绪变化或环境因素等引起的异常数据。首先通过训练前三年的体征数据记录集,得到异常检测经验,然后在实时体征数据集中验证了本发明中异常检测方法的性能,图3(a)和3(b)分别显示了在心率指标和血压血糖指标的异常检测性能。As shown in Figure 3, we perform anomaly detection on various sign datasets of chronic heart failure patients to eliminate anomalous data caused by emotional changes or environmental factors, etc. First, the abnormality detection experience is obtained by training the first three-year physical sign data record set, and then the performance of the abnormality detection method in the present invention is verified in the real-time physical sign data set. and abnormal detection performance of blood pressure and blood glucose indicators.

从图3中我们可以看到:From Figure 3 we can see:

(1)灰线表示原始体征数据记录,黑线表示异常检测中的突发事件;(1) The gray line represents the original sign data record, and the black line represents the emergencies in anomaly detection;

(2)心率指标与慢性心力衰竭有直接关联,其测试时间间隔设为120分钟,血压血糖等指标有间接关联,其间隔设为240分钟;(2) Heart rate indicators are directly related to chronic heart failure, and the test time interval is set to 120 minutes, and indicators such as blood pressure and blood sugar are indirectly related, and the interval is set to 240 minutes;

(3)本发明中的异常检测方法能够识别目标时间序列的均值,点异常和异常模式的偏移;(3) The anomaly detection method in the present invention can identify the mean value of the target time series, the point anomaly and the offset of the anomaly pattern;

(4)该异常检测方法在测试数据集中能够获得高质量的结果,其准确度约为100%。(4) This anomaly detection method is able to obtain high-quality results in the test dataset with an accuracy of about 100%.

需要说明的是,上述实施例不以任何形式限制本发明,凡采用等同替换或等效变换的方式所获得的技术方案,均落在本发明的保护范围内。It should be noted that the above embodiments do not limit the present invention in any form, and all technical solutions obtained by means of equivalent replacement or equivalent transformation fall within the protection scope of the present invention.

Claims (3)

1.一种基于神经网络的时间序列数据异常检测方法,其特征在于,包括以下步骤:1. a method for detecting abnormality in time series data based on neural network, is characterized in that, comprises the following steps: 步骤1:收集数据Step 1: Collect data 收集具有一定持续时间的某类时间序列数据,和各种可能引起数据突变的事件,然后通过移动终端或其他设备将收集的数据上传到云端,此部分为我们的现有技术;Collect certain types of time series data with a certain duration and various events that may cause data mutation, and then upload the collected data to the cloud through mobile terminals or other devices. This part is our existing technology; 步骤2:基于神经网络的时间序列数据异常检测过程Step 2: Anomaly detection process of time series data based on neural network 本发明提出了一种基于神经网络的时间序列数据异常检测方法,异常检测器由递归神经网络驱动,并采用强化学习方法来实现自学习过程,例如通过采用X学习方法来训练用于估计X(z,a)的循环神经网络,异常检测器不对异常类型和异常模式做任何假设,通过从经验值中自我学习从而不断改进异常检测器,本发明所提的异常检测方法可以广泛应用于健康医疗、智能交通和大型生产系统等行业中的人体疾病监测、交通事故发现、设备故障诊断、网络入侵检测等众多领域;The present invention proposes a time series data anomaly detection method based on neural network, the anomaly detector is driven by a recurrent neural network, and adopts a reinforcement learning method to realize the self-learning process, for example, by using the X learning method to train to estimate X ( z, a) of the recurrent neural network, the abnormality detector does not make any assumptions about abnormality types and abnormality patterns, and continuously improves the abnormality detector by self-learning from experience values. The abnormality detection method proposed in the present invention can be widely used in health care , human disease monitoring, traffic accident detection, equipment fault diagnosis, network intrusion detection and many other fields in industries such as intelligent transportation and large-scale production systems; 步骤3:异常检测方法的性能验证Step 3: Performance Verification of Anomaly Detection Method 我们对多种类型时间序列数据集进行了异常检测(如慢性心力衰竭患者的体征数据、公交网络的服务数据、生产系统中的设备能源数据等),以消除由突发事件或紧急状况引起的异常数据,首先通过训练某持续时间段的数据记录集,得到异常检测经验,然后在实时数据集中验证了本发明中异常检测方法的性能,该方法能够识别目标时间序列的均值,点异常和异常模式的偏移,在测试数据集中能够获得高质量的结果,其准确度约为100%。We performed anomaly detection on multiple types of time series datasets (such as physical signs data of chronic heart failure patients, service data of public transport networks, equipment energy data in production systems, etc.) For abnormal data, firstly, the abnormality detection experience is obtained by training the data record set of a certain duration, and then the performance of the abnormality detection method in the present invention is verified in the real-time data set. The method can identify the mean value, point abnormality and abnormality of the target time series Shifting of the pattern, high-quality results can be obtained in the test dataset with an accuracy of about 100%. 2.根据权利要求1所述的基于神经网络的时间序列数据异常检测方法,其特征在于,在步骤2中,异常检测器不对异常类型和异常模式做任何假设。2 . The method for detecting anomalies in time series data based on neural network according to claim 1 , wherein, in step 2, the anomaly detector does not make any assumptions about anomaly types and anomaly patterns. 3 . 3.根据权利要求1所述的基于神经网络的时间序列数据异常检测方法,其特征在于,在步骤3中,异常检测器不需要选择阈值,不需要调整参数。3. The method for detecting anomaly in time series data based on neural network according to claim 1, characterized in that, in step 3, the anomaly detector does not need to select a threshold value and does not need to adjust parameters.
CN202010577504.5A 2020-06-23 2020-06-23 A neural network-based anomaly detection method for time series data Pending CN111738335A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010577504.5A CN111738335A (en) 2020-06-23 2020-06-23 A neural network-based anomaly detection method for time series data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010577504.5A CN111738335A (en) 2020-06-23 2020-06-23 A neural network-based anomaly detection method for time series data

Publications (1)

Publication Number Publication Date
CN111738335A true CN111738335A (en) 2020-10-02

Family

ID=72650511

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010577504.5A Pending CN111738335A (en) 2020-06-23 2020-06-23 A neural network-based anomaly detection method for time series data

Country Status (1)

Country Link
CN (1) CN111738335A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115359870A (en) * 2022-10-20 2022-11-18 之江实验室 Disease diagnosis and treatment process abnormity identification system based on hierarchical graph neural network

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109819446A (en) * 2019-03-14 2019-05-28 湖南大学 Spatial access authentication method and software-defined edge computing system for mobile Internet of Things
CN110287439A (en) * 2019-06-27 2019-09-27 电子科技大学 A network behavior anomaly detection method based on LSTM
EP3561815A1 (en) * 2018-04-27 2019-10-30 Tata Consultancy Services Limited A unified platform for domain adaptable human behaviour inference
CN110865625A (en) * 2018-08-28 2020-03-06 中国科学院沈阳自动化研究所 Process data anomaly detection method based on time series
CN111191934A (en) * 2019-12-31 2020-05-22 北京理工大学 Multi-target cloud workflow scheduling method based on reinforcement learning strategy
CN111190804A (en) * 2019-12-28 2020-05-22 同济大学 A multi-level deep learning log fault detection method for cloud native systems

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3561815A1 (en) * 2018-04-27 2019-10-30 Tata Consultancy Services Limited A unified platform for domain adaptable human behaviour inference
CN110865625A (en) * 2018-08-28 2020-03-06 中国科学院沈阳自动化研究所 Process data anomaly detection method based on time series
CN109819446A (en) * 2019-03-14 2019-05-28 湖南大学 Spatial access authentication method and software-defined edge computing system for mobile Internet of Things
CN110287439A (en) * 2019-06-27 2019-09-27 电子科技大学 A network behavior anomaly detection method based on LSTM
CN111190804A (en) * 2019-12-28 2020-05-22 同济大学 A multi-level deep learning log fault detection method for cloud native systems
CN111191934A (en) * 2019-12-31 2020-05-22 北京理工大学 Multi-target cloud workflow scheduling method based on reinforcement learning strategy

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
CHENGQIANG HUANG ET AL.: ""Towards Experienced Anomaly Detector through Reinforcement Learning"", 《THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115359870A (en) * 2022-10-20 2022-11-18 之江实验室 Disease diagnosis and treatment process abnormity identification system based on hierarchical graph neural network

Similar Documents

Publication Publication Date Title
Fouad et al. Analyzing patient health information based on IoT sensor with AI for improving patient assistance in the future direction
CN112365978B (en) Method and device for establishing a model for early risk assessment of tachycardia events
EP3783619A1 (en) Human body health assessment method and system based on sleep big data
CN111081379B (en) Disease probability decision method and system thereof
WO2022233121A1 (en) Unsupervised medical behavior compliance assessment method based on electronic medical record
CN111540471A (en) Health state tracking and early warning method and system based on user health data
CN111248859A (en) Automatic detection method of sleep apnea based on convolutional neural network
CN108549875A (en) A kind of brain electricity epileptic attack detection method based on the perception of depth channel attention
CN118672193A (en) Intelligent management and control system and method for inspection and maintenance operation of natural gas purification device
CN113764103A (en) Female reproductive health management system and method based on Internet
CN111738335A (en) A neural network-based anomaly detection method for time series data
CN118692706B (en) AI-based intelligent analysis system and method for urinary tract functional status
CN118486468B (en) Patient care intelligent early warning system and method based on 5G Internet of things technology
CN118692705A (en) Physical health status monitoring method and system based on big data
CN110495896B (en) Wearable knee joint monitoring device and monitoring method based on GPRS communication
SrinivasVellela et al. Fusion-Infused Hypnocare: Unveiling Real-Time Instantaneous Heart Rates for Remote Diagnosis of Sleep Apnea
Kumar et al. Disease prediction using machine learning algorithms KNN and CNN
CN118038548A (en) Abnormal behavior detection method, device, electronic equipment and storage medium
Sintiya et al. SARIMA and Holt-Winters Seasonal Methods for Time Series Forecasting in Tuberculosis Case
CN116561704A (en) A Trusted Federation Modeling Method Based on Multimodal Fusion
CN117373587A (en) Real-time full-course information acquisition system and method for orthopedics patient
CN117297549A (en) Anesthesia state monitoring method and system based on neural network
CN117409984A (en) Diabetes follow-up management system, device, electronic equipment and storage medium
Wu [Retracted] Design of Intelligent Nursing System Based on Artificial Intelligence
US12057231B1 (en) Forecasting arterial embolic and bleeding events

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20201002