[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN116631186B - Highway traffic accident risk assessment method and system based on dangerous driving event data - Google Patents

Highway traffic accident risk assessment method and system based on dangerous driving event data Download PDF

Info

Publication number
CN116631186B
CN116631186B CN202310569137.8A CN202310569137A CN116631186B CN 116631186 B CN116631186 B CN 116631186B CN 202310569137 A CN202310569137 A CN 202310569137A CN 116631186 B CN116631186 B CN 116631186B
Authority
CN
China
Prior art keywords
dangerous driving
accident
data
layer
driving event
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202310569137.8A
Other languages
Chinese (zh)
Other versions
CN116631186A (en
Inventor
陆建
霍宗鑫
马潇驰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Southeast University
Original Assignee
Southeast University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Southeast University filed Critical Southeast University
Priority to CN202310569137.8A priority Critical patent/CN116631186B/en
Publication of CN116631186A publication Critical patent/CN116631186A/en
Application granted granted Critical
Publication of CN116631186B publication Critical patent/CN116631186B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/52Surveillance or monitoring of activities, e.g. for recognising suspicious objects
    • G06V20/54Surveillance or monitoring of activities, e.g. for recognising suspicious objects of traffic, e.g. cars on the road, trains or boats
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0635Risk analysis of enterprise or organisation activities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/774Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0125Traffic data processing
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0137Measuring and analyzing of parameters relative to traffic conditions for specific applications
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/017Detecting movement of traffic to be counted or controlled identifying vehicles
    • G08G1/0175Detecting movement of traffic to be counted or controlled identifying vehicles by photographing vehicles, e.g. when violating traffic rules
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/052Detecting movement of traffic to be counted or controlled with provision for determining speed or overspeed
    • G08G1/054Detecting movement of traffic to be counted or controlled with provision for determining speed or overspeed photographing overspeeding vehicles
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T10/00Road transport of goods or passengers
    • Y02T10/10Internal combustion engine [ICE] based vehicles
    • Y02T10/40Engine management systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Evolutionary Computation (AREA)
  • Human Resources & Organizations (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Medical Informatics (AREA)
  • Analytical Chemistry (AREA)
  • Chemical & Material Sciences (AREA)
  • Tourism & Hospitality (AREA)
  • Strategic Management (AREA)
  • Databases & Information Systems (AREA)
  • Economics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Development Economics (AREA)
  • Educational Administration (AREA)
  • Molecular Biology (AREA)
  • Data Mining & Analysis (AREA)
  • Game Theory and Decision Science (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Biophysics (AREA)
  • General Engineering & Computer Science (AREA)
  • Primary Health Care (AREA)
  • Biomedical Technology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Traffic Control Systems (AREA)

Abstract

The invention discloses a highway traffic accident risk assessment method and system based on dangerous driving event data, wherein the method comprises the following steps: acquiring expressway traffic accident data and four types of dangerous driving event data; searching and counting the frequency of dangerous driving events on road sections before accidents and when no accidents occur and the average value of the maximum acceleration and the maximum speed of vehicles in all the same type of dangerous driving events respectively; constructing a sample data set by taking the dangerous driving event information as an independent variable and taking whether an accident is a dependent variable; processing the data into an image form, constructing and training a deep learning model, and taking into consideration the internal relation of dangerous driving event data in different event types and upstream and downstream space positions, so as to mine the rule of dangerous driving events on a road section before an accident occurs; and finally, acquiring dangerous driving event data of the whole highway section in real time, and evaluating the traffic accident risk in real time according to the trained model.

Description

基于危险驾驶事件数据的高速公路交通事故风险评估方法、 系统Highway traffic accident risk assessment method and system based on dangerous driving event data

技术领域Technical Field

本发明提出一种基于危险驾驶事件数据的高速公路交通事故风险评估方法及系统,属于道路交通安全技术领域。The invention provides a highway traffic accident risk assessment method and system based on dangerous driving event data, belonging to the technical field of road traffic safety.

背景技术Background technique

高速公路的交通安全问题一直是政府关注的重点。近年来,高速公路交通事故风险的实时评估成为研究热点,其对于开展主动式交通安全管理、减少交通事故发生率具有极其重要的意义。The traffic safety of highways has always been the focus of government attention. In recent years, the real-time assessment of highway traffic accident risks has become a research hotspot, which is of great significance for carrying out active traffic safety management and reducing the incidence of traffic accidents.

现有高速公路交通事故风险评估方法通常使用固定检测器(例如ETC门架、地面环形线圈等)采集的交通流数据判断事故风险,但这类数据受到固定检测器布设位置和布设密度的限制,难以动态反映全路段各位置处的交通运行状态,进而影响事故风险评估准确性。随着数据采集技术的进步,车辆危险驾驶行为数据越来越容易获取,出现了一系列危险驾驶行为监测、风险评估和预警方法。但这些方法均是以单一个体车辆为研究对象进行风险评估,缺乏从道路交通管理者的角度、以路段事故风险为研究对象、基于路段上车辆危险驾驶事件数据进行交通事故风险评估的方法。Existing highway traffic accident risk assessment methods usually use traffic flow data collected by fixed detectors (such as ETC gantries, ground ring coils, etc.) to determine accident risks. However, this type of data is limited by the location and density of fixed detectors, and it is difficult to dynamically reflect the traffic operation status at all locations on the entire road section, which in turn affects the accuracy of accident risk assessment. With the advancement of data collection technology, it is becoming easier to obtain data on dangerous driving behaviors of vehicles, and a series of dangerous driving behavior monitoring, risk assessment and early warning methods have emerged. However, these methods all use a single individual vehicle as the research object for risk assessment, and lack methods for traffic accident risk assessment from the perspective of road traffic managers, with the accident risk of the road section as the research object, and based on the data of dangerous driving events of vehicles on the road section.

在事故风险评估模型方面,现有方法大多采用统计分析或机器学习模型挖掘事故发生前路段交通流等因素的特征规律,部分方法还采用了循环神经网络(RNN)深度学习模型用以考虑交通流数据的时间相关性进行事故风险建模。但这些方法均是利用一维向量形式数据表示事故发生前道路交通流等因素的情况,难以同时考虑数据在时间、空间和影响因素等多个维度的内在联系。卷积神经网络(CNN)深度学习模型能够高效地提取图像数据中的局部模式,综合考虑数据在多个维度的相关性进行模型训练和预测,且计算代价比RNN小得多。In terms of accident risk assessment models, most existing methods use statistical analysis or machine learning models to mine the characteristic laws of factors such as road traffic flow before an accident occurs. Some methods also use recurrent neural network (RNN) deep learning models to consider the time correlation of traffic flow data for accident risk modeling. However, these methods all use one-dimensional vector data to represent factors such as road traffic flow before an accident occurs, and it is difficult to simultaneously consider the inherent connections of data in multiple dimensions such as time, space, and influencing factors. The convolutional neural network (CNN) deep learning model can efficiently extract local patterns in image data, comprehensively consider the correlation of data in multiple dimensions for model training and prediction, and the computational cost is much lower than that of RNN.

由于危险驾驶事件数据是通过车载导航软件采集到的,其采集范围能够覆盖高速公路全路段的各个位置且不受天气条件的影响。因此,使用道路上车辆的危险驾驶事件数据并将其处理成图像形式,输入到CNN深度学习模型中,能够综合考虑数据中蕴含的多维度相关性,在无需安装新设备的条件下实现对高速公路各路段的交通事故风险进行实时精细化评估。Since the dangerous driving incident data is collected through the vehicle navigation software, its collection range can cover all locations of the entire highway section and is not affected by weather conditions. Therefore, using the dangerous driving incident data of vehicles on the road and processing it into image form, and inputting it into the CNN deep learning model, it can comprehensively consider the multi-dimensional correlation contained in the data and realize real-time and refined assessment of traffic accident risks on various sections of the highway without installing new equipment.

发明内容Summary of the invention

本发明所要解决的技术问题:针对现有技术的不足,本发明使用高速公路上车辆的危险驾驶事件数据,按照不同危险驾驶事件类型和空间位置关系将其处理成图像形式,利用包含卷积神经网络(CNN)等模块的深度学习模型挖掘交通事故发生前路段上的危险驾驶事件规律,提出了一种基于危险驾驶事件数据的高速公路交通事故风险评估方法、系统,旨在解决现有高速公路交通事故风险评估方法受到固定检测器布设位置和布设密度的限制难以精细化评估高速公路各位置处的事故风险、难以提取和学习到数据在不同影响因素和空间位置方面的内在联系导致风险评估精度有限的问题。Technical problem to be solved by the present invention: In view of the shortcomings of the prior art, the present invention uses the dangerous driving event data of vehicles on the highway, processes it into image form according to different dangerous driving event types and spatial position relationships, and uses a deep learning model including modules such as convolutional neural network (CNN) to mine the laws of dangerous driving events on the road section before the traffic accident occurs. A highway traffic accident risk assessment method and system based on dangerous driving event data are proposed, aiming to solve the problem that the existing highway traffic accident risk assessment method is limited by the location and density of fixed detectors, making it difficult to finely assess the accident risks at various locations on the highway, and difficult to extract and learn the intrinsic connections between data in different influencing factors and spatial positions, resulting in limited risk assessment accuracy.

本发明为解决以上技术问题而采用以下技术方案:The present invention adopts the following technical solutions to solve the above technical problems:

本发明提出一种基于危险驾驶事件数据的高速公路交通事故风险评估方法,包括以下步骤:The present invention proposes a highway traffic accident risk assessment method based on dangerous driving event data, comprising the following steps:

S1:确定研究路段和统计时间范围,分别获取以下两类信息:S1: Determine the research section and statistical time range, and obtain the following two types of information:

(1)统计时间范围内高速公路研究路段上车辆的危险驾驶事件数据,包括危险驾驶事件发生时间,发生地点的经度、纬度和上下行方向,危险驾驶事件类型(急加速、急减速、急左变道和急右变道),危险驾驶事件中车辆的最大加速度和最大速度。(1) Data on dangerous driving incidents of vehicles on the research section of the expressway within the statistical time range, including the time of occurrence of the dangerous driving incident, the longitude, latitude and up and down directions of the location of the incident, the type of dangerous driving incident (sudden acceleration, sudden deceleration, sudden left lane change and sudden right lane change), and the maximum acceleration and maximum speed of the vehicle during the dangerous driving incident.

(2)统计时间范围内高速公路研究路段上发生的交通事故数据,包括事故的起始时间、结束时间,事故发生地点的经度、纬度和上下行方向。(2) Traffic accident data occurring on the research section of the expressway within the statistical time range, including the start and end time of the accident, the longitude, latitude, and up and down directions of the accident location.

S2、针对每起交通事故,查找该事故发生前事故地点上下游路段的危险驾驶事件数据,以危险驾驶事件数据为自变量,以发生事故为因变量,构建事故案例数据集;S2. For each traffic accident, find the dangerous driving event data of the upstream and downstream sections of the accident site before the accident, take the dangerous driving event data as the independent variable and the accident as the dependent variable, and construct the accident case data set;

S3、查找未发生交通事故时上下游路段的危险驾驶事件数据,以危险驾驶事件数据为自变量,以未发生事故为因变量,构建非事故案例数据集作为对照组,将事故案例数据集和非事故案例数据集合并,得到样本数据集;S3, searching for dangerous driving event data of upstream and downstream sections when no traffic accidents occurred, taking the dangerous driving event data as the independent variable and the absence of accidents as the dependent variable, constructing a non-accident case data set as a control group, merging the accident case data set and the non-accident case data set to obtain a sample data set;

S4、对数据进行标准化处理:依据不同危险驾驶事件类型和空间位置关系将危险驾驶事件数据处理成图像形式,构建深度学习模型进行训练和调参,直至模型效果最优;S4. Standardize the data: process the dangerous driving event data into images according to different dangerous driving event types and spatial position relationships, build a deep learning model for training and parameter adjustment until the model effect is optimal;

S5、实时获取高速公路全路段上车辆的危险驾驶事件数据,依据S2计算得出各种危险驾驶事件发生频次以及危险驾驶事件中车辆最大加速度和最大速度的平均值,将其输入S4构建的深度学习模型,计算得出各路段的交通事故风险评估值和风险水平。S5: Real-time acquisition of dangerous driving incident data of vehicles on all sections of the expressway. Based on S2, the frequency of occurrence of various dangerous driving incidents and the average values of the maximum acceleration and maximum speed of vehicles in dangerous driving incidents are calculated, and the data are input into the deep learning model constructed by S4 to calculate the traffic accident risk assessment value and risk level of each section.

具体地,步骤S2具体是:Specifically, step S2 is:

S2.1:根据危险驾驶事件和交通事故位置的经度、纬度和上下行方向,将它们标记到高速公路对应路段上,并建立路段间的上下游关系。S2.1: According to the longitude, latitude, and up and down directions of the dangerous driving incidents and traffic accidents, mark them on the corresponding sections of the highway and establish the upstream and downstream relationships between the sections.

S2.2:依据每起交通事故的时空位置,提取事故发生前t时间范围内事故地点上下游l空间范围内的急加速、急减速、急左变道和急右变道四种危险驾驶事件数据。S2.2: Based on the spatiotemporal location of each traffic accident, extract data on four types of dangerous driving events: sudden acceleration, sudden deceleration, sudden left lane change, and sudden right lane change within a spatial range l upstream and downstream of the accident location within a time range t before the accident.

S2.3:针对每起交通事故,分别计算以下内容:S2.3: For each traffic accident, calculate the following:

(1)计算S2.2中时空范围内的四种危险驾驶事件发生频次,记为:(1) Calculate the frequency of occurrence of the four types of dangerous driving events within the time and space range in S2.2, recorded as:

Times=[L_up,L_down,R_up,R_down,A_up,A_down,B_up,B_down]Times = [L_up, L_down, R_up, R_down, A_up, A_down, B_up, B_down]

其中,每个变量的第一部分表示危险驾驶事件类型,即L、R、A和B分别表示事故发生前t时间范围内急左变道、急右变道、急加速和急减速驾驶事件发生的频次。每个变量的第二部分表示危险驾驶事件发生地点,即up和down分别表示事故地点上游和下游l空间范围内的路段。The first part of each variable represents the type of dangerous driving event, i.e., L, R, A, and B represent the frequency of sudden left lane change, sudden right lane change, sudden acceleration, and sudden deceleration in the time range t before the accident. The second part of each variable represents the location where the dangerous driving event occurred, i.e., up and down represent the road sections within the spatial range l upstream and downstream of the accident location, respectively.

(2)计算S2.2中时空范围内同一类型危险驾驶事件中车辆的最大加速度和最大速度的平均值,记为:(2) Calculate the average value of the maximum acceleration and maximum speed of the vehicle in the same type of dangerous driving incident within the time and space range in S2.2, and record it as:

Acceleration=[La_up,La_down,Ra_up,Ra_down,Aa_up,Aa_down,Ba_up,Ba_down]Acceleration = [La_up, La_down, Ra_up, Ra_down, Aa_up, Aa_down, Ba_up, Ba_down]

Speed=[Ls_up,Ls_down,Rs_up,Rs_down,As_up,As_down,Bs_up,Bs_down]Speed = [Ls_up, Ls_down, Rs_up, Rs_down, As_up, As_down, Bs_up, Bs_down]

其中,每个变量的第一部分内的La和Ls分别表示所有急左变道事件中车辆的最大加速度和最大速度的平均值,其余Ra,Rs,Aa,As,Ba,Bs的含义同理。每个变量的第二部分表示危险驾驶事件发生地点。Among them, La and Ls in the first part of each variable represent the average values of the maximum acceleration and maximum speed of the vehicle in all sharp left lane change events, and the meanings of the remaining Ra, Rs, Aa, As, Ba, and Bs are similar. The second part of each variable represents the location where the dangerous driving event occurred.

S2.4:将每起交通事故对应的Times,Acceleration和Speed数据进行合并,得到具有24个特征变量的危险驾驶事件数据。汇总所有交通事故对应的危险驾驶事件数据,将其作为自变量,记为:S2.4: Combine the Times, Acceleration and Speed data corresponding to each traffic accident to obtain dangerous driving event data with 24 characteristic variables. Summarize the dangerous driving event data corresponding to all traffic accidents and use them as independent variables, recorded as:

Xi=[Timesi,Accelerationi,Speedi] Xi =[Times i , Acceleration i , Speed i ]

其中,Timesi表示第i起交通事故对应的四种危险驾驶事件发生频次,Accelerationi和Speedi的含义同理。Among them, Times i represents the frequency of occurrence of four dangerous driving events corresponding to the i-th traffic accident, and Acceleration i and Speed i have the same meaning.

对于每起事故案例数据,赋予标签y=1,将其作为因变量,记为Y=[yi],yi=1。将自变量和因变量进行匹配,得到事故案例数据集。For each accident case data, a label y = 1 is assigned and it is used as the dependent variable, denoted as Y = [y i ], y i = 1. The independent variable and the dependent variable are matched to obtain the accident case data set.

具体地,步骤S3具体是:Specifically, step S3 is:

S3.1:针对每起交通事故,依据S2.2、S2.3和S2.4中的方法,提取并计算事故发生前t0、t0~2t0、…、(n-1)t0~nt0时间范围内事故地点上下游l空间范围内的危险驾驶事件数据,汇总得到初始非事故案例数据。其中n是正整数,控制着对照组选择的比例,可根据实际情况选择。S3.1: For each traffic accident, according to the methods in S2.2, S2.3 and S2.4, extract and calculate the dangerous driving event data within the spatial range of l upstream and downstream of the accident site within the time range of t 0 before the accident, t 0 ~ 2t 0 , ..., (n-1)t 0 ~nt 0 , and summarize the initial non-accident case data. Where n is a positive integer, which controls the proportion of the control group selection and can be selected according to actual conditions.

为避免上述时间范围内存在交通事故造成的影响,检查全部非事故案例数据的时间,删除时间介于任一交通事故发生前t时间到事故发生后t′时间内的非事故案例数据。删除有信息缺失的案例,得到最终非事故案例数据,将其作为自变量,记为:In order to avoid the impact of traffic accidents within the above time range, check the time of all non-accident case data and delete the non-accident case data between t time before any traffic accident and t′ time after the accident. Delete the cases with missing information to obtain the final non-accident case data, which is used as the independent variable and recorded as:

Xj=[Timesj,Accelerationj,Speedj]。X j = [Times j , Acceleration j , Speed j ].

其中,Timesj表示第j起非事故案例对应的四种危险驾驶事件发生频次,Accelerationj和Speedj的含义同理。Among them, Times j represents the frequency of occurrence of four dangerous driving events corresponding to the jth non-accident case, and Acceleration j and Speed j have the same meaning.

对于每起非事故案例数据,赋予标签y=0,将其作为因变量,记为Y=[yj],yj=0。将自变量和因变量进行匹配,得到非事故案例数据集。For each non-accident case data, the label y = 0 is assigned and used as the dependent variable, denoted as Y = [y j ], y j = 0. The independent variable and the dependent variable are matched to obtain the non-accident case data set.

S3.2:将事故案例数据集和非事故案例数据集合并,得到样本数据集,记为:S3.2: Combine the accident case dataset and the non-accident case dataset to obtain a sample dataset, recorded as:

X=[Timesk,Accelerationk,Speedk]X=[Times k ,Acceleration k ,Speed k ]

=[L_upk,L_downk,R_upk,R_downk,A_upk,A_downk,B_upk,B_downk,La_upk,La_downk,Ra_upk,Ra_downk,Aa_upk,Aa_downk,Ba_upk,Ba_downk,Ls_upk,Ls_downk,Rs_upk,Rs_downk,As_upk,As_downk,Bs_upk,Bs_downk] =[L_up k ,L_down k ,R_up k ,R_down k ,A_up k ,A_down k ,B_up k ,B_down k ,La_up k ,La_down k ,Ra_up k ,Ra_down k ,Aa_up k ,Aa_down k ,Ba_up k ,Ba_down k ,Ls_up k ,Ls_down k ,Rs_up k ,Rs_down k ,As_up k ,As_down k ,Bs_up k ,Bs_down k ]

Y=[yk],yk=0或1Y=[y k ], y k =0 or 1

具体地,步骤S4具体是:Specifically, step S4 is:

S4.1:计算各个自变量的平均值和标准差,按下式对所有数据进行标准化:S4.1: Calculate the mean and standard deviation of each independent variable and standardize all data as follows:

xnorm=(x-μ)/σx norm = (x-μ)/σ

其中,μ为自变量的平均值,σ为自变量的标准差。Among them, μ is the mean of the independent variable and σ is the standard deviation of the independent variable.

S4.2:将每条危险驾驶事件数据处理成如下的四通道一维图像形式。各通道内的变量如下:S4.2: Process each dangerous driving event data into the following four-channel one-dimensional image format. The variables in each channel are as follows:

通道1:L_up,La_up,Ls_up,L_down,La_down,Ls_downChannel 1: L_up, La_up, Ls_up, L_down, La_down, Ls_down

通道2:R_up,Ra_up,Rs_up,R_down,Ra_down,Rs_downChannel 2: R_up, Ra_up, Rs_up, R_down, Ra_down, Rs_down

通道3:A_up,Aa_up,As_up,A_down,Aa_down,As_downChannel 3: A_up, Aa_up, As_up, A_down, Aa_down, As_down

通道4:B_up,Ba_up,Bs_up,B_down,Ba_down,Bs_downChannel 4: B_up, Ba_up, Bs_up, B_down, Ba_down, Bs_down

其中,同一驾驶行为类型的6个变量位于同一通道,按上下游路段关系(由up至down)以及危险驾驶事件发生频次、平均最大加速度和平均最大速度的顺序进行排列。不同类型危险驾驶事件的变量位于4个不同的通道。这种数据形式有利于模型提取和学习到数据在不同危险驾驶事件类型和上下游空间位置方面的内在联系,进而挖掘出数据中的隐藏模式。Among them, the six variables of the same driving behavior type are located in the same channel, and are arranged in the order of upstream and downstream road section relationship (from up to down) and the frequency of dangerous driving events, average maximum acceleration and average maximum speed. The variables of different types of dangerous driving events are located in four different channels. This data format is conducive to the model to extract and learn the intrinsic connection between the data in different types of dangerous driving events and upstream and downstream spatial locations, and then to mine the hidden patterns in the data.

S4.3:建立深度学习模型。依次构建卷积层、池化层、展平层、全连接层、输出层。S4.2中得到的四通道一维图像数据作为深度学习模型的输入数据首先被输入到卷积层中。S4.3: Establish a deep learning model. Convolutional layer, pooling layer, flattening layer, fully connected layer, and output layer are constructed in sequence. The four-channel one-dimensional image data obtained in S4.2 is first input into the convolutional layer as the input data of the deep learning model.

所述的卷积层(Conv1D layer)的输入维度为I1,输出维度为O1,具有m1个尺寸为n1的卷积核。这些卷积核以s1的步长沿着输入数据通道内变量排列的方向(例如由L_up到Ls_down)进行一维卷积运算。卷积层的激活函数为relu。该模块的输出作为池化层的输入。The convolution layer (Conv1D layer) has an input dimension of I 1 and an output dimension of O 1 , and has m 1 convolution kernels of size n 1. These convolution kernels perform one-dimensional convolution operations along the direction of the variable arrangement in the input data channel (for example, from L_up to Ls_down) with a step size of s 1. The activation function of the convolution layer is relu. The output of this module is used as the input of the pooling layer.

所述的池化层(AveragePooling1D layer)的输入维度为I2,输出维度为O2,以s2的步长进行范围为n2的一维平均池化运算。该模块的输出作为展平层的输入。The input dimension of the pooling layer (AveragePooling1D layer) is I 2 , the output dimension is O 2 , and a one-dimensional average pooling operation with a range of n 2 is performed with a step size of s 2. The output of this module is used as the input of the flattening layer.

所述的展平层(Flatten layer)的输入维度为I3,输出维度为O3。其将池化层输出的多通道一维图像数据转换成一维向量形式,并采用了比率为p的随机丢弃方法丢弃数据来避免过拟合。该模块的输出作为全连接层的输入。The input dimension of the flatten layer is I 3 and the output dimension is O 3 . It converts the multi-channel one-dimensional image data output by the pooling layer into a one-dimensional vector form, and uses a random drop method with a ratio of p to discard data to avoid overfitting. The output of this module is used as the input of the fully connected layer.

所述的全连接层(Dense layer)的输入维度为I4,输出维度为O4,具有m2个神经元。全连接层的激活函数为relu。同时采用了比率为pdrop的随机丢弃方法丢弃神经元。该模块的输出作为输出层的输入。The input dimension of the fully connected layer is I 4 , the output dimension is O 4 , and it has m 2 neurons. The activation function of the fully connected layer is relu. At the same time, a random drop method with a ratio of p drop is used to drop neurons. The output of this module is used as the input of the output layer.

所述的输出层(Output layer)具有1个神经元。输出层的激活函数为sigmoid,该函数将输出值转换到(0,1)区间。因此,整个深度学习模型的输出数据的含义为当前研究路段发生交通事故的概率。The output layer has 1 neuron. The activation function of the output layer is sigmoid, which converts the output value to the interval (0,1). Therefore, the output data of the entire deep learning model means the probability of a traffic accident occurring on the current research section.

S4.4:基于样本数据集对建立的深度学习模型进行训练。以受试者工作特性曲线下的面积(AUC,即Area Under Curve)、灵敏度(Sensitivity)和特异度(Specificity)为评估指标,采用十折交叉验证方法、二元交叉熵损失函数和Nadam优化器基于样本数据集对建立的深度学习模型进行调优,得到效果最优的深度学习模型。S4.4: Train the established deep learning model based on the sample data set. Using the area under the receiver operating characteristic curve (AUC), sensitivity, and specificity as evaluation indicators, the ten-fold cross-validation method, binary cross entropy loss function, and Nadam optimizer are used to tune the established deep learning model based on the sample data set to obtain the deep learning model with the best effect.

具体地,步骤S5具体是:Specifically, step S5 is:

S5.1:实时获取高速公路全路段上车辆的急加速、急减速、急左变道和急右变道四种危险驾驶事件数据。依据S2计算得出高速公路各位置前t时间范围内四种危险驾驶事件发生频次Times′以及所有同类型危险驾驶事件中车辆最大加速度Acceleration′和最大速度Speed′的平均值。汇总得到输入数据。S5.1: Real-time acquisition of four types of dangerous driving events data on the entire highway: sudden acceleration, sudden deceleration, sudden left lane change and sudden right lane change. Based on S2, the frequency of occurrence of the four types of dangerous driving events Times′ within the time range before t at each location on the highway and the average value of the maximum acceleration Acceleration′ and maximum speed Speed′ of the vehicle in all the same types of dangerous driving events are calculated. The input data is summarized.

S5.2:将数据输入S4构建的深度学习模型,得到当前高速公路各位置发生交通事故的概率p∈(0,1),即为交通事故风险评估值。S5.2: Input the data into the deep learning model constructed in S4 to obtain the probability p∈(0,1) of a traffic accident occurring at each location on the current highway, which is the traffic accident risk assessment value.

当p∈(0,threshold)时,交通事故风险水平为低风险,无需干预。When p∈(0,threshold), the traffic accident risk level is low and no intervention is required.

当p∈[threshold,1)时,交通事故风险水平为高风险,需要采取相应措施管控交通安全风险。When p∈[threshold,1), the traffic accident risk level is high, and corresponding measures need to be taken to control traffic safety risks.

其中,threshold∈(0,1)。Among them, threshold∈(0,1).

本发明还提出一种基于危险驾驶事件数据的高速公路交通事故风险评估系统,包括:The present invention also proposes a highway traffic accident risk assessment system based on dangerous driving event data, comprising:

数据获取模块,用于确定研究路段和统计时间范围,获取统计时间范围高速公路研究路段上的危险驾驶事件数据和研究路段上发生的交通事故数据;A data acquisition module is used to determine the research section and statistical time range, and to acquire the dangerous driving incident data and traffic accident data on the research section of the expressway within the statistical time range;

样本构建模块,用于针对每起交通事故,查找该事故发生前事故地点上下游路段的危险驾驶事件数据,以危险驾驶事件数据为自变量,以发生事故为因变量,构建事故案例数据集;然后查找未发生交通事故时上下游路段的危险驾驶事件数据,以危险驾驶事件数据为自变量,以未发生事故为因变量,构建非事故案例数据集作为对照组,将事故案例数据集和非事故案例数据集合并,得到样本数据集;The sample construction module is used to find the dangerous driving event data of the upstream and downstream sections of the accident site before each traffic accident, and use the dangerous driving event data as the independent variable and the accident as the dependent variable to construct the accident case data set; then find the dangerous driving event data of the upstream and downstream sections when no traffic accident occurs, use the dangerous driving event data as the independent variable and the accident as the dependent variable to construct a non-accident case data set as a control group, and merge the accident case data set and the non-accident case data set to obtain the sample data set;

模型训练模块,用于对数据进行标准化处理,依据不同危险驾驶事件类型和上下游空间关系将数据处理成图像形式,构建深度学习模型进行训练和调参,直至模型效果最优;The model training module is used to standardize the data, process the data into images according to different types of dangerous driving events and upstream and downstream spatial relationships, and build a deep learning model for training and parameter adjustment until the model effect is optimal;

路段事故风险评估模块,用于实时获取高速公路全路段上车辆的危险驾驶事件数据,计算危险驾驶事件发生频次以及所有同类型危险驾驶事件中车辆最大加速度和最大速度的平均值,将其输入构建的深度学习模型,计算得出各路段交通事故风险评估值和风险水平。The road section accident risk assessment module is used to obtain real-time data on dangerous driving incidents of vehicles on all sections of the highway, calculate the frequency of dangerous driving incidents and the average value of the maximum acceleration and maximum speed of vehicles in all similar dangerous driving incidents, input them into the constructed deep learning model, and calculate the traffic accident risk assessment value and risk level of each section.

本发明采用以上技术方案,与现有技术相比具有以下有益技术效果:The present invention adopts the above technical solution, which has the following beneficial technical effects compared with the prior art:

本发明提出的基于危险驾驶事件数据的高速公路交通事故风险评估方法不受固定检测器布设位置和布设密度的限制,能够覆盖高速公路全路段且不受天气条件的影响,在无需安装新设备的条件下即可对高速公路各个路段处的交通事故风险进行实时精细化评估。The highway traffic accident risk assessment method based on dangerous driving event data proposed in the present invention is not limited by the location and density of fixed detectors, can cover the entire section of the highway and is not affected by weather conditions. It can perform real-time and refined assessment of traffic accident risks at each section of the highway without installing new equipment.

同时,本发明提出的方法综合考虑了危险驾驶事件数据在不同事件类型和上下游空间位置方面的内在联系,依据道路上危险驾驶事件发生频次和危险驾驶事件中的车辆运动参数两方面因素对路段事故风险进行评估,能够提升风险评估的准确性,为高速公路交通管理部门管控交通安全风险提供了新的理论和技术支持。At the same time, the method proposed in the present invention comprehensively considers the intrinsic connection between dangerous driving event data in terms of different event types and upstream and downstream spatial positions, and evaluates the accident risk of road sections based on two factors: the frequency of dangerous driving events on the road and the vehicle motion parameters in dangerous driving events. This can improve the accuracy of risk assessment and provide new theoretical and technical support for highway traffic management departments to control traffic safety risks.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1为本发明中高速公路交通事故风险评估方法流程图。FIG1 is a flow chart of a highway traffic accident risk assessment method according to the present invention.

图2为本发明中交通事故发生前的危险驾驶事件数据的提取规则示意图。FIG. 2 is a schematic diagram of rules for extracting dangerous driving event data before a traffic accident occurs in the present invention.

图3为本发明中深度学习模型框架示意图。FIG3 is a schematic diagram of the deep learning model framework in the present invention.

具体实施方式Detailed ways

下面将结合附图详细、完整地描述本发明实施例中的技术方案。此外,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be described in detail and completely in conjunction with the accompanying drawings. In addition, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, other embodiments obtained by ordinary technicians in this field without creative work are all within the scope of protection of the present invention.

图1所示为本发明的一种基于危险驾驶事件数据的高速公路交通事故风险评估方法的流程示意图,该方法可以应用于有交通事故数据和包括车辆运动参数的危险驾驶事件数据的高速公路交通事故风险评估。具体步骤如下:FIG1 is a flow chart of a highway traffic accident risk assessment method based on dangerous driving event data of the present invention. The method can be applied to highway traffic accident risk assessment with traffic accident data and dangerous driving event data including vehicle motion parameters. The specific steps are as follows:

S1:确定研究路段和统计时间范围,分别获取以下两类信息:S1: Determine the research section and statistical time range, and obtain the following two types of information:

(1)统计时间范围内高速公路研究路段上车辆的危险驾驶事件数据,包括危险驾驶事件发生时间,发生地点的经度、纬度和上下行方向,危险驾驶事件类型(急加速、急减速、急左变道和急右变道),危险驾驶事件中车辆的最大加速度和最大速度。(1) Data on dangerous driving incidents of vehicles on the research section of the expressway within the statistical time range, including the time of occurrence of the dangerous driving incident, the longitude, latitude and up and down directions of the location of the incident, the type of dangerous driving incident (sudden acceleration, sudden deceleration, sudden left lane change and sudden right lane change), and the maximum acceleration and maximum speed of the vehicle during the dangerous driving incident.

(2)统计时间范围内高速公路研究路段上发生的交通事故数据,包括事故的起始时间、结束时间,事故发生地点的经度、纬度和上下行方向。(2) Traffic accident data occurring on the research section of the expressway within the statistical time range, including the start and end time of the accident, the longitude, latitude, and up and down directions of the accident location.

S2:针对每起交通事故,查找该事故发生前事故地点上下游路段的危险驾驶事件数据。以危险驾驶事件数据为自变量,以发生事故为因变量,构建事故案例数据集。具体步骤如下:S2: For each traffic accident, find the dangerous driving event data of the upstream and downstream sections of the accident site before the accident. Use the dangerous driving event data as the independent variable and the accident as the dependent variable to construct the accident case data set. The specific steps are as follows:

S2.1:根据危险驾驶事件和交通事故位置的经度、纬度和上下行方向,将它们标记到高速公路对应路段上,并建立路段间的上下游关系。S2.1: According to the longitude, latitude, and up and down directions of the dangerous driving incidents and traffic accidents, mark them on the corresponding sections of the highway and establish the upstream and downstream relationships between the sections.

S2.2:如图2所示,依据每起交通事故的时空位置,提取事故发生前t分钟范围内事故地点上下游l米范围内的急加速、急减速、急左变道和急右变道四种危险驾驶事件数据,即图2中椭圆形虚线框圈出的部分。通常t≤30,l≤1000。S2.2: As shown in Figure 2, based on the spatiotemporal location of each traffic accident, extract the data of four dangerous driving events: sudden acceleration, sudden deceleration, sudden left lane change and sudden right lane change within a range of 1 meter upstream and downstream of the accident location within t minutes before the accident, i.e. the part circled by the oval dotted box in Figure 2. Usually t≤30,l≤1000.

S2.3:针对每起交通事故,分别计算以下内容:S2.3: For each traffic accident, calculate the following:

(1)计算图2所示时空范围内的四种危险驾驶事件发生频次,记为:(1) Calculate the frequency of occurrence of four types of dangerous driving events within the time and space range shown in Figure 2, recorded as:

Times=[L_up,L_down,R_up,R_down,A_up,A_down,B_up,B_down]Times = [L_up, L_down, R_up, R_down, A_up, A_down, B_up, B_down]

其中,每个变量的第一部分表示危险驾驶事件类型,即L、R、A和B分别表示事故发生前时间范围内急左变道、急右变道、急加速和急减速驾驶事件发生的频次。每个变量的第二部分表示危险驾驶事件发生地点,即up和down分别表示事故地点上游和下游空间范围内的路段。The first part of each variable represents the type of dangerous driving event, i.e., L, R, A, and B represent the frequency of sudden left lane change, sudden right lane change, sudden acceleration, and sudden deceleration in the time range before the accident. The second part of each variable represents the location where the dangerous driving event occurred, i.e., up and down represent the road sections within the spatial range upstream and downstream of the accident location, respectively.

(2)计算图2所示时空范围内同一类型危险驾驶事件中车辆的最大加速度和最大速度的平均值,记为:(2) Calculate the average value of the maximum acceleration and maximum speed of the vehicle in the same type of dangerous driving incident within the time and space range shown in Figure 2, and record it as:

Acceleration=[La_up,La_down,Ra_up,Ra_down,Aa_up,Aa_down,Ba_up,Ba_down]Acceleration = [La_up, La_down, Ra_up, Ra_down, Aa_up, Aa_down, Ba_up, Ba_down]

Speed=[Ls_up,Ls_down,Rs_up,Rs_down,As_up,As_down,Bs_up,Bs_down]Speed = [Ls_up, Ls_down, Rs_up, Rs_down, As_up, As_down, Bs_up, Bs_down]

其中,每个变量的第一部分内的La和Ls分别表示所有急左变道事件中车辆的最大加速度和最大速度的平均值,其余Ra,Rs,Aa,As,Ba,Bs的含义同理。每个变量的第二部分表示危险驾驶事件发生地点。Among them, La and Ls in the first part of each variable represent the average values of the maximum acceleration and maximum speed of the vehicle in all sharp left lane change events, and the meanings of the remaining Ra, Rs, Aa, As, Ba, and Bs are similar. The second part of each variable represents the location where the dangerous driving event occurred.

S2.4:将每起交通事故对应的Times,Acceleration和Speed数据进行合并,得到具有24个特征变量的危险驾驶事件数据。汇总所有交通事故对应的危险驾驶事件数据,将其作为自变量,记为:S2.4: Combine the Times, Acceleration and Speed data corresponding to each traffic accident to obtain dangerous driving event data with 24 characteristic variables. Summarize the dangerous driving event data corresponding to all traffic accidents and use them as independent variables, recorded as:

Xi=[Timesi,Accelerationi,Speedi] Xi =[Times i , Acceleration i , Speed i ]

其中,Timesi表示第i起交通事故对应的四种危险驾驶事件发生频次,Accelerationi和Speedi的含义同理。Among them, Times i represents the frequency of occurrence of four dangerous driving events corresponding to the i-th traffic accident, and Acceleration i and Speed i have the same meaning.

对于每起事故案例数据,赋予标签y=1,将其作为因变量,记为Y=[yi],yi=1。将自变量和因变量进行匹配,得到事故案例数据集。For each accident case data, a label y = 1 is assigned and it is used as the dependent variable, denoted as Y = [y i ], y i = 1. The independent variable and the dependent variable are matched to obtain the accident case data set.

S3:查找未发生交通事故时上下游路段的危险驾驶事件数据。以危险驾驶事件数据为自变量,以未发生事故为因变量,构建非事故案例数据集作为对照组。将事故案例数据集和非事故案例数据集合并,得到样本数据集。具体步骤如下:S3: Find the dangerous driving event data of the upstream and downstream sections when no traffic accidents occurred. Take the dangerous driving event data as the independent variable and the non-accident data as the dependent variable to construct a non-accident case data set as the control group. Merge the accident case data set and the non-accident case data set to obtain a sample data set. The specific steps are as follows:

S3.1:针对每起交通事故,依据S2.2、S2.3和S2.4中的方法,提取并计算事故发生前t0、t0~2t0、…、(n-1)t0~nt0时间范围内事故地点上下游l空间范围内的危险驾驶事件数据,汇总得到初始非事故案例数据。通常t0≥1且t0是正整数。n是正整数,控制着对照组选择的比例,可根据实际情况选择。S3.1: For each traffic accident, according to the methods in S2.2, S2.3 and S2.4, extract and calculate the dangerous driving event data within the spatial range of l upstream and downstream of the accident site within the time range of t 0 before the accident, t 0 ~2t 0 , ..., (n-1)t 0 ~nt 0 , and summarize the initial non-accident case data. Usually t 0 ≥1 and t 0 is a positive integer. n is a positive integer that controls the proportion of the control group selection and can be selected according to actual conditions.

为避免上述时间范围内存在交通事故造成的影响,检查全部非事故案例数据的时间,删除时间介于任一交通事故发生前t时间到事故发生后t′时间内的非事故案例数据。删除有信息缺失的案例,得到最终非事故案例数据,将其作为自变量,记为:In order to avoid the impact of traffic accidents within the above time range, check the time of all non-accident case data and delete the non-accident case data between t time before any traffic accident and t′ time after the accident. Delete the cases with missing information to obtain the final non-accident case data, which is used as the independent variable and recorded as:

Xj=[Timesj,Accelerationj,Speedj]。X j =[Times j ,Acceleration j ,Speed j ].

其中,Timesj表示第j起非事故案例对应的四种危险驾驶事件发生频次,Accelerationj和Speedj的含义同理。Among them, Times j represents the frequency of occurrence of four dangerous driving events corresponding to the jth non-accident case, and Acceleration j and Speed j have the same meaning.

对于每起非事故案例数据,赋予标签y=0,将其作为因变量,记为Y=[yj],yj=0。将自变量和因变量进行匹配,得到非事故案例数据集。For each non-accident case data, the label y = 0 is assigned and used as the dependent variable, denoted as Y = [y j ], y j = 0. The independent variable and the dependent variable are matched to obtain the non-accident case data set.

S3.2:将事故案例数据集和非事故案例数据集合并,得到样本数据集,记为:S3.2: Combine the accident case dataset and the non-accident case dataset to obtain a sample dataset, recorded as:

X=[Timesk,Accelerationk,Speedk]X=[Times k ,Acceleration k ,Speed k ]

=[L_upk,L_downk,R_upk,R_downk,A_upk,A_downk,B_upk,B_downk,La_upk,La_downk,Ra_upk,Ra_downk,Aa_upk,Aa_downk,Ba_upk,Ba_downk,Ls_upk,Ls_downk,Rs_upk,Rs_downk,As_upk,As_downk,Bs_upk,Bs_downk] =[L_up k ,L_down k ,R_up k ,R_down k ,A_up k ,A_down k ,B_up k ,B_down k ,La_up k ,La_down k ,Ra_up k ,Ra_down k ,Aa_up k ,Aa_down k ,Ba_up k ,Ba_down k ,Ls_up k ,Ls_down k ,Rs_up k ,Rs_down k ,As_up k ,As_down k ,Bs_up k ,Bs_down k ]

Y=[yk],yk=0或1Y=[y k ], y k =0 or 1

S4:对数据进行标准化处理。依据不同危险驾驶事件类型和空间位置关系将危险驾驶事件数据处理成图像形式,构建深度学习模型进行训练和调参,直至模型效果最优。具体步骤如下:S4: Standardize the data. Process the dangerous driving event data into images according to different dangerous driving event types and spatial position relationships, build a deep learning model for training and parameter adjustment until the model effect is optimal. The specific steps are as follows:

S4.1:计算各个自变量的平均值和标准差,按下式对所有数据进行标准化:S4.1: Calculate the mean and standard deviation of each independent variable and standardize all data as follows:

xnorm=(x-μ)/σx norm = (x-μ)/σ

其中,μ为自变量的平均值,σ为自变量的标准差。Among them, μ is the mean of the independent variable and σ is the standard deviation of the independent variable.

S4.2:将每条危险驾驶事件数据处理成四通道一维图像形式,如图3中的输入层所示(由于空间所限,部分变量名称未标出)。各通道内的变量如下:S4.2: Process each dangerous driving event data into a four-channel one-dimensional image format, as shown in the input layer of Figure 3 (due to space limitations, some variable names are not marked). The variables in each channel are as follows:

通道1:L_up,La_up,Ls_up,L_down,La_down,Ls_downChannel 1: L_up, La_up, Ls_up, L_down, La_down, Ls_down

通道2:R_up,Ra_up,Rs_up,R_down,Ra_down,Rs_downChannel 2: R_up, Ra_up, Rs_up, R_down, Ra_down, Rs_down

通道3:A_up,Aa_up,As_up,A_down,Aa_down,As_downChannel 3: A_up, Aa_up, As_up, A_down, Aa_down, As_down

通道4:B_up,Ba_up,Bs_up,B_down,Ba_down,Bs_downChannel 4: B_up, Ba_up, Bs_up, B_down, Ba_down, Bs_down

其中,同一驾驶行为类型的6个变量位于同一通道,按上下游路段关系(由up至down)以及危险驾驶事件发生频次、平均最大加速度和平均最大速度的顺序进行排列。不同类型危险驾驶事件的变量位于4个不同的通道。这种数据形式有利于模型提取和学习到数据在不同危险驾驶事件类型和上下游空间位置方面的内在联系,进而挖掘出数据中的隐藏模式。Among them, the six variables of the same driving behavior type are located in the same channel, and are arranged in the order of upstream and downstream road section relationship (from up to down) and the frequency of dangerous driving events, average maximum acceleration and average maximum speed. The variables of different types of dangerous driving events are located in four different channels. This data format is conducive to the model to extract and learn the intrinsic connection between the data in different types of dangerous driving events and upstream and downstream spatial locations, and then to mine the hidden patterns in the data.

S4.3:如图3所示,依次构建卷积层、池化层、展平层、全连接层、输出层。S4.2中得到的四通道一维图像数据作为深度学习模型的输入数据首先被输入到卷积层中。S4.3: As shown in Figure 3, the convolution layer, pooling layer, flattening layer, fully connected layer, and output layer are constructed in sequence. The four-channel one-dimensional image data obtained in S4.2 is first input into the convolution layer as the input data of the deep learning model.

所述的卷积层(Conv1D layer)的输入维度为I1,输出维度为O1,具有m1个尺寸为n1的卷积核。这些卷积核以s1的步长沿着输入数据通道内变量排列的方向(图3输入层箭头方向)进行一维卷积运算。卷积层的激活函数为relu。该模块的输出作为池化层的输入。The convolution layer (Conv1D layer) has an input dimension of I 1 and an output dimension of O 1 , and has m 1 convolution kernels of size n 1. These convolution kernels perform one-dimensional convolution operations along the direction of the variable arrangement in the input data channel (the direction of the input layer arrow in Figure 3) with a step size of s 1. The activation function of the convolution layer is relu. The output of this module is used as the input of the pooling layer.

所述的池化层(AveragePooling1D layer)的输入维度为I2,输出维度为O2,以s2的步长进行范围为n2的一维平均池化运算,以便降低数据维度,避免过拟合。该模块的输出作为展平层的输入。The pooling layer (AveragePooling1D layer) has an input dimension of I 2 and an output dimension of O 2 , and performs a one-dimensional average pooling operation with a range of n 2 with a step size of s 2 to reduce the data dimension and avoid overfitting. The output of this module is used as the input of the flattening layer.

所述的展平层(Flatten layer)的输入维度为I3,输出维度为O3。其将池化层输出的多通道一维图像数据转换成一维向量形式,并采用了比率为p的随机丢弃方法丢弃数据,来避免过拟合。该模块的输出作为全连接层的输入。The input dimension of the flatten layer is I 3 and the output dimension is O 3 . It converts the multi-channel one-dimensional image data output by the pooling layer into a one-dimensional vector form, and uses a random drop method with a ratio of p to discard data to avoid overfitting. The output of this module is used as the input of the fully connected layer.

所述的全连接层(Dense layer)的输入维度为I4,输出维度为O4,具有m2个神经元。全连接层的激活函数为relu。同时,采用了比率为pdrop的随机丢弃方法丢弃神经元。该模块的输出作为输出层的输入。The input dimension of the fully connected layer is I 4 , the output dimension is O 4 , and it has m 2 neurons. The activation function of the fully connected layer is relu. At the same time, a random drop method with a ratio of p drop is used to drop neurons. The output of this module is used as the input of the output layer.

所述的输出层(Output layer)具有1个神经元。输出层的激活函数为sigmoid,该函数将输出值转换到(0,1)区间。因此,整个深度学习模型的输出数据的含义为当前研究路段发生交通事故的概率。The output layer has 1 neuron. The activation function of the output layer is sigmoid, which converts the output value to the interval (0,1). Therefore, the output data of the entire deep learning model means the probability of a traffic accident occurring on the current research section.

S4.4:基于样本数据集对建立的深度学习模型进行训练。以受试者工作特性曲线下的面积(AUC,即Area Under Curve)、灵敏度(Sensitivity)和特异度(Specificity)为评估指标对深度学习模型效果进行评估。具体地,采用约登指数法计算得出各分类阈值下的约登指数,计算公式如下:S4.4: Train the established deep learning model based on the sample data set. The area under the receiver operating characteristic curve (AUC), sensitivity and specificity are used as evaluation indicators to evaluate the effect of the deep learning model. Specifically, the Youden index method is used to calculate the Youden index under each classification threshold, and the calculation formula is as follows:

Youden=Sensitivity+Specificity-1Youden=Sensitivity+Specificity-1

其中,TP表示实际和预测均为事故的数目,FN表示实际为事故但预测为非事故的数目,TN表示实际和预测均为非事故的数目,FP表示实际为非事故但预测为事故的数目。选择约登指数(Youden)最大时对应的阈值作为模型的分类阈值(threshold),该阈值下模型的灵敏度、特异度作为模型评估指标。模型的AUC、灵敏度和特异度的值越接近1,表明模型预测性能越好。Among them, TP represents the number of accidents that are both actual and predicted, FN represents the number of accidents that are actually predicted to be non-accidents, TN represents the number of non-accidents that are actually predicted to be, and FP represents the number of non-accidents that are actually predicted to be accidents. The threshold corresponding to the maximum Youden index is selected as the classification threshold of the model, and the sensitivity and specificity of the model under this threshold are used as model evaluation indicators. The closer the values of the model's AUC, sensitivity, and specificity are to 1, the better the model's prediction performance is.

基于上述三个评估指标,采用十折交叉验证方法、二元交叉熵损失函数和Nadam优化器基于样本数据集对建立的深度学习模型进行调优,得到效果最优的深度学习模型。Based on the above three evaluation indicators, the ten-fold cross validation method, binary cross entropy loss function and Nadam optimizer are used to tune the established deep learning model based on the sample data set to obtain the deep learning model with the best effect.

S5:实时获取高速公路全路段车辆的危险驾驶事件数据,将其输入S4构建的深度学习模型,计算得出各路段交通事故风险评估值和风险水平。具体步骤如下:S5: Real-time acquisition of dangerous driving incident data of vehicles on all sections of the highway, inputting it into the deep learning model constructed in S4, and calculating the traffic accident risk assessment value and risk level of each section. The specific steps are as follows:

S5.1:实时获取高速公路全路段上车辆的急加速、急减速、急左变道和急右变道四种危险驾驶事件数据,建立路段间的上下游关系。依据S2中计算方法得出高速公路各位置前t时间范围内四种危险驾驶事件发生频次Times′以及所有同类型危险驾驶事件中车辆最大加速度Acceleration′和最大速度Speed′的平均值,汇总得到模型的输入数据X。S5.1: Obtain the data of four dangerous driving events, namely, sudden acceleration, sudden deceleration, sudden left lane change and sudden right lane change, on all sections of the highway in real time, and establish the upstream and downstream relationship between sections. According to the calculation method in S2, obtain the frequency of occurrence of the four dangerous driving events Times′ within the time range before t at each position of the highway, as well as the average value of the maximum acceleration Acceleration′ and the maximum speed Speed′ of the vehicle in all the same type of dangerous driving events, and summarize them to obtain the input data X of the model.

S5.2:将X输入S4构建的深度学习模型,得到当前高速公路各位置发生交通事故的概率p∈(0,1),即为交通事故风险评估值。S5.2: Input X into the deep learning model constructed in S4 to obtain the probability p∈(0,1) of a traffic accident occurring at each location on the current highway, which is the traffic accident risk assessment value.

当p∈(0,threshold)时,交通事故风险水平为低风险,无需干预。When p∈(0,threshold), the traffic accident risk level is low and no intervention is required.

当p∈[threshold,1)时,交通事故风险水平为高风险,需要采取相应措施管控安全风险。When p∈[threshold,1), the traffic accident risk level is high, and corresponding measures need to be taken to control safety risks.

其中,threshold∈(0,1)。Among them, threshold∈(0,1).

具体案例Specific case

为展示本发明实施例提供的基于危险驾驶事件数据的高速公路交通事故风险评估方法在真实场景中的适用性,给出以下具体案例进行进一步说明:In order to demonstrate the applicability of the highway traffic accident risk assessment method based on dangerous driving event data provided by the embodiment of the present invention in real scenarios, the following specific case is given for further explanation:

以中国G25高速公路溧阳-长兴段为例,数据收集时间范围为2020年9月26日至2020年10月2日。采集该路段上车辆的危险驾驶事件数据,数据采集的时间间隔为1秒。数据字段包括危险驾驶事件发生时间,事件所在地的经度、纬度和上下行方向,危险驾驶事件类型(急加速、急减速、急左变道和急右变道),危险驾驶事件中车辆的运动参数(最大加速度和最大速度)。收集时间范围内该路段上的交通事故数据,数据字段包括事故的起始时间、结束时间,事故发生地点的经度、纬度和上下行方向。经整理,统计范围内研究路段上共记录371起交通事故。Taking the Liyang-Changxing section of China's G25 Expressway as an example, the data collection time range is from September 26, 2020 to October 2, 2020. The dangerous driving incident data of vehicles on this section are collected, and the time interval for data collection is 1 second. The data fields include the time of occurrence of the dangerous driving incident, the longitude, latitude and uplink and downlink directions of the location of the incident, the type of dangerous driving incident (sudden acceleration, sudden deceleration, sudden left lane change and sudden right lane change), and the motion parameters of the vehicle in the dangerous driving incident (maximum acceleration and maximum speed). The traffic accident data on this section within the time range are collected, and the data fields include the start time and end time of the accident, the longitude, latitude and uplink and downlink directions of the location of the accident. After sorting, a total of 371 traffic accidents were recorded on the research section within the statistical scope.

由于记录的交通事故发生时间往往略晚于实际发生时间,危险驾驶事件数据汇总的时间间隔较短会影响预测的准确性。此外,汇总时间间隔较短时,每个间隔内的危险驾驶事件数量较少且不同间隔内分布差异较大,不利于建模。因此,本案例选择30分钟作为数据汇总的时间间隔。此外,选择250米作为危险驾驶事件数据选取的空间间隔,便于精细地提取对交通事故影响较大的危险驾驶事件。即t=30,l=250。Since the recorded time of traffic accidents is often slightly later than the actual time of occurrence, a short time interval for the aggregation of dangerous driving event data will affect the accuracy of the prediction. In addition, when the aggregation time interval is short, the number of dangerous driving events in each interval is small and the distribution difference in different intervals is large, which is not conducive to modeling. Therefore, this case selects 30 minutes as the time interval for data aggregation. In addition, 250 meters is selected as the spatial interval for the selection of dangerous driving event data, which is convenient for fine extraction of dangerous driving events that have a greater impact on traffic accidents. That is, t = 30, l = 250.

依据具体实施方式S2的步骤提取了事故发生前上下游路段发生的危险驾驶事件数据并进行数据融合,得到265条事故案例。对于每起交通事故,依据具体实施方式S3的步骤采用提取并计算事故发生前0~1、1~2、2~3小时(即t0=1)范围内事故地点上下游250米内的危险驾驶事件数据作为该交通事故对应的非事故案例,汇总得到初始非事故案例数据。According to step S2 of the specific implementation method, the dangerous driving event data that occurred in the upstream and downstream sections before the accident was extracted and data fusion was performed to obtain 265 accident cases. For each traffic accident, according to step S3 of the specific implementation method, the dangerous driving event data within 250 meters upstream and downstream of the accident site within the range of 0 to 1, 1 to 2, and 2 to 3 hours before the accident (i.e., t 0 =1) was extracted and calculated as the non-accident case corresponding to the traffic accident, and the initial non-accident case data was obtained by summarizing.

为避免事故案例的影响,检查全部非事故案例的时间,删除时间介于任一交通事故发生前30分钟到事故发生后180分钟内的非事故案例数据。删除有信息缺失的案例,得到最终的722条非事故案例数据,形成了事故与非事故比例约为1:3的样本数据集。To avoid the influence of accident cases, the time of all non-accident cases was checked, and the non-accident case data with a time between 30 minutes before and 180 minutes after any traffic accident was deleted. Cases with missing information were deleted, and the final 722 non-accident case data were obtained, forming a sample data set with an accident-to-non-accident ratio of approximately 1:3.

依据具体实施方式S4的步骤对数据进行标准化处理,将每条危险驾驶事件数据处理成四通道一维图像形式,建立了包括一维卷积模块的深度学习模型。该模型的具体参数如下:According to the step S4 of the specific implementation method, the data is standardized, each dangerous driving event data is processed into a four-channel one-dimensional image form, and a deep learning model including a one-dimensional convolution module is established. The specific parameters of the model are as follows:

卷积层的输入维度为(N,6,4),具有512个尺寸为3的卷积核。这些卷积核以1的步长进行一维卷积运算。因此,输出维度为(N,4,512)。其中,N为训练集样本数。The input dimension of the convolutional layer is (N, 6, 4), with 512 convolution kernels of size 3. These convolution kernels perform one-dimensional convolution operations with a stride of 1. Therefore, the output dimension is (N, 4, 512), where N is the number of samples in the training set.

池化层以1的步长进行范围为3的一维平均池化运算。因此,输出维度为(N,2,512)。The pooling layer performs a one-dimensional average pooling operation with a range of 3 and a stride of 1. Therefore, the output dimension is (N, 2, 512).

展平层将池化层输出的多通道一维图像数据转换成一维向量形式,并采用了比率为0.5的随机丢弃方法丢弃数据,来避免过拟合。因此,输出维度为(N,1024)。The flattening layer converts the multi-channel one-dimensional image data output by the pooling layer into a one-dimensional vector form, and uses a random dropout method with a ratio of 0.5 to discard data to avoid overfitting. Therefore, the output dimension is (N, 1024).

全连接层具有32个神经元,激活函数为relu。采用了比率为0.5的随机丢弃方法丢弃神经元。因此,输出维度为(N,32)。The fully connected layer has 32 neurons and the activation function is relu. Neurons are dropped randomly with a rate of 0.5. Therefore, the output dimension is (N,32).

输出层具有1个神经元,输出维度为(N,1),即为一维向量。输出层的激活函数为sigmoid。输出数据的含义为当前研究路段发生交通事故的概率。The output layer has 1 neuron, and the output dimension is (N, 1), which is a one-dimensional vector. The activation function of the output layer is sigmoid. The output data means the probability of a traffic accident occurring on the current research section.

采用十折交叉验证方法、二元交叉熵损失函数和Nadam优化器训练深度学习模型并进行超参数调优,得到效果最优的深度学习模型。The ten-fold cross validation method, binary cross entropy loss function and Nadam optimizer were used to train the deep learning model and perform hyperparameter tuning to obtain the deep learning model with the best effect.

将验证集数据作为实时获取的高速公路全路段车辆的危险驾驶事件数据,依据具体实施方式S5的步骤,将输入到训练好的深度学习模型计算交通事故风险评估值。实验显示,与逻辑回归、支持向量机和人工神经网络模型三个基准模型(输入数据仅支持一维向量形式)相比,本发明提出的深度学习模型(输入数据为图像形式)的AUC值最高,在最优阈值下能够识别出71.3%的事故情况和75.1%的非事故情况,是唯一一个灵敏度和特异度均大于70%的模型。同时该深度学习模型在较小的误报率情况下(小于30%)具有最高的灵敏度。具体案例的实验结果证明了本发明提出的基于危险驾驶事件数据进行高速公路事故风险评估方法的综合性能最好,能够提取和学习到危险驾驶事件数据在不同事件类型和上下游空间位置方面的内在联系,有效提升了风险评估的准确性,具有较高的应用价值。The validation set data is used as the real-time data of dangerous driving events of vehicles on the entire section of the highway, and according to the step S5 of the specific implementation method, it is input into the trained deep learning model to calculate the traffic accident risk assessment value. Experiments show that compared with the three benchmark models of logistic regression, support vector machine and artificial neural network model (input data only supports one-dimensional vector form), the deep learning model proposed in the present invention (input data is in image form) has the highest AUC value, and can identify 71.3% of accident situations and 75.1% of non-accident situations under the optimal threshold. It is the only model with a sensitivity and specificity greater than 70%. At the same time, the deep learning model has the highest sensitivity under a small false alarm rate (less than 30%). The experimental results of the specific case prove that the comprehensive performance of the highway accident risk assessment method based on dangerous driving event data proposed in the present invention is the best, and it can extract and learn the inherent connection between dangerous driving event data in different event types and upstream and downstream spatial positions, effectively improve the accuracy of risk assessment, and has a high application value.

以上所述仅为本发明的具体实施方式,只是用于帮助理解本发明的原理及其核心思想,并不用于限制本发明。在本发明的思想和原则之内,任何本领域的技术人员对本发明描述的技术方案所做的修改、等同替换等均应包含在本发明的保护范围之内。The above description is only a specific embodiment of the present invention, which is only used to help understand the principle and core idea of the present invention, and is not used to limit the present invention. Within the idea and principle of the present invention, any modification, equivalent replacement, etc. made by any technician in the field to the technical solution described in the present invention should be included in the protection scope of the present invention.

Claims (3)

1. The highway traffic accident risk assessment method based on the dangerous driving event data is characterized by comprising the following steps of:
S1, determining a research road section and a statistical time range, and acquiring vehicle dangerous driving event related data on the expressway research road section and traffic accident data on the research road section in the statistical time range;
S2, aiming at each traffic accident, searching dangerous driving event data of an upstream road section and a downstream road section of an accident site before the accident occurs, taking the dangerous driving event data as independent variables, and taking the accident as the dependent variables to construct an accident case data set;
S3, searching dangerous driving event data of an upstream road section and a downstream road section when no traffic accident occurs, taking the dangerous driving event data as an independent variable, taking the non-occurring accident as a dependent variable, constructing a non-accident case data set as a comparison group, and combining the accident case data set and the non-accident case data set to obtain a sample data set;
s4, carrying out standardized processing on the data: processing dangerous driving event data into an image form according to different dangerous driving event types and spatial position relations, constructing a deep learning model, and training and parameter adjustment until the model effect is optimal;
s5, acquiring dangerous driving event data of vehicles on all road sections of the expressway in real time, calculating to obtain occurrence frequency of various dangerous driving events and average value of maximum acceleration and maximum speed of the vehicles in the dangerous driving events according to the S2, inputting the average value into a deep learning model constructed in the S4, and calculating to obtain traffic accident risk assessment values and risk levels of all road sections;
The data acquired in step S1 includes:
(1) Dangerous driving event related data of vehicles on an expressway, comprising: the occurrence time of dangerous driving events, the longitude, latitude and uplink and downlink directions of the occurrence place, the type of the dangerous driving events, and the maximum acceleration and maximum speed of the vehicle in the dangerous driving events; dangerous driving event types include: rapid acceleration, rapid deceleration, rapid left lane change and rapid right lane change;
(2) Highway traffic accident data comprising: the starting time and the ending time of the accident, the longitude and the latitude of the accident place and the uplink and downlink directions;
The step S2 specifically comprises the following sub-steps:
S201, marking dangerous driving events and traffic accident positions on corresponding road sections of the expressway according to longitude, latitude and uplink and downlink directions, and establishing an uplink and downlink relation between the road sections;
S202, according to the space-time position of each traffic accident, four dangerous driving event data of rapid acceleration, rapid deceleration, rapid left lane change and rapid right lane change in the space range of the upstream and downstream of the accident site in the time range of t before the accident occurs are extracted;
S203, for each traffic accident, respectively calculating the following contents:
(1) Calculating occurrence frequencies of four dangerous driving events in the space-time range in the step S202, and marking as:
Times=[L_up,L_down,R_up,R_down,A_up,A_down,B_up,B_down]
Wherein a first part of each variable represents a dangerous driving event type, namely L, R, A and B respectively represent the frequency of occurrence of sudden left lane change, sudden right lane change, sudden acceleration and sudden deceleration driving events in a time range t before an accident occurs, and a second part of each variable represents a dangerous driving event occurrence place, namely up and down respectively represent road sections in a space range of an upstream and a downstream of the accident place;
(2) Calculating the average value of the maximum acceleration and the maximum speed of the vehicle in the same type of dangerous driving event in the space-time range in the step S202, and marking as:
Acceleration=[La_up,La_down,Ra_up,Ra_down,Aa_up,Aa_down,Ba_up,Ba_down]
Speed=[Ls_up,Ls_down,Rs_up,Rs_down,As_up,As_down,Bs_up,Bs_down]
Wherein La, ls in the first portion of each variable represent average values of maximum acceleration and maximum velocity of the vehicle in all rapid left lane change events, ra, rs represent average values of maximum acceleration and maximum velocity of the vehicle in all rapid right lane change events, aa, as represent average values of maximum acceleration and maximum velocity of the vehicle in all rapid acceleration events, ba, bs represent average values of maximum acceleration and maximum velocity of the vehicle in all rapid deceleration events, respectively;
S204: combining the time, acception and Speed data corresponding to each traffic accident to obtain dangerous driving event data with 24 characteristic variables, summarizing the dangerous driving event data corresponding to all the traffic accidents, and recording the dangerous driving event data as independent variables as: x i=[Timesi,Accelerationi,Speedi ], wherein time i represents the occurrence frequency of four dangerous driving events corresponding to the ith traffic accident, and acception i and Speed i represent the average value of the maximum Acceleration and the maximum Speed of the vehicle in the four dangerous driving events corresponding to the ith traffic accident respectively;
Each accident case data is given a label y=1, the label y=1 is taken as a dependent variable, the dependent variable is marked as Y= [ Y i],yi =1, and the independent variable and the dependent variable are matched to obtain an accident case data set;
the step S3 specifically comprises the following sub-steps:
s301: for each traffic accident, extracting and calculating dangerous driving event data in the space range of the upstream and downstream of the accident site in the time range of t 0、t0~2t0、…、(n-1)t0~nt0 before the accident occurs, and summarizing to obtain initial non-accident case data, wherein n is a positive integer for controlling the proportion selected by a control group;
Checking the time of all initial non-accident cases, and deleting non-accident case data with the time between t time before any traffic accident and t' time after the accident; deleting the case with information missing to obtain final non-accident case data, and recording the final non-accident case data as an independent variable: x j=[Timesj,Accelerationj,Speedj ], wherein time j represents the occurrence frequency of four dangerous driving events corresponding to the jth non-accident case, and acceletion j and Speed j represent the average value of the maximum Acceleration and the maximum Speed of the vehicle in the four dangerous driving events corresponding to the jth non-accident case, respectively;
Each piece of non-accident case data is given a label y=0, the label y=0 is taken as a dependent variable, the dependent variable is marked as Y= [ Y j],yj =0, and the independent variable and the dependent variable are matched to obtain a non-accident case data set;
S302, combining the accident case data set and the non-accident case data set to obtain a sample data set, and marking as:
X=[Timesk,Accelerationk,Speedk]
=[L_upk,L_downk,R_upk,R_downk,A_upk,A_downk,B_upk,B_downk,La_upk,La_downk,Ra_upk,Ra_downk,Aa_upk,Aa_downk,Ba_upk,Ba_downk,Ls_upk,Ls_downk,Rs_upk,Rs_downk,As_upk,As_downk,Bs_upk,Bs_downk]
Y= [ Y k],yk =0 or 1;
The step S4 specifically comprises the following sub-steps:
s401, carrying out standardization treatment on data according to x norm = (x-mu)/sigma, wherein mu is an average value, and sigma is a standard deviation;
S402, processing each dangerous driving event data into a four-channel one-dimensional image form, wherein variables in each channel are as follows:
channels 1:L_up, la_up, ls_up, L_down, la_down, ls_down
Channel 2 R_up, ra_up, rs_up, R_down, ra_down, rs_down
Channels 3: A_up, aa_up, as_up, A_down, aa_down, as_down
Channels 4:B_up, ba_up, bs_up, B_down, ba_down, bs_down
The method comprises the steps that 6 variables of the same driving behavior type are located in the same channel, and are arranged according to the relation between an upstream road section and a downstream road section and the sequence of occurrence frequency, average maximum acceleration and average maximum speed of dangerous driving events, and the variables of different types of dangerous driving events are located in 4 different channels;
S403, establishing a deep learning model: sequentially constructing a convolution layer, a pooling layer, a flattening layer, a full-connection layer and an output layer, and inputting the four-channel one-dimensional image data obtained in the step S402 into the convolution layer as input data of a deep learning model; the convolution layer is provided with m 1 convolution kernels with the size of n 1, the convolution kernels carry out one-dimensional convolution operation along the variable arrangement direction in an input data channel in the step length of s 1, the activation function is relu, and the output of the convolution layer is used as the input of the pooling layer;
The pooling layer performs one-dimensional average pooling operation with the range of n 2 by the step length of s 2, and the output of the pooling layer is used as the input of the flattening layer;
The flattening layer converts the multi-channel one-dimensional image data output by the pooling layer into a one-dimensional vector form, a random discarding method with the ratio of p is adopted to avoid overfitting, and the output of the flattening layer is used as the input of the full-connection layer;
The full-connection layer is provided with m 2 neurons, the activation function is relu, the random discarding method with the ratio of p is adopted to avoid overfitting, the random discarding method with the ratio of p drop is adopted, and the output of the full-connection layer is used as the input of the output layer;
The output of the full-connection layer is provided with 1 neuron, the activation function of the output layer is sigmoid, and the meaning of the output data is the probability of traffic accidents in the current research road section;
S404, training and optimizing the established deep learning model based on the sample data set: taking the area, sensitivity and specificity of the test subject under the working characteristic curve as evaluation indexes, and adopting a ten-fold cross validation method, a binary cross entropy loss function and Nadam optimizer for optimization to obtain a deep learning model with optimal effect;
The step S5 specifically comprises the following steps:
s501, obtaining data of four dangerous driving events of rapid Acceleration, rapid deceleration, rapid left lane change and rapid right lane change of a vehicle on a whole highway section in real time, calculating the occurrence frequency Times ' of the four dangerous driving events in a time range of t before each position of the highway according to the step S2, and collecting average values of the maximum Acceleration accelization ' and the maximum Speed ' of the vehicle in all the same type of dangerous driving events to obtain input data of a model;
S502: inputting data into the deep learning model constructed in the step S4, and calculating the probability p E (0, 1) of the occurrence of the traffic accident at each position of the current expressway, namely, the traffic accident risk assessment value;
When p is E (0, threshold), the traffic accident risk level is low risk, and no intervention is needed;
When p is epsilon (threshold, 1), the traffic accident risk level is high risk, and corresponding measures are needed to be adopted to manage and control the safety risk;
Wherein threshold E (0, 1).
2. A highway traffic accident risk assessment system based on the method of claim 1, comprising:
The data acquisition module is used for determining a research road section and a statistical time range and acquiring dangerous driving event data on the expressway research road section and traffic accident data on the research road section within the statistical time range;
The sample construction module is used for searching dangerous driving event data of an upstream road section and a downstream road section of an accident site before the accident occurs according to each traffic accident, taking the dangerous driving event data as independent variables and taking the accident as the dependent variables, and constructing an accident case data set; searching dangerous driving event data of an upstream road section and a downstream road section when no traffic accident occurs, taking the dangerous driving event data as an independent variable, taking the non-occurring accident as a dependent variable, constructing a non-accident case data set as a comparison group, and combining the accident case data set and the non-accident case data set to obtain a sample data set;
The model training module is used for carrying out standardized processing on the data, processing the data into an image form according to different dangerous driving event types and upstream and downstream spatial relations, constructing a deep learning model, and carrying out training and parameter adjustment until the model effect is optimal;
The road section accident risk assessment module is used for acquiring dangerous driving event data of vehicles on the whole road section of the expressway in real time, calculating the occurrence frequency of dangerous driving events and the average value of the maximum acceleration and the maximum speed of the vehicles in all the same type of dangerous driving events, inputting the average value into the constructed deep learning model, and calculating to obtain the traffic accident risk assessment value and the risk level of each road section.
3. The system of claim 2, wherein the model training module builds a deep learning model by sequentially building a convolutional layer, a pooling layer, a flattening layer, a fully-connected layer, and an output layer, wherein
The convolution layer is provided with m 1 convolution kernels with the size of n 1, the convolution kernels carry out one-dimensional convolution operation along the variable arrangement direction in the input data channel in the step length of s 1, the activation function is relu, and the output of the convolution layer is used as the input of the pooling layer;
The pooling layer performs one-dimensional average pooling operation with the range of n 2 by the step length of s 2, and the output of the pooling layer is used as the input of the flattening layer;
The flattening layer converts the multi-channel one-dimensional image data output by the pooling layer into a one-dimensional vector form, a random discarding method with the ratio of p is adopted to avoid overfitting, and the output of the flattening layer is used as the input of the full-connection layer;
The full-connection layer is provided with m 2 neurons, the activation function is relu, the random discarding method with the ratio of p is adopted to avoid overfitting, the random discarding method with the ratio of p drop is adopted, and the output of the flattening layer is used as the input of the output layer;
The output of the full-connection layer is provided with 1 neuron, the activation function of the output layer is sigmoid, and the meaning of the output data is the probability of traffic accidents in the current research road section.
CN202310569137.8A 2023-05-19 2023-05-19 Highway traffic accident risk assessment method and system based on dangerous driving event data Active CN116631186B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310569137.8A CN116631186B (en) 2023-05-19 2023-05-19 Highway traffic accident risk assessment method and system based on dangerous driving event data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310569137.8A CN116631186B (en) 2023-05-19 2023-05-19 Highway traffic accident risk assessment method and system based on dangerous driving event data

Publications (2)

Publication Number Publication Date
CN116631186A CN116631186A (en) 2023-08-22
CN116631186B true CN116631186B (en) 2024-05-28

Family

ID=87591311

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310569137.8A Active CN116631186B (en) 2023-05-19 2023-05-19 Highway traffic accident risk assessment method and system based on dangerous driving event data

Country Status (1)

Country Link
CN (1) CN116631186B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117351708A (en) * 2023-10-08 2024-01-05 北京迈道科技有限公司 A highway safety operation management early warning method, system and storage medium
CN117273964B (en) * 2023-11-21 2024-02-02 国任财产保险股份有限公司 Intelligent vehicle insurance policy generation system and method for self-adaptive driving data
CN118135802B (en) * 2024-05-10 2024-06-28 四川高路文化旅游发展有限责任公司 Bridge road management and control system and method based on deep learning network
CN118968771B (en) * 2024-10-16 2024-12-31 清华大学 Multi-level traffic accident prediction and correction system, method and product

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106355877A (en) * 2016-08-23 2017-01-25 重庆大学 Method of simulated estimate of expressway-traffic-accident-affected areas based on particle filter algorithm
CN108198415A (en) * 2017-12-28 2018-06-22 同济大学 A kind of city expressway accident forecast method based on deep learning
CN110210090A (en) * 2019-05-22 2019-09-06 淮阴工学院 Motor vehicle accident analysis method based on uncertainty theory and genetic algorithm
CN110866677A (en) * 2019-10-25 2020-03-06 东南大学 A method for evaluating the relative risk of drivers based on benchmarking analysis
CN112037513A (en) * 2020-09-01 2020-12-04 清华大学 A real-time traffic safety index dynamic comprehensive evaluation system and its construction method
CN112201038A (en) * 2020-09-28 2021-01-08 同济大学 A road network risk assessment method based on the risk of bad driving behavior of bicycles
CN112837533A (en) * 2021-01-08 2021-05-25 合肥工业大学 A Highway Accident Frequency Prediction Method Considering Time-varying Characteristics of Risk Factors
CN112990545A (en) * 2021-02-08 2021-06-18 东南大学 Traffic safety state prediction method for expressway intersection area
WO2021133789A1 (en) * 2019-12-27 2021-07-01 Lyft, Inc. Systems and methods for incident detection using inference models
CN113689071A (en) * 2021-07-15 2021-11-23 东南大学 An active intervention method for bad driving behavior based on multi-driver risk assessment
CN113762734A (en) * 2021-08-17 2021-12-07 淮阴工学院 A method and system for risk assessment of dangerous chemicals vehicles on highways
KR102416711B1 (en) * 2022-02-21 2022-07-05 주식회사 카비 Accident risk prediction device for each driver using artificial intelligence and accident risk prediction method using the same

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020046662A2 (en) * 2018-08-22 2020-03-05 Cubic Corporation Connected and autonomous vehicle (cav) behavioral adaptive driving

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106355877A (en) * 2016-08-23 2017-01-25 重庆大学 Method of simulated estimate of expressway-traffic-accident-affected areas based on particle filter algorithm
CN108198415A (en) * 2017-12-28 2018-06-22 同济大学 A kind of city expressway accident forecast method based on deep learning
CN110210090A (en) * 2019-05-22 2019-09-06 淮阴工学院 Motor vehicle accident analysis method based on uncertainty theory and genetic algorithm
CN110866677A (en) * 2019-10-25 2020-03-06 东南大学 A method for evaluating the relative risk of drivers based on benchmarking analysis
WO2021133789A1 (en) * 2019-12-27 2021-07-01 Lyft, Inc. Systems and methods for incident detection using inference models
CN112037513A (en) * 2020-09-01 2020-12-04 清华大学 A real-time traffic safety index dynamic comprehensive evaluation system and its construction method
CN112201038A (en) * 2020-09-28 2021-01-08 同济大学 A road network risk assessment method based on the risk of bad driving behavior of bicycles
CN112837533A (en) * 2021-01-08 2021-05-25 合肥工业大学 A Highway Accident Frequency Prediction Method Considering Time-varying Characteristics of Risk Factors
CN112990545A (en) * 2021-02-08 2021-06-18 东南大学 Traffic safety state prediction method for expressway intersection area
CN113689071A (en) * 2021-07-15 2021-11-23 东南大学 An active intervention method for bad driving behavior based on multi-driver risk assessment
CN113762734A (en) * 2021-08-17 2021-12-07 淮阴工学院 A method and system for risk assessment of dangerous chemicals vehicles on highways
KR102416711B1 (en) * 2022-02-21 2022-07-05 주식회사 카비 Accident risk prediction device for each driver using artificial intelligence and accident risk prediction method using the same

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
信号交叉口左转待行区综合效益评估;李静;王占永;蔡铭;;中山大学学报(自然科学版);20190515(03);全文 *

Also Published As

Publication number Publication date
CN116631186A (en) 2023-08-22

Similar Documents

Publication Publication Date Title
CN116631186B (en) Highway traffic accident risk assessment method and system based on dangerous driving event data
CN116504076A (en) Expressway traffic flow prediction method based on ETC portal data
CN111275962B (en) Vehicle track data aggregation effect prediction method and device
CN111680745A (en) A method and system for sudden congestion discrimination based on multi-source traffic big data fusion
CN115148019A (en) Early warning method and system based on holiday congestion prediction algorithm
CN102436740A (en) Automatic detection method for highway traffic incident
CN113963539A (en) Highway traffic accident identification method, module and system
CN116597642A (en) Traffic jam condition prediction method and system
CN117238126A (en) Traffic accident risk assessment method under continuous flow road scene
CN118430270B (en) A public transportation safety early warning method and system based on real-time platform data fusion
CN118627407A (en) Waterlogging model creation method and system for urban water management
CN118155431A (en) Road section flow monitoring and controlling method and system based on radar detection
CN119399446A (en) A smart community target recognition and tracking method based on big data
CN114446052B (en) Expressway event detection method and device based on multi-level fusion of multi-source data
CN119441782A (en) An intelligent analysis method for abnormal behavior of platform vehicles based on deep learning
CN118378901B (en) Dynamic evaluation method and equipment for influence degree of highway traffic accident
CN119380539A (en) A method and system for real-time warning and dynamic clearing of highway traffic congestion
CN115035328A (en) Converter image incremental automatic machine learning system and its establishment and training method
CN114444872A (en) Method and system for analyzing driving behavior of driver by adopting artificial intelligence algorithm
CN118966865A (en) A subway station emergency classification and emergency coordination method and system
CN115374855B (en) Clustering-based method for grading railway emergency
CN117196153A (en) A smart traffic comprehensive management system based on traffic big data
CN118798664B (en) A method and system for early warning of emergencies based on emergency portraits
CN120088989B (en) A method and system for warning abnormal events based on multi-modal perception of traffic network
CN119049291B (en) A road traffic management method and system based on digital twin

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant