CN102881162B - Data processing and fusion method for large-scale traffic information - Google Patents
Data processing and fusion method for large-scale traffic information Download PDFInfo
- Publication number
- CN102881162B CN102881162B CN201210375074.4A CN201210375074A CN102881162B CN 102881162 B CN102881162 B CN 102881162B CN 201210375074 A CN201210375074 A CN 201210375074A CN 102881162 B CN102881162 B CN 102881162B
- Authority
- CN
- China
- Prior art keywords
- data
- traffic
- fusion
- traffic information
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Landscapes
- Traffic Control Systems (AREA)
Abstract
本发明公开了一种大规模交通信息的数据处理及融合方法,属于交通信息实时处理技术,包括:根据测试车及各个传感器采集到的多源交通数据获得真值系统的标准数据,并确定参数的动态赋值方法;剔除传感器采集到的数据集合中的异常数据,并进行历史数据的补偿;对完成数据补偿的多源交通数据实时分级信息融合。本发明通过建立真值系统获得各种采集方式正确的初始赋值参数,对真实采集到的数据进行异常剔除,缺失数据根据历史数据进行合理填补以保证数据的准确性和完整性,对不同类别的采集方式获得的数据进行分级逐级融合处理以保证数据的可靠性,融合过程的快速性,并在融合过程中考虑交通事件、交通管制、占地施工、交通事故给数据带来的影响。
The invention discloses a large-scale traffic information data processing and fusion method, which belongs to the traffic information real-time processing technology, including: obtaining the standard data of the true value system according to the multi-source traffic data collected by the test vehicle and various sensors, and determining the parameters The dynamic assignment method; eliminate the abnormal data in the data collection collected by the sensor, and compensate the historical data; the real-time classification information fusion of the multi-source traffic data that has completed the data compensation. The present invention obtains the correct initial assignment parameters of various acquisition methods by establishing a truth system, and removes abnormalities from the actually collected data, and fills in missing data reasonably according to historical data to ensure the accuracy and integrity of the data. The data obtained by the collection method is fused step by step to ensure the reliability of the data and the speed of the fusion process, and the impact of traffic events, traffic control, land occupation construction, and traffic accidents on the data is considered during the fusion process.
Description
技术领域technical field
本发明涉及智能交通系统的交通信息实时处理技术领域,特别涉及一种大规模交通信息的数据处理及融合方法。The invention relates to the technical field of real-time processing of traffic information of an intelligent traffic system, in particular to a data processing and fusion method of large-scale traffic information.
背景技术Background technique
信息融合最早应用于军事领域,是组合多源信息和数据完成目标检测、关联、状态评估的多层次、多方面的过程。这种信息融合的目的是获得准确的目标识别、完整而及时的战场态势和威胁评估。随着传感器技术、计算机科学和信息技术的发展,各种面向复杂应用背景的多传感器系统大量涌现,使得多渠道的信息获取、处理和融合成为可能,并且在金融管理、心理评估和预测、医疗诊断、气象预报、组织管理决策、机器人视觉、交通管制、遥感遥测等诸多领域,人们都认识到把多个数据源中的信息综合起来能够提高工作的成绩。因此,多源信息融合技术在军事领域和民用领域得到了广泛的重视和成功的应用,其理论和方法已成为智能信息处理及控制的一个重要研究方向。Information fusion was first applied in the military field. It is a multi-level and multi-faceted process of combining multi-source information and data to complete target detection, association, and status assessment. The purpose of this information fusion is to obtain accurate target identification, complete and timely battlefield situation and threat assessment. With the development of sensor technology, computer science and information technology, a large number of multi-sensor systems for complex application backgrounds have emerged, making multi-channel information acquisition, processing and fusion possible. In many fields such as diagnosis, weather forecast, organization and management decision-making, robot vision, traffic control, remote sensing and telemetry, people have realized that combining information from multiple data sources can improve work performance. Therefore, multi-source information fusion technology has been widely valued and successfully applied in military and civilian fields, and its theory and method have become an important research direction of intelligent information processing and control.
现有技术更集中在数据级信息融合,其直接对未经预处理的传感器原始观测数据进行综合和分析。优点是保持了尽可能多的客体信息,基本不发生信息丢失或遗漏;缺点是处理数据太多,耗费时间太长,实时性差。另一种信息融合是特征级信息融合,亦称文件级信息融合,是对已经过传感器初步预处理之后,在传感器实现基本特征提取、提供文件报告的基础上执行的综合分析处理。其优点是既保持足够数量的重要信息,又已经过可容许的数据压缩,大大稀释了数据量,可以提高处理过程的实时性;而且特别有价值的是在模式识别、图像分析、计算机视觉等现代高技术应用中,实际都以特征提取为基础,都已在这方面开展大量工作。特征级信息融合的缺点是,不可避免地会有某些信息损失,因而需对传感器预处理提出较严格的要求。Existing technologies are more focused on data-level information fusion, which directly synthesizes and analyzes raw sensor observation data without preprocessing. The advantage is that it keeps as much object information as possible, and there is basically no information loss or omission; the disadvantage is that it processes too much data, takes too long, and has poor real-time performance. Another type of information fusion is feature-level information fusion, also known as file-level information fusion. It is a comprehensive analysis and processing performed on the basis of the basic feature extraction of the sensor and the provision of file reports after the initial preprocessing of the sensor. Its advantage is that it not only maintains a sufficient amount of important information, but also has undergone allowable data compression, which greatly dilutes the amount of data and can improve the real-time performance of the processing process; and it is particularly valuable in pattern recognition, image analysis, computer vision, etc. In modern high-tech applications, feature extraction is actually the basis, and a lot of work has been carried out in this area. The disadvantage of feature-level information fusion is that there will inevitably be some information loss, so it is necessary to put forward stricter requirements for sensor preprocessing.
由于交通数据的多源性,必须运用一种有效的方法合理协调多源数据,充分综合有用信息,提高在多变环境中正确决策的能力。数据融合技术是一种可满足该要求的良好工具。交通数据的多源性和复杂性要求交通信息处理必须拥有数据融合技术,这样才能提供交通信息的稳健性。Due to the multi-source nature of traffic data, an effective method must be used to coordinate multi-source data reasonably, fully integrate useful information, and improve the ability to make correct decisions in a changing environment. Data fusion technology is a good tool to meet this requirement. The multi-source and complexity of traffic data requires that traffic information processing must have data fusion technology, so as to provide the robustness of traffic information.
现有的交通数据融合技术更集中在数据级信息融合,融合过程未对数据根据传感器的稳定性进行分级,并且未对传感器采集到的数据进行校验剔除,数据逐级融合,在融合过程中并未考虑交通事件、交通管制、占地施工、交通事故给数据带来的影响。The existing traffic data fusion technology is more focused on data-level information fusion. The fusion process does not classify the data according to the stability of the sensor, and does not check and eliminate the data collected by the sensor. The data is fused step by step. The impact of traffic incidents, traffic control, land occupation construction, and traffic accidents on the data is not considered.
发明内容Contents of the invention
有鉴于此,本发明的目的在于提供一种大规模交通信息的数据处理及融合方法,用于实现交通信息数据的高效融合。In view of this, the object of the present invention is to provide a large-scale traffic information data processing and fusion method for realizing efficient fusion of traffic information data.
本发明提供了一种大规模交通信息的数据处理及融合方法,包括:The present invention provides a data processing and fusion method of large-scale traffic information, including:
根据测试车及各个传感器采集到的多源交通数据获得真值系统的标准数据,并确定参数的动态赋值方法;剔除所述传感器采集到的数据集合中的异常数据,并进行历史数据的补偿;对完成数据补偿的多源交通数据实时分级信息融合;Obtain the standard data of the truth system according to the multi-source traffic data collected by the test vehicle and each sensor, and determine the dynamic assignment method of the parameters; eliminate the abnormal data in the data set collected by the sensors, and perform historical data compensation; Real-time hierarchical information fusion of multi-source traffic data that has completed data compensation;
所述获得真值系统的标准数据的步骤具体包括:The step of obtaining the standard data of the truth system specifically includes:
通过多辆高采样率的测试车,在不同时段多次测试,并获得时段t的平均值,作为标准值:Through multiple test vehicles with high sampling rate, multiple tests in different periods, and the average value of period t is obtained as the standard value:
其中,Xi,j(t)表示第i辆测试车的第j次在时段t测试的测量值,n辆车进行m次测量;Among them, Xi, j(t) represents the measurement value of the i-th test vehicle in the j-th test in the time period t, and n vehicles perform m measurements;
所述确定参数的动态赋值方法的步骤具体包括:The steps of the dynamic assignment method for determining parameters specifically include:
将时段t测试车采集到的数据及各个传感器采集到的数据,拆分出出租车数据、公交车数据、微波/线圈数据、视频监控数据以及车牌识别数据,作为时段t的各类交通信息数据;Split the data collected by the test vehicle and various sensors in time period t into taxi data, bus data, microwave/coil data, video surveillance data and license plate recognition data as various traffic information data in time period t ;
分别以出租车数据、公交车数据、微波/线圈数据、视频监控数据及车牌识别数据在时段t的平均值为标准值计算其标准差:The standard deviation is calculated by taking the average value of taxi data, bus data, microwave/coil data, video surveillance data and license plate recognition data in time period t as the standard value:
其中,E(X)为t时段测试得到所有数据的平均值,Xi表示各类数据的第i个测试数据;Among them, E(X) is the average value of all data obtained in the t-period test, and Xi represents the i-th test data of various types of data;
使用所述标准值及标准差,分别获得每类数据的正态分布图,确定可信区间,根据正态曲线,计算数据在可信区间的概率:Using the standard value and standard deviation, obtain the normal distribution diagram of each type of data respectively, determine the credible interval, and calculate the probability of the data in the credible interval according to the normal curve:
其中,a,b分别表示测试数据偏离标准值X的左右可接受数据的界限值,D(X)为标准差,E(X)为平均值;Among them, a and b respectively represent the limit value of acceptable data around which the test data deviates from the standard value X, D(X) is the standard deviation, and E(X) is the average value;
以所述数据在可信区间的概率对每类数据基本概率赋值函数m(A)进行赋值,赋值过程为m(A)=P(a<A<b),A表示每类数据,P(a<A<b)表示每类数据在可信区间的概率。The basic probability assignment function m(A) of each type of data is assigned with the probability of the data in the credible interval, the assignment process is m(A)=P(a<A<b), A represents each type of data, P( a<A<b) represents the probability of each type of data in the credible interval.
本发明通过建立真值系统获得各种采集方式正确的初始赋值参数,对真实采集到的数据进行异常剔除,缺失数据根据历史数据进行合理填补以保证数据的准确性和完整性,对不同类别的采集方式获得的数据进行分级逐级融合处理以保证数据的可靠性,融合过程的快速性,并在融合过程中考虑交通事件、交通管制、占地施工、交通事故给数据带来的影响。经该融合过程,实现融合的快速、准确、可靠等效果,真值系统解决了现有技术初始参数赋值不准确的问题;对异常数据剔除、缺失数据填补克服了现有技术采集终端性能不稳定,以及采集终端损坏的难题;采用D-S理论对处理后的数据进行分级融合处理,解决了数据融合的多源性问题;还充分考虑交通事件、交通管制、占地施工、交通事故,克服了因以上因素对数据造成的影响。The present invention obtains the correct initial assignment parameters of various acquisition methods by establishing a truth system, and removes abnormalities from the actually collected data, and fills in missing data reasonably according to historical data to ensure the accuracy and integrity of the data. The data obtained by the collection method is fused step by step to ensure the reliability of the data and the speed of the fusion process, and the impact of traffic events, traffic control, land occupation construction, and traffic accidents on the data is considered during the fusion process. Through this fusion process, the fast, accurate, and reliable effects of fusion are realized. The truth system solves the problem of inaccurate initial parameter assignment in the prior art; the removal of abnormal data and filling of missing data overcomes the unstable performance of the acquisition terminal in the prior art , and the problem of damage to the collection terminal; the D-S theory is used to perform hierarchical fusion processing on the processed data, which solves the multi-source problem of data fusion; also fully considers traffic incidents, traffic control, land occupation construction, traffic accidents, and overcomes the The impact of the above factors on the data.
附图说明Description of drawings
图1为本发明实施例提供的大规模交通信息的数据处理及融合方法流程图;FIG. 1 is a flow chart of a data processing and fusion method for large-scale traffic information provided by an embodiment of the present invention;
图2为本发明实施例中对参数动态赋值的方法流程图。Fig. 2 is a flowchart of a method for dynamically assigning values to parameters in an embodiment of the present invention.
具体实施方式Detailed ways
为使本发明的目的、技术方案和优点更加清楚,下面结合附图对本发明作进一步的详细描述。In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings.
本发明实施例提供的交通数据融合过程能够实现融合的快速、准确、可靠等效果,真值系统解决了现有技术初始参数赋值不准确的问题;对异常数据剔除、缺失数据填补克服了现有技术采集终端性能不稳定,以及采集终端损坏的难题;采用D-S理论对处理后的数据进行分级融合处理,解决了数据融合的多源性问题;还充分考虑交通事件、交通管制、占地施工、交通事故,克服了因以上因素对数据造成的影响。The traffic data fusion process provided by the embodiment of the present invention can achieve fast, accurate, reliable and other effects of fusion. The truth system solves the problem of inaccurate initial parameter assignment in the prior art; it overcomes the existing problem of eliminating abnormal data and filling missing data. The performance of technical acquisition terminals is unstable, and the problem of damage to acquisition terminals; the D-S theory is used to perform hierarchical fusion processing on the processed data, which solves the multi-source problem of data fusion; it also fully considers traffic incidents, traffic control, land occupation construction, Traffic accidents overcome the influence of the above factors on the data.
图1为本发明实施例提供的大规模交通信息的数据处理及融合方法流程图,包括以下步骤:Fig. 1 is a flow chart of a data processing and fusion method for large-scale traffic information provided by an embodiment of the present invention, including the following steps:
步骤101、获得真值系统,并确定D-S(Dempster-Sharer证据理论多源数据融合方法)中参数的动态赋值方法。Step 101, obtain the truth system, and determine the dynamic assignment method of parameters in D-S (Dempster-Sharer evidence theory multi-source data fusion method).
所述真值系统标准数据是通过多辆高采样率的测试车,在不同时段多次测试,并获得时段t的平均值,作为标准值:The standard data of the true value system is tested multiple times at different time periods by multiple test vehicles with high sampling rates, and the average value of the time period t is obtained as the standard value:
Xi,j(t)代表第i辆测试车的第j次在时段t测试的测量值,n辆车进行m次测量。Xi ,j (t) represents the measurement value of the jth test of the i-th test vehicle at the time period t, and n vehicles make m measurements.
图2为对参数动态赋值的方法流程图,包括以下步骤:Figure 2 is a flowchart of a method for dynamically assigning values to parameters, including the following steps:
步骤1011、将时段t测试车采集到的数据及各个传感器采集到的数据,拆分出出租车数据、公交车数据、微波/线圈数据、视频监控数据,以及车牌识别数据,作为时段t各类交通信息数据。由于不同的采集设备均能采集到同一类型的数据,但是各个采集设备采集精度及其受影响因素不同,因此,在后续步骤中会出现分步融合。Step 1011, split the data collected by the test vehicle and the data collected by each sensor in time period t into taxi data, bus data, microwave/coil data, video surveillance data, and license plate recognition data, as various types of time period t traffic information data. Since different collection devices can collect the same type of data, but the collection accuracy of each collection device and its influencing factors are different, therefore, there will be step-by-step fusion in the subsequent steps.
步骤1012、分别以出租车数据、公交车数据、微波/线圈数据、视频监控数据及车牌识别数据在时段t的平均值为标准值计算其标准差:Step 1012, calculate the standard deviation using the average value of taxi data, bus data, microwave/coil data, video surveillance data and license plate recognition data in time period t as the standard value:
其中,E(X)为t时段测试得到所有数据的平均值,Xi表示各类数据的第i个测试数据。Among them, E(X) is the average value of all data obtained in the t-period test, and Xi represents the i-th test data of various types of data.
标准差反应数据的离散程度,即数据的稳定程度。The standard deviation reflects the degree of dispersion of the data, that is, the degree of stability of the data.
步骤1013、使用所述标准差以及标准值,分别获得每类数据的正态分布图,确定可信区间(即生成数据接收区间),根据正态曲线,计算数据在可信区间的概率:Step 1013, use the standard deviation and standard value to obtain the normal distribution map of each type of data respectively, determine the credible interval (that is, generate the data receiving interval), and calculate the probability of the data in the credible interval according to the normal curve:
其中,a,b分别表示测试数据偏离标准值X的左右可接受数据的界限值,D(X)为标准差,E(X)为平均值。Among them, a and b respectively represent the limit value of the acceptable data that the test data deviates from the standard value X, D(X) is the standard deviation, and E(X) is the average value.
步骤1014、以所述数据在可信区间的概率对每类数据基本概率赋值函数m(A)赋值。例如出租车数据基本概率赋值函数m(A)赋值过程为m(A)=P(a<A<b),A代表出租车数据,P(a<A<b)代表出租车采集数据在可信区间的概率,如上计算。Step 1014, assign a value to the basic probability assignment function m(A) of each type of data with the probability that the data is in the credible interval. For example, the basic probability assignment function m(A) of taxi data has an assignment process of m(A)=P(a<A<b), where A represents the taxi data, and P(a<A<b) represents the data collected by the taxi in the available The probability of the confidence interval, calculated as above.
步骤102、剔除传感器采集到的数据集合中的异常数据。步骤101是测试车在正常情况下多次测试获得的数据,为数据融合过程中的各类数据初始参数赋值做准备。而在现实情况中由于各种因素会导致数据的异常,因此,实际传感器采集到数据之后首先要进行异常数据剔除,以下是对实际数据进行的处理。Step 102, eliminating abnormal data in the data set collected by the sensor. Step 101 is the data obtained from multiple tests of the test vehicle under normal conditions, and prepares for the initial parameter assignment of various data in the data fusion process. In the real situation, various factors will lead to abnormal data. Therefore, after the actual sensor collects the data, the abnormal data must be eliminated first. The following is the processing of the actual data.
分别计算时段t对应分类(包括出租车数据、公交车数据、微波/线圈数据、视频监控数据及车牌识别数据)交通信息数据X(X是实际环境下采集到的数据,101步骤叙述X是在测试车及采集器一切正常的环境下的数据)与历史信息数据H。H是对过去一年的交通信息数据(包括出租车数据、公交车数据、微波/线圈数据、视频监控数据及车牌识别数据)的存储。计算X、H的数学期望E(X)、E(H),计算时段t各类交通信息数据X与过去一年同期历史信息数据H的标准差D(X)、D(H),如果:Calculate the time period t corresponding to the classification (including taxi data, bus data, microwave/coil data, video surveillance data and license plate recognition data) traffic information data X (X is the data collected in the actual environment, and step 101 describes X in The data under the normal environment of the test vehicle and the collector) and the historical information data H. H is the storage of traffic information data (including taxi data, bus data, microwave/coil data, video surveillance data and license plate recognition data) in the past year. Calculate the mathematical expectations E(X) and E(H) of X and H, and calculate the standard deviation D(X) and D(H) of various traffic information data X and the historical information data H of the same period in the past year during the calculation period t, if:
(1)针对时段t各个传感器以及测试车采集到的交通信息数据,数学期望E(X)与数学期望E(H)差值在设定阀值th范围内,且D(X)与D(H)差值同样在设定阀值th′内,则认为该时刻数据没有发生异常;(1) For the traffic information data collected by each sensor and test vehicle in the time period t, the difference between the mathematical expectation E(X) and the mathematical expectation E(H) is within the set threshold th, and D(X) and D( H) If the difference is also within the set threshold th', it is considered that there is no abnormality in the data at this moment;
(2)针对时段t各个传感器以及测试车采集到的交通信息数据,数学期望E(X)与数学期望E(H)超出设定阀值th,但D(X)与D(H)差值在设定阀值th′内,则等待下一时段的交通信息数据X″生成;如果下一时段的交通信息数据D(X)与D(X″)有较大差异,超出设定阀值th′,则认为时段t的交通信息发生异常,并将异常数据剔除掉。(2) For the traffic information data collected by each sensor and test vehicle in time period t, the mathematical expectation E(X) and mathematical expectation E(H) exceed the set threshold th, but the difference between D(X) and D(H) Within the set threshold th', wait for the traffic information data X" of the next period to be generated; if there is a large difference between the traffic information data D(X) and D(X") of the next period, it exceeds the set threshold th′, it is considered that the traffic information of time period t is abnormal, and the abnormal data will be removed.
在较佳实施例中,th阀值设定为th=E(H)/2,th′=D(X)/2。In a preferred embodiment, the th threshold is set as th=E(H)/2, th'=D(X)/2.
步骤103、历史数据的补偿。每天各个传感器以及测试车采集到的交通信息数据经过步骤102后得到的数据存储起来,作为历史数据,为未来出现数据缺失时作为补偿数据。Step 103, compensation of historical data. The traffic information data collected by each sensor and the test vehicle every day is stored in step 102 as historical data, and used as compensation data in case of data loss in the future.
根据历史数据,分别计算相同天内各类交通信息数据的路网平均速度,包括出租车数据路网平均速度、公交车数据平均速度、微波/线圈数据平均速度,生成交通趋势曲线。According to the historical data, calculate the average speed of the road network of various traffic information data in the same day, including the average speed of the road network of taxi data, the average speed of bus data, and the average speed of microwave/coil data, and generate a traffic trend curve.
每天各个传感器以及测试车采集到数据后,及时对数据进行初步处理(剔除数据中的异常数据)过程中,查找相邻Z个时间段内的包含的Q个交通数据文件(即前面分类存储的具有相似交通趋势曲线的交通数据),计算每一个时间段交通数据文件的路网平均速度,并将该平均速度视为一个点,平滑连接Z个点生成交通信息曲线片段。将该曲线片段与相同时间段内的历史数据进行匹配,找出最为相似的P条曲线作为辅助信息源,其中Z、P和Q是根据算法、历史统计、实践、经验、系统性能得出的函数值,Z表示时间段,Q表示Z个时间段内生成的文件量,P表示最相似的曲线数量。相邻Z个时间段,以一小时为单位,24个时间段。一般数据采集每五分钟生成一个数据,一个时间段会生成12个数据文件,Q按24个时间段算会生成288个取值,P取值为1即可。After each sensor and test vehicle collects data every day, during the process of preliminary data processing (removing abnormal data in the data) in time, search for Q traffic data files contained in adjacent Z time periods (that is, the previously classified storage) Traffic data with similar traffic trend curves), calculate the average speed of the road network of the traffic data file in each time period, and regard the average speed as a point, and smoothly connect Z points to generate a traffic information curve segment. Match the curve segment with the historical data in the same time period, and find the most similar P curves as auxiliary information sources, where Z, P and Q are obtained based on algorithms, historical statistics, practice, experience, and system performance Function value, Z represents the time period, Q represents the number of files generated in Z time period, and P represents the number of most similar curves. Adjacent Z time periods, in units of one hour, 24 time periods. General data collection generates one data every five minutes, 12 data files will be generated in one time period, Q will generate 288 values based on 24 time periods, and the value of P will be 1.
根据各个传感器以及测试车采集到的各类交通信息数据的时间属性,从以上步骤中获得的辅助信息源中查找与交通数据缺失的路链具有相同唯一编号、相同时间属性的历史数据对补偿区域中空缺信息路链进行填补。其中时间属性包含星期周期和该时刻所处的时间窗特性。之所以数据会出现缺失,一方面是由于异常数据剔除有可能造成数据的不连续,另一方面是由于各个数据源设备由于外界因素或人为因素短暂失效造成数据不能及时采集或采集误差较大。而后者尤为重要。According to the time attributes of various traffic information data collected by each sensor and test vehicle, from the auxiliary information sources obtained in the above steps, find the historical data pair with the same unique number and the same time attributes as the road link with missing traffic data. Compensation area Fill in the gaps in the information link. The time attribute includes the period of the week and the time window characteristic of the moment. The reason why the data will be missing is that on the one hand, the elimination of abnormal data may cause data discontinuity, and on the other hand, due to the short-term failure of each data source device due to external factors or human factors, the data cannot be collected in time or the collection error is large. And the latter is especially important.
步骤104、对完成数据补偿的多源交通数据(此处的多源数据包括经过剔除以及数据补偿后的出租车数据、公交车数据、微波/线圈数据、视频监控数据,以及车牌识别数据)实时分级信息融合。前面三个步骤是实时分级信息融合的基础,步骤101产生真值系统为分级信息融合确定各类别数据融合时的融合参数,并为步骤102异常数据的剔除提供标准数据,步骤103对于数据缺失的路链或数据缺失时间段进行数据补偿,以步骤102与步骤103产生的数据作为多源交通信息数据的数据源,以下步骤中使用的数据均是经过步骤101、102、103处理的数据。Step 104, the multi-source traffic data (the multi-source data here includes taxi data, bus data, microwave/coil data, video surveillance data, and license plate recognition data after elimination and data compensation) is completed in real time Hierarchical information fusion. The first three steps are the basis of real-time classification information fusion. Step 101 generates the true value system for the classification information fusion to determine the fusion parameters of each type of data fusion, and provides standard data for the removal of abnormal data in step 102. Step 103 is for missing data. Data compensation is performed for road links or data missing time periods, and the data generated in steps 102 and 103 are used as the data source of multi-source traffic information data. The data used in the following steps are all processed data in steps 101, 102, and 103.
为了保障数据精度和覆盖率,首先融合大规模多样的出租车数据和公交车数据,再融合微波/线圈数据以及车牌识别数据。步骤如下:In order to ensure data accuracy and coverage, large-scale and diverse taxi data and bus data are first fused, and then microwave/coil data and license plate recognition data are fused. Proceed as follows:
D-S证据理论中最基本的概念是建立辨识框架,记作U,辨识框架中的元素满足互不相容的条件。命题A对函数m的赋值m(A)是集合到的影射,若函数m满足如下条件:The most basic concept in D-S evidence theory is to establish an identification frame, which is denoted as U, and the elements in the identification frame meet the mutually incompatible conditions. The assignment m(A) of proposition A to function m is the mapping to set, if function m satisfies the following conditions:
m(Ф)=0m(Ф)=0
对
则称m(A)为A的基本概率赋值函数(BPAF)。其中Ф表示空集,m(A)表示对命题A的精确信任程度,表示了对A的直接支持。Then m(A) is called the basic probability assignment function (BPAF) of A. Among them, Ф represents the empty set, and m(A) represents the precise degree of belief in proposition A, expressing the direct support for A.
在以下步骤中,对于出租车数据、公交车数据、微波/线圈数据、视频监控数据以及车牌识别数据的初始化,同样采用步骤1014的方式对基本概率赋值函数赋值。步骤1013得到的是在正常情况下通过测试车获得的对各类数据在可信区间的概率,较少考虑到突发事件、道路维修等对采集到的数据值范围的影响,因此在以下融合过程中,还要考虑到突发事件,道路维修等。In the following steps, for the initialization of taxi data, bus data, microwave/coil data, video surveillance data and license plate recognition data, the method of step 1014 is also used to assign values to the basic probability assignment function. Step 1013 obtains the probability of all kinds of data in the credible interval obtained by the test vehicle under normal circumstances, less consideration is given to the impact of emergencies, road maintenance, etc. on the range of collected data values, so the following fusion In the process, emergencies, road maintenance, etc. should also be considered.
步骤1041、首先融合出租车与公交车数据。由于GPS等设备的高使用率,可以保障数据精度和覆盖率。Step 1041, firstly integrate taxi and bus data. Due to the high usage rate of devices such as GPS, data accuracy and coverage can be guaranteed.
根据步骤1014得到的出租车数据和公交车数据获得数据的基本概率赋值函数的初始值,表示为m1(A)和m2(A);The initial value of the basic probability assignment function obtained according to the taxi data obtained in step 1014 and the bus data is expressed as m 1 (A) and m 2 (A);
记融合的数据为U,出租车数据与公交车数据分别为U1和U2;Note that the fused data is U, and the taxi data and bus data are U1 and U2 respectively;
U即为出租车和公交车信息融合后的所得值,融合后的U的精确信任程度为:U is the value obtained after the fusion of taxi and bus information, and the precise trust degree of U after fusion is:
步骤1042、融入微波/线圈数据。微波/线圈数据进一步保障数据的稳定性和可靠性。Step 1042, integrating microwave/coil data. The microwave/coil data further guarantees the stability and reliability of the data.
根据步骤1014得到的微波/线圈数据的基本概率赋值函数的初始值m3(A);The initial value m 3 (A) of the basic probability assignment function of the microwave/coil data obtained according to step 1014;
记融合的数据为K,步骤1041获得融合数据为K1(即1014获得的U值),微波/线圈数据为K2;Note that the fusion data is K, the fusion data obtained in step 1041 is K 1 (i.e. the U value obtained in 1014), and the microwave/coil data is K 2 ;
K为出租车、公交车以及微波/线圈数据融合后的所得值,融合后的K的精确信任程度为:K is the value obtained after the fusion of taxi, bus and microwave/coil data. The precise trust degree of K after fusion is:
步骤1043、融入车牌识别数据。交叉口区域一般会装有高清摄像头,融入高清摄像头车牌识别数据可以保证卡口路段通过的数据的精确可靠性。方法同步骤1042。Step 1043, integrating the license plate recognition data. High-definition cameras are generally installed in the intersection area, and the integration of high-definition camera license plate recognition data can ensure the accuracy and reliability of the data passing through the checkpoint road section. The method is the same as step 1042.
步骤1044、融入主干路、快速路、次干路信号灯配时数据。交叉口区域一般会装有高清摄像头,交叉路口由于车辆汇集,对车辆行驶速度有较大影响,融入高清摄像头车牌识别数据可以保证卡口路段通过的数据的精确可靠性,主干路、快速路、次干路路段间数据受信号灯配时数据的影响较大,分别融入高清摄像头数据与信号灯配时数据可进一步提高数据准确性。由于信号灯的干扰,车辆在路链上的行驶时间和速度会有所变化,采集到的数据值的范围应适当放大,即步骤1013计算时应适当放宽区间[a,b]的范围,融合方法同步骤1042。Step 1044, integrating the signal light timing data of trunk roads, expressways, and secondary trunk roads. High-definition cameras are generally installed in the intersection area. Due to the collection of vehicles at the intersection, it has a great impact on the speed of the vehicle. The integration of high-definition camera license plate recognition data can ensure the accuracy and reliability of the data passing through the bayonet road section. Trunk roads, expressways, The data between sub-trunk road sections is greatly affected by the timing data of signal lights, and the data accuracy can be further improved by integrating high-definition camera data and signal light timing data respectively. Due to the interference of signal lights, the driving time and speed of vehicles on the road link will change, and the range of collected data values should be appropriately enlarged, that is, the range of interval [a, b] should be appropriately relaxed when calculating in step 1013. The fusion method Same as step 1042.
步骤1045、融入交通事件、交通管制、站路施工、交通事故数据。由于交通事件、交通管制、站路施工、交通事故对数据的影响,对出租车,公交车,微波/线圈等采集到的数据值的范围应适当放大,即步骤1013计算时应适当放宽区间[a,b]的范围,同上。Step 1045, integrating traffic incidents, traffic control, station road construction, and traffic accident data. Due to the impact of traffic events, traffic control, station road construction, and traffic accidents on the data, the range of data values collected by taxis, buses, microwaves/coils, etc. should be appropriately enlarged, that is, the interval should be appropriately relaxed when calculating in step 1013 [ a, b] range, ibid.
以上方法充分考虑了多种采集方式的稳定性、准确性和可靠性。建立真值系统获得各种采集方式正确的初始赋值参数,对真实采集到的数据进行异常剔除,缺失数据根据历史数据进行合理填补以保证数据的准确性和完整性,对不同类别的采集方式获得的数据进行分级逐级融合处理以保证数据的可靠性,融合过程的快速性,并在融合过程中考虑交通事件、交通管制、占地施工、交通事故给数据带来的影响。The above methods have fully considered the stability, accuracy and reliability of various acquisition methods. Establish a truth system to obtain the correct initial assignment parameters for various collection methods, remove abnormalities from the real collected data, and fill in missing data reasonably according to historical data to ensure the accuracy and integrity of the data. The data is fused step by step to ensure the reliability of the data and the rapidity of the fusion process, and the impact of traffic incidents, traffic control, land occupation construction, and traffic accidents on the data is considered during the fusion process.
经以上步骤的融合过程,实现融合的快速、准确、可靠等效果,真值系统解决了现有技术初始参数赋值不准确的问题;对异常数据剔除、缺失数据填补克服了现有技术采集终端性能不稳定,以及采集终端损坏的难题;采用D-S理论对处理后的数据进行分级融合处理,解决了数据融合的多源性问题。充分考虑交通事件、交通管制、占地施工、交通事故,克服了因以上因素对数据造成的影响。Through the fusion process of the above steps, the fast, accurate and reliable effects of fusion are realized. The truth system solves the problem of inaccurate initial parameter assignment in the prior art; the removal of abnormal data and filling of missing data overcomes the performance of the acquisition terminal in the prior art Instability, and the problem of damage to the collection terminal; the D-S theory is used to perform hierarchical fusion processing on the processed data, which solves the multi-source problem of data fusion. Fully consider traffic incidents, traffic control, land occupation construction, and traffic accidents, and overcome the influence of the above factors on the data.
总之,以上所述仅为本发明的较佳实施例而已,并非用于限定本发明的保护范围。In a word, the above descriptions are only preferred embodiments of the present invention, and are not intended to limit the protection scope of the present invention.
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210375074.4A CN102881162B (en) | 2012-09-29 | 2012-09-29 | Data processing and fusion method for large-scale traffic information |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210375074.4A CN102881162B (en) | 2012-09-29 | 2012-09-29 | Data processing and fusion method for large-scale traffic information |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102881162A CN102881162A (en) | 2013-01-16 |
CN102881162B true CN102881162B (en) | 2015-03-11 |
Family
ID=47482468
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210375074.4A Expired - Fee Related CN102881162B (en) | 2012-09-29 | 2012-09-29 | Data processing and fusion method for large-scale traffic information |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102881162B (en) |
Families Citing this family (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9396433B2 (en) | 2013-05-29 | 2016-07-19 | International Business Machines Corporation | Determining related data points from multi-modal inputs |
CN103686085A (en) * | 2013-12-11 | 2014-03-26 | 东方网力科技股份有限公司 | Method, device and system for processing monitoring video data |
CN104318772B (en) * | 2014-10-31 | 2016-08-24 | 重庆大学 | Freeway traffic flow data quality checking method |
CN105374209B (en) * | 2015-11-05 | 2018-02-06 | 北京航空航天大学 | A kind of urban area road network running status characteristics information extraction method |
CN105374210B (en) * | 2015-12-03 | 2017-08-04 | 中山大学 | Acquisition method of bayonet detection rate based on floating car data |
CN105489014B (en) * | 2016-01-13 | 2017-09-15 | 北京理工大学 | A kind of compensation method based on principal component analysis transport information |
CN106408940B (en) * | 2016-11-02 | 2023-04-14 | 南京慧尔视智能科技有限公司 | Traffic detection method and device based on microwave and video data fusion |
CN106530693B (en) * | 2016-11-02 | 2019-02-05 | 浙江大学 | Rolling estimation method of urban expressway traffic state based on fusion data |
CN106506272B (en) * | 2016-11-04 | 2019-12-27 | 迈锐数据(北京)有限公司 | Method and system for correcting road data |
CN107146409B (en) * | 2017-06-01 | 2019-11-19 | 东方网力科技股份有限公司 | The identification of equipment detection time exception and true time difference evaluation method in road network |
CN107919016B (en) * | 2017-11-15 | 2020-02-18 | 杭州远眺科技有限公司 | Traffic flow parameter missing filling method based on multi-source detector data |
CN108257385B (en) * | 2018-03-19 | 2021-03-16 | 北京工业大学 | A Discrimination Method of Abnormal Events Based on Public Transportation |
CN108777064A (en) * | 2018-05-24 | 2018-11-09 | 深圳市益鑫智能科技有限公司 | A kind of traffic behavior assessment system based on information fusion |
CN109544901A (en) * | 2018-11-26 | 2019-03-29 | 南京行者易智能交通科技有限公司 | A kind of Research on Intelligent Scheduling of Public Traffic Vehicles method and device based on history passenger flow big data |
CN109723610A (en) * | 2018-12-05 | 2019-05-07 | 新奥数能科技有限公司 | Generating set rate of load condensate lacks value complement and recruits method and device |
CN110264744B (en) * | 2019-06-13 | 2022-05-27 | 同济大学 | Traffic flow prediction algorithm based on multivariate data |
CN112735140B (en) * | 2019-10-28 | 2022-05-20 | 阿波罗智联(北京)科技有限公司 | Data processing method and device, electronic equipment and storage medium |
CN111141950B (en) * | 2019-12-03 | 2022-06-14 | 深圳供电局有限公司 | A method for judging and repairing abnormality of time-sharing meter code of metering automation system |
CN113804252A (en) * | 2021-09-10 | 2021-12-17 | 广州市吉华勘测股份有限公司 | High formwork supporting safety monitoring method, device, equipment and storage medium |
CN115935289B (en) * | 2022-12-12 | 2024-04-30 | 国网青海省电力公司海北供电公司 | Station domain polymorphic data fusion method |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101034005A (en) * | 2007-04-12 | 2007-09-12 | 上海交通大学 | Switching method between videos for detecting vehicles during day-night |
CN101034406A (en) * | 2007-04-04 | 2007-09-12 | 同济大学 | Multicomponent data source traffic information fusion method based on distributed architecture |
EP1921589A2 (en) * | 2006-11-10 | 2008-05-14 | Hitachi, Ltd. | Traffic information interpolation system |
CN101281685A (en) * | 2008-01-30 | 2008-10-08 | 吉林大学 | Adaptive Signal Coordination Control Method for Regional Mixed Traffic |
CN101286269A (en) * | 2008-05-26 | 2008-10-15 | 北京捷讯畅达科技发展有限公司 | Traffic flow forecasting system combining dynamic real time traffic data |
-
2012
- 2012-09-29 CN CN201210375074.4A patent/CN102881162B/en not_active Expired - Fee Related
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1921589A2 (en) * | 2006-11-10 | 2008-05-14 | Hitachi, Ltd. | Traffic information interpolation system |
CN101034406A (en) * | 2007-04-04 | 2007-09-12 | 同济大学 | Multicomponent data source traffic information fusion method based on distributed architecture |
CN101034005A (en) * | 2007-04-12 | 2007-09-12 | 上海交通大学 | Switching method between videos for detecting vehicles during day-night |
CN101281685A (en) * | 2008-01-30 | 2008-10-08 | 吉林大学 | Adaptive Signal Coordination Control Method for Regional Mixed Traffic |
CN101286269A (en) * | 2008-05-26 | 2008-10-15 | 北京捷讯畅达科技发展有限公司 | Traffic flow forecasting system combining dynamic real time traffic data |
Also Published As
Publication number | Publication date |
---|---|
CN102881162A (en) | 2013-01-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102881162B (en) | Data processing and fusion method for large-scale traffic information | |
CN112700470B (en) | Target detection and track extraction method based on traffic video stream | |
CN109923595B (en) | An urban road traffic anomaly detection method based on floating car data | |
CN110264586A (en) | L3 grades of automated driving system driving path data acquisitions, analysis and method for uploading | |
CN103473459A (en) | Method of processing and fusing multisystem big data | |
CN118609036B (en) | Management method of mobile vendors in urban operation based on image analysis | |
CN116758762A (en) | Control method based on big data | |
CN115188187A (en) | A roadside perception data quality monitoring system and method based on vehicle-road collaboration | |
CN115527364B (en) | Traffic accident tracing method and system based on radar data fusion | |
CN118366312B (en) | Traffic detection system and method | |
KR102400842B1 (en) | Service methods for providing information on traffic accidents | |
Bai et al. | A multimodal data-driven approach for driving risk assessment | |
CN116597404A (en) | Sustainable road anomaly detection method and system based on multi-source sensor fusion | |
Refai et al. | The study of vehicle classification equipment with solutions to improve accuracy in Oklahoma. | |
CN118918707B (en) | Comprehensive quality determination method for radar fusion system | |
CN119441782A (en) | An intelligent analysis method for abnormal behavior of platform vehicles based on deep learning | |
CN108920655B (en) | Method and device for quantifying space-time coverage range of road weather information system | |
Bäumler et al. | Generating representative test scenarios: The fuse for representativity (Fuse4Rep) process model for collecting and analysing traffic observation data | |
CN116913039A (en) | Microclimate disaster early warning system and method | |
CN116259173A (en) | A multi-detection mode traffic event fusion method based on D-S evidence theory | |
CN115695418A (en) | Information processing method and device, electronic device and storage medium | |
Rashid et al. | Automated traffic measurement system based on FCD and image processing | |
CN114005041A (en) | Road disease identification control method and equipment based on UAVRS and BIM | |
Das et al. | Vehicular propagation velocity forecasting using open CV | |
CN119179408B (en) | Traveling event label dotting system and method for intelligent network-connected automobile road traffic regulation compliance test |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20150311 Termination date: 20210929 |