[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN107526786A - The method and system that place name address date based on multi-source data is integrated - Google Patents

The method and system that place name address date based on multi-source data is integrated Download PDF

Info

Publication number
CN107526786A
CN107526786A CN201710645011.9A CN201710645011A CN107526786A CN 107526786 A CN107526786 A CN 107526786A CN 201710645011 A CN201710645011 A CN 201710645011A CN 107526786 A CN107526786 A CN 107526786A
Authority
CN
China
Prior art keywords
data
place name
address date
source
name address
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710645011.9A
Other languages
Chinese (zh)
Inventor
孙海峰
徐忠建
朱必亮
李俊
陈朴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu Speed Information Polytron Technologies Inc
Original Assignee
Jiangsu Speed Information Polytron Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu Speed Information Polytron Technologies Inc filed Critical Jiangsu Speed Information Polytron Technologies Inc
Priority to CN201710645011.9A priority Critical patent/CN107526786A/en
Publication of CN107526786A publication Critical patent/CN107526786A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Tourism & Hospitality (AREA)
  • Economics (AREA)
  • Strategic Management (AREA)
  • Primary Health Care (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Human Resources & Organizations (AREA)
  • General Health & Medical Sciences (AREA)
  • Quality & Reliability (AREA)
  • Health & Medical Sciences (AREA)
  • Educational Administration (AREA)
  • Development Economics (AREA)
  • Remote Sensing (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention relates to a kind of place name address date integration system based on multi-source data, comprise the following steps:(1) data are collected, choose data model and organizational structure design:The place name of separate sources, address and interest point data structure, attribute field, georeferencing are integrated into a sets of data, choose data model, design organization structure;(2) data prediction:The place name address date form of multi-source is standardized, is unified for manageable form;(3) duplicate data inquiry, rejecting:The place name address date of multi-source is carried out repeating point inquiry;(4) data fusion:Multi-source data is matched and integrated;(5) data are audited:Batch examination & verification is carried out to fused data using GIS software, by the way of automatic examination & verification and manual examination and verification combine;Stored if examination & verification is qualified to database, build outcome data, if examination & verification is unqualified, returned data fusion steps re-start data fusion, until examination & verification is qualified.

Description

The method and system that place name address date based on multi-source data is integrated
Technical field
The present invention relates to geographic information services technical field, more particularly to a kind of place name address date based on multi-source data Integration system and method.
Background technology
With the city management work such as public safety, emergency cooperative, intelligent transportation, city management, environmental renovation, protection against and mitigation of earthquake disasters Make constantly to propose the supportability of dimensional information's basic installation new requirement, build unified, authority, the place name number of addresses of the trend of the times According to storehouse, the development of exploitation level of primary spatial data can not only be substantially improved, and its government department, different industries His information resources share is served by having important reference and reference value, help to start basic control survey it is shared, Service and the new model of application.Particularly under geographic information public service platform construction promotion, by real-time performance place name The inquiry of location information, browse, order that application demand is further strong, abundant, the trend of the times place name address base will be that government and the public carry The application services such as inquiry, positioning, statistics and thematic information spatial match for meeting self-demand, it is total to for all kinds of spatial informations Enjoy exchange and basis is provided, play the pivotal role during geography information frame data is built in digital city and smart city.
Place name address As-Is analysis:1) data source is extensive:Place name address date is related to multiple functional departments, such as state Soil, civil administration, house property, public security, combustion gas, industry and commerce, statistics, quality inspection, land tax etc..Therefore analyzed from the angle in data resource source, Its source department is numerous, as shown in Figure 1;2) standard disunity, form are various;Each functional department due to itself focus not Together, caused place name address date form is also various, and unified specification is lacked in process of construction and is instructed, and causes existing at present each Class address date does not possess higher normalization.It is in particular in not advising for the name of file, the setting of field and address descriptor Plasticity and diversity.Such as:Land departments place name address date derives from topographic map DWG forms, and the administration for industry and commerce's data source is in stepping on Numeration is according to EXCEL forms;3) spatial data lacks:In the place name address date for each functional department collected, only Department of Civil Affairs, public affairs The doorplate of peace office and Bureau of Surveying and Mapping, geographical name data belong to GIS spatial data, place name address date all Yes-No space numbers of other departments According to only simple address descriptive information to it, it is necessary to carry out coordinate imparting;4) poor compatibility, data sharing can not be realized;By It is compatible very poor between no unified place name address base database technology standards and norms, disparate databases, it can not realize Data resource is shared, and limits the application of urban addresses information of place names management system and shares.
Therefore, it is necessary to develop it is a kind of can integrate the multidisciplinary place name address date such as civil administration, house property, public security, territory, The place name address date of unified standard is established, realizes the place name address based on multi-source data of the efficient management of magnanimity address date The method of Data Integration.
The content of the invention
The technical problem to be solved in the present invention is to provide one kind, and can to integrate civil administration, house property, public security, territory etc. multidisciplinary Place name address date, establish the place name address date of unified standard, realize magnanimity address date efficient management based on more The method that the place name address date of source data is integrated.
In order to solve the above-mentioned technical problem, the technical solution adopted by the present invention is:A kind of place name based on multi-source data The method of location Data Integration, comprises the following steps:
(1) data are collected, choose data model and organizational structure design:By the place name of separate sources, address and emerging Interesting point data structure, attribute field, georeferencing are integrated into a sets of data, choose data model, design organization structure;
(2) data prediction:The place name address date form of multi-source is standardized, is unified for manageable Form;
(3) duplicate data inquiry, rejecting:The place name address date of multi-source is carried out repeating point inquiry, the weight that will be inquired Complex point is rejected;
(4) data fusion:Multi-source data is cleaned, matched and integrated;
(5) data are audited:Batch examination & verification is carried out to fused data using GIS software, using automatic examination & verification and people The mode that work examination & verification combines;Stored if examination & verification is qualified to database, build outcome data, if examination & verification is unqualified, returned data Fusion steps re-start data fusion, until examination & verification is qualified.
Using above-mentioned technical proposal, by user by multi-source place name address date input system software, by system software pair Multi-source data forms the unification of standard by flows such as data prediction, the rejecting of emphasis data, data fusion, data examination & verifications Authority data, establish the place name address date of standard.Wherein, the place name of separate sources, address and interest point data structure, category Property field, georeferencing it is all inconsistent, be integrated into a sets of data, it is necessary to have rational data model and identical Institutional framework, to realize the unified management of data.Herein with reference to the model definition of related geographical entity, data model is divided into base This attribute and extended attribute;Base attribute is shared field, and extended attribute sets different attributes according to different entities type Content, stored with data tableau format, the two is identified and linked by unique pel code.The data model both meets to unite One management requires that and can enough retains the particular attribute of different pieces of information;1) base attribute:According to the requirement of data, it is set Base attribute, including element name, address, type codes, longitude, latitude, Sort Code, pel identification code etc.;2) extended attribute: Place name, address and point of interest have various features attribute, can not be described with unified data structure, and extended attribute item can be with Spread is carried out according to various data types, ensures the integrality and scalability of data message;In application according to Classification adds various extended attribute items, and fixed again as needed during actual job, the attribute as road famous cake needs to extend can So that including road number, category of roads, road width etc., Water system scale, affiliated basin can be included in water system point extended attribute Deng professional attributes.Address library data relative priority is more single, can suitably be extended according to being actually needed;What point of interest was related to Quantity is more, and classification is complicated, the peculiar abundant information of every kind of classification, therefore the extended attribute of point of interest can be according to three different fractions Class category feature is extended, but will typically be included telephone number, network address, postcode, data acquisition time, acquisition units, be adopted Collect the information such as people;In addition, the inquiry for repeating point mainly has 2 kinds of methods;Method one is to combine locus, by separate sources data It is attached according to name field, finds out title identical point, reference is screened after being exported.The deficiency of this method It is that can only find out the completely the same point of title, the different repetition point of many titles can not be found out, so needing to enter data Row fuzzy query.Method two is to utilize FME softwares, data fuzzy query module is built, by a certain key element and its certain distance model All key elements in enclosing are matched one by one, take matching degree highest key element, and are matched angle value and match the name of key element Title is write on inside its attribute.Wherein, the distance of matching can be configured according to actual conditions, for place name, park, industry park Area, residential quarters etc. refer to the bigger point of scope, and matching distance can be set somewhat a little louder, such as 500m~1000m or so;And For in general POI types, matching distance can be arranged between 50m~100m scopes., can with reference to matching degree and matching title Whether be identical element between quick interpretation key element, further according to the references such as image and data source Up-to-date state itself, precision, The factors such as attribute integrality, correctness, selection attribute information is complete, positional precision is high, the relatively good point of Up-to-date state, so as to reject Repeat point.This method carries out data duplicate checking by fuzzy matching, while can find the completely the same repetition point of title out, has There are higher practicality and correctness.
The present invention further improvement is that, include the step of data fusion in the step (4):
1) data prediction:The data of extended formatting are converted into shape formatted datas, it is stand-by;
2) geographic element feature extraction:According to《Cadastral Management Information System graph data standard》Data point are carried out by feature Class, then to without character code data, manually carrying out interpretation, carrying out data classification;
3) data encoding is changed:According to《Cadastral Management Information System graph data standard》With《Fundamental Geographic Information System key element Sorting code number》Corresponding relation carries out code conversion;
4) data edition:With extracting tape symbol characteristic, wire, the feature skeleton line of area feature and point-like respectively The characteristic point of thing;
5) topology editor:Integrate the topological relation between key element, construction face key element and grid;
6) attributes match and assignment:Attribute information is matched and assigned to each key element;
7) Coordinate Conversion:It is not WGS84 vector data progress Coordinate Conversion for coordinate.
The present invention further improvement is that, the inquiry of the repetition point in the step (3) mainly has 2 kinds of methods:Method one It is to combine locus, separate sources data is attached according to name field, title identical point are found out, after being exported Ginseng is turned over the data of examining and screened;Method two is to utilize FME softwares, builds data fuzzy query module, key element is being matched with it All key elements in the range of distance are matched one by one, take matching degree highest key element, and are matched angle value and matched The title of key element is write on inside its attribute;
The present invention further improvement is that, data model in the step (1) according to the model definition of geographical entity, point For base attribute and extended attribute.
The present invention further improvement is that, the base attribute include element name, address, type codes, longitude, latitude, Sort Code and pel identification code;According to the requirement of data, the base attribute of data is set.
The present invention further improvement is that, the matching in the method two of the inquiry of the repetition point in the step (3) Distance can be configured according to actual conditions, be compared for place name, park, industrial park, this kind of scope that refers in residential quarters Big point, matching distance can be set a little louder;And for POI types, matching distance can be arranged on 50m~100m scopes it Between.
The present invention further improvement is that, the place name address date sources of the collections data in the step (1) is including more Place name address date, interest point data and the third party's data of department, wherein it is multidisciplinary including statistics bureau, public security bureau, Department of Qulity Supervision, Local Tax Bureau, Department of Civil Affairs, live to found the bureau, industrial and commercial bureau, housing bureau and Land and Resources Bureau.It is multidisciplinary to use place name address including other Department.
The present invention further improvement is that, reference in the step (3) includes various resolution image figures, document Data, 1:10000 and 1:50000 digital line draws map (DLG) and atlas and network data.
The present invention further improvement is that, the topology editor in the step 5) specifically includes line feature and region feature.
What the present invention also technical problems to be solved were to provide that a kind of place name address date based on multi-source data integrates is System.
In order to solve the above-mentioned technical problem, the technical solution adopted in the present invention is:The place name based on multi-source data Location data integrated system, it is characterised in that being somebody's turn to do the place name address date integration system based on multi-source data includes data prediction Module, data cleansing module, data fusion module, data-auditing module and data model module;The data preprocessing module, Data-auditing module, data fusion module, data cleansing module and data model module are electrically connected with the control module And it is in bidirectional data transfers;The data model module is used for the place name address data model for establishing standardization;The data are pre- Processing module is used to the place name address date form of multi-source being standardized, and is unified for manageable form;It is described Data cleansing module is used to the place name address date of multi-source is carried out to repeat point inquiry and rejects repetition point;The data fusion Module is used to multi-source data is matched and integrated;The data-auditing module is used for using GIS software to fused Data carry out batch examination & verification, by the way of automatic examination & verification and manual examination and verification combine.Multi-source data place name address date system Software is developed using JAVA language, and wherein data model module is responsible for establishing the place name address data model of standardization;Data are pre- Processing module is responsible for the place name address date of different-format carrying out unified conversion, and data cleansing module is responsible for the number according to setting Pretreated data are carried out according to model to repeat point data rejecting;Data fusion module is according to one by the data after cleaning Fixed logical construction carries out Data Integration;Data-auditing module is responsible for carrying out quality examination work to the data after fusion.
Compared with prior art, the beneficial effects of the invention are as follows:
1) the Data Integration flow for the standard established, it is various to solve current data class, the multifarious problem of form;
2) the Data Integration flow and technical scheme of a set of standardization are provided, greatly reduce the process manually participated in, is saved The Data Integration time is saved;
3) isomeric data integration technology is used, quick, efficient technology solution is provided for the updating maintenance of later data Scheme, solves the problem that place name address quickly updates.
Brief description of the drawings
Technical scheme is further described below in conjunction with the accompanying drawings:
Fig. 1 is the data source figure for the method that the place name address date based on multi-source data of the present invention is integrated;
Fig. 2 is the flow chart for the method that the place name address date based on multi-source data of the present invention is integrated;
Fig. 3 is the flow of the data fusion in the method that the place name address date based on multi-source data of the present invention is integrated Figure;
Fig. 4 is the hardware block diagram for the system that the place name address date based on multi-source data of the present invention is integrated.
Embodiment
In order to deepen the understanding of the present invention, the present invention is done below in conjunction with drawings and examples and further retouched in detail State, the embodiment is only used for explaining the present invention, and protection scope of the present invention is not formed and limited.
Embodiment:The method that the place name address date based on multi-source data is integrated, comprises the following steps:
(1) data are collected, choose data model and organizational structure design:By the place name of separate sources, address and emerging Interesting point data structure, attribute field, georeferencing are integrated into a sets of data, choose data model, design organization structure;
(2) data prediction:The place name address date form of multi-source is standardized, is unified for manageable Form;
(3) duplicate data inquiry, rejecting:The place name address date of multi-source is carried out repeating point inquiry, repeats the inquiry of point Mainly there are 2 kinds of methods:Method one is to combine locus, and separate sources data are attached according to name field, find out name Claim identical point, reference is screened after being exported;Method two is to utilize FME softwares, builds data fuzzy query mould Block, key element is matched one by one with its all key element in the range of matching distance, take matching degree highest key element, and will It matches angle value and write on the title for matching key element inside its attribute;The repetition inquired point is rejected;
(4) data fusion:Multi-source data is cleaned, matched and integrated;
(5) data are audited:Batch examination & verification is carried out to fused data using GIS software, using automatic examination & verification and people The mode that work examination & verification combines;Stored if examination & verification is qualified to database, build outcome data, if examination & verification is unqualified, returned data Fusion steps re-start data fusion, until examination & verification is qualified;
The step of data fusion in the step (4), includes:
1) data prediction:The data of extended formatting are converted into shape formatted datas, it is stand-by;
2) geographic element feature extraction:According to《Cadastral Management Information System graph data standard》Data point are carried out by feature Class, then to without character code data, manually carrying out interpretation, carrying out data classification;
3) data encoding is changed:According to《Cadastral Management Information System graph data standard》With《Fundamental Geographic Information System key element Sorting code number》Corresponding relation carries out code conversion;
4) data edition:With extracting tape symbol characteristic, wire, the feature skeleton line of area feature and point-like respectively The characteristic point of thing;
5) topology editor:Integrate the topological relation between key element, construction face key element and grid;
6) attributes match and assignment:Attribute information is matched and assigned to each key element;
7) Coordinate Conversion:It is not WGS84 vector data progress Coordinate Conversion for coordinate;
Data model in the step (1) is divided into base attribute and extended attribute according to the model definition of geographical entity; The base attribute includes element name, address, type codes, longitude, latitude, Sort Code and pel identification code;According to data Requirement, set the base attributes of data;The matching in the method two of the inquiry of repetition point in the step (3) Distance can be configured according to actual conditions, be compared for place name, park, industrial park, this kind of scope that refers in residential quarters Big point, matching distance can be set a little louder;And for POI types, matching distance can be arranged on 50m~100m scopes it Between;The place name address date source of collection data in the step (1) includes multidisciplinary place name address date, interest point According to third party's data, wherein it is multidisciplinary including statistics bureau, public security bureau, Department of Qulity Supervision, Local Tax Bureau, Department of Civil Affairs, live to found the bureau, industrial and commercial Office, housing bureau and Land and Resources Bureau;Reference in the step (3) includes various resolution image figures, document information, 1:10000 With 1:50000 digital line draws map (DLG) and atlas and network data;Topology editor in the step 5) specifically includes line spy Seek peace region feature.
Using above-mentioned technical proposal, by user by multi-source place name address date input system software, by system software pair Multi-source data forms the unification of standard by flows such as data prediction, the rejecting of emphasis data, data fusion, data examination & verifications Authority data, establish the place name address date of standard.Wherein, the place name of separate sources, address and interest point data structure, category Property field, georeferencing it is all inconsistent, be integrated into a sets of data, it is necessary to have rational data model and identical Institutional framework, to realize the unified management of data.Herein with reference to the model definition of related geographical entity, data model is divided into base This attribute and extended attribute;Base attribute is shared field, and extended attribute sets different attributes according to different entities type Content, stored with data tableau format, the two is identified and linked by unique pel code.The data model both meets to unite One management requires that and can enough retains the particular attribute of different pieces of information;1) base attribute:According to the requirement of data, it is set Base attribute, including element name, address, type codes, longitude, latitude, Sort Code, pel identification code etc.;2) extended attribute: Place name, address and point of interest have various features attribute, can not be described with unified data structure, and extended attribute item can be with Spread is carried out according to various data types, ensures the integrality and scalability of data message;In application according to Classification adds various extended attribute items, and fixed again as needed during actual job, the attribute as road famous cake needs to extend can So that including road number, category of roads, road width etc., Water system scale, affiliated basin can be included in water system point extended attribute Deng professional attributes.Address library data relative priority is more single, can suitably be extended according to being actually needed;What point of interest was related to Quantity is more, and classification is complicated, the peculiar abundant information of every kind of classification, therefore the extended attribute of point of interest can be according to three different fractions Class category feature is extended, but will typically be included telephone number, network address, postcode, data acquisition time, acquisition units, be adopted Collect the information such as people;In addition, the inquiry for repeating point mainly has 2 kinds of methods;Method one is to combine locus, by separate sources data It is attached according to name field, finds out title identical point, reference is screened after being exported.The deficiency of this method It is that can only find out the completely the same point of title, the different repetition point of many titles can not be found out, so needing to enter data Row fuzzy query.Method two is to utilize FME softwares, data fuzzy query module is built, by a certain key element and its certain distance model All key elements in enclosing are matched one by one, take matching degree highest key element, and are matched angle value and match the name of key element Title is write on inside its attribute.Wherein, the distance of matching can be configured according to actual conditions, for place name, park, industry park Area, residential quarters etc. refer to the bigger point of scope, and matching distance can be set somewhat a little louder, such as 500m~1000m or so;And For in general POI types, matching distance can be arranged between 50m~100m scopes., can with reference to matching degree and matching title Whether be identical element between quick interpretation key element, further according to the references such as image and data source Up-to-date state itself, precision, The factors such as attribute integrality, correctness, selection attribute information is complete, positional precision is high, the relatively good point of Up-to-date state, so as to reject Repeat point.This method carries out data duplicate checking by fuzzy matching, while can find the completely the same repetition point of title out, has There are higher practicality and correctness.
The place name address date integration system based on multi-source data, it is characterised in that should the place name based on multi-source data Address date integration system includes data preprocessing module, data cleansing module, data fusion module, data-auditing module sum According to model module;The data preprocessing module, data-auditing module, data fusion module, data cleansing module and data mould Pattern block is electrically connected with the control module and is in bidirectional data transfers;The data model module, which is used to establish, to be standardized Place name address data model;The data preprocessing module is used to the place name address date form of multi-source being standardized place Reason, is unified for manageable form;The data cleansing module is looked into for carrying out repetition point to the place name address date of multi-source Ask and point will be repeated and reject;The data fusion module is used to multi-source data is matched and integrated;The data audit mould Block is used to carry out batch examination & verification to fused data using GIS software, the side combined using automatic examination & verification and manual examination and verification Formula.The software of multi-source data place name address date system is developed using JAVA language, and wherein data model module is responsible for establishing mark The place name address data model of standardization;Data preprocessing module is responsible for the place name address date of different-format carrying out unified turn Change, data cleansing module is responsible for that pretreated data are carried out according to the data model of setting to repeat point data rejecting;Number It is that the data after cleaning are subjected to Data Integration according to certain logical construction according to Fusion Module;Data-auditing module is responsible for melting Data after conjunction carry out quality examination work.
For the ordinary skill in the art, simply the present invention is exemplarily described for specific embodiment, Obviously present invention specific implementation is not subject to the restrictions described above, and is entered as long as employing the inventive concept and technical scheme of the present invention The improvement of capable various unsubstantialities, or it is not improved by the present invention design and technical scheme directly apply to other occasions , within protection scope of the present invention.

Claims (10)

1. a kind of method that place name address date based on multi-source data is integrated, it is characterised in that comprise the following steps:
(1) data are collected, choose data model and organizational structure design:By the place name of separate sources, address and point of interest Data structure, attribute field, georeferencing are integrated into a sets of data, choose data model, design organization structure;
(2) data prediction:The place name address date form of multi-source is standardized, is unified for manageable lattice Formula;
(3) duplicate data inquiry, rejecting:The place name address date of multi-source is carried out repeating point inquiry, by the repetition inquired point Reject;
(4) data fusion:Multi-source data is matched and integrated;
(5) data are audited:Batch examination & verification is carried out to fused data using GIS software, examined using automatic examination & verification and manually The mode that core combines;Stored if examination & verification is qualified to database, build outcome data, if examination & verification is unqualified, returned data fusion Step re-starts data fusion, until examination & verification is qualified.
2. the method that the place name address date according to claim 1 based on multi-source data is integrated, it is characterised in that described The step of data fusion in step (4), includes:
1) data prediction:The data of extended formatting are converted into shape formatted datas, it is stand-by;
2) geographic element feature extraction:According to《Cadastral Management Information System graph data standard》Data classification is carried out by feature, Again to without character code data, manually carrying out interpretation, carrying out data classification;
3) data encoding is changed:According to《Cadastral Management Information System graph data standard》With《Fundamental Geographic Information System element category Coding》Corresponding relation carries out code conversion;
4) data edition:Tape symbol characteristic, wire, the feature skeleton line of area feature and punctual geo-objects are extracted respectively Characteristic point;
5) topology editor:Integrate the topological relation between key element, construction face key element and grid;
6) attributes match and assignment:Attribute information is matched and assigned to each key element;
7) Coordinate Conversion:It is not WGS84 vector data progress Coordinate Conversion for coordinate.
3. the method that the place name address date according to claim 2 based on multi-source data is integrated, it is characterised in that described Data model in step (1) is divided into base attribute and extended attribute according to the model definition of geographical entity.
4. the method that the place name address date according to claim 3 based on multi-source data is integrated, it is characterised in that described The inquiry that step (3) repeats point has 2 kinds of methods:Method one is to combine locus, by separate sources data according to name field It is attached, finds out title identical point, reference is screened after being exported;Method two is to utilize FME softwares, structure Data fuzzy query module, key element is matched one by one with its all key element in the range of matching distance, takes matching degree Highest key element, and matched angle value and write on the title for matching key element inside its attribute.
5. the method that the place name address date according to claim 4 based on multi-source data is integrated, it is characterised in that described The matching distance in the method two of the inquiry of repetition point in step (3) can be configured according to actual conditions, for The bigger point of place name, park, industrial park, this kind of reference scope in residential quarters, matching distance can be set a little louder;And for POI types, matching distance can be arranged between 50m~100m scopes.
6. the method that the place name address date according to claim 4 based on multi-source data is integrated, it is characterised in that described The place name address date source of collection data in step (1) includes multidisciplinary place name address date, interest point data and the Tripartite's data, wherein it is multidisciplinary including statistics bureau, public security bureau, Department of Qulity Supervision, Local Tax Bureau, Department of Civil Affairs, live found the bureau, industrial and commercial bureau, real estate management Office and Land and Resources Bureau.
7. the method that the place name address date according to claim 4 based on multi-source data is integrated, it is characterised in that described Data model in step (1) is divided into base attribute and extended attribute according to the model definition of geographical entity.
8. the method that the place name address date according to claim 4 based on multi-source data is integrated, it is characterised in that described Topology editor in step 5) specifically includes line feature and region feature.
9. the method that the place name address date according to claim 4 based on multi-source data is integrated, it is characterised in that described Reference in step (3) includes various resolution image figures, document information, 1:10000 and 1:50000 digital line draws map And atlas and network data (DLG).
10. a kind of place name address date integration system based on multi-source data, it is characterised in that should the place name based on multi-source data Address date integration system includes data preprocessing module, data cleansing module, data fusion module, data-auditing module sum According to model module;The data preprocessing module, data-auditing module, data fusion module, data cleansing module and data mould Pattern block is electrically connected with the control module and is in bidirectional data transfers;The data model module, which is used to establish, to be standardized Place name address data model;The data preprocessing module is used to the place name address date form of multi-source being standardized place Reason, is unified for manageable form;The data cleansing module is looked into for carrying out repetition point to the place name address date of multi-source Ask and point will be repeated and reject;The data fusion module is used to multi-source data is matched and integrated;The data audit mould Block is used to carry out batch examination & verification to fused data using GIS software, the side combined using automatic examination & verification and manual examination and verification Formula.
CN201710645011.9A 2017-08-01 2017-08-01 The method and system that place name address date based on multi-source data is integrated Pending CN107526786A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710645011.9A CN107526786A (en) 2017-08-01 2017-08-01 The method and system that place name address date based on multi-source data is integrated

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710645011.9A CN107526786A (en) 2017-08-01 2017-08-01 The method and system that place name address date based on multi-source data is integrated

Publications (1)

Publication Number Publication Date
CN107526786A true CN107526786A (en) 2017-12-29

Family

ID=60680550

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710645011.9A Pending CN107526786A (en) 2017-08-01 2017-08-01 The method and system that place name address date based on multi-source data is integrated

Country Status (1)

Country Link
CN (1) CN107526786A (en)

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108399192A (en) * 2018-01-25 2018-08-14 链家网(北京)科技有限公司 A kind of cell information matching process and device
CN108573039A (en) * 2018-04-04 2018-09-25 烟台海颐软件股份有限公司 A kind of target identification method assembled based on multisource spatio-temporal data and system
CN109308294A (en) * 2018-09-13 2019-02-05 浙江省国土勘测规划有限公司 Point of interest input system and method
CN110222139A (en) * 2019-06-14 2019-09-10 北京百度网讯科技有限公司 Road solid data De-weight method, calculates equipment and medium at device
CN111090630A (en) * 2019-12-16 2020-05-01 中科宇图科技股份有限公司 Data fusion processing method based on multi-source spatial point data
CN111104449A (en) * 2019-12-18 2020-05-05 福州市勘测院 Multisource city space-time standard address fusion method based on geographic space portrait mining
CN111143297A (en) * 2019-12-19 2020-05-12 上海三稻智能科技有限公司 System and method for classifying and splicing multi-format mixed data
CN111445309A (en) * 2020-03-26 2020-07-24 四川旅游学院 Social network-based travel service recommendation method
CN111459941A (en) * 2020-04-03 2020-07-28 福州市勘测院 Historical land parcel method based on geocoding index and multi-source data comparison
CN111488409A (en) * 2019-01-25 2020-08-04 阿里巴巴集团控股有限公司 City address library construction method, retrieval method and device
CN111680082A (en) * 2020-04-30 2020-09-18 四川弘智远大科技有限公司 Government financial data acquisition system and data acquisition method based on data integration
CN111723172A (en) * 2020-06-10 2020-09-29 广东世纪高通科技有限公司 Data fusion method and device
WO2020220810A1 (en) * 2019-04-30 2020-11-05 京东城市(南京)科技有限公司 Data fusion method and apparatus
CN112115221A (en) * 2020-09-08 2020-12-22 浙江嘉兴数字城市实验室有限公司 Multi-factor matching fusion method for block data
CN112182091A (en) * 2020-12-03 2021-01-05 光大科技有限公司 Multi-source data integration method, system, storage medium and electronic device
CN112417214A (en) * 2020-11-02 2021-02-26 中关村科学城城市大脑股份有限公司 Fusion method and system for multi-source heterogeneous data of urban brain scene
CN112905728A (en) * 2021-02-26 2021-06-04 中国科学院电子学研究所苏州研究院 Efficient fusion and retrieval system and method for multi-source place name data
CN112988715A (en) * 2021-04-13 2021-06-18 速度时空信息科技股份有限公司 Construction method of global network place name database based on open source mode
CN113127759A (en) * 2021-04-16 2021-07-16 深圳集智数字科技有限公司 Interest point processing method and device, computing equipment and computer readable storage medium
CN113254127A (en) * 2021-05-13 2021-08-13 中国电力工程顾问集团西南电力设计院有限公司 Processing method for large-data-volume graphic elements in power transmission line engineering measurement software
CN113434623A (en) * 2021-06-30 2021-09-24 广东省城乡规划设计研究院有限责任公司 Fusion method based on multi-source heterogeneous space planning data
CN113626408A (en) * 2021-08-05 2021-11-09 广州城市信息研究所有限公司 City information database construction method and map display method
CN113656493A (en) * 2021-07-23 2021-11-16 贵州图智信息技术有限公司 Method and system for constructing digital twin city multi-bank fusion

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8996523B1 (en) * 2011-05-24 2015-03-31 Google Inc. Forming quality street addresses from multiple providers
CN105740257A (en) * 2014-12-09 2016-07-06 朗新科技股份有限公司 Method and system for establishing standard geographic name address base
CN106850788A (en) * 2017-01-22 2017-06-13 中国科学院电子学研究所苏州研究院 Towards the integrated framework and integrated approach of multi-source heterogeneous geographic information resources

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8996523B1 (en) * 2011-05-24 2015-03-31 Google Inc. Forming quality street addresses from multiple providers
CN105740257A (en) * 2014-12-09 2016-07-06 朗新科技股份有限公司 Method and system for establishing standard geographic name address base
CN106850788A (en) * 2017-01-22 2017-06-13 中国科学院电子学研究所苏州研究院 Towards the integrated framework and integrated approach of multi-source heterogeneous geographic information resources

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
王银花: "多源地名地址和兴趣点数据整合方法研究", 《地理空间信息》 *

Cited By (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108399192B (en) * 2018-01-25 2020-07-24 贝壳找房(北京)科技有限公司 Cell information matching method and device
CN108399192A (en) * 2018-01-25 2018-08-14 链家网(北京)科技有限公司 A kind of cell information matching process and device
CN108573039A (en) * 2018-04-04 2018-09-25 烟台海颐软件股份有限公司 A kind of target identification method assembled based on multisource spatio-temporal data and system
CN109308294A (en) * 2018-09-13 2019-02-05 浙江省国土勘测规划有限公司 Point of interest input system and method
CN111488409A (en) * 2019-01-25 2020-08-04 阿里巴巴集团控股有限公司 City address library construction method, retrieval method and device
WO2020220810A1 (en) * 2019-04-30 2020-11-05 京东城市(南京)科技有限公司 Data fusion method and apparatus
CN110222139A (en) * 2019-06-14 2019-09-10 北京百度网讯科技有限公司 Road solid data De-weight method, calculates equipment and medium at device
CN110222139B (en) * 2019-06-14 2021-07-09 北京百度网讯科技有限公司 Road entity data duplication eliminating method, device, computing equipment and medium
CN111090630A (en) * 2019-12-16 2020-05-01 中科宇图科技股份有限公司 Data fusion processing method based on multi-source spatial point data
CN111104449A (en) * 2019-12-18 2020-05-05 福州市勘测院 Multisource city space-time standard address fusion method based on geographic space portrait mining
CN111143297A (en) * 2019-12-19 2020-05-12 上海三稻智能科技有限公司 System and method for classifying and splicing multi-format mixed data
CN111143297B (en) * 2019-12-19 2023-05-19 上海三稻智能科技有限公司 Multi-format mixed data classification and splicing system and method
CN111445309A (en) * 2020-03-26 2020-07-24 四川旅游学院 Social network-based travel service recommendation method
CN111445309B (en) * 2020-03-26 2023-05-30 四川旅游学院 Tourism service recommendation method based on social network
CN111459941A (en) * 2020-04-03 2020-07-28 福州市勘测院 Historical land parcel method based on geocoding index and multi-source data comparison
CN111680082A (en) * 2020-04-30 2020-09-18 四川弘智远大科技有限公司 Government financial data acquisition system and data acquisition method based on data integration
CN111680082B (en) * 2020-04-30 2023-08-18 四川弘智远大科技有限公司 Government financial data acquisition system and method based on data integration
CN111723172A (en) * 2020-06-10 2020-09-29 广东世纪高通科技有限公司 Data fusion method and device
CN112115221A (en) * 2020-09-08 2020-12-22 浙江嘉兴数字城市实验室有限公司 Multi-factor matching fusion method for block data
CN112417214A (en) * 2020-11-02 2021-02-26 中关村科学城城市大脑股份有限公司 Fusion method and system for multi-source heterogeneous data of urban brain scene
CN112182091A (en) * 2020-12-03 2021-01-05 光大科技有限公司 Multi-source data integration method, system, storage medium and electronic device
CN112905728A (en) * 2021-02-26 2021-06-04 中国科学院电子学研究所苏州研究院 Efficient fusion and retrieval system and method for multi-source place name data
CN112988715A (en) * 2021-04-13 2021-06-18 速度时空信息科技股份有限公司 Construction method of global network place name database based on open source mode
CN112988715B (en) * 2021-04-13 2021-08-13 速度时空信息科技股份有限公司 Construction method of global network place name database based on open source mode
CN113127759A (en) * 2021-04-16 2021-07-16 深圳集智数字科技有限公司 Interest point processing method and device, computing equipment and computer readable storage medium
CN113254127A (en) * 2021-05-13 2021-08-13 中国电力工程顾问集团西南电力设计院有限公司 Processing method for large-data-volume graphic elements in power transmission line engineering measurement software
CN113434623A (en) * 2021-06-30 2021-09-24 广东省城乡规划设计研究院有限责任公司 Fusion method based on multi-source heterogeneous space planning data
CN113434623B (en) * 2021-06-30 2022-02-15 广东省城乡规划设计研究院有限责任公司 Fusion method based on multi-source heterogeneous space planning data
CN113656493A (en) * 2021-07-23 2021-11-16 贵州图智信息技术有限公司 Method and system for constructing digital twin city multi-bank fusion
CN113626408A (en) * 2021-08-05 2021-11-09 广州城市信息研究所有限公司 City information database construction method and map display method
CN113626408B (en) * 2021-08-05 2022-04-12 广州城市信息研究所有限公司 City information database construction method and map display method

Similar Documents

Publication Publication Date Title
CN107526786A (en) The method and system that place name address date based on multi-source data is integrated
CN111222661B (en) Urban planning implementation effect analysis and evaluation method
CN101350012B (en) Method and system for matching address
CN112347222B (en) Method and system for converting non-standard address into standard address based on knowledge base reasoning
JP5856618B2 (en) Geospatial database integration method and device
CN109102193A (en) Geography designs ecological red line and delimit and management system and database, evaluation model
CN111221867B (en) Protective building information management system
CN114692236B (en) Big data-oriented territorial space planning base map base number processing method
CN112988715B (en) Construction method of global network place name database based on open source mode
CN109508363A (en) Water conservancy big data service platform and its working method based on GIS
CN116341967A (en) Park green scheme evaluation and optimization method, device and equipment based on GIS model and storage medium
CN112365391A (en) Land diversity measurement method based on 'homeland survey' data
CN111813819B (en) Space-time big data-based place name and address online matching method
CN114661744B (en) Terrain database updating method and system based on deep learning
Olszewski et al. Methodology of creating the new generation of official topographic maps in Poland
Droj GIS and remote sensing in environmental management
CN113672788A (en) Urban building function classification method based on multi-source data and weight coefficient method
CN109977190B (en) Large-scale vector map data-oriented area query processing method and device
Nod et al. Methods for measuring the spatial mobility of tourists using a network theory approach
Bond et al. The role of geographic information systems in survey analysis
CN114003678A (en) Data distribution method, dangerous waste management method based on data distribution method and road emergency management method
Paraskevopoulos et al. Exploring the urban types of built density, network centrality, and functional mixture in the city of athens
Lai et al. Computing Places and Human Activity in Data-absent Informal Urban Settlements
Kostourou Visualising Change in 4D: Working With Quantitative and Qualitative Data in Cartographic Studies
Tran et al. Exploiting WebGis technology to build an environmental database to support the environmental management of Ho Chi Minh city

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 210042 8 Blocks 699-22 Xuanwu Avenue, Xuanwu District, Nanjing City, Jiangsu Province

Applicant after: Speed Space-time Information Technology Co., Ltd.

Address before: 210000 8 -22, 699 Xuanwu Road, Xuanwu District, Nanjing, Jiangsu.

Applicant before: Jiangsu speed information Polytron Technologies Inc

CB02 Change of applicant information
RJ01 Rejection of invention patent application after publication

Application publication date: 20171229

RJ01 Rejection of invention patent application after publication