[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN104133861A - Method for intelligently resolving excel format international air ticket freight rate lists - Google Patents

Method for intelligently resolving excel format international air ticket freight rate lists Download PDF

Info

Publication number
CN104133861A
CN104133861A CN201410336305.XA CN201410336305A CN104133861A CN 104133861 A CN104133861 A CN 104133861A CN 201410336305 A CN201410336305 A CN 201410336305A CN 104133861 A CN104133861 A CN 104133861A
Authority
CN
China
Prior art keywords
freight rate
information
voyage
freight
list
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410336305.XA
Other languages
Chinese (zh)
Other versions
CN104133861B (en
Inventor
黄翰
叶树锦
卢尔昂
郝志峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
South China University of Technology SCUT
Original Assignee
South China University of Technology SCUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by South China University of Technology SCUT filed Critical South China University of Technology SCUT
Priority to CN201410336305.XA priority Critical patent/CN104133861B/en
Publication of CN104133861A publication Critical patent/CN104133861A/en
Application granted granted Critical
Publication of CN104133861B publication Critical patent/CN104133861B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a method for intelligently resolving excel format international air ticket freight rate lists. According to the method, general characteristics are analyzed from different freight rate lists in different formats, and different information retrieval extraction rules are analyzed and concluded for different freight rate information, so the required freight rate information is extracted. The freight rate information mainly includes freight rate list basic information, air range information, air range freight rate, additional freight rate and the like. Finally, the freight rate lists are split into a plurality of pieces of air range information according to differences of departure places, transit stations, destinations, single-trip or round-trip modes, cabins and the like, and are stored into a unified format. The method provided by the invention has the advantage that the unified air range information can be accurately and fast extracted from the freight rate lists.

Description

The method of the international air ticket freight rate of a kind of intelligently parsing excel form list
Technical field
The present invention relates generally to information retrieval technical field, be specifically related to the method for the international air ticket freight rate of a kind of intelligently parsing excel form list.
Background technology
Along with the raising of quality of life and the development of communications and transportation, there is now increasing people to start to select airplane trip, so travelling merchants group needs to process by being faced with a large amount of air ticket freight rate lists.Traditional processing mode, is to read freight rate list by artificially, and by the valency list information of reading input system manually.Yet the form of different freight rate lists is far from each other, even the different price list of identical boat department all exists many difference.Therefore by manual type, process, usually need to expend many manpowers and time.
Based on above situation, we have proposed the method for the international air ticket freight rate of a kind of intelligently parsing excel form list, have replaced the mode of manual entry, thereby have saved in large quantities manpower and time.
Summary of the invention
The present invention is directed to the deficiency of current manual extraction freight rate list infotech, a kind of intelligently parsing is provided and has extracted the method for excel freight rate list information.The object of the invention is by intelligently parsing freight rate list, extract freight rate information, valency list is split into many voyage information, and save as unified formatted output.Concrete technical scheme is as described below.
A method for the international air ticket freight rate of intelligently parsing excel form list, comprises the following steps:
(1) a large amount of existing valency lists are analyzed;
(2) valency individual palpation different-format is classified;
(3) classified valency list is resolved respectively, according to the Position Approximate at voyage attribute place, locking range of search;
(4) in the range of search of locking, the retrieval extracting rule of the freight rate information of the required extraction of analytic induction.
(5) in the range of search of locking, single-frame scan, the gauge outfit sign of lookup table, and record voyage attribute and the place line number of gauge outfit;
(6) at gauge outfit position next line, start single-frame to scan, find out all cells that comprise pricing information, each pricing information can split into a voyage;
(7), according to the retrieval extracting rule of analytic induction, find out voyage attribute corresponding to all prices in step (6), and preserve with unified form;
(8) repeating step (5), to (7), until can not find next gauge outfit sign, splits complete to all forms.
More specifically, the analysis described in step (1) is: according to the freight rate information that will extract, and the similarities and differences between initial analysis different price list.The freight rate information that wherein will extract comprises, departure place, destination, terminal, travel type, the maximum residence time, the minimum residence time, and the voyage attributes such as freight rate of being grown up.
More specifically, the sorting technique in step (2) is: according to the initial analysis of step (1), the valency list with larger general character that information needed storage mode is close or identical is sorted out.
More specifically, the object of step (3) is to dwindle range of search, improves retrieval rate.
More specifically, during step (4) is described, the retrieval extracting rule of analyzing the freight rate information of required extraction comprises following steps:
(a) whether there are common fixing key word or crucial phrase near finding out the information of required extraction, using it as retrieval symbol;
(b) determine the position relationship between required information extraction and retrieval symbol.
More specifically, in described step (5), gauge outfit contains following voyage attribute conventionally:
(a) route, wherein comprises three character codes of departure place and terminal (if any terminal), and separates with "-";
(b) voyage type, comprises one way and roundtrip two classes;
(c) booking freight space, freight space information, by freight space representation, is single capitalization English letter;
(d) term of validity.
More specifically, in step (6), the lookup method of pricing information is: from gauge outfit position next line, single-frame scan, run into the cell of pure digi-tal, be the cell of storage pricing information, total number of recorded unit lattice.
More specifically, a corresponding voyage in pricing information unit, step (7) splits voyage according to the price unit finding in step (6):
(a) utilize API that Java carries to find the retrieval symbol of institute's analytic induction in step (4);
(b) according to analysis in step (4), sum up, the relation between required freight rate information and retrieval symbol, finds out all freight rate information that pricing information unit is corresponding.
(c) to each pricing information unit repeating step (a) and (b), until form is split into many voyages according to consolidation form.
Compared with prior art, tool of the present invention has the following advantages and technique effect: the present invention is by excel freight rate list is sorted out, and summarizes respectively retrieval extracting rule, thereby intelligently parsing extracts required freight rate information, improves retrieval rate.
Accompanying drawing explanation
Fig. 1 is the method flow schematic diagram of the international air ticket freight rate of a kind of intelligently parsing excel form of the present invention list.
Embodiment
In order to allow those skilled in the art can understand better technical scheme of the present invention, below in conjunction with accompanying drawing, the invention will be further elaborated.
As shown in Figure 1, the method that the present invention has disclosed the international air ticket freight rate of a kind of intelligently parsing excel form list comprises the following steps:
(1) a large amount of existing valency lists are analyzed: according to the information that will extract, the similarities and differences between initial analysis different price list.The information that wherein will extract comprises, departure place, destination, terminal, travel type, the maximum residence time, the minimum residence time, and the voyage attributes such as freight rate of being grown up.
(2) valency individual palpation different-format is classified, the valency list with larger general character that freight rate information storage mode is close or identical is sorted out.
(3) classified valency list is resolved respectively, according to the Position Approximate at voyage attribute place, locking range of search, to dwindle range of search, improves retrieval rate.
(4), in the range of search of locking, analyze the retrieval extracting rule of the freight rate information of required extraction:
(a) whether there are common fixing key word or crucial phrase near finding out the information of required extraction, using it as retrieval symbol;
(b) determine the position relationship between required information extraction and retrieval symbol.
(5) in the range of search of locking, single-frame scan, the gauge outfit sign of lookup table, and record voyage attribute and the place line number of gauge outfit, gauge outfit contains following voyage attribute conventionally:
(a) route, wherein comprises three character codes of departure place and terminal (if any terminal), and separates with "-";
(b) voyage type, comprises one way and roundtrip two classes;
(c) booking freight space, freight space information, by freight space representation, is single capitalization English letter;
(d) term of validity.
(6) lookup method of pricing information unit is: from gauge outfit position next line, single-frame scan, run into the cell of pure digi-tal, be the cell of storage pricing information, total number of recorded unit lattice.
(7) find out voyage attribute corresponding to all prices in step (6), a corresponding fractionation in pricing information unit, comprises the following steps:
(a) utilize API that Java carries to find the retrieval symbol of institute's analytic induction in step (4);
(b) according to analysis in step (4), sum up, the relation between required freight rate information and retrieval symbol, finds out all freight rate information that pricing information unit is corresponding.
(c) to each pricing information unit repeating step (a) and (b), until valency individual palpation is split into many voyages according to consolidation form.
(8) repeating step (5), to (7), until can not find next gauge outfit sign, splits complete to all forms.
The present embodiment is more excellent embodiment of the present invention; it should be noted that; in the situation that not deviating from spirit of the present invention and essence thereof; those of ordinary skill in the art are when making according to the present invention various corresponding changes and distortion, but these changes and distortion all should belong to the protection domain of the appended claim of the present invention.

Claims (4)

1. a method for the international air ticket freight rate of intelligently parsing excel form list, is characterized in that, comprises the following steps:
(1) a large amount of existing valency lists are analyzed: according to the freight rate information that will extract, the similarities and differences between initial analysis different price list, the freight rate information that wherein will extract comprises departure place, destination, terminal, travel type, the maximum residence time, the minimum residence time and adult's freight rate voyage attribute;
(2) valency individual palpation different-format is classified, sorting technique is: according to the initial analysis of step (1), the valency list with larger general character that information needed storage mode is close or identical is sorted out;
(3) classified valency list is resolved respectively, according to the Position Approximate at voyage attribute place, locking range of search;
(4) in the range of search of locking, the retrieval extracting rule of the freight rate information of the required extraction of analytic induction, specifically comprises following steps:
(a) whether there are common fixing key word or crucial phrase near finding out the information of required extraction, using it as retrieval symbol;
(b) determine the position relationship between required information extraction and retrieval symbol;
(5) in the range of search of locking, single-frame scan, the gauge outfit sign of lookup table, and record voyage attribute and the place line number of gauge outfit;
(6) at gauge outfit position next line, start single-frame to scan, find out all cells that comprise pricing information, each pricing information splits into a voyage;
(7), according to the retrieval extracting rule of analytic induction, find out voyage attribute corresponding to all prices in step (6), and preserve with unified form;
(8) repeating step (5), to (7), until can not find next gauge outfit sign, splits complete to all forms.
2. the method for the international air ticket freight rate of intelligently parsing excel form list according to claim 1, is characterized in that: in described step (5), gauge outfit contains following voyage attribute:
(a) route, wherein comprises departure place and terminal three character codes, and separates with "-";
(b) voyage type, comprises one way and roundtrip two classes;
(c) booking freight space, freight space information, by freight space representation, is single capitalization English letter;
(d) term of validity.
3. the method for the international air ticket freight rate of intelligently parsing excel form list according to claim 1, it is characterized in that: the lookup method of the described pricing information of step (6) is: from gauge outfit position next line, single-frame scan, run into the cell of pure digi-tal, be the cell of storage pricing information, total number of recorded unit lattice.
4. the method for the international air ticket freight rate of intelligently parsing excel form list according to claim 1, it is characterized in that: a corresponding voyage in pricing information unit, the price unit finding according to step (6) in step (7) splits voyage, comprises the following steps:
(a) utilize API that Java carries to find the retrieval symbol of institute's analytic induction in step (4);
(b) according to analysis in step (4), sum up, the relation between required freight rate information and retrieval symbol, finds out all freight rate information that pricing information unit is corresponding;
(c) to each pricing information unit repeating step (a) and (b), until form is split into many voyages according to consolidation form.
CN201410336305.XA 2014-07-16 2014-07-16 A kind of method of the international air ticket freight rate list of intelligently parsing excel forms Active CN104133861B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410336305.XA CN104133861B (en) 2014-07-16 2014-07-16 A kind of method of the international air ticket freight rate list of intelligently parsing excel forms

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410336305.XA CN104133861B (en) 2014-07-16 2014-07-16 A kind of method of the international air ticket freight rate list of intelligently parsing excel forms

Publications (2)

Publication Number Publication Date
CN104133861A true CN104133861A (en) 2014-11-05
CN104133861B CN104133861B (en) 2017-08-25

Family

ID=51806539

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410336305.XA Active CN104133861B (en) 2014-07-16 2014-07-16 A kind of method of the international air ticket freight rate list of intelligently parsing excel forms

Country Status (1)

Country Link
CN (1) CN104133861B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107392660A (en) * 2017-07-14 2017-11-24 深圳市活力天汇科技股份有限公司 A kind of budget fare lookup method
CN110717794A (en) * 2019-10-21 2020-01-21 中国民航信息网络股份有限公司 Freight rate calculation processing method and device
CN110737665A (en) * 2019-10-21 2020-01-31 中国民航信息网络股份有限公司 data processing method and device
CN110751521A (en) * 2019-10-21 2020-02-04 中国民航信息网络股份有限公司 International freight rate determination method and device for single-trip single-flight section
CN117408338A (en) * 2023-12-14 2024-01-16 神州医疗科技股份有限公司 Method and system for constructing knowledge graph of traditional Chinese medicine decoction pieces based on Chinese pharmacopoeia

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040064500A1 (en) * 2001-11-20 2004-04-01 Kolar Jennifer Lynn System and method for unified extraction of media objects
CN101388036A (en) * 2008-10-08 2009-03-18 金蝶软件(中国)有限公司 Data table summarizing method and device
CN101706809A (en) * 2009-11-17 2010-05-12 北京灵图软件技术有限公司 Method, device and system for processing multi-source map data
CN102375859A (en) * 2010-08-25 2012-03-14 阿里巴巴集团控股有限公司 Method and equipment for processing information

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040064500A1 (en) * 2001-11-20 2004-04-01 Kolar Jennifer Lynn System and method for unified extraction of media objects
CN101388036A (en) * 2008-10-08 2009-03-18 金蝶软件(中国)有限公司 Data table summarizing method and device
CN101706809A (en) * 2009-11-17 2010-05-12 北京灵图软件技术有限公司 Method, device and system for processing multi-source map data
CN102375859A (en) * 2010-08-25 2012-03-14 阿里巴巴集团控股有限公司 Method and equipment for processing information

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107392660A (en) * 2017-07-14 2017-11-24 深圳市活力天汇科技股份有限公司 A kind of budget fare lookup method
CN110717794A (en) * 2019-10-21 2020-01-21 中国民航信息网络股份有限公司 Freight rate calculation processing method and device
CN110737665A (en) * 2019-10-21 2020-01-31 中国民航信息网络股份有限公司 data processing method and device
CN110751521A (en) * 2019-10-21 2020-02-04 中国民航信息网络股份有限公司 International freight rate determination method and device for single-trip single-flight section
CN110737665B (en) * 2019-10-21 2023-07-18 中国民航信息网络股份有限公司 Data processing method and device
CN117408338A (en) * 2023-12-14 2024-01-16 神州医疗科技股份有限公司 Method and system for constructing knowledge graph of traditional Chinese medicine decoction pieces based on Chinese pharmacopoeia
CN117408338B (en) * 2023-12-14 2024-03-12 神州医疗科技股份有限公司 Method and system for constructing knowledge graph of traditional Chinese medicine decoction pieces based on Chinese pharmacopoeia

Also Published As

Publication number Publication date
CN104133861B (en) 2017-08-25

Similar Documents

Publication Publication Date Title
CN104133861A (en) Method for intelligently resolving excel format international air ticket freight rate lists
Ma et al. Transit smart card data mining for passenger origin information extraction
CN108614898B (en) Document analysis method and device
US7937338B2 (en) System and method for identifying document structure and associated metainformation
CN103703459A (en) Method and system for text message normalization based on character transformation and unsupervised of web data
CN107145516B (en) Text clustering method and system
Xu et al. A supervoxel approach to the segmentation of individual trees from LiDAR point clouds
CN107577702B (en) Method for distinguishing traffic information in social media
CN104731958A (en) User-demand-oriented cloud manufacturing service recommendation method
CN101853246A (en) Method and device for converting document format
CN108763212A (en) A kind of address information extraction method and device
CN111931077B (en) Data processing method, device, electronic equipment and storage medium
US20130318110A1 (en) System for data extraction and processing
CN113761202B (en) Optimizing system for mapping unstructured financial Excel form to database
CN110765231A (en) Chapter event extraction method based on common-finger fusion
CN110968730B (en) Audio mark processing method, device, computer equipment and storage medium
CN112560468A (en) Meteorological early warning text processing method, related device and computer program product
CN103902918B (en) Method and device for rapidly extracting text from Word document
CN103778141A (en) Mixed PDF book catalogue automatic extracting algorithm
CN112749283A (en) Entity relationship joint extraction method for legal field
CN102063413A (en) Fast text composition method of mobile terminal
CN105512996A (en) Method and system for determining most common place-of-departure
CN108228778B (en) BPA power flow data separation equivalent conversion method based on MATLAB platform
CN103218420A (en) Method and device for extracting page titles
KR20200084166A (en) Address abbreviated waybill

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant