CN104133861A - Method for intelligently resolving excel format international air ticket freight rate lists - Google Patents
Method for intelligently resolving excel format international air ticket freight rate lists Download PDFInfo
- Publication number
- CN104133861A CN104133861A CN201410336305.XA CN201410336305A CN104133861A CN 104133861 A CN104133861 A CN 104133861A CN 201410336305 A CN201410336305 A CN 201410336305A CN 104133861 A CN104133861 A CN 104133861A
- Authority
- CN
- China
- Prior art keywords
- freight rate
- information
- voyage
- freight
- list
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a method for intelligently resolving excel format international air ticket freight rate lists. According to the method, general characteristics are analyzed from different freight rate lists in different formats, and different information retrieval extraction rules are analyzed and concluded for different freight rate information, so the required freight rate information is extracted. The freight rate information mainly includes freight rate list basic information, air range information, air range freight rate, additional freight rate and the like. Finally, the freight rate lists are split into a plurality of pieces of air range information according to differences of departure places, transit stations, destinations, single-trip or round-trip modes, cabins and the like, and are stored into a unified format. The method provided by the invention has the advantage that the unified air range information can be accurately and fast extracted from the freight rate lists.
Description
Technical field
The present invention relates generally to information retrieval technical field, be specifically related to the method for the international air ticket freight rate of a kind of intelligently parsing excel form list.
Background technology
Along with the raising of quality of life and the development of communications and transportation, there is now increasing people to start to select airplane trip, so travelling merchants group needs to process by being faced with a large amount of air ticket freight rate lists.Traditional processing mode, is to read freight rate list by artificially, and by the valency list information of reading input system manually.Yet the form of different freight rate lists is far from each other, even the different price list of identical boat department all exists many difference.Therefore by manual type, process, usually need to expend many manpowers and time.
Based on above situation, we have proposed the method for the international air ticket freight rate of a kind of intelligently parsing excel form list, have replaced the mode of manual entry, thereby have saved in large quantities manpower and time.
Summary of the invention
The present invention is directed to the deficiency of current manual extraction freight rate list infotech, a kind of intelligently parsing is provided and has extracted the method for excel freight rate list information.The object of the invention is by intelligently parsing freight rate list, extract freight rate information, valency list is split into many voyage information, and save as unified formatted output.Concrete technical scheme is as described below.
A method for the international air ticket freight rate of intelligently parsing excel form list, comprises the following steps:
(1) a large amount of existing valency lists are analyzed;
(2) valency individual palpation different-format is classified;
(3) classified valency list is resolved respectively, according to the Position Approximate at voyage attribute place, locking range of search;
(4) in the range of search of locking, the retrieval extracting rule of the freight rate information of the required extraction of analytic induction.
(5) in the range of search of locking, single-frame scan, the gauge outfit sign of lookup table, and record voyage attribute and the place line number of gauge outfit;
(6) at gauge outfit position next line, start single-frame to scan, find out all cells that comprise pricing information, each pricing information can split into a voyage;
(7), according to the retrieval extracting rule of analytic induction, find out voyage attribute corresponding to all prices in step (6), and preserve with unified form;
(8) repeating step (5), to (7), until can not find next gauge outfit sign, splits complete to all forms.
More specifically, the analysis described in step (1) is: according to the freight rate information that will extract, and the similarities and differences between initial analysis different price list.The freight rate information that wherein will extract comprises, departure place, destination, terminal, travel type, the maximum residence time, the minimum residence time, and the voyage attributes such as freight rate of being grown up.
More specifically, the sorting technique in step (2) is: according to the initial analysis of step (1), the valency list with larger general character that information needed storage mode is close or identical is sorted out.
More specifically, the object of step (3) is to dwindle range of search, improves retrieval rate.
More specifically, during step (4) is described, the retrieval extracting rule of analyzing the freight rate information of required extraction comprises following steps:
(a) whether there are common fixing key word or crucial phrase near finding out the information of required extraction, using it as retrieval symbol;
(b) determine the position relationship between required information extraction and retrieval symbol.
More specifically, in described step (5), gauge outfit contains following voyage attribute conventionally:
(a) route, wherein comprises three character codes of departure place and terminal (if any terminal), and separates with "-";
(b) voyage type, comprises one way and roundtrip two classes;
(c) booking freight space, freight space information, by freight space representation, is single capitalization English letter;
(d) term of validity.
More specifically, in step (6), the lookup method of pricing information is: from gauge outfit position next line, single-frame scan, run into the cell of pure digi-tal, be the cell of storage pricing information, total number of recorded unit lattice.
More specifically, a corresponding voyage in pricing information unit, step (7) splits voyage according to the price unit finding in step (6):
(a) utilize API that Java carries to find the retrieval symbol of institute's analytic induction in step (4);
(b) according to analysis in step (4), sum up, the relation between required freight rate information and retrieval symbol, finds out all freight rate information that pricing information unit is corresponding.
(c) to each pricing information unit repeating step (a) and (b), until form is split into many voyages according to consolidation form.
Compared with prior art, tool of the present invention has the following advantages and technique effect: the present invention is by excel freight rate list is sorted out, and summarizes respectively retrieval extracting rule, thereby intelligently parsing extracts required freight rate information, improves retrieval rate.
Accompanying drawing explanation
Fig. 1 is the method flow schematic diagram of the international air ticket freight rate of a kind of intelligently parsing excel form of the present invention list.
Embodiment
In order to allow those skilled in the art can understand better technical scheme of the present invention, below in conjunction with accompanying drawing, the invention will be further elaborated.
As shown in Figure 1, the method that the present invention has disclosed the international air ticket freight rate of a kind of intelligently parsing excel form list comprises the following steps:
(1) a large amount of existing valency lists are analyzed: according to the information that will extract, the similarities and differences between initial analysis different price list.The information that wherein will extract comprises, departure place, destination, terminal, travel type, the maximum residence time, the minimum residence time, and the voyage attributes such as freight rate of being grown up.
(2) valency individual palpation different-format is classified, the valency list with larger general character that freight rate information storage mode is close or identical is sorted out.
(3) classified valency list is resolved respectively, according to the Position Approximate at voyage attribute place, locking range of search, to dwindle range of search, improves retrieval rate.
(4), in the range of search of locking, analyze the retrieval extracting rule of the freight rate information of required extraction:
(a) whether there are common fixing key word or crucial phrase near finding out the information of required extraction, using it as retrieval symbol;
(b) determine the position relationship between required information extraction and retrieval symbol.
(5) in the range of search of locking, single-frame scan, the gauge outfit sign of lookup table, and record voyage attribute and the place line number of gauge outfit, gauge outfit contains following voyage attribute conventionally:
(a) route, wherein comprises three character codes of departure place and terminal (if any terminal), and separates with "-";
(b) voyage type, comprises one way and roundtrip two classes;
(c) booking freight space, freight space information, by freight space representation, is single capitalization English letter;
(d) term of validity.
(6) lookup method of pricing information unit is: from gauge outfit position next line, single-frame scan, run into the cell of pure digi-tal, be the cell of storage pricing information, total number of recorded unit lattice.
(7) find out voyage attribute corresponding to all prices in step (6), a corresponding fractionation in pricing information unit, comprises the following steps:
(a) utilize API that Java carries to find the retrieval symbol of institute's analytic induction in step (4);
(b) according to analysis in step (4), sum up, the relation between required freight rate information and retrieval symbol, finds out all freight rate information that pricing information unit is corresponding.
(c) to each pricing information unit repeating step (a) and (b), until valency individual palpation is split into many voyages according to consolidation form.
(8) repeating step (5), to (7), until can not find next gauge outfit sign, splits complete to all forms.
The present embodiment is more excellent embodiment of the present invention; it should be noted that; in the situation that not deviating from spirit of the present invention and essence thereof; those of ordinary skill in the art are when making according to the present invention various corresponding changes and distortion, but these changes and distortion all should belong to the protection domain of the appended claim of the present invention.
Claims (4)
1. a method for the international air ticket freight rate of intelligently parsing excel form list, is characterized in that, comprises the following steps:
(1) a large amount of existing valency lists are analyzed: according to the freight rate information that will extract, the similarities and differences between initial analysis different price list, the freight rate information that wherein will extract comprises departure place, destination, terminal, travel type, the maximum residence time, the minimum residence time and adult's freight rate voyage attribute;
(2) valency individual palpation different-format is classified, sorting technique is: according to the initial analysis of step (1), the valency list with larger general character that information needed storage mode is close or identical is sorted out;
(3) classified valency list is resolved respectively, according to the Position Approximate at voyage attribute place, locking range of search;
(4) in the range of search of locking, the retrieval extracting rule of the freight rate information of the required extraction of analytic induction, specifically comprises following steps:
(a) whether there are common fixing key word or crucial phrase near finding out the information of required extraction, using it as retrieval symbol;
(b) determine the position relationship between required information extraction and retrieval symbol;
(5) in the range of search of locking, single-frame scan, the gauge outfit sign of lookup table, and record voyage attribute and the place line number of gauge outfit;
(6) at gauge outfit position next line, start single-frame to scan, find out all cells that comprise pricing information, each pricing information splits into a voyage;
(7), according to the retrieval extracting rule of analytic induction, find out voyage attribute corresponding to all prices in step (6), and preserve with unified form;
(8) repeating step (5), to (7), until can not find next gauge outfit sign, splits complete to all forms.
2. the method for the international air ticket freight rate of intelligently parsing excel form list according to claim 1, is characterized in that: in described step (5), gauge outfit contains following voyage attribute:
(a) route, wherein comprises departure place and terminal three character codes, and separates with "-";
(b) voyage type, comprises one way and roundtrip two classes;
(c) booking freight space, freight space information, by freight space representation, is single capitalization English letter;
(d) term of validity.
3. the method for the international air ticket freight rate of intelligently parsing excel form list according to claim 1, it is characterized in that: the lookup method of the described pricing information of step (6) is: from gauge outfit position next line, single-frame scan, run into the cell of pure digi-tal, be the cell of storage pricing information, total number of recorded unit lattice.
4. the method for the international air ticket freight rate of intelligently parsing excel form list according to claim 1, it is characterized in that: a corresponding voyage in pricing information unit, the price unit finding according to step (6) in step (7) splits voyage, comprises the following steps:
(a) utilize API that Java carries to find the retrieval symbol of institute's analytic induction in step (4);
(b) according to analysis in step (4), sum up, the relation between required freight rate information and retrieval symbol, finds out all freight rate information that pricing information unit is corresponding;
(c) to each pricing information unit repeating step (a) and (b), until form is split into many voyages according to consolidation form.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410336305.XA CN104133861B (en) | 2014-07-16 | 2014-07-16 | A kind of method of the international air ticket freight rate list of intelligently parsing excel forms |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410336305.XA CN104133861B (en) | 2014-07-16 | 2014-07-16 | A kind of method of the international air ticket freight rate list of intelligently parsing excel forms |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104133861A true CN104133861A (en) | 2014-11-05 |
CN104133861B CN104133861B (en) | 2017-08-25 |
Family
ID=51806539
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410336305.XA Active CN104133861B (en) | 2014-07-16 | 2014-07-16 | A kind of method of the international air ticket freight rate list of intelligently parsing excel forms |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104133861B (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107392660A (en) * | 2017-07-14 | 2017-11-24 | 深圳市活力天汇科技股份有限公司 | A kind of budget fare lookup method |
CN110717794A (en) * | 2019-10-21 | 2020-01-21 | 中国民航信息网络股份有限公司 | Freight rate calculation processing method and device |
CN110737665A (en) * | 2019-10-21 | 2020-01-31 | 中国民航信息网络股份有限公司 | data processing method and device |
CN110751521A (en) * | 2019-10-21 | 2020-02-04 | 中国民航信息网络股份有限公司 | International freight rate determination method and device for single-trip single-flight section |
CN117408338A (en) * | 2023-12-14 | 2024-01-16 | 神州医疗科技股份有限公司 | Method and system for constructing knowledge graph of traditional Chinese medicine decoction pieces based on Chinese pharmacopoeia |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040064500A1 (en) * | 2001-11-20 | 2004-04-01 | Kolar Jennifer Lynn | System and method for unified extraction of media objects |
CN101388036A (en) * | 2008-10-08 | 2009-03-18 | 金蝶软件(中国)有限公司 | Data table summarizing method and device |
CN101706809A (en) * | 2009-11-17 | 2010-05-12 | 北京灵图软件技术有限公司 | Method, device and system for processing multi-source map data |
CN102375859A (en) * | 2010-08-25 | 2012-03-14 | 阿里巴巴集团控股有限公司 | Method and equipment for processing information |
-
2014
- 2014-07-16 CN CN201410336305.XA patent/CN104133861B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040064500A1 (en) * | 2001-11-20 | 2004-04-01 | Kolar Jennifer Lynn | System and method for unified extraction of media objects |
CN101388036A (en) * | 2008-10-08 | 2009-03-18 | 金蝶软件(中国)有限公司 | Data table summarizing method and device |
CN101706809A (en) * | 2009-11-17 | 2010-05-12 | 北京灵图软件技术有限公司 | Method, device and system for processing multi-source map data |
CN102375859A (en) * | 2010-08-25 | 2012-03-14 | 阿里巴巴集团控股有限公司 | Method and equipment for processing information |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107392660A (en) * | 2017-07-14 | 2017-11-24 | 深圳市活力天汇科技股份有限公司 | A kind of budget fare lookup method |
CN110717794A (en) * | 2019-10-21 | 2020-01-21 | 中国民航信息网络股份有限公司 | Freight rate calculation processing method and device |
CN110737665A (en) * | 2019-10-21 | 2020-01-31 | 中国民航信息网络股份有限公司 | data processing method and device |
CN110751521A (en) * | 2019-10-21 | 2020-02-04 | 中国民航信息网络股份有限公司 | International freight rate determination method and device for single-trip single-flight section |
CN110737665B (en) * | 2019-10-21 | 2023-07-18 | 中国民航信息网络股份有限公司 | Data processing method and device |
CN117408338A (en) * | 2023-12-14 | 2024-01-16 | 神州医疗科技股份有限公司 | Method and system for constructing knowledge graph of traditional Chinese medicine decoction pieces based on Chinese pharmacopoeia |
CN117408338B (en) * | 2023-12-14 | 2024-03-12 | 神州医疗科技股份有限公司 | Method and system for constructing knowledge graph of traditional Chinese medicine decoction pieces based on Chinese pharmacopoeia |
Also Published As
Publication number | Publication date |
---|---|
CN104133861B (en) | 2017-08-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104133861A (en) | Method for intelligently resolving excel format international air ticket freight rate lists | |
Ma et al. | Transit smart card data mining for passenger origin information extraction | |
CN108614898B (en) | Document analysis method and device | |
US7937338B2 (en) | System and method for identifying document structure and associated metainformation | |
CN103703459A (en) | Method and system for text message normalization based on character transformation and unsupervised of web data | |
CN107145516B (en) | Text clustering method and system | |
Xu et al. | A supervoxel approach to the segmentation of individual trees from LiDAR point clouds | |
CN107577702B (en) | Method for distinguishing traffic information in social media | |
CN104731958A (en) | User-demand-oriented cloud manufacturing service recommendation method | |
CN101853246A (en) | Method and device for converting document format | |
CN108763212A (en) | A kind of address information extraction method and device | |
CN111931077B (en) | Data processing method, device, electronic equipment and storage medium | |
US20130318110A1 (en) | System for data extraction and processing | |
CN113761202B (en) | Optimizing system for mapping unstructured financial Excel form to database | |
CN110765231A (en) | Chapter event extraction method based on common-finger fusion | |
CN110968730B (en) | Audio mark processing method, device, computer equipment and storage medium | |
CN112560468A (en) | Meteorological early warning text processing method, related device and computer program product | |
CN103902918B (en) | Method and device for rapidly extracting text from Word document | |
CN103778141A (en) | Mixed PDF book catalogue automatic extracting algorithm | |
CN112749283A (en) | Entity relationship joint extraction method for legal field | |
CN102063413A (en) | Fast text composition method of mobile terminal | |
CN105512996A (en) | Method and system for determining most common place-of-departure | |
CN108228778B (en) | BPA power flow data separation equivalent conversion method based on MATLAB platform | |
CN103218420A (en) | Method and device for extracting page titles | |
KR20200084166A (en) | Address abbreviated waybill |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |