CN103412904A - PDF (portable document format) file comparison method and PDF file comparison system - Google Patents
PDF (portable document format) file comparison method and PDF file comparison system Download PDFInfo
- Publication number
- CN103412904A CN103412904A CN2013103298395A CN201310329839A CN103412904A CN 103412904 A CN103412904 A CN 103412904A CN 2013103298395 A CN2013103298395 A CN 2013103298395A CN 201310329839 A CN201310329839 A CN 201310329839A CN 103412904 A CN103412904 A CN 103412904A
- Authority
- CN
- China
- Prior art keywords
- paragraph
- pdf document
- expression vector
- computing machine
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a PDF (portable document format) file comparison method and a PDF file comparison system and relates to the field of computers. The method includes the following steps: 110, converting each page in two PDF files into a computer presentation carrier; 120, extracting a remaining paragraph in a first PDF file as a target paragraph; 130, judging whether a remaining paragraph exists within a preset range in a second PDF file or not, if yes, matching within the preset range to acquire the most similar paragraph of the target paragraph, marking characters, identical in the two paragraphs, on the computer presentation carriers corresponding to the target paragraph and the most similar paragraph, and removing the target paragraph and the most similar paragraph; if not, removing the target paragraph; 140, judging whether a remaining paragraph exists in the first PDF file or not, if yes, executing step 120; if not, outputting the computer presentation carriers marked with identical characters in the computer presentation carriers corresponding to the two PDF files. By the method and the system, comparison efficiency and comparison accuracy of PDF files are improved.
Description
Technical field
The present invention relates to field of computer technology, particularly a kind of PDF(Portable Document Format, portable file layout) file control methods and system.
Background technology
The tenderer often can occur in the construction project bidding and tendering process gang up the target behavior of enclosing.To enclose mark and be exactly a tenderer in order in the assessment of bids, winning, to use a plurality of tenderer's identity to produce many parts of biddings documents, get the bid to help own acquisition.Enclosing mark is a kind of opportunistic behavior, and it usually can cause middle marked price to exceed normal range, has had a strong impact on fairness and the seriousness of bid, has damaged bid inviter and other tenderers' interests.But, in the process of building field bid, enclose the mark behavior and compare and more be difficult to be discovered by the people with other lawbreaking activities.In the electronics bid evaluation system, bidding documents is generally PDF, so the bidding documents contrast can be carried out based on the pdf document contrast.
Existing PDF bidding documents contrast work, mainly by manually completing.And the method for artificial contrast only is only applicable to content bidding documents seldom, when the content of bidding documents is a lot of, can increase greatly staff's work load, and the reduced time is long, low to specific efficiency, contrast accuracy low.
Summary of the invention
The technical matters that (one) will solve
The technical problem to be solved in the present invention is: how to provide a kind of pdf document control methods and system, to improve specific efficiency.
(2) technical scheme
For solving the problems of the technologies described above, the invention provides a kind of pdf document control methods, comprising:
110: respectively the every one page in the first pdf document is converted to a computing machine and is expression vector, respectively the every one page in the second pdf document is converted to a computing machine and is expression vector;
120: one that extracts in described the first pdf document remains paragraph as the target paragraph;
130: judge in described the second pdf document whether have the residue paragraph in preset range, if exist, coupling obtains the most similar paragraph of described target paragraph in described preset range, at target paragraph and computing machine corresponding to the most similar described paragraph, be the expression vector subscript and know two words that paragraph is identical, remove described target paragraph and the most similar described paragraph; Otherwise, remove described target paragraph;
140: judge whether described the first pdf document exists the residue paragraph, if exist, carry out described step 120; Otherwise, export computing machine corresponding to described the first pdf document and be expression vector acceptance of the bid and know and have the computing machine of identical word to be expression vector, export computing machine corresponding to described the second pdf document and be expression vector acceptance of the bid knowledge and have the computing machine of identical word to be expression vector.
Wherein, described method also comprises: according to computing machine corresponding to described the first pdf document, be the quantity of same text identified in expression vector, export the identical degree value of described the first pdf document and described the second pdf document.
Wherein, the computing formula of described identical degree value L is as follows:
L=S/(A+B-S);
Wherein, S means that computing machine corresponding to described the first pdf document is the quantity of same text identified in expression vector, and A means the word quantity of described the first pdf document, and B means the word quantity of described the second pdf document.
Wherein, described preset range is [F
min, F
max], and F
minAnd F
maxComputing formula as follows:
F
min=P
m-Y;
F
max=P
m+Y;
Wherein, F
minMean the lower limit page number corresponding to preset range described in described the second pdf document, F
maxMean the upper limit page number corresponding to preset range described in described the second pdf document, P
mThe page number that means the page of target paragraph place described in described the first pdf document, Y are normal value.
Wherein, Y equals 3 or 5.
Wherein, coupling obtains the most similar paragraph of described target paragraph in described preset range, specifically comprises:
By described target paragraph successively with described preset range in each paragraph be complementary, obtain each paragraph in described preset range and the quantity of described target paragraph same text;
In described preset range to the most similar paragraph of the maximum paragraph of the quantity of described target paragraph same text as described target paragraph.
Wherein, before described step 140, also comprise:
Judge in the page object at target paragraph place described in described the first pdf document whether have the residue paragraph, if exist, carry out described step 140, otherwise, export computing machine corresponding to described page object and be expression vector.
The present invention also provides a kind of pdf document comparison system, comprising:
Converting unit, be expression vector for respectively every one page of the first pdf document being converted to a computing machine, respectively the every one page in the second pdf document is converted to a computing machine and is expression vector;
Extraction unit, for extracting one of described the first pdf document residue paragraph as the target paragraph;
The first judging unit, for judging in described the second pdf document preset range whether have the residue paragraph, if exist, coupling obtains the most similar paragraph of described target paragraph in described preset range, at target paragraph and computing machine corresponding to the most similar described paragraph, be the expression vector subscript and know two words that paragraph is identical, remove described target paragraph and the most similar described paragraph; Otherwise, remove described target paragraph;
Whether the second judging unit, exist the residue paragraph be used to judging described the first pdf document, if exist, one that notifies described extraction unit to extract in described the first pdf document remains paragraph as the target paragraph; Otherwise, export computing machine corresponding to described the first pdf document and be expression vector acceptance of the bid and know and have the computing machine of identical word to be expression vector, export computing machine corresponding to described the second pdf document and be expression vector acceptance of the bid knowledge and have the computing machine of identical word to be expression vector.
Wherein, described system also comprises: identical degree unit, for according to computing machine corresponding to described the first pdf document, being the quantity of the identified same text of expression vector, export the identical degree value of described the first pdf document and described the second pdf document.
Wherein, described system also comprises: middle output unit, for the page object that judges target paragraph place described in described the first pdf document, whether there is the residue paragraph, and if there is no, export computing machine corresponding to described page object and be expression vector.
(3) beneficial effect
The described pdf document control methods of the embodiment of the present invention and system, the page of in advance two pdf documents (the first pdf document and the second pdf document) being take is expression vector as Conversion of measurement unit as computing machine, automatically the paragraph of take carries out the contrast of two pdf documents as unit, and identical word in two pdf documents is identified, then export comparing result, significantly improved pdf document to specific efficiency and the contrast accuracy.
The accompanying drawing explanation
Fig. 1 is the described pdf document control methods of the embodiment of the present invention 1 process flow diagram;
Fig. 2 is the described pdf document control methods of the embodiment of the present invention 2 process flow diagram;
Fig. 3 is the modular structure schematic diagram of the described pdf document comparison system of the embodiment of the present invention 3;
Fig. 4 is the modular structure schematic diagram of the described pdf document comparison system of the embodiment of the present invention 4.
Embodiment
Below in conjunction with drawings and Examples, the specific embodiment of the present invention is described in further detail.Following examples are used for the present invention is described, but are not used for limiting the scope of the invention.
Embodiment 1
Fig. 1 is the described pdf document control methods of the embodiment of the present invention 1 process flow diagram, and as shown in Figure 1, described method comprises:
110: respectively the every one page in the first pdf document is converted to a computing machine and is expression vector, respectively the every one page in the second pdf document is converted to a computing machine and is expression vector.
Concrete, described the first pdf document and described the second pdf document can be two parts of PDF biddings documents to be contrasted.After conversion, the corresponding computing machine of every one page in the first pdf document is expression vector, the corresponding computing machine of every one page in the second pdf document is expression vector, by being converted to computing machine, pdf document is expression vector, can facilitate follow-up identical word to be identified, and final comparing result is showed to the user.Wherein, the computing machine here is expression vector and refers to and can by computing machine, present the form of expression of comparing result, such as picture, document etc.Preferably, it is picture that described computing machine is expression vector, and form is unfixing, and the forms such as .bmp file .png file or .jpg file all can be realized.
120: one that extracts in described the first pdf document remains paragraph as the target paragraph.
Specifically, can be according to original sequencing in described the first pdf document successively from residue paragraph of each extraction described the first pdf document, the residue paragraph here removes the backward remaining paragraph of target phase after referring to and carrying out subsequent step.
130: judge in described the second pdf document whether have the residue paragraph in preset range, if exist, coupling obtains the most similar paragraph of described target paragraph in described preset range, at target paragraph and computing machine corresponding to the most similar described paragraph, be the expression vector subscript and know two words that paragraph is identical, remove described target paragraph and the most similar described paragraph; Otherwise, remove described target paragraph.
Concrete, described preset range can be expressed as [F
min, F
max], and F
minAnd F
maxComputing formula as follows:
F
min=P
m-Y;
F
max=P
m+Y;
Wherein, F
minMean the lower limit page number corresponding to preset range described in described the second pdf document, F
maxMean the upper limit page number corresponding to preset range described in described the second pdf document, P
mThe page number that means the page of target paragraph place described in described the first pdf document, Y are normal value, generally can be set to 3,5 etc.That is to say, described preset range is the page number P of described target paragraph place page
mIn the Y page scope of front and back.Suppose that Y is 3, as the page number P of described target paragraph place page
mBe 10, described preset range is 7 to 13 pages, as the page number P of described target paragraph place page
mBe 1 o'clock, because its front does not have the page number, described preset range is 1 to 4 page.
Wherein, coupling obtains the most similar paragraph of described target paragraph in described preset range, specifically comprises:
By described target paragraph successively with described preset range in each paragraph be complementary, obtain each paragraph in described preset range and the quantity of described target paragraph same text;
In described preset range to the most similar paragraph of the maximum paragraph of the quantity of described target paragraph same text as described target paragraph.
Wherein, at current paragraph and computing machine corresponding to the most similar described paragraph, be the expression vector subscript and know two words that paragraph is identical, can adopt highlighted mode to identify, also can adopt the modes such as frame, predetermined color to identify.
140: judge whether described the first pdf document exists the residue paragraph, if exist, carry out described step 120; Otherwise, export computing machine corresponding to described the first pdf document and be expression vector acceptance of the bid and know and have the computing machine of identical word to be expression vector, export computing machine corresponding to described the second pdf document and be expression vector acceptance of the bid knowledge and have the computing machine of identical word to be expression vector.
Concrete, when there is not the residue paragraph in described the first pdf document, be that in described the first pdf document, each paragraph has all completed contrast with the second pdf document, mean to have contrasted, at this moment can export comparing result by display screen, namely exporting computing machine corresponding to described the first pdf document is expression vector acceptance of the bid and knows and have the computing machine of identical word to be expression vector, exporting computing machine corresponding to described the second pdf document is expression vector acceptance of the bid and knows and have the computing machine of identical word to be expression vector, and can show control knob below display screen, for the user, selecting computing machine that same page is not corresponding to be expression vector checks.
The described method of the present embodiment, the page of in advance two pdf documents (the first pdf document and the second pdf document) being take is expression vector as Conversion of measurement unit as computing machine, automatically the paragraph of take carries out the contrast of two pdf documents as unit, and identical file in two pdf documents is identified, then export comparing result, significantly improved pdf document to specific efficiency and the contrast accuracy.
Embodiment 2
Fig. 2 is the process flow diagram of the described pdf document control methods of the embodiment of the present invention 2, and as shown in Figure 2, described method is substantially the same manner as Example 1, and its difference is:
Before described step 140, also comprise:
210: judge in the page object at target paragraph place described in described the first pdf document whether have the residue paragraph, if exist, carry out described step 140, otherwise, export computing machine corresponding to described page object and be expression vector.
By this step is set, can be in two pdf document comparison process, to the user, show that described the first pdf document has completed the content of the corresponding page of contrast in advance, thereby make the user before two pdf documents have contrasted, can roughly understand the identical degree of two pdf documents.
In addition, in the present embodiment, after described 140, also comprise step 220: the quantity of identified same text in being expression vector according to computing machine corresponding to described the first pdf document, export the identical degree value of described the first pdf document and described the second pdf document.
Wherein, the computing formula of described identical degree value L is as follows:
L=S/(A+B-S);
Wherein, S means that computing machine corresponding to described the first pdf document is the quantity of same text identified in expression vector, and A means the word quantity of described the first pdf document, and B means the word quantity of described the second pdf document.
By exporting the identical degree value of described the first pdf document and described the second pdf document, can make the identical degree of two pdf documents more clear and directly perceived.
Embodiment 3
Fig. 3 is the modular structure schematic diagram of the described pdf document comparison system of the embodiment of the present invention 3, and as shown in Figure 3, described system 300 comprises: converting unit 310, extraction unit 320, the first judging unit 330, the second judging unit 340.
Described converting unit 310, be expression vector for respectively every one page of the first pdf document being converted to a computing machine, respectively the every one page in the second pdf document is converted to a computing machine and is expression vector.
Described extraction unit 320, for extracting one of described the first pdf document residue paragraph as the target paragraph.
Described the first judging unit 330, for judging in described the second pdf document preset range whether have the residue paragraph, if exist, coupling obtains the most similar paragraph of described target paragraph in described preset range, at target paragraph and computing machine corresponding to the most similar described paragraph, be the expression vector subscript and know two words that paragraph is identical, remove described target paragraph and the most similar described paragraph; Otherwise, remove described target paragraph;
Whether described the second judging unit 340, exist the residue paragraph be used to judging described the first pdf document, if exist, one that notifies described extraction unit 320 to extract in described the first pdf document remains paragraph as the target paragraph; Otherwise, export computing machine corresponding to described the first pdf document and be expression vector acceptance of the bid and know and have the computing machine of identical word to be expression vector, export computing machine corresponding to described the second pdf document and be expression vector acceptance of the bid knowledge and have the computing machine of identical word to be expression vector.
Embodiment 4
Fig. 4 is the modular structure schematic diagram of the described pdf document comparison system of the embodiment of the present invention 4, as shown in Figure 4, the described system of the present embodiment is substantially the same manner as Example 3, and its difference is, the described system 300 of the present embodiment also comprises: middle output unit 410 and identical degree unit 420.
Whether output unit 410 in the middle of described, exist the residue paragraph for the page object that judges target paragraph place described in described the first pdf document, if there is no, exports computing machine corresponding to described page object and be expression vector.
Described identical degree unit 420, for according to computing machine corresponding to described the first pdf document, being the quantity of the identified same text of expression vector, export the identical degree value of described the first pdf document and described the second pdf document.
The described pdf document control methods of the embodiment of the present invention and system, the page of in advance two pdf documents (the first pdf document and the second pdf document) being take is expression vector as Conversion of measurement unit as computing machine, automatically the paragraph of take carries out the contrast of two pdf documents as unit, and identical word in two pdf documents is identified, then export comparing result, significantly improved pdf document to specific efficiency and the contrast accuracy.Simultaneously, described method and system, can be before having contrasted fully, the output comparing result, and after having contrasted fully, the identical degree value of two pdf documents of output, make the user can more clearly check sooner comparing result.
Above embodiment is only be used to illustrating the present invention; and be not limitation of the present invention; the those of ordinary skill in relevant technologies field; without departing from the spirit and scope of the present invention; can also make a variety of changes and modification; therefore all technical schemes that are equal to also belong to category of the present invention, and scope of patent protection of the present invention should be defined by the claims.
Claims (10)
1. portable file layout pdf document control methods, is characterized in that, comprising:
110: respectively the every one page in the first pdf document is converted to a computing machine and is expression vector, respectively the every one page in the second pdf document is converted to a computing machine and is expression vector;
120: one that extracts in described the first pdf document remains paragraph as the target paragraph;
130: judge in described the second pdf document whether have the residue paragraph in preset range, if exist, coupling obtains the most similar paragraph of described target paragraph in described preset range, at target paragraph and computing machine corresponding to the most similar described paragraph, be the expression vector subscript and know two words that paragraph is identical, remove described target paragraph and the most similar described paragraph; Otherwise, remove described target paragraph;
140: judge whether described the first pdf document exists the residue paragraph, if exist, carry out described step 120; Otherwise, export computing machine corresponding to described the first pdf document and be expression vector acceptance of the bid and know and have the computing machine of identical word to be expression vector, export computing machine corresponding to described the second pdf document and be expression vector acceptance of the bid knowledge and have the computing machine of identical word to be expression vector.
2. the method for claim 1, it is characterized in that, described method also comprises: according to computing machine corresponding to described the first pdf document, be the quantity of same text identified in expression vector, export the identical degree value of described the first pdf document and described the second pdf document.
3. method as claimed in claim 2, is characterized in that, the computing formula of described identical degree value L is as follows:
L=S/(A+B-S);
Wherein, S means that computing machine corresponding to described the first pdf document is the quantity of same text identified in expression vector, and A means the word quantity of described the first pdf document, and B means the word quantity of described the second pdf document.
4. the method for claim 1, is characterized in that, described preset range is [F
min, F
max], and F
minAnd F
maxComputing formula as follows:
F
min=P
m-Y;
F
max=P
m+Y;
Wherein, F
minMean the lower limit page number corresponding to preset range described in described the second pdf document, F
maxMean the upper limit page number corresponding to preset range described in described the second pdf document, P
mThe page number that means the page of target paragraph place described in described the first pdf document, Y are normal value.
5. method as claimed in claim 4, is characterized in that, Y equals 3 or 5.
6. the method for claim 1, is characterized in that, coupling obtains the most similar paragraph of described target paragraph in described preset range, specifically comprises:
By described target paragraph successively with described preset range in each paragraph be complementary, obtain each paragraph in described preset range and the quantity of described target paragraph same text;
In described preset range to the most similar paragraph of the maximum paragraph of the quantity of described target paragraph same text as described target paragraph.
7. the method for claim 1, is characterized in that, also comprises before described step 140:
Judge in the page object at target paragraph place described in described the first pdf document whether have the residue paragraph, if exist, carry out described step 140, otherwise, export computing machine corresponding to described page object and be expression vector.
8. a pdf document comparison system, is characterized in that, comprising:
Converting unit, be expression vector for respectively every one page of the first pdf document being converted to a computing machine, respectively the every one page in the second pdf document is converted to a computing machine and is expression vector;
Extraction unit, for extracting one of described the first pdf document residue paragraph as the target paragraph;
The first judging unit, for judging in described the second pdf document preset range whether have the residue paragraph, if exist, coupling obtains the most similar paragraph of described target paragraph in described preset range, at target paragraph and computing machine corresponding to the most similar described paragraph, be the expression vector subscript and know two words that paragraph is identical, remove described target paragraph and the most similar described paragraph; Otherwise, remove described target paragraph;
Whether the second judging unit, exist the residue paragraph be used to judging described the first pdf document, if exist, one that notifies described extraction unit to extract in described the first pdf document remains paragraph as the target paragraph; Otherwise, export computing machine corresponding to described the first pdf document and be expression vector acceptance of the bid and know and have the computing machine of identical word to be expression vector, export computing machine corresponding to described the second pdf document and be expression vector acceptance of the bid knowledge and have the computing machine of identical word to be expression vector.
9. system as claimed in claim 8, it is characterized in that, described system also comprises: identical degree unit, for according to computing machine corresponding to described the first pdf document, being the quantity of the identified same text of expression vector, export the identical degree value of described the first pdf document and described the second pdf document.
10. system as claimed in claim 8, it is characterized in that, described system also comprises: middle output unit, for judging whether the page object at target paragraph place described in described the first pdf document exists the residue paragraph, if there is no, computing machine corresponding to the described page object of output is expression vector.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2013103298395A CN103412904A (en) | 2013-07-31 | 2013-07-31 | PDF (portable document format) file comparison method and PDF file comparison system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2013103298395A CN103412904A (en) | 2013-07-31 | 2013-07-31 | PDF (portable document format) file comparison method and PDF file comparison system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103412904A true CN103412904A (en) | 2013-11-27 |
Family
ID=49605916
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2013103298395A Pending CN103412904A (en) | 2013-07-31 | 2013-07-31 | PDF (portable document format) file comparison method and PDF file comparison system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103412904A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106648321A (en) * | 2016-12-20 | 2017-05-10 | 天脉聚源(北京)教育科技有限公司 | Control method and device of pages |
CN109934712A (en) * | 2019-01-30 | 2019-06-25 | 网联清算有限公司 | Account checking method, account checking apparatus and electronic equipment applied to distributed system |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101404037A (en) * | 2008-11-18 | 2009-04-08 | 西安交通大学 | Method for detecting and positioning electronic text contents plagiary |
CN101957809A (en) * | 2010-10-14 | 2011-01-26 | 传神联合(北京)信息技术有限公司 | Anti-plagiarism method |
CN103049467A (en) * | 2011-10-12 | 2013-04-17 | 杨纯青 | Chinese digital anti-plagiarism detection and comparison system and method |
-
2013
- 2013-07-31 CN CN2013103298395A patent/CN103412904A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101404037A (en) * | 2008-11-18 | 2009-04-08 | 西安交通大学 | Method for detecting and positioning electronic text contents plagiary |
CN101957809A (en) * | 2010-10-14 | 2011-01-26 | 传神联合(北京)信息技术有限公司 | Anti-plagiarism method |
CN103049467A (en) * | 2011-10-12 | 2013-04-17 | 杨纯青 | Chinese digital anti-plagiarism detection and comparison system and method |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106648321A (en) * | 2016-12-20 | 2017-05-10 | 天脉聚源(北京)教育科技有限公司 | Control method and device of pages |
CN109934712A (en) * | 2019-01-30 | 2019-06-25 | 网联清算有限公司 | Account checking method, account checking apparatus and electronic equipment applied to distributed system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108734110B (en) | Text paragraph identification and comparison method and system based on longest public subsequence | |
CN105095160B (en) | A kind of document conversion reading method and system | |
CN108415887A (en) | A kind of method that pdf document is converted to OFD files | |
WO2019041527A1 (en) | Method of extracting chart in document, electronic device and computer-readable storage medium | |
CN102289497A (en) | Document preview image generating system and method | |
CN101777060A (en) | Automatic evaluation method and system of webpage visual quality | |
CN102855243A (en) | Method and device for extracting document structure | |
TWI536798B (en) | Image filing method | |
CN103488999A (en) | Invoice data recording method | |
CN110765739A (en) | Method for extracting table data and chapter structure from PDF document | |
CN103902918B (en) | Method and device for rapidly extracting text from Word document | |
CN102841940B (en) | Document summary extracting method based on data reconstruction | |
CN102937994A (en) | Similar document query method based on stop words | |
CN102141998A (en) | Automatic evaluation method for webpage vision complexity | |
CN103186880B (en) | Generate the method and apparatus of thumbnail | |
CN102842143B (en) | Method for adding data in a working area | |
CN103412904A (en) | PDF (portable document format) file comparison method and PDF file comparison system | |
CN106354731A (en) | Document inspection method and device | |
CN110598623B (en) | Method and device for cutting and extracting picture, computer equipment and storage medium | |
US9817913B2 (en) | Method and apparatus for collecting, merging and presenting content | |
CN102200966A (en) | Method for extracting and processing layout information | |
JP2011123825A (en) | Character recognition method, character recognition device, and character recognition program | |
CN103412905A (en) | PDF (Portable document format) file comparison method and system | |
CN103186513B (en) | A kind of method of document format conversion and device | |
CN106406560A (en) | Method and system for outputting vector fonts of mechanical engineering characters in desktop operation system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20131127 |