CN101916255B - Html内容对比装置及方法 - Google Patents
Html内容对比装置及方法 Download PDFInfo
- Publication number
- CN101916255B CN101916255B CN2010102240001A CN201010224000A CN101916255B CN 101916255 B CN101916255 B CN 101916255B CN 2010102240001 A CN2010102240001 A CN 2010102240001A CN 201010224000 A CN201010224000 A CN 201010224000A CN 101916255 B CN101916255 B CN 101916255B
- Authority
- CN
- China
- Prior art keywords
- text
- difference
- label
- html
- contrast
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2010102240001A CN101916255B (zh) | 2010-07-02 | 2010-07-02 | Html内容对比装置及方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2010102240001A CN101916255B (zh) | 2010-07-02 | 2010-07-02 | Html内容对比装置及方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101916255A CN101916255A (zh) | 2010-12-15 |
CN101916255B true CN101916255B (zh) | 2012-02-15 |
Family
ID=43323767
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2010102240001A Active CN101916255B (zh) | 2010-07-02 | 2010-07-02 | Html内容对比装置及方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101916255B (zh) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102737012B (zh) * | 2011-04-06 | 2015-09-30 | 赛恩倍吉科技顾问(深圳)有限公司 | 文本信息对比方法及系统 |
CN102420851B (zh) * | 2011-11-10 | 2015-05-06 | 百度在线网络技术(北京)有限公司 | Http服务监控方法及系统 |
CN102929999A (zh) * | 2012-10-25 | 2013-02-13 | 北京数码大方科技股份有限公司 | 对比数据异同的方法及装置 |
CN103825632B (zh) * | 2012-11-16 | 2016-08-03 | 纬创资通股份有限公司 | 应用近场通信的信息快速同步方法 |
CN104424194B (zh) * | 2013-08-20 | 2017-10-03 | 广州汽车集团股份有限公司 | CANdb网络文件异同的比较方法及其系统 |
CN103500169B (zh) * | 2013-09-02 | 2017-02-08 | 用友网络科技股份有限公司 | 文件对比装置和文件对比方法 |
CN105589813B (zh) * | 2015-07-02 | 2018-12-25 | 中国银联股份有限公司 | 一种电子文档版本变化跟踪方法 |
CN106933782A (zh) * | 2015-12-30 | 2017-07-07 | 阿里巴巴集团控股有限公司 | 一种文本资源文件的比对方法及装置 |
CN108090165B (zh) * | 2017-12-13 | 2021-12-28 | 美林数据技术股份有限公司 | 一种基于嵌入式图数据库的图谱变化差异的获取方法 |
CN108021952A (zh) * | 2017-12-29 | 2018-05-11 | 广州品唯软件有限公司 | 一种多格式文本对比方法及装置 |
CN108614725B (zh) * | 2018-05-11 | 2020-09-01 | 维沃移动通信有限公司 | 一种界面显示方法及终端 |
CN111061975B (zh) * | 2019-12-13 | 2021-09-07 | 腾讯科技(深圳)有限公司 | 一种页面中无关内容的处理方法、装置 |
CN112507660A (zh) * | 2020-12-07 | 2021-03-16 | 厦门美亚亿安信息科技有限公司 | 一种用于复合文档的同源判定、差异化显示方法和系统 |
CN115357286B (zh) * | 2022-08-03 | 2023-11-10 | 中信建投证券股份有限公司 | 一种程序文件对比方法、装置、电子设备及存储介质 |
CN115544969B (zh) * | 2022-11-29 | 2023-03-21 | 明度智云(浙江)科技有限公司 | 基于超文本标记语言的页面对比方法、设备及介质 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101046858A (zh) * | 2006-03-29 | 2007-10-03 | 腾讯科技(深圳)有限公司 | 电子信息比较系统和方法以及反垃圾邮件系统 |
JP4046000B2 (ja) * | 2003-04-16 | 2008-02-13 | 日本電信電話株式会社 | 構造化文書の抽出方法及び装置及びプログラム |
US7373586B2 (en) * | 2004-09-03 | 2008-05-13 | International Business Machines Corporation | Differencing and merging tree-structured documents |
-
2010
- 2010-07-02 CN CN2010102240001A patent/CN101916255B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4046000B2 (ja) * | 2003-04-16 | 2008-02-13 | 日本電信電話株式会社 | 構造化文書の抽出方法及び装置及びプログラム |
US7373586B2 (en) * | 2004-09-03 | 2008-05-13 | International Business Machines Corporation | Differencing and merging tree-structured documents |
CN101046858A (zh) * | 2006-03-29 | 2007-10-03 | 腾讯科技(深圳)有限公司 | 电子信息比较系统和方法以及反垃圾邮件系统 |
Also Published As
Publication number | Publication date |
---|---|
CN101916255A (zh) | 2010-12-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101916255B (zh) | Html内容对比装置及方法 | |
US8196036B2 (en) | Method and system for converting hypertext markup language web page to plain text | |
US9218322B2 (en) | Producing web page content | |
KR20170123331A (ko) | 정보 추출 방법 및 장치 | |
US20170357913A1 (en) | Automated customized web portal template generation systems and methods | |
CN101025738B (zh) | 一种免模板动态网站生成方法 | |
US11003442B2 (en) | Application programming interface documentation annotation | |
US20150067476A1 (en) | Title and body extraction from web page | |
US8910039B2 (en) | File format conversion by automatically converting to an intermediate form for manual editing in a multi-column graphical user interface | |
US20090164888A1 (en) | Automated Content-Based Adjustment of Formatting and Application Behavior | |
CN109543126B (zh) | 基于块文字占比的网页正文信息提取方法 | |
US20170060986A1 (en) | Systems and methods for detection of content of a predefined content category in a network document | |
CN102253979A (zh) | 基于视觉的web页面萃取方法 | |
CN105677654A (zh) | 广告过滤方法及装置 | |
CN102819561A (zh) | 一种基于网页的图片显示方法和装置 | |
Insa Cabrera et al. | Using the words/leafs ratio in the DOM tree for content extraction | |
US11334644B2 (en) | Methods and systems for three-way merges of object representations | |
CN109033282A (zh) | 一种基于抽取模板的网页正文抽取方法及装置 | |
CN104731815B (zh) | 一种网页元素的绘制方法及装置 | |
CN105740355B (zh) | 基于聚集文本密度的网页正文提取方法及装置 | |
EP2599013A1 (en) | Visual separator detection in web pages by using code analysis | |
CN102207974A (zh) | 一种上下文web页面合并方法 | |
CN108959204B (zh) | 互联网金融项目信息抽取方法和系统 | |
CN105117434A (zh) | 一种网页分类方法和系统 | |
US20120221545A1 (en) | Isolating desired content, metadata, or both from social media |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C56 | Change in the name or address of the patentee | ||
CP01 | Change in the name or title of a patent holder |
Address after: 100191, Beijing, Haidian District, Xueyuan Road Datang Telecom research, four floor, 2 floor Patentee after: BEIJING HUDONG BAIKE NETWORK TECHNOLOGY CO.,LTD. Address before: 100191, Beijing, Haidian District, Xueyuan Road Datang Telecom research, four floor, 2 floor Patentee before: Hudong Online (Beijing) Technology Co.,Ltd. |
|
CP01 | Change in the name or title of a patent holder |
Address after: 100191, Beijing, Haidian District, Xueyuan Road Datang Telecom research, four floor, 2 floor Patentee after: Beijing Interactive Encyclopedia Network Technology Co.,Ltd. Address before: 100191, Beijing, Haidian District, Xueyuan Road Datang Telecom research, four floor, 2 floor Patentee before: BEIJING HUDONG BAIKE NETWORK TECHNOLOGY CO.,LTD. |
|
CP01 | Change in the name or title of a patent holder | ||
TR01 | Transfer of patent right |
Effective date of registration: 20191008 Address after: 100041, room 2, building 3, building 30, Xing Xing street, Shijingshan District, Beijing, Patentee after: BEIJING BYTEDANCE NETWORK TECHNOLOGY Co.,Ltd. Address before: 100191, Beijing, Haidian District, Xueyuan Road Datang Telecom research, four floor, 2 floor Patentee before: Beijing Interactive Encyclopedia Network Technology Co.,Ltd. |
|
TR01 | Transfer of patent right |