CN109710833B - 用于确定内容节点的方法与设备 - Google Patents
用于确定内容节点的方法与设备 Download PDFInfo
- Publication number
- CN109710833B CN109710833B CN201811645127.3A CN201811645127A CN109710833B CN 109710833 B CN109710833 B CN 109710833B CN 201811645127 A CN201811645127 A CN 201811645127A CN 109710833 B CN109710833 B CN 109710833B
- Authority
- CN
- China
- Prior art keywords
- node
- content
- suspected
- nodes
- determining
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 39
- 238000001914 filtration Methods 0.000 claims description 12
- 238000000605 extraction Methods 0.000 abstract description 5
- 230000005540 biological transmission Effects 0.000 description 1
- 230000001351 cycling effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811645127.3A CN109710833B (zh) | 2018-12-29 | 2018-12-29 | 用于确定内容节点的方法与设备 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811645127.3A CN109710833B (zh) | 2018-12-29 | 2018-12-29 | 用于确定内容节点的方法与设备 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109710833A CN109710833A (zh) | 2019-05-03 |
CN109710833B true CN109710833B (zh) | 2021-07-16 |
Family
ID=66259725
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811645127.3A Active CN109710833B (zh) | 2018-12-29 | 2018-12-29 | 用于确定内容节点的方法与设备 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109710833B (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116029284B (zh) * | 2023-03-27 | 2023-07-21 | 上海蜜度信息技术有限公司 | 中文子串提取方法、系统、存储介质及电子设备 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101872350A (zh) * | 2009-04-24 | 2010-10-27 | 富士通株式会社 | 网页正文抽取方法和装置 |
KR20110060428A (ko) * | 2009-11-30 | 2011-06-08 | 동국대학교 산학협력단 | 프리픽스 트리 기반 색인 방법 및 장치, 그 기록 매체 |
CN102270206A (zh) * | 2010-06-03 | 2011-12-07 | 北京迅捷英翔网络科技有限公司 | 一种有效网页内容的抓取方法及装置 |
CN102750390A (zh) * | 2012-07-05 | 2012-10-24 | 翁时锋 | 新闻网页要素自动提取方法 |
CN107590219A (zh) * | 2017-09-04 | 2018-01-16 | 电子科技大学 | 网页人物主题相关信息提取方法 |
CN108268433A (zh) * | 2018-02-26 | 2018-07-10 | 杭州数梦工场科技有限公司 | 基于网页文章的标题抽取方法及装置 |
-
2018
- 2018-12-29 CN CN201811645127.3A patent/CN109710833B/zh active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101872350A (zh) * | 2009-04-24 | 2010-10-27 | 富士通株式会社 | 网页正文抽取方法和装置 |
KR20110060428A (ko) * | 2009-11-30 | 2011-06-08 | 동국대학교 산학협력단 | 프리픽스 트리 기반 색인 방법 및 장치, 그 기록 매체 |
CN102270206A (zh) * | 2010-06-03 | 2011-12-07 | 北京迅捷英翔网络科技有限公司 | 一种有效网页内容的抓取方法及装置 |
CN102750390A (zh) * | 2012-07-05 | 2012-10-24 | 翁时锋 | 新闻网页要素自动提取方法 |
CN107590219A (zh) * | 2017-09-04 | 2018-01-16 | 电子科技大学 | 网页人物主题相关信息提取方法 |
CN108268433A (zh) * | 2018-02-26 | 2018-07-10 | 杭州数梦工场科技有限公司 | 基于网页文章的标题抽取方法及装置 |
Also Published As
Publication number | Publication date |
---|---|
CN109710833A (zh) | 2019-05-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8549138B2 (en) | Web test generation | |
US10769143B1 (en) | Composite index on hierarchical nodes in the hierarchical data model within case model | |
CN112579623B (zh) | 存储数据的方法、装置、存储介质及设备 | |
CN110020236B (zh) | 网页解析方法、装置、存储介质、处理器和设备 | |
CN106503003A (zh) | 一种可扩展标记语言xml文档的压缩、解压方法和装置 | |
CN107015986B (zh) | 一种爬虫爬取网页的方法及装置 | |
CN111368227A (zh) | 一种url处理方法以及装置 | |
US10782942B1 (en) | Rapid onboarding of data from diverse data sources into standardized objects with parser and unit test generation | |
CN105824647A (zh) | 一种表单页面生成方法和装置 | |
CN110008393B (zh) | 一种用于获取网站信息的方法及设备 | |
CN109710833B (zh) | 用于确定内容节点的方法与设备 | |
CN110019497B (zh) | 一种数据读取方法及装置 | |
CN114297204A (zh) | 一种异构数据源的数据存储、检索方法及装置 | |
CN117289905B (zh) | 一种应用软件开发方法和装置、存储介质和电子设备 | |
CN111125087B (zh) | 数据的存储方法及装置 | |
US20180046712A1 (en) | Artificial intelligence content detection system | |
CN115437930B (zh) | 网页应用指纹信息的识别方法及相关设备 | |
CN110019357B (zh) | 数据库查询脚本生成方法及装置 | |
CN110019295B (zh) | 数据库检索方法、装置、系统以及存储介质 | |
US10509659B1 (en) | Input processing logic to produce outputs for downstream systems using configurations | |
CN113722278B (zh) | 一种基于pdf文件的知识元抽取方法、设备及介质 | |
CN116610700A (zh) | 查询语句检测方法及装置、存储介质 | |
CN110929188A (zh) | 服务端页面渲染方法及装置 | |
CN116415156A (zh) | 一种文档相似度计算方法、设备及介质 | |
CN115796146A (zh) | 一种文件对比方法及装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Method and apparatus for determining content nodes Effective date of registration: 20220824 Granted publication date: 20210716 Pledgee: China Minsheng Banking Corp Shanghai branch Pledgor: SHANGHAI MDATA INFORMATION TECHNOLOGY Co.,Ltd. Registration number: Y2022310000198 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20230901 Granted publication date: 20210716 Pledgee: China Minsheng Banking Corp Shanghai branch Pledgor: SHANGHAI MDATA INFORMATION TECHNOLOGY Co.,Ltd. Registration number: Y2022310000198 |
|
CP03 | Change of name, title or address | ||
CP03 | Change of name, title or address |
Address after: Room 301ab, No.10, Lane 198, zhangheng Road, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai 201204 Patentee after: Shanghai Mido Technology Co.,Ltd. Address before: 201800 room j71, 8 / F, 1112 Hanggui Road, Anting Town, Jiading District, Shanghai Patentee before: SHANGHAI MDATA INFORMATION TECHNOLOGY Co.,Ltd. |