CN107885870B - 一种业务文档公式提取方法及装置 - Google Patents
一种业务文档公式提取方法及装置 Download PDFInfo
- Publication number
- CN107885870B CN107885870B CN201711189981.9A CN201711189981A CN107885870B CN 107885870 B CN107885870 B CN 107885870B CN 201711189981 A CN201711189981 A CN 201711189981A CN 107885870 B CN107885870 B CN 107885870B
- Authority
- CN
- China
- Prior art keywords
- information
- sentence
- feature
- independent variable
- extraction model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
- G06F16/367—Ontology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/253—Grammatical analysis; Style critique
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/258—Heading extraction; Automatic titling; Numbering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Animal Behavior & Ethology (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711189981.9A CN107885870B (zh) | 2017-11-24 | 2017-11-24 | 一种业务文档公式提取方法及装置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711189981.9A CN107885870B (zh) | 2017-11-24 | 2017-11-24 | 一种业务文档公式提取方法及装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107885870A CN107885870A (zh) | 2018-04-06 |
CN107885870B true CN107885870B (zh) | 2019-04-16 |
Family
ID=61774858
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711189981.9A Active CN107885870B (zh) | 2017-11-24 | 2017-11-24 | 一种业务文档公式提取方法及装置 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107885870B (zh) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108304383B (zh) * | 2018-01-29 | 2019-06-25 | 北京神州泰岳软件股份有限公司 | 业务文档的公式信息提取方法及装置 |
CN109189385B (zh) * | 2018-08-14 | 2024-09-13 | 中国平安人寿保险股份有限公司 | 算法配置方法、装置、计算机设备及存储介质 |
CN109543516A (zh) * | 2018-10-16 | 2019-03-29 | 深圳壹账通智能科技有限公司 | 签约意向判断方法、装置、计算机设备和存储介质 |
CN109598632A (zh) * | 2018-12-13 | 2019-04-09 | 泰康保险集团股份有限公司 | 保险业务处理方法、装置、介质及电子设备 |
US11144719B2 (en) | 2019-11-27 | 2021-10-12 | International Business Machines Corporation | System and method for argument retrieval |
CN111950037A (zh) * | 2020-08-25 | 2020-11-17 | 北京天融信网络安全技术有限公司 | 检测方法、装置、电子设备及存储介质 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101206752A (zh) * | 2007-12-25 | 2008-06-25 | 北京科文书业信息技术有限公司 | 电子商务网站相关商品推荐系统及其方法 |
CN101634983A (zh) * | 2008-07-21 | 2010-01-27 | 华为技术有限公司 | 一种文本分类方法和装置 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2006065404A (ja) * | 2004-08-24 | 2006-03-09 | Mitsubishi Electric Corp | 個人識別装置 |
CN101013421B (zh) * | 2007-02-02 | 2012-06-27 | 清华大学 | 基于规则的汉语基本块自动分析方法 |
CN104679768B (zh) * | 2013-11-29 | 2019-08-09 | 百度在线网络技术(北京)有限公司 | 从文档中提取关键词的方法和设备 |
CN105868177A (zh) * | 2016-03-24 | 2016-08-17 | 河北师范大学 | 一种通用公式搜索方法 |
CN106484663B (zh) * | 2016-10-12 | 2019-05-03 | 天闻数媒科技(湖南)有限公司 | 一种文档内容的提取方法和装置 |
-
2017
- 2017-11-24 CN CN201711189981.9A patent/CN107885870B/zh active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101206752A (zh) * | 2007-12-25 | 2008-06-25 | 北京科文书业信息技术有限公司 | 电子商务网站相关商品推荐系统及其方法 |
CN101634983A (zh) * | 2008-07-21 | 2010-01-27 | 华为技术有限公司 | 一种文本分类方法和装置 |
Also Published As
Publication number | Publication date |
---|---|
CN107885870A (zh) | 2018-04-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107885870B (zh) | 一种业务文档公式提取方法及装置 | |
CN109685056B (zh) | 获取文档信息的方法及装置 | |
CN105630768B (zh) | 一种基于层叠条件随机场的产品名识别方法及装置 | |
CN110489560A (zh) | 基于知识图谱技术的小微企业画像生成方法及装置 | |
CN106649394A (zh) | 融合知识库处理方法和装置,以及知识库管理系统 | |
CN107180025A (zh) | 一种新词的识别方法及装置 | |
CN107704453A (zh) | 一种文字语义分析方法、文字语义分析终端及存储介质 | |
CN102693279A (zh) | 一种快速计算评论相似度的方法、装置及系统 | |
CN103282903A (zh) | 话题提取装置和程序 | |
CN105912645A (zh) | 一种智能问答方法及装置 | |
US20180330202A1 (en) | Identifying augmented features based on a bayesian analysis of a text document | |
CN107102993A (zh) | 一种用户诉求分析方法和装置 | |
CN110119401A (zh) | 用户画像的处理方法、装置、服务器及存储介质 | |
CN107798622A (zh) | 一种识别用户意图的方法和装置 | |
CN109190124A (zh) | 用于分词的方法和装置 | |
JP2013190985A (ja) | 知識応答システム、方法およびコンピュータプログラム | |
CN110032736A (zh) | 一种文本分析方法、装置及存储介质 | |
CN107704869B (zh) | 一种语料数据抽样方法及模型训练方法 | |
CN106126497A (zh) | 一种自动挖掘对应施引片段和被引文献原文内容片段的方法 | |
CN108536673A (zh) | 新闻事件抽取方法及装置 | |
CN106126496B (zh) | 一种信息分词方法及装置 | |
CN105302859B (zh) | 一种基于互联网的智能交互系统 | |
CN103823868A (zh) | 一种面向在线百科的事件识别方法和事件关系抽取方法 | |
CN111144116A (zh) | 一种文档知识结构化的抽取方法及装置 | |
CN108073678A (zh) | 应用于大数据分析中的文档解析处理方法、系统及装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20180406 Assignee: Zhongke Dingfu (Beijing) Science and Technology Development Co., Ltd. Assignor: Beijing Shenzhou Taiyue Software Co., Ltd. Contract record no.: X2019990000215 Denomination of invention: Business document formula extraction method and device Granted publication date: 20190416 License type: Exclusive License Record date: 20191127 |
|
EE01 | Entry into force of recordation of patent licensing contract | ||
TR01 | Transfer of patent right |
Effective date of registration: 20200629 Address after: 230000 zone B, 19th floor, building A1, 3333 Xiyou Road, hi tech Zone, Hefei City, Anhui Province Patentee after: Dingfu Intelligent Technology Co., Ltd Address before: 100089 Beijing city Haidian District wanquanzhuang Road No. 28 Wanliu new building block A Room 601 Patentee before: BEIJING ULTRAPOWER SOFTWARE Co.,Ltd. |
|
TR01 | Transfer of patent right |