CN107491531B - 基于集成学习框架的中文网络评论情感分类方法 - Google Patents
基于集成学习框架的中文网络评论情感分类方法 Download PDFInfo
- Publication number
- CN107491531B CN107491531B CN201710713966.3A CN201710713966A CN107491531B CN 107491531 B CN107491531 B CN 107491531B CN 201710713966 A CN201710713966 A CN 201710713966A CN 107491531 B CN107491531 B CN 107491531B
- Authority
- CN
- China
- Prior art keywords
- attribute
- comment
- feature
- word
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 53
- 230000008451 emotion Effects 0.000 title claims abstract description 16
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 40
- 239000013598 vector Substances 0.000 claims abstract description 38
- 238000011156 evaluation Methods 0.000 claims abstract description 27
- 239000011159 matrix material Substances 0.000 claims abstract description 23
- 238000012360 testing method Methods 0.000 claims description 41
- 238000005065 mining Methods 0.000 claims description 35
- 238000012549 training Methods 0.000 claims description 32
- 230000002996 emotional effect Effects 0.000 claims description 20
- 238000004364 calculation method Methods 0.000 claims description 14
- 230000011218 segmentation Effects 0.000 claims description 11
- 238000000605 extraction Methods 0.000 claims description 8
- 238000007781 pre-processing Methods 0.000 claims description 4
- 238000012795 verification Methods 0.000 claims 4
- 238000002372 labelling Methods 0.000 claims 2
- 238000001724 coherent Stokes Raman spectroscopy Methods 0.000 claims 1
- 230000004069 differentiation Effects 0.000 claims 1
- 239000000463 material Substances 0.000 claims 1
- 238000012552 review Methods 0.000 abstract description 44
- 238000010276 construction Methods 0.000 abstract 1
- 239000000284 extract Substances 0.000 description 10
- 238000002474 experimental method Methods 0.000 description 7
- 238000010200 validation analysis Methods 0.000 description 6
- 238000013145 classification model Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 238000007635 classification algorithm Methods 0.000 description 3
- 230000000052 comparative effect Effects 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 238000003058 natural language processing Methods 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 241000288113 Gallirallus australis Species 0.000 description 1
- 238000003066 decision tree Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000007636 ensemble learning method Methods 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012706 support-vector machine Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710713966.3A CN107491531B (zh) | 2017-08-18 | 2017-08-18 | 基于集成学习框架的中文网络评论情感分类方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710713966.3A CN107491531B (zh) | 2017-08-18 | 2017-08-18 | 基于集成学习框架的中文网络评论情感分类方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107491531A CN107491531A (zh) | 2017-12-19 |
CN107491531B true CN107491531B (zh) | 2019-05-17 |
Family
ID=60645311
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710713966.3A Active CN107491531B (zh) | 2017-08-18 | 2017-08-18 | 基于集成学习框架的中文网络评论情感分类方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107491531B (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US12079579B2 (en) | 2018-09-19 | 2024-09-03 | Huawei Technologies Co., Ltd. | Intention identification model learning method, apparatus, and device |
Families Citing this family (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108417210B (zh) * | 2018-01-10 | 2020-06-26 | 苏州思必驰信息科技有限公司 | 一种词嵌入语言模型训练方法、词语识别方法及系统 |
CN110362809B (zh) * | 2018-03-26 | 2022-06-14 | 阿里巴巴(中国)有限公司 | 文本分析方法及装置 |
CN110362808B (zh) * | 2018-03-26 | 2022-06-14 | 阿里巴巴(中国)有限公司 | 文本分析方法及装置 |
CN110555104B (zh) * | 2018-03-26 | 2022-06-17 | 阿里巴巴(中国)有限公司 | 文本分析方法及装置 |
CN110362810B (zh) * | 2018-03-26 | 2022-06-14 | 阿里巴巴(中国)有限公司 | 文本分析方法及装置 |
CN108804512B (zh) * | 2018-04-20 | 2020-11-24 | 平安科技(深圳)有限公司 | 文本分类模型的生成装置、方法及计算机可读存储介质 |
CN108804524B (zh) * | 2018-04-27 | 2020-03-27 | 成都信息工程大学 | 基于层次化分类体系的情感判别和重要性划分方法 |
CN108647205B (zh) * | 2018-05-02 | 2022-02-15 | 深圳前海微众银行股份有限公司 | 细粒度情感分析模型构建方法、设备及可读存储介质 |
CN108717407B (zh) * | 2018-05-11 | 2022-08-09 | 北京三快在线科技有限公司 | 实体向量确定方法及装置,信息检索方法及装置 |
CN108804416B (zh) * | 2018-05-18 | 2022-08-09 | 大连民族大学 | 基于机器学习的影评情感倾向性分析的训练方法 |
CN108710802A (zh) * | 2018-06-08 | 2018-10-26 | 南京大学 | 一种特征优选的Android勒索软件检测方法 |
CN109145187A (zh) * | 2018-07-23 | 2019-01-04 | 浙江大学 | 基于评论数据的跨平台电商欺诈检测方法和系统 |
CN109213831A (zh) * | 2018-08-14 | 2019-01-15 | 阿里巴巴集团控股有限公司 | 事件检测方法和装置、计算设备及存储介质 |
CN109190121A (zh) * | 2018-09-03 | 2019-01-11 | 重庆工商大学 | 基于汽车本体和词性规则的汽车评论情感分析方法 |
CN111241271B (zh) * | 2018-11-13 | 2023-04-25 | 网智天元科技集团股份有限公司 | 文本情感分类方法、装置及电子设备 |
CN109670039B (zh) * | 2018-11-20 | 2020-10-30 | 华南师范大学 | 基于三部图和聚类分析的半监督电商评论情感分析方法 |
CN110008332B (zh) * | 2019-02-13 | 2020-11-10 | 创新先进技术有限公司 | 通过强化学习提取主干词的方法及装置 |
CN109933648B (zh) * | 2019-02-28 | 2022-07-05 | 北京学之途网络科技有限公司 | 一种真实用户评论的区分方法和区分装置 |
CN110377915B (zh) * | 2019-07-25 | 2022-11-29 | 腾讯科技(深圳)有限公司 | 文本的情感分析方法、装置、存储介质及设备 |
CN110390018A (zh) * | 2019-07-25 | 2019-10-29 | 哈尔滨工业大学 | 一种基于lstm的社交网络评论生成方法 |
CN112685558B (zh) * | 2019-10-18 | 2024-05-17 | 普天信息技术有限公司 | 一种情感分类模型的训练方法及装置 |
CN110838343B (zh) * | 2019-11-15 | 2022-03-01 | 山东中医药大学 | 一种基于多模态指纹图谱的中药药性识别方法及系统 |
CN111126046B (zh) * | 2019-12-06 | 2023-07-14 | 腾讯云计算(北京)有限责任公司 | 语句特征的处理方法和装置、存储介质 |
CN111177392A (zh) * | 2019-12-31 | 2020-05-19 | 腾讯云计算(北京)有限责任公司 | 一种数据处理方法及装置 |
CN111143569B (zh) * | 2019-12-31 | 2023-05-02 | 腾讯科技(深圳)有限公司 | 一种数据处理方法、装置及计算机可读存储介质 |
CN111159409B (zh) * | 2019-12-31 | 2023-06-02 | 腾讯科技(深圳)有限公司 | 基于人工智能的文本分类方法、装置、设备、介质 |
CN111400496B (zh) * | 2020-03-18 | 2023-05-09 | 江苏海洋大学 | 一种面向用户行为分析的大众口碑情感分析方法 |
CN113449100A (zh) * | 2020-03-26 | 2021-09-28 | 北京国双科技有限公司 | 文本的评论性质识别方法、机器学习模型训练方法及装置 |
CN111565322B (zh) * | 2020-05-14 | 2022-03-04 | 北京奇艺世纪科技有限公司 | 一种用户情感倾向信息获得方法、装置及电子设备 |
CN111695359B (zh) * | 2020-06-12 | 2023-10-03 | 腾讯科技(深圳)有限公司 | 生成词向量的方法、装置、计算机存储介质和电子设备 |
CN111931481A (zh) * | 2020-07-03 | 2020-11-13 | 北京新联财通咨询有限公司 | 文本情感识别方法、装置、存储介质及计算机设备 |
CN112417157B (zh) * | 2020-12-15 | 2022-04-26 | 华南师范大学 | 一种基于深度学习网络的文本属性词的情感分类方法 |
CN113158670A (zh) * | 2021-01-21 | 2021-07-23 | 北京明略昭辉科技有限公司 | 一种基于实体情感识别的电商评论意见抽取方法 |
CN112905736B (zh) * | 2021-01-27 | 2023-09-19 | 郑州轻工业大学 | 一种基于量子理论的无监督文本情感分析方法 |
CN112686056B (zh) * | 2021-03-22 | 2021-07-06 | 华南师范大学 | 一种情感分类方法 |
CN112988975A (zh) * | 2021-04-09 | 2021-06-18 | 北京语言大学 | 一种基于albert和知识蒸馏的观点挖掘方法 |
CN113704393A (zh) * | 2021-04-13 | 2021-11-26 | 腾讯科技(深圳)有限公司 | 关键词提取方法、装置、设备及介质 |
CN113792148B (zh) * | 2021-11-15 | 2022-02-11 | 成都晓多科技有限公司 | 一种基于序列到序列的评论方面类别检测方法及系统 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102789498B (zh) * | 2012-07-16 | 2014-08-06 | 钱钢 | 基于集成学习的中文评论文本的情感分类方法与系统 |
CN105354183A (zh) * | 2015-10-19 | 2016-02-24 | Tcl集团股份有限公司 | 一种家电产品互联网评论的分析方法、装置及系统 |
CN105824898A (zh) * | 2016-03-14 | 2016-08-03 | 苏州大学 | 一种网络评论的标签提取方法和装置 |
-
2017
- 2017-08-18 CN CN201710713966.3A patent/CN107491531B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102789498B (zh) * | 2012-07-16 | 2014-08-06 | 钱钢 | 基于集成学习的中文评论文本的情感分类方法与系统 |
CN105354183A (zh) * | 2015-10-19 | 2016-02-24 | Tcl集团股份有限公司 | 一种家电产品互联网评论的分析方法、装置及系统 |
CN105824898A (zh) * | 2016-03-14 | 2016-08-03 | 苏州大学 | 一种网络评论的标签提取方法和装置 |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US12079579B2 (en) | 2018-09-19 | 2024-09-03 | Huawei Technologies Co., Ltd. | Intention identification model learning method, apparatus, and device |
Also Published As
Publication number | Publication date |
---|---|
CN107491531A (zh) | 2017-12-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107491531B (zh) | 基于集成学习框架的中文网络评论情感分类方法 | |
US8676730B2 (en) | Sentiment classifiers based on feature extraction | |
CN107357837B (zh) | 基于保序子矩阵和频繁序列挖掘的电商评论情感分类方法 | |
Chang et al. | Research on detection methods based on Doc2vec abnormal comments | |
CN108491377A (zh) | 一种基于多维度信息融合的电商产品综合评分方法 | |
CN107590134A (zh) | 文本情感分类方法、存储介质及计算机 | |
CN108038725A (zh) | 一种基于机器学习的电商产品客户满意度分析方法 | |
CN108763214B (zh) | 一种针对商品评论的情感词典自动构建方法 | |
Itani et al. | Classifying sentiment in arabic social networks: Naive search versus naive bayes | |
Suleiman et al. | Comparative study of word embeddings models and their usage in Arabic language applications | |
Mozafari et al. | Emotion detection by using similarity techniques | |
Tyagi et al. | Sentiment analysis of product reviews using support vector machine learning algorithm | |
Haque et al. | Opinion mining from bangla and phonetic bangla reviews using vectorization methods | |
Azim et al. | Text to emotion extraction using supervised machine learning techniques | |
CN112069312A (zh) | 一种基于实体识别的文本分类方法及电子装置 | |
Suchdev et al. | Twitter sentiment analysis using machine learning and knowledge-based approach | |
KR20160149050A (ko) | 텍스트 마이닝을 활용한 순수 기업 선정 장치 및 방법 | |
CN109189919B (zh) | 文本多视角情感分类的方法、系统、终端及存储介质 | |
Angelpreethi et al. | An enhanced architecture for feature based opinion mining from product reviews | |
CN110851593A (zh) | 一种基于位置与语义的复值词向量构建方法 | |
Sani et al. | Sentiment analysis of Hausa language tweet using machine learning approach | |
CN105912720A (zh) | 一种计算机中涉及情感的文本数据分析方法 | |
Suhasini et al. | A Hybrid TF-IDF and N-grams based feature extraction approach for accurate detection of fake news on twitter data | |
Rubtsova et al. | Aspect extraction from reviews using conditional random fields | |
Prakash et al. | Lexicon Based Sentiment Analysis (LBSA) to Improve the Accuracy of Acronyms, Emoticons, and Contextual Words |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20210706 Address after: 210012 4th floor, building C, Wanbo Science Park, 20 Fengxin Road, Yuhuatai District, Nanjing City, Jiangsu Province Patentee after: NANJING SILICON INTELLIGENCE TECHNOLOGY Co.,Ltd. Address before: Room 614-615, No.1, Lane 2277, Zuchongzhi Road, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai, 200120 Patentee before: Shanghai Airlines Intellectual Property Services Ltd. Effective date of registration: 20210706 Address after: Room 614-615, No.1, Lane 2277, Zuchongzhi Road, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai, 200120 Patentee after: Shanghai Airlines Intellectual Property Services Ltd. Address before: 510275 science and Technology Department of South China Normal University, Shipai, Tianhe District, Guangzhou City, Guangdong Province Patentee before: SOUTH CHINA NORMAL University |
|
TR01 | Transfer of patent right | ||
CP03 | Change of name, title or address |
Address after: 5th Floor, Building C, Wanbo Science and Technology Park, No. 20 Fengxin Road, Yuhuatai District, Nanjing City, Jiangsu Province, China 210012 Patentee after: Nanjing Silicon based Intelligent Technology Group Co.,Ltd. Country or region after: China Address before: 210012 4th floor, building C, Wanbo Science Park, 20 Fengxin Road, Yuhuatai District, Nanjing City, Jiangsu Province Patentee before: NANJING SILICON INTELLIGENCE TECHNOLOGY Co.,Ltd. Country or region before: China |
|
CP03 | Change of name, title or address |