CN112232276B - 一种基于语音识别和图像识别的情绪检测方法和装置 - Google Patents
一种基于语音识别和图像识别的情绪检测方法和装置 Download PDFInfo
- Publication number
- CN112232276B CN112232276B CN202011213188.XA CN202011213188A CN112232276B CN 112232276 B CN112232276 B CN 112232276B CN 202011213188 A CN202011213188 A CN 202011213188A CN 112232276 B CN112232276 B CN 112232276B
- Authority
- CN
- China
- Prior art keywords
- expression
- emotion
- image
- recognition
- scene
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000008451 emotion Effects 0.000 title claims abstract description 172
- 238000001514 detection method Methods 0.000 title claims abstract description 96
- 238000012545 processing Methods 0.000 claims abstract description 27
- 238000000034 method Methods 0.000 claims abstract description 15
- 230000014509 gene expression Effects 0.000 claims description 228
- 238000012937 correction Methods 0.000 claims description 48
- 239000013598 vector Substances 0.000 claims description 38
- 230000006870 function Effects 0.000 claims description 18
- 238000002372 labelling Methods 0.000 claims description 15
- 238000011161 development Methods 0.000 claims description 12
- 238000004590 computer program Methods 0.000 claims description 6
- 238000000605 extraction Methods 0.000 claims description 6
- 238000013507 mapping Methods 0.000 claims description 6
- 230000008921 facial expression Effects 0.000 description 3
- 230000005236 sound signal Effects 0.000 description 2
- 206010011469 Crying Diseases 0.000 description 1
- 208000013875 Heart injury Diseases 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000001815 facial effect Effects 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012549 training Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/174—Facial expression recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/63—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for estimating an emotional state
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Oral & Maxillofacial Surgery (AREA)
- Child & Adolescent Psychology (AREA)
- Hospice & Palliative Care (AREA)
- Psychiatry (AREA)
- Signal Processing (AREA)
- Image Analysis (AREA)
- Image Processing (AREA)
Abstract
Description
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011213188.XA CN112232276B (zh) | 2020-11-04 | 2020-11-04 | 一种基于语音识别和图像识别的情绪检测方法和装置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011213188.XA CN112232276B (zh) | 2020-11-04 | 2020-11-04 | 一种基于语音识别和图像识别的情绪检测方法和装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112232276A CN112232276A (zh) | 2021-01-15 |
CN112232276B true CN112232276B (zh) | 2023-10-13 |
Family
ID=74121979
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011213188.XA Active CN112232276B (zh) | 2020-11-04 | 2020-11-04 | 一种基于语音识别和图像识别的情绪检测方法和装置 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112232276B (zh) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112992148A (zh) * | 2021-03-03 | 2021-06-18 | 中国工商银行股份有限公司 | 视频内的语音识别方法及装置 |
CN112990301A (zh) * | 2021-03-10 | 2021-06-18 | 深圳市声扬科技有限公司 | 情绪数据标注方法、装置、计算机设备和存储介质 |
CN114065742B (zh) * | 2021-11-19 | 2023-08-25 | 马上消费金融股份有限公司 | 一种文本检测方法和装置 |
CN118428343B (zh) * | 2024-07-03 | 2024-09-27 | 广州讯鸿网络技术有限公司 | 一种全媒体交互式智能客服交互方法及系统 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020125386A1 (zh) * | 2018-12-18 | 2020-06-25 | 深圳壹账通智能科技有限公司 | 表情识别方法、装置、计算机设备和存储介质 |
WO2020135194A1 (zh) * | 2018-12-26 | 2020-07-02 | 深圳Tcl新技术有限公司 | 基于情绪引擎技术的语音交互方法、智能终端及存储介质 |
CN111681681A (zh) * | 2020-05-22 | 2020-09-18 | 深圳壹账通智能科技有限公司 | 语音情绪识别方法、装置、电子设备及存储介质 |
CN111694959A (zh) * | 2020-06-08 | 2020-09-22 | 谢沛然 | 基于面部表情和文本信息的网络舆情多模态情感识别方法及系统 |
-
2020
- 2020-11-04 CN CN202011213188.XA patent/CN112232276B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020125386A1 (zh) * | 2018-12-18 | 2020-06-25 | 深圳壹账通智能科技有限公司 | 表情识别方法、装置、计算机设备和存储介质 |
WO2020135194A1 (zh) * | 2018-12-26 | 2020-07-02 | 深圳Tcl新技术有限公司 | 基于情绪引擎技术的语音交互方法、智能终端及存储介质 |
CN111368609A (zh) * | 2018-12-26 | 2020-07-03 | 深圳Tcl新技术有限公司 | 基于情绪引擎技术的语音交互方法、智能终端及存储介质 |
CN111681681A (zh) * | 2020-05-22 | 2020-09-18 | 深圳壹账通智能科技有限公司 | 语音情绪识别方法、装置、电子设备及存储介质 |
CN111694959A (zh) * | 2020-06-08 | 2020-09-22 | 谢沛然 | 基于面部表情和文本信息的网络舆情多模态情感识别方法及系统 |
Non-Patent Citations (3)
Title |
---|
Deep Learning-Based Emotion Recognition from Real-Time Videos;Wenbin Zhou等;《HCII 2020: Human-Computer Interaction. Multimodal and Natural Interaction》;全文 * |
基于语义分析的情感计算技术研究进展;饶元;吴连伟;王一鸣;冯聪;;软件学报(第08期);全文 * |
多文化场景下的多模态情感识别;陈师哲;王帅;金琴;;软件学报(第04期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN112232276A (zh) | 2021-01-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112232276B (zh) | 一种基于语音识别和图像识别的情绪检测方法和装置 | |
WO2024000867A1 (zh) | 情绪识别方法、装置、设备及存储介质 | |
CN107818798B (zh) | 客服服务质量评价方法、装置、设备及存储介质 | |
US10438586B2 (en) | Voice dialog device and voice dialog method | |
CN110428820B (zh) | 一种中英文混合语音识别方法及装置 | |
CN111261162B (zh) | 语音识别方法、语音识别装置及存储介质 | |
CN108305618B (zh) | 语音获取及搜索方法、智能笔、搜索终端及存储介质 | |
CN112614510B (zh) | 一种音频质量评估方法及装置 | |
CN110418204B (zh) | 基于微表情的视频推荐方法、装置、设备和存储介质 | |
CN112614489A (zh) | 用户发音准确度评估方法、装置和电子设备 | |
CN109872714A (zh) | 一种提高语音识别准确性的方法、电子设备及存储介质 | |
CN110956958A (zh) | 搜索方法、装置、终端设备及存储介质 | |
CN114495217A (zh) | 基于自然语言和表情分析的场景分析方法、装置及系统 | |
CN110781329A (zh) | 图像搜索方法、装置、终端设备及存储介质 | |
CN109408175B (zh) | 通用高性能深度学习计算引擎中的实时交互方法及系统 | |
CN110910898B (zh) | 一种语音信息处理的方法和装置 | |
CN110827799A (zh) | 用于处理语音信号的方法、装置、设备和介质 | |
CN111126084A (zh) | 数据处理方法、装置、电子设备和存储介质 | |
CN112597889A (zh) | 一种基于人工智能的情绪处理方法和装置 | |
WO2024093578A1 (zh) | 语音识别方法、装置、电子设备、存储介质及计算机程序产品 | |
CN112434953A (zh) | 一种基于计算机数据处理的客服人员考核方法和装置 | |
CN112584238A (zh) | 影视资源匹配方法、装置及智能电视 | |
CN114267324A (zh) | 语音生成方法、装置、设备和存储介质 | |
CN116959418A (zh) | 一种音频处理方法及装置 | |
CN114297409A (zh) | 模型训练方法、信息抽取方法及装置、电子设备、介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20230526 Address after: No. 16-44, No. 10A-10C, 12A, 12B, 13A, 13B, 15-18, Phase II of Wuyue Plaza Project, east of Zhengyang Street and south of Haoyue Road, Lvyuan District, Changchun City, Jilin Province, 130000 Applicant after: Jilin Huayuan Network Technology Co.,Ltd. Address before: 450000 Wenhua Road, Jinshui District, Zhengzhou City, Henan Province Applicant before: Zhao Zhen |
|
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20230913 Address after: Room 1001, 1st floor, building B, 555 Dongchuan Road, Minhang District, Shanghai Applicant after: Shanghai Enterprise Information Technology Co.,Ltd. Address before: No. 16-44, No. 10A-10C, 12A, 12B, 13A, 13B, 15-18, Phase II of Wuyue Plaza Project, east of Zhengyang Street and south of Haoyue Road, Lvyuan District, Changchun City, Jilin Province, 130000 Applicant before: Jilin Huayuan Network Technology Co.,Ltd. |
|
TA01 | Transfer of patent application right | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: An emotion detection method and device based on speech recognition and image recognition Granted publication date: 20231013 Pledgee: Agricultural Bank of China Limited Shanghai Huangpu Sub branch Pledgor: Shanghai Enterprise Information Technology Co.,Ltd. Registration number: Y2024310000041 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |