JP6769299B2 - オーディオ符号化装置およびオーディオ符号化方法 - Google Patents
オーディオ符号化装置およびオーディオ符号化方法 Download PDFInfo
- Publication number
- JP6769299B2 JP6769299B2 JP2016254286A JP2016254286A JP6769299B2 JP 6769299 B2 JP6769299 B2 JP 6769299B2 JP 2016254286 A JP2016254286 A JP 2016254286A JP 2016254286 A JP2016254286 A JP 2016254286A JP 6769299 B2 JP6769299 B2 JP 6769299B2
- Authority
- JP
- Japan
- Prior art keywords
- envelope
- information
- peak
- frequency
- tone
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims description 25
- 238000001514 detection method Methods 0.000 claims description 45
- 238000001228 spectrum Methods 0.000 claims description 44
- 238000000605 extraction Methods 0.000 claims description 11
- 239000000284 extract Substances 0.000 claims description 8
- 230000001629 suppression Effects 0.000 claims description 7
- 230000000873 masking effect Effects 0.000 claims description 4
- 238000010586 diagram Methods 0.000 description 24
- 238000005516 engineering process Methods 0.000 description 9
- 230000005236 sound signal Effects 0.000 description 8
- 238000007493 shaping process Methods 0.000 description 4
- 239000000203 mixture Substances 0.000 description 3
- 230000006870 function Effects 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
- G10L19/0208—Subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Description
(数1)
(数3)
3:包絡情報抽出部
4:トーン情報検出部
5:包絡情報補正部
7:包絡ピーク検出部
8:補正判定部
9:ピーク抑圧部
50:CPU
52:記憶装置
53:オーディオ符号化プログラム
56:入力装置
58:出力装置
60:DSP
62:インタフェース装置
Claims (6)
- 入力信号から低域の周波数成分を有する低域信号を抽出するフィルタと、
前記入力信号のうち前記低域信号よりも周波数の高い高域信号の包絡線に関する包絡情報を抽出する包絡情報抽出部と、
前記入力信号から高域信号スペクトルに含まれるトーン信号の情報であるトーン情報を検出するトーン情報検出部と、
前記トーン信号の周波数と前記包絡線のピークの周波数との差分に基づき前記包絡情報を補正する包絡情報補正部と、
前記低域信号、前記トーン情報、および補正された前記包絡情報を符号化する符号化部と
を有するオーディオ符号化装置。 - 前記包絡情報補正部は、
前記包絡情報に含まれるピークである包絡ピークを検出する包絡ピーク検出部と、
前記包絡ピークと前記トーン情報に基づき、前記包絡情報を補正するか否かを判定する補正判定部と、
前記補正判定部の判定結果に基づき、前記包絡情報に含まれるピークを抑圧するピーク抑圧部と
を有する、請求項1に記載のオーディオ符号化装置。 - 前記補正判定部は、前記包絡ピークのピーク値、および前記包絡ピークのピーク値における周波数と前記トーン情報のピーク値における周波数との差分値が所定値以上の場合に補正要と判定する、請求項2に記載のオーディオ符号化装置。
- 前記高域信号スペクトルを複数のサブバンドに分割して符号化処理する場合に、隣接する2つの前記サブバンドを前記包絡ピーク検出部における検出範囲として前記包絡ピークを検出する、請求項2に記載のオーディオ符号化装置。
- 前記補正判定部が補正要と判定した場合に、マスキング閾値に基づいて前記包絡ピークのピーク値または前記トーン情報のピーク値を補正する、請求項3に記載のオーディオ符号化装置。
- 入力信号を符号化処理するオーディオ符号化方法であって、コンピュータに、
前記入力信号から低域の周波数成分を有する低域信号を抽出し、
前記入力信号のうち前記低域信号よりも周波数の高い高域信号の包絡線に関する包絡情報を抽出し、
前記入力信号から高域信号スペクトルに含まれるトーン信号の情報であるトーン情報を検出し、
前記トーン信号の周波数と前記包絡線のピークの周波数との差分に基づき前記包絡情報を補正し、
前記低域信号および補正された前記包絡情報を符号化する
処理を実行させる、オーディオ符号化方法。
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2016254286A JP6769299B2 (ja) | 2016-12-27 | 2016-12-27 | オーディオ符号化装置およびオーディオ符号化方法 |
US15/809,623 US10224048B2 (en) | 2016-12-27 | 2017-11-10 | Audio coding device and audio coding method |
EP17201820.2A EP3343560B1 (en) | 2016-12-27 | 2017-11-15 | Audio coding device and audio coding method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2016254286A JP6769299B2 (ja) | 2016-12-27 | 2016-12-27 | オーディオ符号化装置およびオーディオ符号化方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2018106076A JP2018106076A (ja) | 2018-07-05 |
JP6769299B2 true JP6769299B2 (ja) | 2020-10-14 |
Family
ID=60327202
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2016254286A Active JP6769299B2 (ja) | 2016-12-27 | 2016-12-27 | オーディオ符号化装置およびオーディオ符号化方法 |
Country Status (3)
Country | Link |
---|---|
US (1) | US10224048B2 (ja) |
EP (1) | EP3343560B1 (ja) |
JP (1) | JP6769299B2 (ja) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10896684B2 (en) | 2017-07-28 | 2021-01-19 | Fujitsu Limited | Audio encoding apparatus and audio encoding method |
CN111210832B (zh) * | 2018-11-22 | 2024-06-04 | 广州广晟数码技术有限公司 | 基于频谱包络模板的带宽扩展音频编解码方法及装置 |
CN109473116B (zh) * | 2018-12-12 | 2021-07-20 | 思必驰科技股份有限公司 | 语音编码方法、语音解码方法及装置 |
CN113192523B (zh) * | 2020-01-13 | 2024-07-16 | 华为技术有限公司 | 一种音频编解码方法和音频编解码设备 |
CN113192517B (zh) * | 2020-01-13 | 2024-04-26 | 华为技术有限公司 | 一种音频编解码方法和音频编解码设备 |
CN113593586A (zh) * | 2020-04-15 | 2021-11-02 | 华为技术有限公司 | 音频信号编码方法、解码方法、编码设备以及解码设备 |
CN113539281B (zh) * | 2020-04-21 | 2024-09-06 | 华为技术有限公司 | 音频信号编码方法和装置 |
CN113808596A (zh) * | 2020-05-30 | 2021-12-17 | 华为技术有限公司 | 一种音频编码方法和音频编码装置 |
CN113808597B (zh) * | 2020-05-30 | 2024-10-29 | 华为技术有限公司 | 一种音频编码方法和音频编码装置 |
CN113259115B (zh) * | 2021-05-06 | 2022-03-25 | 上海大学 | 一种基于钙钛矿晶体制备密码原语的方法 |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2002352182A1 (en) * | 2001-11-29 | 2003-06-10 | Coding Technologies Ab | Methods for improving high frequency reconstruction |
US7069212B2 (en) * | 2002-09-19 | 2006-06-27 | Matsushita Elecric Industrial Co., Ltd. | Audio decoding apparatus and method for band expansion with aliasing adjustment |
WO2005104094A1 (ja) * | 2004-04-23 | 2005-11-03 | Matsushita Electric Industrial Co., Ltd. | 符号化装置 |
JP2008096567A (ja) * | 2006-10-10 | 2008-04-24 | Matsushita Electric Ind Co Ltd | オーディオ符号化装置およびオーディオ符号化方法ならびにプログラム |
JP5071479B2 (ja) * | 2007-07-04 | 2012-11-14 | 富士通株式会社 | 符号化装置、符号化方法および符号化プログラム |
US8041577B2 (en) * | 2007-08-13 | 2011-10-18 | Mitsubishi Electric Research Laboratories, Inc. | Method for expanding audio signal bandwidth |
US20090201983A1 (en) * | 2008-02-07 | 2009-08-13 | Motorola, Inc. | Method and apparatus for estimating high-band energy in a bandwidth extension system |
US8560330B2 (en) * | 2010-07-19 | 2013-10-15 | Futurewei Technologies, Inc. | Energy envelope perceptual correction for high band coding |
JP5743137B2 (ja) * | 2011-01-14 | 2015-07-01 | ソニー株式会社 | 信号処理装置および方法、並びにプログラム |
US20130006644A1 (en) * | 2011-06-30 | 2013-01-03 | Zte Corporation | Method and device for spectral band replication, and method and system for audio decoding |
WO2014115225A1 (ja) * | 2013-01-22 | 2014-07-31 | パナソニック株式会社 | 帯域幅拡張パラメータ生成装置、符号化装置、復号装置、帯域幅拡張パラメータ生成方法、符号化方法、および、復号方法 |
EP3731226A1 (en) * | 2013-06-11 | 2020-10-28 | FRAUNHOFER-GESELLSCHAFT zur Förderung der angewandten Forschung e.V. | Device and method for bandwidth extension for acoustic signals |
EP2830061A1 (en) * | 2013-07-22 | 2015-01-28 | Fraunhofer Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for encoding and decoding an encoded audio signal using temporal noise/patch shaping |
-
2016
- 2016-12-27 JP JP2016254286A patent/JP6769299B2/ja active Active
-
2017
- 2017-11-10 US US15/809,623 patent/US10224048B2/en active Active
- 2017-11-15 EP EP17201820.2A patent/EP3343560B1/en active Active
Also Published As
Publication number | Publication date |
---|---|
US10224048B2 (en) | 2019-03-05 |
EP3343560B1 (en) | 2019-08-14 |
EP3343560A1 (en) | 2018-07-04 |
US20180182403A1 (en) | 2018-06-28 |
JP2018106076A (ja) | 2018-07-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6769299B2 (ja) | オーディオ符号化装置およびオーディオ符号化方法 | |
EP1334484B1 (en) | Enhancing the performance of coding systems that use high frequency reconstruction methods | |
EP1840874B1 (en) | Audio encoding device, audio encoding method, and audio encoding program | |
JP6542717B2 (ja) | 高度なスペクトラム拡張を使用して量子化ノイズを低減するための圧縮伸張装置および方法 | |
WO2010024371A1 (ja) | 周波数帯域拡大装置及び方法、符号化装置及び方法、復号化装置及び方法、並びにプログラム | |
RU2733533C1 (ru) | Устройство и способы для обработки аудиосигнала | |
KR101375582B1 (ko) | 대역폭 확장 부호화 및 복호화 방법 및 장치 | |
KR20070045993A (ko) | 오디오 처리 | |
JP2006048043A (ja) | オーディオデータの高周波数の復元方法及びその装置 | |
JPWO2004010415A1 (ja) | オーディオ復号装置と復号方法およびプログラム | |
WO2016002551A1 (ja) | 信号処理装置及び信号処理方法 | |
JP2011059714A (ja) | 信号符号化装置及び方法、信号復号装置及び方法、並びにプログラム及び記録媒体 | |
EP3179476B1 (en) | Coding device and method, and program | |
JP5365380B2 (ja) | 音響信号処理装置、その処理方法およびプログラム | |
JP5817499B2 (ja) | 復号装置、符号化装置、符号化復号システム、復号方法、符号化方法、復号プログラム、及び符号化プログラム | |
JP4313993B2 (ja) | オーディオ復号化装置およびオーディオ復号化方法 | |
CN105324815A (zh) | 信号处理装置和信号处理方法 | |
US20130085762A1 (en) | Audio encoding device | |
US10896684B2 (en) | Audio encoding apparatus and audio encoding method | |
KR20080084043A (ko) | 노이즈를 포함하는 오디오 신호를 저비트율로부호화/복호화하는 방법 및 이를 위한 장치 | |
CN112771610A (zh) | 用压扩对密集瞬态事件进行译码 | |
JP5569476B2 (ja) | 信号符号化装置及び方法、信号復号装置及び方法、並びにプログラム及び記録媒体 | |
KR20100062063A (ko) | 오디오 신호 디코딩 방법, 이를 적용한 오디오 디코더, 기록매체 및 av 기기 | |
JP2008250347A (ja) | 信号処理方法、信号処理装置及びプログラム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RD01 | Notification of change of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7421 Effective date: 20180528 |
|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20190910 |
|
A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20200423 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20200609 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20200805 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20200825 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20200907 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 6769299 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |