CN110322421A - One kind being based on multimedia information processing method - Google Patents
One kind being based on multimedia information processing method Download PDFInfo
- Publication number
- CN110322421A CN110322421A CN201910647346.3A CN201910647346A CN110322421A CN 110322421 A CN110322421 A CN 110322421A CN 201910647346 A CN201910647346 A CN 201910647346A CN 110322421 A CN110322421 A CN 110322421A
- Authority
- CN
- China
- Prior art keywords
- image
- signal
- information processing
- carries out
- multimedia
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000010365 information processing Effects 0.000 title claims abstract description 25
- 238000003672 processing method Methods 0.000 title claims abstract description 18
- 238000000034 method Methods 0.000 claims abstract description 20
- 238000005070 sampling Methods 0.000 claims abstract description 19
- 230000006835 compression Effects 0.000 claims abstract description 13
- 238000007906 compression Methods 0.000 claims abstract description 13
- 238000003709 image segmentation Methods 0.000 claims abstract description 6
- 230000005236 sound signal Effects 0.000 claims description 15
- 238000004458 analytical method Methods 0.000 claims description 12
- 230000015572 biosynthetic process Effects 0.000 claims description 11
- 238000003786 synthesis reaction Methods 0.000 claims description 11
- 239000000284 extract Substances 0.000 claims description 9
- 238000013139 quantization Methods 0.000 claims description 9
- 230000011218 segmentation Effects 0.000 claims description 9
- 238000000605 extraction Methods 0.000 claims description 6
- 239000012850 fabricated material Substances 0.000 claims description 6
- 238000012545 processing Methods 0.000 claims description 6
- 230000000694 effects Effects 0.000 claims description 4
- 210000000988 bone and bone Anatomy 0.000 claims description 3
- 230000008030 elimination Effects 0.000 claims description 3
- 238000003379 elimination reaction Methods 0.000 claims description 3
- 230000000877 morphologic effect Effects 0.000 claims description 3
- 230000002787 reinforcement Effects 0.000 claims description 3
- 239000002131 composite material Substances 0.000 claims 1
- 238000005516 engineering process Methods 0.000 abstract description 10
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/20—Image enhancement or restoration using local operators
- G06T5/30—Erosion or dilatation, e.g. thinning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/10—Segmentation; Edge detection
- G06T7/11—Region-based segmentation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/44—Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/85—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Image Analysis (AREA)
Abstract
The invention discloses one kind to be based on multimedia information processing method, comprising the following steps: S1: multimedia messages sampling;S2: image enhancement;S3: image procossing;S4: identification region carries out character representation and description;S5: image segmentation;S6: information coding;S7: Information Compression;S8: audio-frequency information processing;S9: video information process, the present invention successively carries out image enhancement by the multimedia messages to sampling, image procossing, the description of identification region feature and image Segmentation Technology are handled, multimedia messages are integrated, keep multimedia more science intuitive, pass through technology standardization specification, so that audio, video, image etc. carries out integrated treatment, the graphical information for ensuring to generate is vivid, characteristics of image identification carries out the identification of vibration signal using Fourier descriptor, contour feature is transformed from a spatial domain in frequency domain, profile is digitized, so as to better discriminate between different profiles, achieve the purpose that identify object.
Description
Technical field
The present invention relates to technical field of information processing, specially a kind of to be based on multimedia information processing method.
Background technique
Multimedia itself there are two aspect as all modern technologies it be by hardware and software or machine and thought
Mixing composition.Multimedia technology and function can conceptually be divided into control system and information.Multimedia why can
Realization is by digital technology.Multimedia represents converging for digital control and Digital Media, and computer is numerical control system, and is counted
Word media are the state-of-the-art storage of current audio and video and mode of propagation.Multimedia messages refer to text, image, image,
Sound and animation etc. are the media information of the form of expression, and meaning is generally acknowledged that the phase referred to storing with taking technology to obtain again
Close the digital information in information, especially computer.
In multimedia signal processing, when to information processings such as images, vision is presented existing information processing method
Graphical information vividness it is inadequate, it is indefinite for the discrimination of information profile so that identification object the effect is unsatisfactory, be
This, it is proposed that a kind of be based on multimedia information processing method.
Summary of the invention
The purpose of the present invention is to provide one kind to be based on multimedia information processing method, multimedia messages is carried out whole
It closes, keeps multimedia more science intuitive, by technology standardization specification, so that audio, video, image etc. carry out General Office
Reason, it is ensured that the graphical information of generation is vivid, and characteristics of image identification carries out the identification of vibration signal using Fourier descriptor,
Contour feature is transformed from a spatial domain in frequency domain, extracts feature vector of the frequency domain information as image, i.e., with a vector generation
One profile of table, profile is digitized, and so as to better discriminate between different profiles, achievees the purpose that identify object, to solve
The problems mentioned above in the background art.
To achieve the above object, the invention provides the following technical scheme: a kind of be based on multimedia information processing method, packet
Include following steps:
S1: multimedia messages sampling acquires multimedia messages data, quantizing noise and receiver noise factor by external equipment
Influence, sampling exports interrupted burst pulse, and sampling is exported resulting instantaneous analog signal and protected by sampling output digitized signal
Hold a period of time;
S2: unsharp image is apparent from and is emphasized feature, in enlarged image between different objects feature by image enhancement
Difference inhibits inappropriate feature, and improving image quality, abundant information amount reinforce image interpretation and recognition effect, meets analysis
Needs;
S3: image procossing carries out image reinforcement, including image expansion, holes filling, region point by Morphological scale-space method
It cuts;
S4: identification region carries out character representation and description, by extraction image bone, extracts the Fourier descriptor of image, and
The identification of vibration signal is carried out by Fourier descriptor;
S5: image segmentation carries out emphasis segmentation to the image that needs are divided, allows image precisely to be identified, be analyzed and understand, reach
To the target of image zooming-out;
S6: information coding, by after extraction target image and audio signal carry out coding output;
S7: Information Compression, steps are as follows:
A: audio signal is divided into the voice of telephony quality, the audio signal of amplitude modulation broadcasting quality by the compressed encoding of audio signal
With clear stereo signal, when information source generate signal have redundancy when, it is compressed, input signal by encoder into
Row analysis synthesis, synthesizes binary coded signal, carries out signal output by decoder;
B: the compressed encoding of vision signal utilizes elimination image very strong correlation bring data redundancy on room and time
Degree compresses it to meet application requirement, and input signal carries out analysis synthesis, synthesis binary coding letter by encoder
Number, signal output is carried out by decoder;
S8: audio-frequency information processing is directly used after being modified using ready-made material or to ready-made material, and certainly by user
Oneself creates;
S9: video information process is believed after editing by being acquired to video information and then to video information using video
Breath.
Preferably, in the step S1 sampling rate formula are as follows:
FS=2.5fmax (1).
Preferably, image enhancement includes frequency domain method and space domain method in the step S2.
Preferably, quantization is that the sampled signal of continuous amplitude is converted into discrete time, discrete amplitudes in the step S1
Digital signal, the main problem of quantization is quantization error.
Preferably, image expansion is to obtain relatively with the image of own origin and by reflecting relatively in the step S3
As the expansion based on being shifted;Holes filling is to be filled out using the imfill in Matlab software for bianry image hole
It fills, to be used to fill image-region and cavity;Region segmentation refers to that the data being analysed to carry out region division, will wherein feel emerging
The data slot of interest, which extracts, to be further processed, and other data is abandoned, the main purpose of region segmentation, is to reduce
The data volume of subsequent processing.
Preferably, in the step S4 Fourier descriptor complex function z(t) formula are as follows:
(2)
Wherein, t is time variable, seriesThe referred to as Fourier descriptor of curve C;
When curve distance s is useful in comparison with the time, L is length of curve, Fourier descriptorThen indicate:(3).
Preferably, the compression coding mode of a step sound intermediate frequency signal is divided into waveform coding, analysis synthesis in the step S7
Coding and mixed type coding, the frequency range of audio signal are 300Hz-3400Hz.
Preferably, video image compressing method includes lossy compression and lossless compression in b step in the step S7.
Compared with prior art, the beneficial effects of the present invention are: strict control of the present invention should be based at multimedia information
Reason method successively carries out image enhancement, image procossing, the description of identification region feature and image by the multimedia messages to sampling
Cutting techniques are handled, and ensure that the integrality of information processing, multimedia messages are integrated, and make multimedia more section
It learns intuitively, by technology standardization specification, so that audio, video, image etc. carry out integrated treatment, it is ensured that the graphical information of generation
Vivid, characteristics of image identification carries out the identification of vibration signal using Fourier descriptor, and contour feature is become from spatial domain
It changes in frequency domain, extracts feature vector of the frequency domain information as image, i.e., represent a profile with a vector, by profile number
Change, so as to better discriminate between different profiles, achievees the purpose that identify object.
Specific embodiment
In order to make the objectives, technical solutions, and advantages of the present invention clearer, below in conjunction with specific embodiment, to this
Invention is further elaborated.It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, not
For limiting the present invention.
One kind being based on multimedia information processing method, comprising the following steps:
S1: multimedia messages sampling acquires multimedia messages data, quantizing noise and receiver noise factor by external equipment
Influence, sampling exports interrupted burst pulse, and sampling is exported resulting instantaneous analog signal and protected by sampling output digitized signal
Hold a period of time;
S2: unsharp image is apparent from and is emphasized feature, in enlarged image between different objects feature by image enhancement
Difference inhibits inappropriate feature, and improving image quality, abundant information amount reinforce image interpretation and recognition effect, meets analysis
Needs;
S3: image procossing carries out image reinforcement, including image expansion, holes filling, region point by Morphological scale-space method
It cuts;
S4: identification region carries out character representation and description, by extraction image bone, extracts the Fourier descriptor of image, and
The identification of vibration signal is carried out by Fourier descriptor;
S5: image segmentation carries out emphasis segmentation to the image that needs are divided, allows image precisely to be identified, be analyzed and understand, reach
To the target of image zooming-out;
S6: information coding, by after extraction target image and audio signal carry out coding output;
S7: Information Compression, steps are as follows:
A: audio signal is divided into the voice of telephony quality, the audio signal of amplitude modulation broadcasting quality by the compressed encoding of audio signal
With clear stereo signal, when information source generate signal have redundancy when, it is compressed, input signal by encoder into
Row analysis synthesis, synthesizes binary coded signal, carries out signal output by decoder;
B: the compressed encoding of vision signal utilizes elimination image very strong correlation bring data redundancy on room and time
Degree compresses it to meet application requirement, and input signal carries out analysis synthesis, synthesis binary coding letter by encoder
Number, signal output is carried out by decoder;
S8: audio-frequency information processing is directly used after being modified using ready-made material or to ready-made material, and certainly by user
Oneself creates;
S9: video information process is believed after editing by being acquired to video information and then to video information using video
Breath.
Specifically, in the step S1 sampling rate formula are as follows:
FS=2.5fmax (1).
Specifically, image enhancement includes frequency domain method and space domain method in the step S2.
Specifically, quantization is that the sampled signal of continuous amplitude is converted into discrete time, discrete amplitudes in the step S1
Digital signal, the main problem of quantization is quantization error.
Specifically, image expansion is to obtain relatively with the image of own origin and by reflecting relatively in the step S3
As the expansion based on being shifted;Holes filling is to be filled out using the imfill in Matlab software for bianry image hole
It fills, to be used to fill image-region and cavity;Region segmentation refers to that the data being analysed to carry out region division, will wherein feel emerging
The data slot of interest, which extracts, to be further processed, and other data is abandoned, the main purpose of region segmentation, is to reduce
The data volume of subsequent processing.
Specifically, in the step S4 Fourier descriptor complex function z(t) formula are as follows:
(2)
Wherein, t is time variable, seriesThe referred to as Fourier descriptor of curve C;
When curve distance s is useful in comparison with the time, L is length of curve, Fourier descriptorThen indicate:(3).
Specifically, the compression coding mode of a step sound intermediate frequency signal is divided into waveform coding, analysis synthesis in the step S7
Coding and mixed type coding, the frequency range of audio signal are 300Hz-3400Hz.
Specifically, video image compressing method includes lossy compression and lossless compression in b step in the step S7.
In summary: strict control of the present invention should be based on multimedia information processing method, pass through the multimedia to sampling
Information successively carries out image enhancement, image procossing, the description of identification region feature and image Segmentation Technology and is handled, and ensure that letter
The integrality for ceasing processing, multimedia messages are integrated, and are kept multimedia more science intuitive, are advised by technology standardization
Model, so that audio, video, image etc. carry out integrated treatment, it is ensured that the graphical information of generation is vivid, and characteristics of image identification is adopted
The identification that vibration signal is carried out with Fourier descriptor, contour feature is transformed from a spatial domain in frequency domain, extracts frequency domain information
As the feature vector of image, i.e., a profile is represented with a vector, profile is digitized, so as to better discriminate between difference
Profile, achieve the purpose that identify object.
The foregoing is only a preferred embodiment of the present invention, but scope of protection of the present invention is not limited thereto,
Anyone skilled in the art in the technical scope disclosed by the present invention, according to the technique and scheme of the present invention and its
Inventive concept is subject to equivalent substitution or change, should be covered by the protection scope of the present invention.
Claims (8)
1. one kind is based on multimedia information processing method, it is characterised in that: the following steps are included:
S1: multimedia messages sampling acquires multimedia messages data, quantizing noise and receiver noise factor by external equipment
Influence, sampling exports interrupted burst pulse, and sampling is exported resulting instantaneous analog signal and protected by sampling output digitized signal
Hold a period of time;
S2: unsharp image is apparent from and is emphasized feature, in enlarged image between different objects feature by image enhancement
Difference inhibits inappropriate feature, and improving image quality, abundant information amount reinforce image interpretation and recognition effect, meets analysis
Needs;
S3: image procossing carries out image reinforcement, including image expansion, holes filling, region point by Morphological scale-space method
It cuts;
S4: identification region carries out character representation and description, by extraction image bone, extracts the Fourier descriptor of image, and
The identification of vibration signal is carried out by Fourier descriptor;
S5: image segmentation carries out emphasis segmentation to the image that needs are divided, allows image precisely to be identified, be analyzed and understand, reach
To the target of image zooming-out;
S6: information coding, by after extraction target image and audio signal carry out coding output;
S7: Information Compression, steps are as follows:
A: audio signal is divided into the voice of telephony quality, the audio signal of amplitude modulation broadcasting quality by the compressed encoding of audio signal
With clear stereo signal, when information source generate signal have redundancy when, it is compressed, input signal by encoder into
Row analysis synthesis, synthesizes binary coded signal, carries out signal output by decoder;
B: the compressed encoding of vision signal utilizes elimination image very strong correlation bring data redundancy on room and time
Degree compresses it to meet application requirement, and input signal carries out analysis synthesis, synthesis binary coding letter by encoder
Number, signal output is carried out by decoder;
S8: audio-frequency information processing is directly used after being modified using ready-made material or to ready-made material, and certainly by user
Oneself creates;
S9: video information process is believed after editing by being acquired to video information and then to video information using video
Breath.
2. according to claim 1 a kind of based on multimedia information processing method, it is characterised in that: in the step S1
The formula of sampling rate are as follows:
FS=2.5fmax (1).
3. according to claim 1 a kind of based on multimedia information processing method, it is characterised in that: in the step S2
Image enhancement includes frequency domain method and space domain method.
4. according to claim 1 a kind of based on multimedia information processing method, it is characterised in that: in the step S1
Quantization is that the sampled signal of continuous amplitude is converted into discrete time, the digital signal of discrete amplitudes, and the main problem of quantization is
Quantization error.
5. according to claim 1 a kind of based on multimedia information processing method, it is characterised in that: in the step S3
Image expansion is the expansion based on obtaining being shifted with respect to the image with own origin and by opposite image;Hole
Filling is using the imfill in Matlab software for bianry image holes filling, to be used to fill image-region and cavity;
Region segmentation refers to that the data that are analysed to carry out region division, wherein interested data slot will extract and do further
Processing, and other data are abandoned, the main purpose of region segmentation, it is the data volume for reducing subsequent processing.
6. according to claim 1 a kind of based on multimedia information processing method, it is characterised in that: in the step S4
The complex function z(t of Fourier descriptor) formula are as follows:
(2)
Wherein, t is time variable, seriesThe referred to as Fourier descriptor of curve C;
When curve distance s is useful in comparison with the time, L is length of curve, Fourier descriptorThen indicate:(3).
7. according to claim 1 a kind of based on multimedia information processing method, it is characterised in that: in the step S7
The compression coding mode of a step sound intermediate frequency signal is divided into waveform coding, analysis composite coding and mixed type coding, audio signal
Frequency range is 300Hz-3400Hz.
8. according to claim 1 a kind of based on multimedia information processing method, it is characterised in that: in the step S7
Video image compressing method includes lossy compression and lossless compression in b step.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910647346.3A CN110322421A (en) | 2019-07-17 | 2019-07-17 | One kind being based on multimedia information processing method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910647346.3A CN110322421A (en) | 2019-07-17 | 2019-07-17 | One kind being based on multimedia information processing method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110322421A true CN110322421A (en) | 2019-10-11 |
Family
ID=68123755
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910647346.3A Pending CN110322421A (en) | 2019-07-17 | 2019-07-17 | One kind being based on multimedia information processing method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110322421A (en) |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0616456A2 (en) * | 1993-02-19 | 1994-09-21 | Canon Kabushiki Kaisha | Multimedia communication system, transmitter and receiver therefor |
CN1149795A (en) * | 1995-11-02 | 1997-05-14 | 邝冬英 | Multi media digital transmission broadcasting system |
CN1492632A (en) * | 2002-10-23 | 2004-04-28 | 联想(北京)有限公司 | Multimedia system based on digital household network |
CN2922341Y (en) * | 2006-07-13 | 2007-07-11 | 中兴通讯股份有限公司 | Video meeting terminal capable of realizing high-definition rideo signal input and output |
CN101621294A (en) * | 2009-07-29 | 2010-01-06 | 北京中星微电子有限公司 | Control logical circuit and successive approximation analog-to-digital converter |
CN106874888A (en) * | 2017-03-13 | 2017-06-20 | 无锡亚天光电科技有限公司 | A kind of feature by distributed optical fiber vibration signal pattern strengthens and signal processing method |
CN106991381A (en) * | 2017-03-13 | 2017-07-28 | 无锡亚天光电科技有限公司 | A kind of distributed optical fiber vibration signal Recognition Algorithm based on two-dimensional matrix feature recognition |
-
2019
- 2019-07-17 CN CN201910647346.3A patent/CN110322421A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0616456A2 (en) * | 1993-02-19 | 1994-09-21 | Canon Kabushiki Kaisha | Multimedia communication system, transmitter and receiver therefor |
CN1149795A (en) * | 1995-11-02 | 1997-05-14 | 邝冬英 | Multi media digital transmission broadcasting system |
CN1492632A (en) * | 2002-10-23 | 2004-04-28 | 联想(北京)有限公司 | Multimedia system based on digital household network |
CN2922341Y (en) * | 2006-07-13 | 2007-07-11 | 中兴通讯股份有限公司 | Video meeting terminal capable of realizing high-definition rideo signal input and output |
CN101621294A (en) * | 2009-07-29 | 2010-01-06 | 北京中星微电子有限公司 | Control logical circuit and successive approximation analog-to-digital converter |
CN106874888A (en) * | 2017-03-13 | 2017-06-20 | 无锡亚天光电科技有限公司 | A kind of feature by distributed optical fiber vibration signal pattern strengthens and signal processing method |
CN106991381A (en) * | 2017-03-13 | 2017-07-28 | 无锡亚天光电科技有限公司 | A kind of distributed optical fiber vibration signal Recognition Algorithm based on two-dimensional matrix feature recognition |
Non-Patent Citations (5)
Title |
---|
作业帮用户: "简述多媒体信息数字化的主要步骤以及每步的主要功能(要快!)", 《HTTPS://WWW.ZYBANG.COM/QUESTION/8A11CF73B49703A8A426CB5346824752.HTML#TOP》 * |
张俊: "浅议多媒体信息处理技术", 《民营科技》 * |
百度百科: "图像增强", 《HTTPS://BAIKE.BAIDU.COM/HISTORY/%E5%9B%BE%E5%83%8F%E5%A2%9E%E5%BC%BA/5199407/130451697》 * |
百度百科: "形状识别", 《HTTPS://BAIKE.BAIDU.COM/ITEM/%E5%BD%A2%E7%8A%B6%E8%AF%86%E5%88%AB/20723894》 * |
陈明: "《多媒体技术基础》", 31 August 2000 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4063670B2 (en) | Wideband signal transmission system | |
EP2272062B1 (en) | An audio signal classifier | |
US9852735B2 (en) | Efficient coding of audio scenes comprising audio objects | |
CN100380975C (en) | Method for generating hashes from a compressed multimedia content | |
EP3025330B1 (en) | Apparatus and method for efficient object metadata coding | |
US9892737B2 (en) | Efficient coding of audio scenes comprising audio objects | |
CN110838894B (en) | Speech processing method, device, computer readable storage medium and computer equipment | |
DE112014003337T5 (en) | Speech signal separation and synthesis based on auditory scene analysis and speech modeling | |
US11869519B2 (en) | Apparatus and method for decomposing an audio signal using a variable threshold | |
US7418393B2 (en) | Data reproduction device, method thereof and storage medium | |
AU2006233504A1 (en) | Apparatus and method for generating multi-channel synthesizer control signal and apparatus and method for multi-channel synthesizing | |
EP1470550A1 (en) | Audio encoding and decoding device and methods thereof | |
AU2006228821A1 (en) | Device and method for producing a data flow and for producing a multi-channel representation | |
TWI281657B (en) | Method and system for speech coding | |
CN103413553B (en) | Audio coding method, audio-frequency decoding method, coding side, decoding end and system | |
US11183199B2 (en) | Apparatus and method for decomposing an audio signal using a ratio as a separation characteristic | |
CN110322421A (en) | One kind being based on multimedia information processing method | |
EP4080502B1 (en) | Signal processing device and method, and program | |
JP2003522981A (en) | Error correction method with pitch change detection | |
JP2002049383A (en) | Digital signal processing method and learning method and their devices, and program storage medium | |
JP2003535367A (en) | A transmitter for transmitting a signal encoded in a narrow band and a receiver for extending a signal band at a receiving end | |
JP2003508806A (en) | Transmission system with improved encoder and decoder | |
CN113314130A (en) | Audio object coding and decoding method based on frequency spectrum moving | |
CN113206773A (en) | Improved method and apparatus relating to speech quality estimation | |
CN118400463A (en) | Data processing method, system, equipment and medium for video color ring |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20191011 |
|
RJ01 | Rejection of invention patent application after publication |