Vaca et al., 2019 - Google Patents

An open audio processing platform with zync fpga

Vaca et al., 2019

Document ID: 1260083398550807175
Author: Vaca K; Jefferies M; Yang X
Publication year: 2019
Publication venue: 2019 IEEE International Symposium on Measurement and Control in Robotics (ISMCR)

External Links

Cited by

Snippet

This paper presents an open audio processing platform on Zync7000 Field-Programmable Gate Array (FPGA), capable of collecting analog frequency signals through a microphone, and pushing out a data set of frequencies and amplitudes to a UART interface. The validity …

Continue reading at ieeexplore.ieee.org (other versions)

238000005070 sampling 0 abstract description 13

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS
- G10H2210/00—Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
- G10H2210/031—Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS
- G10H2250/00—Aspects of algorithms or signal processing methods without intrinsic musical character, yet specifically adapted for or used in electrophonic musical processing
- G10H2250/471—General musical sound synthesis principles, i.e. sound category-independent synthesis methods
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/14—Fourier, Walsh or analogous domain transformations, e.g. Laplace, Hilbert, Karhunen-Loeve, transforms
- G06F17/141—Discrete Fourier transforms
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS
- G10H1/00—Details of electrophonic musical instruments
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS
- G10H7/00—Instruments in which the tones are synthesised from a data store, e.g. computer organs

Similar Documents

Publication	Publication Date	Title
Bittner et al.	2017	Deep Salience Representations for F0 Estimation in Polyphonic Music.
Cheuk et al.	2020	nnaudio: An on-the-fly gpu audio to spectrogram conversion toolbox using 1d convolutional neural networks
Vaca et al.	2019	An open audio processing platform with zync fpga
US10430154B2 (en)	2019-10-01	Tonal/transient structural separation for audio effects
CN108962279A (en)	2018-12-07	New Method for Instrument Recognition and device, electronic equipment, the storage medium of audio data
Tachibana et al.	2014	Harmonic/percussive sound separation based on anisotropic smoothness of spectrograms
Vaca et al.	2019	Real-time automatic music transcription (AMT) with Zync FPGA
Bahoura et al.	2013	Hardware implementation of MFCC feature extraction for respiratory sounds analysis
Qi et al.	2013	Bottleneck features based on gammatone frequency cepstral coefficients.
TWI740315B (en)	2021-09-21	Sound separation method, electronic and computer readable storage medium
CN110534091A (en)	2019-12-03	A kind of people-car interaction method identified based on microserver and intelligent sound
Kadyan et al.	2023	Prosody features based low resource Punjabi children ASR and T-NT classifier using data augmentation
Nakajima et al.	2018	Monaural source enhancement maximizing source-to-distortion ratio via automatic differentiation
Wu et al.	2011	Multipitch estimation by joint modeling of harmonic and transient sounds
Kronvall et al.	2015	Sparse chroma estimation for harmonic audio
Ghosh et al.	2012	A comparative study of performance of fpga based mel filter bank & bark filter bank
Zhang	2019	Application of audio visual tuning detection software in piano tuning teaching
Hu et al.	2024	A multi-task learning speech synthesis optimization method based on CWT: a case study of Tacotron2
Diel et al.	2024	Efficient FPGA implementation for sound source separation using direction-informed multichannel non-negative matrix factorization
Ehkan et al.	2015	Hardware implementation of MFCC-based feature extraction for speaker recognition
Kumar et al.	2013	Performance evaluation of a wavelet-based pitch detection scheme
de Souza et al.	2022	Multitaper-mel spectrograms for keyword spotting
Tsai et al.	2022	Hardware design for blind source separation using fast time-frequency mask technique
Ezers et al.	2021	Musical Instruments Recognition App
CN114512141B (en)	2024-09-13	Audio separation method, device, equipment, storage medium and program product