TASLP-II: Vol 21, No 12

Volume 21, Issue 12December 2013Final Issue

Latest Issue

Volume 21, Issue 12

December 2013

Publisher:

IEEE Press

ISSN:1558-7916

Bibliometrics

Select All

Export Citations Save to Binder

research-article

Epoch Extraction Based on Integrated Linear Prediction Residual Using Plosion Index

Pages 2471–2480https://doi.org/10.1109/TASL.2013.2273717

Epoch is defined as the instant of significant excitation within a pitch period of voiced speech. Epoch extraction continues to attract the interest of researchers because of its significance in speech analysis. Existing high performance epoch ...

research-article

Body Conducted Speech Enhancement by Equalization and Signal Fusion

Pages 2481–2492https://doi.org/10.1109/TASL.2013.2274696

This paper studies body-conducted speech for noise robust speech processing purposes. As body-conducted speech is typically limited in bandwidth, signal processing is required to obtain a signal that is both high in quality and low in noise. We propose ...

research-article

Soundfield Imaging in the Ray Space

Pages 2493–2505https://doi.org/10.1109/TASL.2013.2274697

In this work we propose a general approach to acoustic scene analysis based on a novel data structure (ray-space image) that encodes the directional plenacoustic function over a line segment (Observation Window, OW). We define and describe a system for ...

research-article

Cross-Lingual Automatic Speech Recognition Using Tandem Features

Pages 2506–2515https://doi.org/10.1109/TASL.2013.2277932

Automatic speech recognition depends on large amounts of transcribed speech recordings in order to estimate the parameters of the acoustic model. Recording such large speech corpora is time-consuming and expensive; as a result, sufficient quantities of ...

research-article

Dominance Based Integration of Spatial and Spectral Features for Speech Enhancement

Pages 2516–2531https://doi.org/10.1109/TASL.2013.2277937

This paper proposes a versatile technique for integrating two conventional speech enhancement approaches, a spatial clustering approach (SCA) and a factorial model approach (FMA), which are based on two different features of signals, namely spatial and ...

research-article

Linearly-Constrained Minimum-Variance Method for Spherical Microphone Arrays Based on Plane-Wave Decomposition of the Sound Field

Pages 2532–2540https://doi.org/10.1109/TASL.2013.2277939

Speech signals recorded in real environments may be corrupted by ambient noise and reverberation. Therefore, noise reduction and dereverberation algorithms for speech enhancement are typically employed in speech communication systems. Although ...

research-article

Source/Filter Factorial Hidden Markov Model, With Application to Pitch and Formant Tracking

Pages 2541–2553https://doi.org/10.1109/TASL.2013.2277941

Tracking vocal tract formant frequencies $(f_{p})$ and estimating the fundamental frequency $(f_{0})$ are two tracking problems that have been tackled in many speech processing works, often independently, with applications to articulatory parameters ...

research-article

A Bag of Systems Representation for Music Auto-Tagging

Pages 2554–2569https://doi.org/10.1109/TASL.2013.2279318

We present a content-based automatic tagging system for music that relies on a high-level, concise “Bag of Systems” (BoS) representation of the characteristics of a musical piece. The BoS representation leverages a rich dictionary of musical codewords, ...

research-article

HMM Based Intermediate Matching Kernel for Classification of Sequential Patterns of Speech Using Support Vector Machines

Pages 2570–2582https://doi.org/10.1109/TASL.2013.2279338

In this paper, we address the issues in the design of an intermediate matching kernel (IMK) for classification of sequential patterns using support vector machine (SVM) based classifier for tasks such as speech recognition. Specifically, we address the ...

research-article

Geometry-Based Spatial Sound Acquisition Using Distributed Microphone Arrays

Pages 2583–2594https://doi.org/10.1109/TASL.2013.2280210

Traditional spatial sound acquisition aims at capturing a sound field with multiple microphones such that at the reproduction side a listener can perceive the sound image as it was at the recording location. Standard techniques for spatial sound ...

research-article

A Class of Optimal Rectangular Filtering Matrices for Single-Channel Signal Enhancement in the Time Domain

Pages 2595–2606https://doi.org/10.1109/TASL.2013.2280215

In this paper, we introduce a new class of optimal rectangular filtering matrices for single-channel speech enhancement. The new class of filters exploits the fact that the dimension of the signal subspace is lower than that of the full space. By doing ...

research-article

Understanding Effects of Subjectivity in Measuring Chord Estimation Accuracy

Pages 2607–2615https://doi.org/10.1109/TASL.2013.2280218

To assess the performance of an automatic chord estimation system, reference annotations are indispensable. However, owing to the complexity of music and the sometimes ambiguous harmonic structure of polyphonic music, chord annotations are inherently ...

research-article

Investigations on an EM-Style Optimization Algorithm for Discriminative Training of HMMs

Pages 2616–2626https://doi.org/10.1109/TASL.2013.2280234

Today's speech recognition systems are based on hidden Markov models (HMMs) with Gaussian mixture models whose parameters are estimated using a discriminative training criterion such as Maximum Mutual Information (MMI) or Minimum Phone Error (MPE). ...

research-article

Declipping of Audio Signals Using Perceptual Compressed Sensing

Pages 2627–2637https://doi.org/10.1109/TASL.2013.2281570

The restoration of clipped audio signals, commonly known as declipping, is important to achieve an improved level of audio quality in many audio applications. In this paper, a novel declipping algorithm is presented, jointly based on the theory of ...

opinion

List of Reviewers

Pages 2638–2640https://doi.org/10.1109/TASL.2013.2291444

Lists the reviewers who contributed to IEEE Transactions on Audio, Speech, and Language Processing in 2013.

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Sections

Save to Binder

Comments