A multichannel subspace approach with signal presence probability for speech enhancement

Research output: Contribution to journalArticleResearchpeer-review

Abstract

For the last few decades, speech enhancement based on microphone arrays has primarily utilized prior information about system models, e.g., array geometry and source location. However, estimation of the time delay to align microphone inputs is largely affected by reverberation and microphone mismatch. Preprocessing time aligning, e.g., fixed beamforming (the first branch of the generalized sidelobe canceller), is not desirable in general applications. Recently, interest has shifted to linear filtering, which works with only second-order statistics of noisy input and estimated noise. This paper proposes a linear filter design based on a multichannel subspace approach for speech enhancement. The contribution of the proposed multichannel subspace methods is threefold. First, a linear filter is applied to the multichannel frequency domain using a spatiospectral correlation matrix. Next, three types of multichannel signal presence probability (MC-SPP) are derived in the subspace domain. Third, incorporating the MC-SPPs into the gain modification of the linear filter achieves further improved noise reduction performance. Of the gain modifications, the proposed gain modification with subspace probability related to the eigenvector corresponding to the maximum eigenvalue realized the best noise reduction performance. The evaluation on average improved the proposed subspace-based methods by approximately 4 dB in overall SNR while maintaining a similar cepstral distance measured over the minimum variance distortionless response with the state-of-the-art relative transfer function estimation in adverse noisy environments.

Original languageEnglish
JournalMultidimensional Systems and Signal Processing
DOIs
StatePublished - 2019 Jan 1

Fingerprint

Speech Enhancement
Speech enhancement
Microphones
Linear Filter
Subspace
Noise abatement
Noise Reduction
Reverberation
Beamforming
Linear Filtering
Microphone Array
Eigenvalues and eigenfunctions
Subspace Methods
Function Estimation
Transfer functions
Minimum Variance
Filter Design
Time delay
Correlation Matrix
Information systems

Keywords

  • Multichannel subspace approach
  • Signal presence probability
  • Spatiospectral correlation matrix
  • Speech enhancement

Cite this

@article{d92bed845d8e4595a14384d852498023,
title = "A multichannel subspace approach with signal presence probability for speech enhancement",
abstract = "For the last few decades, speech enhancement based on microphone arrays has primarily utilized prior information about system models, e.g., array geometry and source location. However, estimation of the time delay to align microphone inputs is largely affected by reverberation and microphone mismatch. Preprocessing time aligning, e.g., fixed beamforming (the first branch of the generalized sidelobe canceller), is not desirable in general applications. Recently, interest has shifted to linear filtering, which works with only second-order statistics of noisy input and estimated noise. This paper proposes a linear filter design based on a multichannel subspace approach for speech enhancement. The contribution of the proposed multichannel subspace methods is threefold. First, a linear filter is applied to the multichannel frequency domain using a spatiospectral correlation matrix. Next, three types of multichannel signal presence probability (MC-SPP) are derived in the subspace domain. Third, incorporating the MC-SPPs into the gain modification of the linear filter achieves further improved noise reduction performance. Of the gain modifications, the proposed gain modification with subspace probability related to the eigenvector corresponding to the maximum eigenvalue realized the best noise reduction performance. The evaluation on average improved the proposed subspace-based methods by approximately 4 dB in overall SNR while maintaining a similar cepstral distance measured over the minimum variance distortionless response with the state-of-the-art relative transfer function estimation in adverse noisy environments.",
keywords = "Multichannel subspace approach, Signal presence probability, Spatiospectral correlation matrix, Speech enhancement",
author = "Jung-Pyo Hong",
year = "2019",
month = "1",
day = "1",
doi = "10.1007/s11045-019-00640-z",
language = "English",
journal = "Multidimensional Systems and Signal Processing",
issn = "0923-6082",

}

A multichannel subspace approach with signal presence probability for speech enhancement. / Hong, Jung-Pyo.

In: Multidimensional Systems and Signal Processing, 01.01.2019.

Research output: Contribution to journalArticleResearchpeer-review

TY - JOUR

T1 - A multichannel subspace approach with signal presence probability for speech enhancement

AU - Hong, Jung-Pyo

PY - 2019/1/1

Y1 - 2019/1/1

N2 - For the last few decades, speech enhancement based on microphone arrays has primarily utilized prior information about system models, e.g., array geometry and source location. However, estimation of the time delay to align microphone inputs is largely affected by reverberation and microphone mismatch. Preprocessing time aligning, e.g., fixed beamforming (the first branch of the generalized sidelobe canceller), is not desirable in general applications. Recently, interest has shifted to linear filtering, which works with only second-order statistics of noisy input and estimated noise. This paper proposes a linear filter design based on a multichannel subspace approach for speech enhancement. The contribution of the proposed multichannel subspace methods is threefold. First, a linear filter is applied to the multichannel frequency domain using a spatiospectral correlation matrix. Next, three types of multichannel signal presence probability (MC-SPP) are derived in the subspace domain. Third, incorporating the MC-SPPs into the gain modification of the linear filter achieves further improved noise reduction performance. Of the gain modifications, the proposed gain modification with subspace probability related to the eigenvector corresponding to the maximum eigenvalue realized the best noise reduction performance. The evaluation on average improved the proposed subspace-based methods by approximately 4 dB in overall SNR while maintaining a similar cepstral distance measured over the minimum variance distortionless response with the state-of-the-art relative transfer function estimation in adverse noisy environments.

AB - For the last few decades, speech enhancement based on microphone arrays has primarily utilized prior information about system models, e.g., array geometry and source location. However, estimation of the time delay to align microphone inputs is largely affected by reverberation and microphone mismatch. Preprocessing time aligning, e.g., fixed beamforming (the first branch of the generalized sidelobe canceller), is not desirable in general applications. Recently, interest has shifted to linear filtering, which works with only second-order statistics of noisy input and estimated noise. This paper proposes a linear filter design based on a multichannel subspace approach for speech enhancement. The contribution of the proposed multichannel subspace methods is threefold. First, a linear filter is applied to the multichannel frequency domain using a spatiospectral correlation matrix. Next, three types of multichannel signal presence probability (MC-SPP) are derived in the subspace domain. Third, incorporating the MC-SPPs into the gain modification of the linear filter achieves further improved noise reduction performance. Of the gain modifications, the proposed gain modification with subspace probability related to the eigenvector corresponding to the maximum eigenvalue realized the best noise reduction performance. The evaluation on average improved the proposed subspace-based methods by approximately 4 dB in overall SNR while maintaining a similar cepstral distance measured over the minimum variance distortionless response with the state-of-the-art relative transfer function estimation in adverse noisy environments.

KW - Multichannel subspace approach

KW - Signal presence probability

KW - Spatiospectral correlation matrix

KW - Speech enhancement

UR - http://www.scopus.com/inward/record.url?scp=85063230523&partnerID=8YFLogxK

U2 - 10.1007/s11045-019-00640-z

DO - 10.1007/s11045-019-00640-z

M3 - Article

JO - Multidimensional Systems and Signal Processing

JF - Multidimensional Systems and Signal Processing

SN - 0923-6082

ER -