Noise Reduction via Speech Modeling

Conventional methods of de-noising speech signal when SNR values are small positive or even negative fail to preserve speech and limit its distortions. One reason is that in the case of non-stationary noise there is very limited time for noise reduction algorithms to adapt to ever changing noisy conditions. Thus the signal separation (desired signal versus the distractors) limits performance. Alternatives based on speech modeling have been researched for some time and advances in this area bring competitive solutions which generate results of practical value. Among these, two specific approaches are worth mentioning here. They are (cf. Ref. 1):

Linear-Prediction-Based Noise Reduction, and
Hidden-Markov-Model-Based Noise Reduction

Figure 1 illustrates the Speech Enhancement concept of noise reduction via the LP-like approach

The concept of noise reduction via the LP-like approach is referred to in [2]. The original idea of using speech modeling for speech enhancement was based on grouping approximately 40 phonemes in the English language into five classes, such as stops, fricatives, glides, vowels and nasals. (These classes could be considered as a loose precursor or a distant variant of a codebook used in the low-bit-rate codecs such as CELP). Each phoneme is processed by a separate filter designed specifically for the given class. The main purpose of filtering by the separate filters is to remove any intra-class mischaracterization.

The operation of the method illustrated in Figure 1 is as follows. The input signal (speech and noise) is parsed into phonemes. The decision block makes the determination of which class the given phoneme belongs to and then routes the respective segment with speech and noise to the appropriate filter. The filters allow the speech segments belonging to the respective filters to go through while the noise components are filtered out in this process. The filter outputs are combined and the clean speech is routed through the remaining part of the Speech Enhancements System.

More information about speech-model-based approaches to Speech Enhancement is provided in [3].

VOCAL’s Voice Enhancement solutions include noise reduction software solutions that have been tested in typical acoustic environment. These solutions can be modified to fit custom specification and they can be used in conjunction with speech-model-based solutions if required. Contact us to discuss your specific speech application requirements.

REFERENCES

Fundamentals of Noise Reduction, by Chen, J, et el, Handbook of Speech Processing by Benesty, J. et el., Springer, 2008
Single Channel Noise Reduction (Section 4), Sound Capture and Processing, by Tashev, I., A John Wiley and Sons, LTD., Publishing 2009
Model-Based Speech Enhancement
Voice Enhancement Design