Complete Communications Engineering

Post filtering of echo cancellation systems is important for environments with large levels of non-linear distortion. Non-linear distortions of the echo path can result from the loudspeaker microphone enclosure and overdriven loudspeaker signals. Another source of non-linear distortions is when the echo cancellation is moved away from the loudspeaker microphone enclosure to a centralized location, such as an IP gateway. The low bit-rate vocoders such as, MELP and ITU G.722.2, are now located in the echo path. The adaptive filters of an echo canceller can only cancel the linear portions of the echo path, thus the non-linear portions cannot be removed via the adaptive filter and require post filter to remove the residual echo.

Adaptive filters of an acoustic echo canceller can only cancel the linear portions of the echo path

Figure 1: Non-linear Acoustic Echo Cancellation (AEC)
The goal of a post-filtering system is to design a filter such that when it is convolved with the output of the echo canceller, the residual echo is suppressed and only the near-end speech remains. This is very similar to most noise reduction systems. Ŝ(ω) = H(ω)(S(ω) + D(ω)), where S(ω) is the desired near-end speech, H(ω) is the weighted spectrum, and D(ω is the spectrum of the residual echo or noise. As in the noise reduction, H(ω) is determined by the signal to noise ratio,

H(ω) =SNR(ω)
SNR(ω) + 1
(1)

or

H(ω) = 1 –(ω)
S(ω) + D(ω)
(2)

where (ω) is the estimate of the residual echo.

The main challenge is to get an accurate estimate of the residual echo spectrum. There are several approaches to determining this value. One method to perform the estimation is to first determined when we are in an echo only state. Once this state has been determined, a ratio of the long term average of the error power to excitation power, can be used as a coupling factor measure, β(ω). This coupling factor will provide an estimate of the attenuation achieved through the system. Therefore, (ω) = β(ω)X(ω), can be used as an estimate for the residual echo in (2). As (2) illuminated in systems with high levels of non-linear distortion, such as those in packet-based telephony, (ω) will be large and H(ω) will approach 0. Hence, more residual echo will be suppressed.

More Information