Resolution of the angle of arrival of speech signals impinging on a microphone array is critical for beamforming and noise reduction in online applications. Due to computation and memory constraints of embedded DSPs, an efficient but fast algorithm is required to achieve any meaningful gains for beamforming. In speech processing, frame lengths on orders of milliseconds are typical, further buttressing the need for a quick resolution of algorithms. The problem we wish to tackle is as follow: Given frame recordings of a single source of speech, one frame recording for each microphone, each frame having samples, determine the angle of arrival of the signal assuming a far field model. Figure 1 illustrates a typical microphone array.
Figure 1: Square microphone array.
We limit our analysis to the -D case for brevity of presentation. The general approach is to use cross-correlation to find the delay between paired microphones. GCC-PHAT however requires additions and multiplications because the DFT’s of both signals need to be computed. Here, . A faster approach will be to find the cross-correlations by leveraging the known maximum delay between a pair of microphones. This approach reduces the number of computations to , where L is the maximum number of samples that can be delayed. The maximum delay on a typical DSP platform with a microphone spacing of will be as small as 2 samples using a sampling rate of , thus making our approach at least 4 times faster than the so called GCC-PHAT and other DFT based approaches. For the illustrated microphone array in Figure 1, define the time delay between microphone and microphone as , with and being the arrival times of a common sample data. Define the speed of sound as c. Let be the distance between consecutive microphones and , as labeled on Figure 1, such that . Also let the distance between and be for . Then the time difference of arrival of signals at the microphone arrays obey the following:
Here, the ‘s are estimated using pairwise correlations using:
where the ‘s are sampled data at microphone . The inter-sample delays may also be dealt with by simply using the adjacent bins near the peak. For example, suppose the peak of the correlation was at the sample, then we will use the intersample value of:
As a custom design house, VOCAL Technologies’ angle of arrival algorithms are applicable to a wide range of the microphone arrays that exist in reverberation environments. The selection of the algorithm is based on the requirements of the application and the available hardware configuration.